Skip to content

Parsing and building fixed width flat files for Spark Datasets.

Notifications You must be signed in to change notification settings

basilwong/spark-fixed-width

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Spark Fixed Width

Framework for parsing fixed width column files into Spark Datasets and generating fixed width column files from Spark Datasets in Java.

Dependencies

Required dependencies added to the pom. Note that these classes will work on any Spark Version containing Datasets < 2.0.

<dependencies>
    <dependency>
        <groupId>org.apache.spark</groupId>
        <artifactId>spark-core_2.11</artifactId>
        <version>2.3.3</version>
    </dependency>
    <dependency>
        <groupId>org.apache.spark</groupId>
        <artifactId>spark-sql_2.11</artifactId>
        <version>2.3.3</version>
    </dependency>
</dependencies>

Utilization

FlatFileGenerator is used to generate flat files from a Dataset. FlatFileParser is used to parse a flat file into a Dataset.

Examples of how to use the classes are in the main.java file.

About

Parsing and building fixed width flat files for Spark Datasets.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages