Download Latest Version osdq-spark_0.0.1.zip (141.7 MB)
Email in envelope

Get an email when there's a new version of apache spark data pipeline osDQ

Home / datasampler
Name Modified Size InfoDownloads / Week
Parent folder
Readme.txt 2018-01-23 557 Bytes
datasampling_0_1.jar 2018-01-23 11.8 kB
Totals: 2 Items   12.3 kB 0
 -c,--keyColumn <arg>          Key Column for stratified/keylist sampling
 -f,--fraction <arg>           Sample fraction size
 -fm,--fractionMapping <arg>   comma seperated pairs of key,fraction size
 -h,--help                     show this help.
 -i,--input <arg>              Input Folder/File path
 -if,--inputFormat <arg>       input file format
 -o,--output <arg>             Output Folder path
 -of,--outFormat <arg>         output file format
 -t,--type <arg>               Sampling type  - ran
                               dom/stratified/keylist
Source: Readme.txt, updated 2018-01-23