csvcleaner [OPTIONS]
csvcleaner normalizes a CSV file based on the options selected. It helps to address issues like variable number of columns, leading/trailing spaces in columns, and non-UTF-8 encoding issues.
By default input is expected from standard in and output is sent to standard out (errors to standard error). These can be modified by appropriate options. The csv file is processed as a stream of rows so minimal memory is used to operate on the file.
-V, -verbose write verbose output to standard error
-comma if set use this character in place of a comma for delimiting cells
-comment-char if set, rows starting with this character will be ignored as comments
-examples display example(s)
-fields-per-row set the number of columns to output right padding empty cells as needed
-generate-markdown-docs generation markdown documentation
-h, -help display help
-i, -input input filename
-l, -license display license
-left-trim If set to true leading white space in a field is ignored.
-o, -output output filename
-output-comma if set use this character in place of a comma for delimiting output cells
-quiet suppress error messages
-reuse if false then a new array is allocated for each row processed, if true the array gets reused
-right-trim If set to true trailing white space in a field is ignored.
-stop-on-error exit on error, useful if you're trying to debug a problematic CSV file
-trim If set to true leading and trailing white space in a field is ignored.
-use-crlf if set use a charage return and line feed in output
-use-lazy-quoting If LazyQuotes is true, a quote may appear in an unquoted field and a non-doubled quote may appear in a quoted field.
-v, -version display version
Normalizing a spread sheet’s column count to 5 padding columns as needed per row.
cat mysheet.csv | csvcleaner -field-per-row=5
Trim leading spaces.
cat mysheet.csv | csvcleaner -left-trim
Trim trailing spaces.
cat mysheet.csv | csvcleaner -right-trim
Trim leading and trailing spaces
cat mysheet.csv | csvcleaner -trim
csvcleaner v0.0.20-pre