| Crates.io | csv2arrow |
| lib.rs | csv2arrow |
| version | 0.22.3 |
| created_at | 2021-02-26 19:27:51.085936+00 |
| updated_at | 2025-02-16 15:12:37.26998+00 |
| description | Convert CSV files to Arrow |
| homepage | https://github.com/domoritz/arrow-tools/tree/main/crates/csv2arrow |
| repository | https://github.com/domoritz/arrow-tools |
| max_upload_size | |
| id | 361041 |
| size | 45,561 |
Convert CSV files to Apache Arrow. This package is part of Arrow CLI tools.
You can get the latest releases from https://github.com/domoritz/arrow-tools/releases.
brew install domoritz/homebrew-tap/csv2arrow
cargo install csv2arrow
To avoid re-compilation and speed up installation, you can install this tool with cargo binstall:
cargo binstall csv2arrow
Usage: csv2arrow [OPTIONS] <CSV> [ARROW]
Arguments:
<CSV>
Input CSV file, stdin if not present
[ARROW]
Output file, stdout if not present
Options:
-s, --schema-file <SCHEMA_FILE>
File with Arrow schema in JSON format
-m, --max-read-records <MAX_READ_RECORDS>
The number of records to infer the schema from. All rows if not present. Setting max-read-records to zero will stop schema inference and all columns will be string typed
--header <HEADER>
Set whether the CSV file has headers
[default: true]
[possible values: true, false]
--delimiter <DELIMITER>
Set the CSV file's column delimiter as a byte character
--escape <ESCAPE>
Specify an escape character
--quote <QUOTE>
Specify a custom quote character
--comment <COMMENT>
Specify a comment character.
Lines starting with this character will be ignored
--null-regex <NULL_REGEX>
Provide a regex to match null values
-p, --print-schema
Print the schema to stderr
-n, --dry
Only print the schema
-h, --help
Print help (see a summary with '-h')
-V, --version
Print version
The --schema-file option uses the same file format as --dry and --print-schema.
For usage examples, see the csv2parquet examples which shares a similar interface.