Crates.io | csv-index |
lib.rs | csv-index |
version | 0.1.6 |
source | src |
created_at | 2017-05-23 01:30:02.209734 |
updated_at | 2019-06-26 23:55:23.115001 |
description | On disk CSV indexing data structures. |
homepage | https://github.com/BurntSushi/rust-csv |
repository | https://github.com/BurntSushi/rust-csv |
max_upload_size | |
id | 15641 |
size | 20,463 |
A collection of data structures for indexing CSV data, with a focus on data structures that can be easily serialized to and deserialized from disk.
Dual-licensed under MIT or the UNLICENSE.
Add this to your Cargo.toml
:
[dependencies]
csv-index = "0.1.6"
The RandomAccessSimple
index is a simple data structure that maps record
indices to the byte offset corresponding to the start of that record in CSV
data. This example shows how to save this index to disk for a particular CSV
file.
use std::error::Error;
use std::fs::File;
use std::io::{self, Write};
use csv_index::RandomAccessSimple;
fn main() {
example().unwrap();
}
fn example() -> Result<(), Box<dyn Error>> {
// Open a normal CSV reader.
let mut rdr = csv::Reader::from_path("data.csv")?;
// Create an index for the CSV data in `data.csv` and write it
// to `data.csv.idx`.
let mut wtr = io::BufWriter::new(File::create("data.csv.idx")?);
RandomAccessSimple::create(&mut rdr, &mut wtr)?;
wtr.flush()?;
// Open the index we just created, get the position of the last
// record and seek the CSV reader to the last record.
let mut idx = RandomAccessSimple::open(File::open("data.csv.idx")?)?;
if idx.is_empty() {
return Err(From::from("expected a non-empty CSV index"));
}
let last = idx.len() - 1;
let pos = idx.get(last)?;
rdr.seek(pos)?;
// Read the next record.
if let Some(result) = rdr.records().next() {
let record = result?;
println!("{:?}", record);
Ok(())
} else {
Err(From::from("expected at least one record but got none"))
}
}