The goal here is to compress/search save bioinfromatics data on nucleotide polymorphism using BitMagic Library. Tech.notes: http://bitmagic.io/succinct-snp-search.html SNP RS columnar sparse vector example: 1. Download Human SNP report from NCBI FTP ftp://ftp.ncbi.nlm.nih.gov/snp/organisms/human_9606/chr_rpts/ 2. Take Chr1 for a large experiment chr_1.txt.gz ftp://ftp.ncbi.nlm.nih.gov/snp/organisms/human_9606/chr_rpts/chr_1.txt.gz 3. unzip the SNP report file unzip chr_1.txt.gz 4. Load/parse data into compressed sparse vector ./xsample03 -isnp chr_1.txt -svout rs_chr1.sv (wait, wait, wait, ...) 5. Build RS compressed sparse vector ./xsample03 -svin rs_chr1.sv -rscout rs_chr1.rssv -t 6. Print diagnostics (memory usage, statistics, etc) ./xsample03 -svin rs_chr1.sv -rscin rs_chr1.rssv -t -d 7. Search performance benchmark ./xsample03 -svin rs_chr1.sv -rscin rs_chr1.rssv -b -t