| Crates.io | seqpls |
| lib.rs | seqpls |
| version | 0.1.1 |
| created_at | 2025-04-21 00:30:00.497346+00 |
| updated_at | 2025-04-21 00:40:21.811661+00 |
| description | My sequences please - a paired fastq grepper with regex support |
| homepage | |
| repository | https://github.com/noamteyssier/seqpls |
| max_upload_size | |
| id | 1642135 |
| size | 30,137 |
This is a paired FASTQ grep tool for sequence analysis.
It's built using the same matching algorithm as bqtools grep, and was developed to demonstrate the performance difference between BINSEQ and FASTQ formats.
It accepts FASTQ files (compressed or uncompressed) and lets you match on regular expressions or fixed strings on either the R1, R2 or both.
Under the hood it uses the paraseq crate for efficient parallel processing of FASTQ records.
It's fast - but don't use it because using bqtools on binseq files is significantly faster.
cargo install seqpls
# See full help menu
seqpls --help
# Search for a fixed string in an unpaired FASTQ file
seqpls -e "ACGT" <some_fastq>
# Search for a string in the R1 and a regex in the R2
seqpls -T3 -e "ACGT" -R "[AC][TG][AC][TG]" <some_r1> <some_r2>
# Filter sequences without string in either using 3 threads
seqpls -T3 -v -F "ACGT" <some_r1> <some_r2>
Don't contribute to this project, it's doomed.