| Crates.io | fasta-filter |
| lib.rs | fasta-filter |
| version | 0.2.0 |
| created_at | 2024-09-02 16:39:59.137624+00 |
| updated_at | 2026-01-06 21:44:47.836389+00 |
| description | Filter a (multi-sequence) FASTA file and output a subset of the records on STDOUT |
| homepage | |
| repository | |
| max_upload_size | |
| id | 1360796 |
| size | 34,079 |
Filter a FASTA file and output a subset of the records on STDOUT
Filter a FASTA file and output a subset of the records on STDOUT
Usage: fasta-filter [OPTIONS] [FILE]
Arguments:
[FILE]
Options:
-c, --count <COUNT>
-p, --pattern <PATTERN>
-x, --exclude
-m, --min <MIN>
-n, --max <MAX>
-h, --help Print help
-V, --version Print version
This simple program expects to read FASTA data either on STDIN or from a named file, and will output a subset of the records to STDOUT.
If -c or --count is specified, the first COUNT sequences present in the input will be used, after other filters are applied. Thus, at most COUNT sequences will be output (fewer if the file doesn't contain that many sequences after other filters are applied).
Filtering can be based on a min or max length of sequence to include, as well as a pattern that either must or must not (-x) be present in the header of a record to include.
Example: fasta-filter --min 5 --max 2000 --pattern "mol:protein" test.fasta