| Crates.io | tuni |
| lib.rs | tuni |
| version | 0.1.1 |
| created_at | 2024-06-08 19:53:41.729921+00 |
| updated_at | 2024-06-09 17:29:16.82414+00 |
| description | Unify transcripts across different samples |
| homepage | |
| repository | https://github.com/dzhang32/tuni |
| max_upload_size | |
| id | 1265936 |
| size | 83,868 |
The goal of tuni is to unify transcripts across different samples.
Transcript assembly tools can generate arbitary transcript IDs, which may lead to the same transcript being labelled with a different ID across samples.
For example, given two samples sample_1.gtf and sample_2.gtf:
sample_1.gtf
chr1 test transcript 1 100 . + . transcript_id "A";
chr1 test exon 1 40 . + . transcript_id "A";
chr1 test exon 50 100 . + . transcript_id "A";
--snip--
sample_2.gtf
chr1 test transcript 1 100 . + . transcript_id "B";
chr1 test exon 1 40 . + . transcript_id "B";
chr1 test exon 50 100 . + . transcript_id "B";
--snip--
The transcript displayed above is identical between the two samples, however the provided transcript_id is different for each sample, "A" vs "B".
tuni generates a .tuni.gtf/.tuni.gff for each input .gtf/.gff. These output files will contain an additional attribute field tuni_id which contains a unified ID that will be same for identical transcripts across different samples.
sample_1.tuni.gtf
chr1 test transcript 1 100 . + . transcript_id "A"; tuni_id "tuni_0";
chr1 test exon 1 40 . + . transcript_id "A"; tuni_id "tuni_0";
chr1 test exon 50 100 . + . transcript_id "A"; tuni_id "tuni_0";
--snip--
sample_2.tuni.gtf
chr1 test transcript 1 100 . + . transcript_id "B"; tuni_id "tuni_0";
chr1 test exon 1 40 . + . transcript_id "B"; tuni_id "tuni_0";
chr1 test exon 50 100 . + . transcript_id "B"; tuni_id "tuni_0";
--snip--
Download the latest binary for Linux or macOS (ARM) from releases.
Install Rust then run:
cargo install tuni
Usage: tuni [OPTIONS] --gtf-gff-path <*.txt> --output-dir </output/dir/>
Options:
-g, --gtf-gff-path <*.txt> A text file containing GTF/GFF paths
-o, --output-dir </output/dir/> Directory where outputted GTF/GFFs will be stored
-v, --verbose Print log messages
-h, --help Print help
-V, --version Print version
Note: currently, only version 2 .gff files are accepted by tuni.