| Crates.io | efetch2jsonl |
| lib.rs | efetch2jsonl |
| version | 1.0.0 |
| created_at | 2024-06-18 10:39:53.105436+00 |
| updated_at | 2024-06-18 10:39:53.105436+00 |
| description | Convert EFetch XML to JSON Lines |
| homepage | |
| repository | |
| max_upload_size | |
| id | 1275370 |
| size | 18,274 |
A simple parser to convert from the output from efetch to a more parseable format.
This tool was built for converting from a query of SRA biosamples, though it should be applicable beyond that due to the configurable nature.
Example use case: You want to get the metadata from all the SRA runs of a single bioproject:
BIOPROJECT_ID=1081646
efetch -db bioproject -id $BIOPROJECT_ID -format xml | elink -db bioproject -target sra > bioproject_query.elink
cat bioproject_query.elink | efetch -mode xml -format xml > biosamples.out.xml
efetch2jsonl -i biosamples.out.xml -o biosamples.jsonl -k . -r EXPERIMENT_PACKAGE
Further processing to a clean table can be done trivially with something like Polars.