| Crates.io | oscar-tools |
| lib.rs | oscar-tools |
| version | 0.4.0 |
| created_at | 2023-08-30 10:03:32.216725+00 |
| updated_at | 2023-08-31 11:20:03.205899+00 |
| description | Tools for processing OSCAR Corpora |
| homepage | |
| repository | https://github.com/oscar-project/oscar-tools |
| max_upload_size | |
| id | 958849 |
| size | 182,723 |
This is a new set of tools to do common tasks on the OSCAR corpus
The program has a different set of tools for each corpus version:
v1: OSCAR 2019-like, text only (.txt files)v2: OSCAR 22.01-like, JSONLines, document-oriented with annotations and line-level identifications