Crates.io | oscar-tools |
lib.rs | oscar-tools |
version | 0.4.0 |
source | src |
created_at | 2023-08-30 10:03:32.216725 |
updated_at | 2023-08-31 11:20:03.205899 |
description | Tools for processing OSCAR Corpora |
homepage | |
repository | https://github.com/oscar-project/oscar-tools |
max_upload_size | |
id | 958849 |
size | 182,723 |
This is a new set of tools to do common tasks on the OSCAR corpus
The program has a different set of tools for each corpus version:
v1
: OSCAR 2019-like, text only (.txt files)v2
: OSCAR 22.01-like, JSONLines, document-oriented with annotations and line-level identifications