| Crates.io | conllu-utils |
| lib.rs | conllu-utils |
| version | 0.1.8 |
| created_at | 2020-03-22 09:52:04.438908+00 |
| updated_at | 2020-12-07 10:42:12.471733+00 |
| description | Utilities for working with the CoNLL-U dependency format |
| homepage | |
| repository | https://github.com/danieldk/conllu-utils.git |
| max_upload_size | |
| id | 221370 |
| size | 62,813 |
This is a set of utilities to process files in the CoNLL-U format. The
conllu command provides the following subcommands:
accuracy: compute the accuracy of a system based on two treebankscleanup: normalize unicode and replace unicode punctuationcompare: compare two treebanks on one or more layersfrom-text: convert tokenized text files to CoNLL-U.merge: merge CoNLL-U filespartition: partition a CoNLL-U file in N files.shuffle: shuffle the sentences in a CoNLL-U file.to-text: convert CoNLL-U to tokenized plain text.Executing a subcommand gives usage information when --help is given
as an argument.