conllu-utils

Crates.ioconllu-utils
lib.rsconllu-utils
version0.1.8
sourcesrc
created_at2020-03-22 09:52:04.438908
updated_at2020-12-07 10:42:12.471733
descriptionUtilities for working with the CoNLL-U dependency format
homepage
repositoryhttps://github.com/danieldk/conllu-utils.git
max_upload_size
id221370
size62,813
Daniƫl de Kok (danieldk)

documentation

https://github.com/danieldk/conllu-utils

README

CoNLL-U Utilities

Introduction

This is a set of utilities to process files in the CoNLL-U format. The conllu command provides the following subcommands:

  • accuracy: compute the accuracy of a system based on two treebanks
  • cleanup: normalize unicode and replace unicode punctuation
  • compare: compare two treebanks on one or more layers
  • from-text: convert tokenized text files to CoNLL-U.
  • merge: merge CoNLL-U files
  • partition: partition a CoNLL-U file in N files.
  • shuffle: shuffle the sentences in a CoNLL-U file.
  • to-text: convert CoNLL-U to tokenized plain text.

Usage

Executing a subcommand gives usage information when --help is given as an argument.

Commit count: 46

cargo fmt