wiki_corpus_parser

Crates.iowiki_corpus_parser
lib.rswiki_corpus_parser
version1.0.0
created_at2024-11-18 08:04:12.92399+00
updated_at2024-11-18 08:04:12.92399+00
descriptionExtract text from Wikipedia dumps (.bz2) and convert it to JSONLines format.
homepage
repositoryhttps://github.com/akitenkrad/wiki-corpus.git
max_upload_size
id1451951
size11,541
akitenkrad (akitenkrad)

documentation

README

Commit count: 0

cargo fmt