wordmarkov

Crates.iowordmarkov
lib.rswordmarkov
version0.1.4
sourcesrc
created_at2022-10-21 02:13:53.344122
updated_at2022-10-21 08:05:19.989114
descriptionA simple but flexible Markov chain library, specifically for text sentences, which handles punctuation and whitespace
homepage
repositoryhttps://github.com/wallabra/neurs_project
max_upload_size
id693122
size62,795
Gustavo Ramos Rehermann (wallabra)

documentation

README

wordmarkov

:author: Gustavo Ramos Rehermann :toc: :numbered:

A Markov chain library which is tailored for sentences.

This library is a part of the Neurs Project.

Specifics

Unlike a general-purpose Markov chain, a Markov chain in WordMarkov retains information about punctuation and whitespace.

The same two words can have multiple edges if there are instances where they are separated differently. For example, "high priest" and "high-priest" will both result in the tokens "high" and "priest" being linked, but there will be two links each representing a kind of separation.

There are two special tokens, START and END, which also come into play. The Markov chain can be walked both forwards and backwards. Whenever walking in either direction, ideally, one of the special tokens will be reached under a finite amount of time (words walked).

License

For licensing information, see the Neurs Project main repository.

Commit count: 109

cargo fmt