langsan

Crates.iolangsan
lib.rslangsan
version0.0.10
sourcesrc
created_at2024-10-16 19:24:48.362904
updated_at2024-10-19 10:20:12.886267
descriptionA library for sanitizing language model input and output.
homepagehttps://github.com/mdegans/langsan
repositoryhttps://github.com/mdegans/langsan
max_upload_size
id1412234
size154,836
Michael de Gans (mdegans)

documentation

README

langsan is a sanitization library for language models

Build Status codecov

Out of a desire to be first to market, many companies from OpenAI to Anthropic are releasing language models without proper input or output sanitization. This can lead to a variety of safety and security issues, including but not limited to human-invisible adversarial attacks, data leakage, and generation of harmful content.

langsan provides immutable string wrappers guaranteeing their contents are within restricted unicode ranges, generally those only officially supported by a particular language model. Almost all unicode code blocks are available as features (crates.io has a limit set at 300).

Commit count: 12

cargo fmt