Crates.io | simstring_rust |
lib.rs | simstring_rust |
version | |
source | src |
created_at | 2024-12-06 00:49:50.39988 |
updated_at | 2024-12-06 00:49:50.39988 |
description | A native Rust implementation of the SimString algorithm |
homepage | https://github.com/PyDataBlog/simstring_rs#readme |
repository | https://github.com/PyDataBlog/simstring_rs |
max_upload_size | |
id | 1473784 |
Cargo.toml error: | TOML parse error at line 18, column 1 | 18 | autolib = false | ^^^^^^^ unknown field `autolib`, expected one of `name`, `version`, `edition`, `authors`, `description`, `readme`, `license`, `repository`, `homepage`, `documentation`, `build`, `resolver`, `links`, `default-run`, `default_dash_run`, `rust-version`, `rust_dash_version`, `rust_version`, `license-file`, `license_dash_file`, `license_file`, `licenseFile`, `license_capital_file`, `forced-target`, `forced_dash_target`, `autobins`, `autotests`, `autoexamples`, `autobenches`, `publish`, `metadata`, `keywords`, `categories`, `exclude`, `include` |
size | 0 |
A native Rust implementation of the CPMerge algorithm, designed for approximate string matching. This crate is particularly useful for natural language processing tasks that require the retrieval of strings/texts from very large corpora (big amounts of texts). Currently, this crate supports both character and word-based N-grams feature generation, with plans to allow custom user-defined feature generation methods.
Add simstring_rs
to your Cargo.toml
:
[dependencies]
simstring_rs = "0.1.0"
For the latest features, you can add the master branch by specifying the Git repository:
[dependencies]
simstring_rs = { git = "https://github.com/PyDataBlog/simstring_rs.git", branch = "main" }
Note: Using the master branch may include experimental features and potential breakages. Use with caution!
To revert to a stable version, ensure your Cargo.toml specifies a specific version number instead of the Git repository.
Here is a basic example of how to use simstring_rs in your Rust project:
Contributions are welcome! Please open an issue or submit a pull request on GitHub. License
This project is licensed under the MIT License.
Inspired by the SimString.jl project.