| Crates.io | kizzasi-inference |
| lib.rs | kizzasi-inference |
| version | 0.1.0 |
| created_at | 2026-01-19 00:28:03.160172+00 |
| updated_at | 2026-01-19 00:28:03.160172+00 |
| description | Unified autoregressive inference engine for Kizzasi AGSP |
| homepage | https://github.com/cool-japan/kizzasi |
| repository | https://github.com/cool-japan/kizzasi |
| max_upload_size | |
| id | 2053377 |
| size | 584,276 |
Unified autoregressive inference engine for Kizzasi AGSP.
Production-grade inference pipeline with sampling strategies, batching, streaming, and constraint enforcement. Supports all Kizzasi model architectures.
use kizzasi_inference::{InferenceEngine, Pipeline, GreedySampler};
// Create inference pipeline
let pipeline = Pipeline::builder()
.model(my_model)
.tokenizer(my_tokenizer)
.sampler(GreedySampler::new())
.build()?;
// Single-step prediction
let input = Array1::zeros(32);
let output = pipeline.predict(&input)?;
// Multi-step rollout
let predictions = pipeline.predict_n(&input, 100)?;
// Streaming inference
let mut stream = pipeline.stream(input_stream).await?;
while let Some(prediction) = stream.next().await {
// Process prediction
}
Licensed under either of Apache License, Version 2.0 or MIT license at your option.