| Crates.io | paddler |
| lib.rs | paddler |
| version | 2.1.1 |
| created_at | 2024-11-21 11:49:38.748755+00 |
| updated_at | 2025-08-18 09:32:12.141+00 |
| description | Open-source LLMOps platform for hosting and scaling AI in your own infrastructure |
| homepage | |
| repository | https://github.com/intentee/paddler |
| max_upload_size | |
| id | 1456059 |
| size | 878,639 |
Digital products and their users need privacy, reliability, cost control and an option to be independent from third party vendors.
Paddler is an open-source LLMOps platform for organizations that host and scale open-source models in their own infrastructure.
Visit our documentation page to install Paddler and get started with it.
API documentation is also available.
There are multiple ways to install Paddler, but the goal is to obtain the paddler binary and make it available in your system.
You can:
The entire Paddler functionality is available through the paddler command.
You can run paddler --help to see the available commands and options.
Read more about installation and initial setup
Paddler is built for an easy set up. It comes as a self-contained binary with only two deployable components, the balancer and the agents.
The balancer exposes the following:
Agents are usually deployed on separate instances. They further distribute the incoming requests to slots, which are responsible for generating tokens and embeddings.
Paddler uses a built-in llama.cpp engine for inference, but has its own implementation of llama.cpp slots which keep their own context and KV cache.
Paddler comes with a built-in web admin panel.
You can use it to monitor your Paddler fleet:
Add and update your model and customize the chat template and inference parameters:
And use GUI to test the inference:
We initially wanted to use Raft consensus algorithm (thus Paddler, because it paddles on a Raft), but eventually dropped that idea. The name stayed, though.
Later, people started sending us the "that's a paddlin'" clip from The Simpsons, and we just embraced it.
We keep everything simple and on GitHub. Please use GitHub discussions for community conversations, and feel free to contribute.