| Crates.io | tower-http-cache |
| lib.rs | tower-http-cache |
| version | 0.4.3 |
| created_at | 2025-11-09 07:39:17.686127+00 |
| updated_at | 2025-11-11 05:59:11.594583+00 |
| description | Tower-compatible caching layer with pluggable backends (in-memory, Redis, and more) |
| homepage | |
| repository | https://github.com/sadco-io/tower-http-cache |
| max_upload_size | |
| id | 1923777 |
| size | 502,157 |
Tower middleware for HTTP response caching with pluggable storage backends (in-memory, Redis, and more). tower-http-cache brings a production-grade caching layer to Tower/Axum/Hyper stacks, with stampede protection, stale-while-revalidate, header allowlisting, compression, and policy controls out of the box.
CacheLayer: wrap any Tower service; caches GET/HEAD by default.[dependencies]
tower-http-cache = "0.3"
# Enable Redis support if required
tower-http-cache = { version = "0.3", features = ["redis-backend"] }
# With admin API support
tower-http-cache = { version = "0.3", features = ["admin-api"] }
use std::time::Duration;
use tower::ServiceBuilder;
use tower_http_cache::prelude::*;
let cache_layer = CacheLayer::builder(InMemoryBackend::new(10_000))
.ttl(Duration::from_secs(120))
.negative_ttl(Duration::from_secs(10))
.stale_while_revalidate(Duration::from_secs(30))
.refresh_before(Duration::from_secs(5))
.min_body_size(Some(1024))
.max_body_size(Some(256 * 1024))
.respect_cache_control(true)
.build();
let svc = ServiceBuilder::new()
.layer(cache_layer)
.service(tower::service_fn(|_req| async {
Ok::<_, std::convert::Infallible>(http::Response::new("hello world"))
}));
Efficiently cache and serve large files with byte-range support - perfect for video streaming:
use tower_http_cache::prelude::*;
use tower_http_cache::streaming::StreamingPolicy;
use std::time::Duration;
let cache_layer = CacheLayer::builder(InMemoryBackend::new(500))
.policy(
CachePolicy::default()
.with_ttl(Duration::from_secs(3600))
.with_streaming_policy(StreamingPolicy {
enable_chunk_cache: true,
chunk_size: 1024 * 1024, // 1MB chunks
min_chunk_file_size: 5 * 1024 * 1024, // Only chunk files >= 5MB
..Default::default()
})
)
.build();
Benefits:
Example:
See examples/chunk_cache_demo.rs for a complete working example.
use std::time::Duration;
use tower_http_cache::prelude::*;
async fn build_redis_layer(redis_url: &str) -> CacheLayer<RedisBackend> {
let client = redis::Client::open(redis_url).expect("valid Redis URL");
let manager = client.get_tokio_connection_manager().await.expect("connect");
CacheLayer::builder(RedisBackend::new(manager))
.ttl(Duration::from_secs(30))
.stale_while_revalidate(Duration::from_secs(10))
.build()
}
Auto-refresh proactively refreshes frequently-accessed cache entries before they expire, reducing cache misses and latency for hot endpoints:
use std::time::Duration;
use tower_http_cache::prelude::*;
use tower_http_cache::refresh::AutoRefreshConfig;
let cache_layer = CacheLayer::builder(InMemoryBackend::new(10_000))
.ttl(Duration::from_secs(120))
.refresh_before(Duration::from_secs(30))
.auto_refresh(AutoRefreshConfig {
enabled: true,
min_hits_per_minute: 10.0,
check_interval: Duration::from_secs(10),
max_concurrent_refreshes: 5,
..Default::default()
})
.build();
// Initialize auto-refresh with the service instance
cache_layer.init_auto_refresh(my_service.clone()).await?;
Group related cache entries and invalidate them together:
use tower_http_cache::prelude::*;
use tower_http_cache::tags::TagPolicy;
let cache_layer = CacheLayer::builder(backend)
.policy(
CachePolicy::default()
.with_tag_policy(TagPolicy::new().with_enabled(true))
.with_tag_extractor(|method, uri| {
// Extract tags from request
vec!["user:123".to_string(), "posts".to_string()]
})
)
.build();
// Later: invalidate all entries with a tag
backend.invalidate_by_tag("user:123").await?;
backend.invalidate_by_tags(&["user:123", "posts"]).await?;
Combine fast in-memory cache with larger distributed storage:
use tower_http_cache::backend::MultiTierBackend;
let backend = MultiTierBackend::builder()
.l1(InMemoryBackend::new(1_000)) // Hot data (fast)
.l2(RedisBackend::new(manager)) // Cold storage (large)
.promotion_threshold(3) // Promote after 3 L2 hits
.promotion_strategy(PromotionStrategy::HitCount)
.write_through(true)
.build();
let cache_layer = CacheLayer::builder(backend)
.ttl(Duration::from_secs(300))
.build();
Automatically prevent large files from overwhelming your cache:
use tower_http_cache::streaming::StreamingPolicy;
let cache_layer = CacheLayer::builder(backend)
.policy(
CachePolicy::default()
.with_streaming_policy(StreamingPolicy {
enabled: true,
max_cacheable_size: Some(1024 * 1024), // 1MB limit
excluded_content_types: HashSet::from([
"application/pdf".to_string(),
"video/*".to_string(),
"audio/*".to_string(),
"application/zip".to_string(),
]),
..Default::default()
})
)
.build();
Features:
Content-Length and size_hint()Enable cache introspection and management endpoints:
use tower_http_cache::admin::{AdminConfig, admin_router};
let admin_config = AdminConfig::builder()
.require_auth(true)
.auth_token("your-secret-token")
.build();
// Mount admin routes (Axum example)
let admin_routes = admin_router(backend.clone(), admin_config);
let app = Router::new()
.nest("/admin/cache", admin_routes)
.layer(cache_layer);
// Available endpoints:
// GET /admin/cache/health
// GET /admin/cache/stats
// GET /admin/cache/hot-keys
// GET /admin/cache/tags
// POST /admin/cache/invalidate
Enable structured logging for ML model training:
use tower_http_cache::logging::MLLoggingConfig;
let cache_layer = CacheLayer::builder(backend)
.policy(
CachePolicy::default()
.with_ml_logging(MLLoggingConfig {
enabled: true,
sample_rate: 1.0, // Log 100% of operations
hash_keys: true, // Hash keys for privacy
include_request_id: true, // Correlate with X-Request-ID
})
)
.build();
// Logs will be emitted in JSON format:
// {
// "timestamp": "2025-11-10T12:00:00Z",
// "request_id": "550e8400-...",
// "operation": "cache_hit",
// "latency_us": 150,
// "tags": ["user:123"],
// "tier": "l1"
// }
| Policy | Description |
|---|---|
ttl / negative_ttl |
cache lifetime for successful and error responses |
stale_while_revalidate |
serve stale data while a refresh is in progress |
refresh_before |
proactively refresh the cache shortly before expiry |
auto_refresh |
automatically refresh frequently-accessed entries before expiration |
tag_policy |
configure cache tags and invalidation groups |
multi_tier |
enable multi-tier caching with L1/L2 backends |
ml_logging |
enable ML-ready structured logging |
allow_streaming_bodies |
opt into caching streaming responses |
min_body_size / max_body_size |
enforce size bounds for cached bodies |
header_allowlist |
restrict which headers are stored alongside cached bodies |
method_predicate / statuses |
customize cacheable methods and status codes |
For the full API surface, see the generated docs: cargo doc --open.
Benchmarks are powered by Criterion and can be reproduced with:
cargo bench --bench cache_benchmarks
Latest results (macOS / M3 Pro / Rust 1.85, redis-backend disabled unless noted):
| Group | Benchmark | Median | Notes |
|---|---|---|---|
layer_throughput |
baseline_inner |
1.41 ms | Underlying service without caching |
cache_hit |
0.67 µs | Cached GET; body already materialized | |
cache_miss |
0.68 µs | Miss with immediate store | |
key_extractor |
path |
23.8 ns | GET/HEAD path only |
path_and_query |
97.4 ns | Path + query concatenation | |
custom_hit |
84.7 ns | User extractor returning Some |
|
custom_miss |
1.35 ns | User extractor returning None |
|
backend/in_memory |
get_small_hit |
309 ns | 1 KiB entry |
get_large_hit |
327 ns | 128 KiB entry | |
set_small |
676 ns | 1 KiB write | |
set_large |
660 ns | 128 KiB write | |
stampede |
cache_layer |
5.92 ms | 64 concurrent requests with caching |
no_cache |
5.76 ms | Same workload without layer | |
stale_while_revalidate |
stale_hit_latency |
33.6 ms | Serve-stale branch |
strict_refresh_latency |
33.7 ms | Force refresh branch | |
codec/bincode |
encode_small |
362 ns | 1 KiB payload |
decode_small |
381 ns | 1 KiB payload | |
encode_large |
146 µs | 128 KiB payload | |
decode_large |
174 µs | 128 KiB payload | |
negative_cache |
initial_miss |
14.0 µs | First miss populates negative entry |
stored_negative_hit |
21.9 ms | TTL-expired negative pathways | |
after_ttl_churn |
5.66 µs | Subsequent positive hit |
Full raw output, including outlier analysis, is captured in initial_benchmark.md.
# Library unit tests + integration tests
cargo test
# Redis integration tests
REDIS_URL=redis://127.0.0.1:6379/ cargo test --features redis-backend --tests redis_example
# Redis smoke test (launches example service, verifies cache hit/miss behaviour)
docker compose -f docker-compose.redis.yml up -d redis
python3 scripts/redis_smoke.py
docker compose -f docker-compose.redis.yml down
# Examples
cargo run --example axum_basic --features middleware
cargo run --example axum_custom --features middleware
cargo run --example redis_smoke --features redis-backend
| Feature | Description | Default |
|---|---|---|
in-memory |
Enables the Moka-powered in-memory backend | ✓ |
redis-backend |
Enables the Redis backend, codec, and async utilities | ✗ |
admin-api |
Enables admin REST API endpoints (requires axum) | ✗ |
serde |
Derives serde traits for cached entries/codecs |
✓ |
compression |
Adds optional gzip compression for cached payloads | ✗ |
metrics |
Emits metrics counters (hit/miss/store/etc.) |
✗ |
tracing |
Adds tracing spans around cache operations | ✗ |
MSRV: 1.75.0 (matching the crate's rust-version field).
The MSRV will only increase with a minor version bump and will be documented in release notes.
tower-http-cache is under active development. Expect API adjustments while we stabilize the 0.x series. Contributions and feedback are welcome—feel free to open an issue or PR! ***
This project is dual-licensed under either:
You may choose either license to suit your needs. Unless explicitly stated otherwise, any contribution intentionally submitted for inclusion in the crate shall be dual-licensed as above, without additional terms or conditions.
cargo, rustup, and Docker if you plan to run Redis tests).cargo fmt --all
cargo clippy --all-targets --all-features
cargo test
python3 scripts/redis_smoke.py
cargo bench.Bug reports and feature requests are welcome in the issue tracker. For larger design changes, please start a discussion thread to align on API shape before submitting code.