iro-cuda-ffi-kernels

Crates.ioiro-cuda-ffi-kernels
lib.rsiro-cuda-ffi-kernels
version0.2.1
created_at2026-01-18 15:25:45.092231+00
updated_at2026-01-18 15:25:45.092231+00
descriptionReference CUDA kernels for iro-cuda-ffi
homepagehttps://github.com/iro/iro-cuda-ffi
repositoryhttps://github.com/iro/iro-cuda-ffi
max_upload_size
id2052488
size98,045
Tribe IRO (tribe-iro)

documentation

https://docs.rs/iro-cuda-ffi-kernels

README

iro-cuda-ffi-kernels

Reference CUDA kernels for iro-cuda-ffi.

These kernels are compiled with nvcc at build time and demonstrate the expected ABI and wrapper patterns. They are intended as examples and integration tests, not as a production math library.

Notes:

  • Requires a CUDA toolkit (nvcc) to build.
  • Tests require --features cuda-tests.
  • Benchmarks (including cudarc cross-validation) live in iro-cuda-ffi-benchmarks.
  • iro-cuda-ffi vs cudarc benchmarks are sanity checks, not a competition.
  • Run benchmarks in release mode and serially to avoid GPU contention.

Docs: https://docs.rs/iro-cuda-ffi-kernels

Commit count: 0

cargo fmt