What would you want to see in a new tensor crate?
github.com·3d·
Discuss: r/rust
🧮Memory Allocators
Preview
Report Post

tensors and such

Tensor primitives with arbitrary view, shape, stride. Cuda and CPU backends

Goal is high performance ML stack with minimal dependencies and maximal flexibility

todo

  • Backend abstraction
  • Nicer syntax. macro time tset!(tensor.view_mut(), v: 99, 1, 2) and tget!(tensor.view(), 1, 2)
  • Basic GPU backend
  • Slicing with ranges tensor.view().slice(0, 1..3) etc.
  • Test more slicing syntaxes
  • Slicing macro
  • Elementwise broadcasting
  • Basic linear algebra helpers
  • Accelerated backends (GPU / parallel) in progress
  • x86 SIMD paths (currently relying on llvm auto-vectorization for CPU which only works for contiguous memory)
  • Multiple gpu devices allowed
  • Do not lock thread on GPU dispatch
  • strides and offset as bytes
  • Broadcasting
  • Idx should not be re…

Similar Posts

Loading similar posts...