meta-pytorch/segment-anything-fast: A batched offline inference oriented version of segment-anything
github.com·5m
FlashAttention 4: Faster, Memory-Efficient Attention for LLMs
digitalocean.com·21h
Co-optimization Approaches For Reliable and Efficient AI Acceleration (Peking University et al.)
semiengineering.com·15h
Loading...Loading more...