Enabling Trillion-Parameter Models on AWS EFA
research.perplexity.ai·1h·
Discuss: Hacker News
🤖AI
Flag this post
Dual 5090 work station for SDXL
reddit.com·14h·
Discuss: r/LocalLLaMA
🤖AI
Flag this post
Cutting LLM Batch Inference Time in Half: Dynamic Prefix Bucketing at Scale
daft.ai·8h·
Discuss: Hacker News
🤖AI
Flag this post
Inside Pinecone: Slab Architecture
pinecone.io·8h·
Discuss: Hacker News
🏛️Software Architecture Patterns
Flag this post
Topographical sparse mapping: A training framework for deep learning models
sciencedirect.com·4h·
Discuss: Hacker News
🤖AI
Flag this post
My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.io·2d·
Discuss: Hacker News
🤖AI
Flag this post
Inline vs. Pipeline Ray Tracing
evolvebenchmark.com·11h·
Discuss: Hacker News
🤖AI
Flag this post
Lazy loading isn't the magic pill to fix AI Inference
tensorfuse-docs.mintlify.dev·11h·
Discuss: Hacker News
🏛️Software Architecture Patterns
Flag this post
Parallel achieves 70% accuracy on SEAL, benchmark for hard web research
parallel.ai·6h·
Discuss: Hacker News
🤖AI
Flag this post
Design of quasi phase matching crystal based on differential gray wolf algorithm
arxiv.org·20h
🏛️Software Architecture Patterns
Flag this post
Dive into Systems
diveintosystems.org·1d·
Discuss: Hacker News
⚙️DevOps Practices
Flag this post
NVIDIA Sends a Powerful GPU to Space
spectrum.ieee.org·1d·
🏛️Software Architecture Patterns
Flag this post
Hydra: Dual Exponentiated Memory for Multivariate Time Series Analysis
arxiv.org·20h
🏛️Software Architecture Patterns
Flag this post
Tetris: An SLA-aware Application Placement Strategy in the Edge-Cloud Continuum
arxiv.org·20h
🏛️Software Architecture Patterns
Flag this post
Hybrid Quantum-Classical Optimization of the Resource Scheduling Problem
arxiv.org·20h
🤖AI
Flag this post
Geonum – geometric number library for unlimited dimensions with O(1) complexity
github.com·1d·
Discuss: Hacker News
💻Programming
Flag this post
Disciplined Biconvex Programming
arxiv.org·20h
🏛️Software Architecture Patterns
Flag this post
Exploring a space-based, scalable AI infrastructure system design
research.google·8h·
Discuss: Hacker News
🏛️Software Architecture Patterns
Flag this post