GEN-0: SoTA 10B+ Foundation Model for Robotics with Harmonic Reasoning
📊Gradient Accumulation
Flag this post
Inference Acceleration from the Ground Up
semiwiki.com·6d
🧠CPU Architecture
Flag this post
Speech-DRAME: A Framework for Human-Aligned Benchmarks in Speech Role-Play
arxiv.org·21h
🔄ONNX
Flag this post
Spiking Neural Networks: The Next Leap in AI Power Efficiency by Arvind Sundararajan
⚡ONNX Runtime
Flag this post
Real-time Semantic Segmentation for AR Glasses: Dynamic Occlusion Handling via Bayesian Fusion
🏎️TensorRT
Flag this post
Show HN: ReadMyMRI DICOM native preprocessor with multi model consensus/ML pipes
🏎️TensorRT
Flag this post
flowengineR: A Modular and Extensible Framework for Fair and Reproducible Workflow Design in R
arxiv.org·21h
🔄ONNX
Flag this post
World Simulation with Video Foundation Models for Physical AI
arxiv.org·21h
🏎️TensorRT
Flag this post
Contrastive Knowledge Transfer and Robust Optimization for Secure Alignment of Large Language Models
arxiv.org·1d
🎓Model Distillation
Flag this post
Probabilistic Robustness for Free? Revisiting Training via a Benchmark
arxiv.org·21h
📊Gradient Accumulation
Flag this post
A Comparative Analysis of LLM Adaptation: SFT, LoRA, and ICL in Data-Scarce Scenarios
arxiv.org·21h
🎓Model Distillation
Flag this post
DEER: Disentangled Mixture of Experts with Instance-Adaptive Routing for Generalizable Machine-Generated Text Detection
arxiv.org·21h
🏎️TensorRT
Flag this post
Scalable In-Memory Associative Processing for Graph Neural Network Inference
⚡Flash Attention
Flag this post
Building WriteRight: My Journey Creating an AI Writing Assistant with Mastra
🤖AI Coding Tools
Flag this post
T3: Test-Time Model Merging in VLMs for Zero-Shot Medical Imaging Analysis
arxiv.org·1d
🏎️TensorRT
Flag this post
MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling
arxiv.org·21h
📊Gradient Accumulation
Flag this post
Loading...Loading more...