Brain Float, Mixed Precision, Numeric Format, TPU, Training Stability

OlmoEarth: A new state-of-the-art Earth observation foundation model family
allenai.org·4h·
Discuss: Hacker News
ONNX Runtime
Flag this post
The Noise and the Signal
russmiles.substack.com·13h·
Discuss: Substack
Flash Attention
Flag this post
Fine-Tuning an AI – Part I
zwischenzugs.com·1h
🤖AI Coding Tools
Flag this post
New AI models Cursor and Cognition (Windsurf) built on Chinese base models
linkedin.com·1d·
Discuss: r/China
🤖AI Coding Tools
Flag this post
A Thesis and Playbook for Edge AI
ondeviceguy.substack.com·1d·
Discuss: Substack
ONNX Runtime
Flag this post
Magnetic Materials for Transcranial Magnetic Stimulation (TMS)
arxiv.org·13h
⏱️Benchmarking
Flag this post
The mind-boggling valuations of AI companies
theguardian.com·3h
ONNX Runtime
Flag this post
The Riddle of Reflection: Evaluating Reasoning and Self-Awareness in Multilingual LLMs using Indian Riddles
arxiv.org·13h
🛠Ml-eng
Flag this post
CoT-Saliency: Unified Chain-of-Thought Reasoning for Heterogeneous Saliency Tasks
arxiv.org·13h
🏎️TensorRT
Flag this post
Secure Distributed RIS-MIMO over Double Scattering Channels: Adversarial Attack, Defense, and SER Improvement
arxiv.org·13h
🏎️TensorRT
Flag this post
High Resolution Seismic Waveform Generation using Denoising Diffusion
arxiv.org·13h
🏎️TensorRT
Flag this post
Adversarial Spatio-Temporal Attention Networks for Epileptic Seizure Forecasting
arxiv.org·13h
👁️Attention Optimization
Flag this post
Hybrid channel attention network for auditory attention detection
nature.com·1d
🧩Attention Kernels
Flag this post
NOMAD - Navigating Optimal Model Application to Datastreams
arxiv.org·13h
ONNX Runtime
Flag this post
DC4GS: Directional Consistency-Driven Adaptive Density Control for 3D Gaussian Splatting
arxiv.org·1d
🧮cuDNN
Flag this post
Weak-To-Strong Generalization
lesswrong.com·2d
📉Model Quantization
Flag this post