Why Alpha Arena was a bad benchmark
borisagain.substack.com·10h·
Discuss: Substack
🤖AI
Flag this post
My fan worked fine, so I gave it WiFi
ellis.codes·1h·
Discuss: Hacker News
⚛️Plasma Physics
Flag this post
When Your Hash Becomes a String: Hunting Ruby's Million-to-One Memory Bug
mensfeld.pl·1d·
Discuss: Hacker News
🦀Rust
Flag this post
High-speed and ultra-low-power superconductive neuron with ReLU activation
iopscience.iop.org·10h·
Discuss: Hacker News
🤖AI
Flag this post
Building our geospatial database in production
radar.com·11h·
Discuss: Hacker News
📊HFT
Flag this post
Fair human-centric image dataset for ethical AI benchmarking
nature.com·9h
🤖AI
Flag this post
Hephaestus: AI workflows that discover and create their own tasks as they work
reddit.com·16h·
Discuss: r/LocalLLaMA
🤖AI
Flag this post
OneNote's Genesis (2004)
web.archive.org·21h·
Discuss: Hacker News
🤖AI
Flag this post
Can AI be truly creative?
nature.com·15h
🤖AI
Flag this post
Notes Apps
cao.sh·2h·
Discuss: Hacker News
🧮Mathematics
Flag this post
Eight millennia of continuity of a previously unknown lineage in Argentina
nature.com·9h
🦀Rust
Flag this post
Split Learning-Enabled Framework for Secure and Light-weight Internet of Medical Things Systems
arxiv.org·1d
🤖AI
Flag this post
Secretome translation shaped by lysosomes and lunapark-marked ER junctions
nature.com·9h
⚛️Plasma Physics
Flag this post
Show HN: ReadMyMRI DICOM native preprocessor with multi model consensus/ML pipes
github.com·1d·
Discuss: Hacker News
🤖AI
Flag this post
A Hybrid Deep Learning and Forensic Approach for Robust Deepfake Detection
arxiv.org·2d
🤖AI
Flag this post
Self-Harmony: Learning to Harmonize Self-Supervision and Self-Play in Test-Time Reinforcement Learning
arxiv.org·1d
🤖AI
Flag this post
MedRECT: A Medical Reasoning Benchmark for Error Correction in Clinical Texts
arxiv.org·1d
🤖AI
Flag this post
DEER: Disentangled Mixture of Experts with Instance-Adaptive Routing for Generalizable Machine-Generated Text Detection
arxiv.org·1d
🤖AI
Flag this post
The Riddle of Reflection: Evaluating Reasoning and Self-Awareness in Multilingual LLMs using Indian Riddles
arxiv.org·1d
🤖AI
Flag this post
FairAIED: Navigating Fairness, Bias, and Ethics in Educational AI Applications
arxiv.org·1d
🤖AI
Flag this post