Can LLMs subtract numbers?
🧪Property-based testing
Flag this post
The AI Localhost
🧩WebAssembly
Flag this post
AI for pRedicting Exacerbations in KIDs with aSthma (AIRE-KIDS)
arxiv.org·1d
🔵TypeScript
Flag this post
The AI Capability Gap
🎨WebGL
Flag this post
Diagnosing Hallucination Risk in AI Surgical Decision-Support: A Sequential Framework for Sequential Validation
arxiv.org·1d
🎨WebGL
Flag this post
Automatically Finding Rule-Based Neurons in OthelloGPT
arxiv.org·1d
🎨WebGL
Flag this post
Accumulating Context Changes the Beliefs of Language Models
arxiv.org·1d
🎨WebGL
Flag this post
Realistic pedestrian-driver interaction modelling using multi-agent RL with human perceptual-motor constraints
arxiv.org·2d
🎨WebGL
Flag this post
Touring_test: A Cucumber Extension for Agentic Usability Testing
🧪Property-based testing
Flag this post
Probabilistic Robustness for Free? Revisiting Training via a Benchmark
arxiv.org·1d
🎨WebGL
Flag this post
I built a leaderboard for Rerankers
❄️Nix
Flag this post
Loading...Loading more...