LLMs show a “highly unreliable” capacity to describe their own internal processes
arstechnica.com·3d·
Discuss: Hacker News
Performance
Flag this post
The Curved Spacetime of Transformer Architectures
arxiv.org·18h·
Discuss: Hacker News
Performance
Flag this post
Show HN: Yansu, Serious Coding
twitter.com·1d·
Discuss: Hacker News
Performance
Flag this post
Google's MCP Toolbox for Databases: A Technical Deep Dive for Engineering Teams
agnost.ai·1h·
Discuss: Hacker News
Performance
Flag this post
The Work of AI, Ourselves
oliverbatemandoesthework.substack.com·2d·
Discuss: Substack
Performance
Flag this post
Why Alpha Arena was a bad benchmark
borisagain.substack.com·1d·
Discuss: Substack
Performance
Flag this post
Nubank announces a new hybrid model for 2026
international.nubank.com.br·58m·
Discuss: Hacker News
Performance
Flag this post
My excellent Conversation with Sam Altman
marginalrevolution.com·1d·
Discuss: Hacker News
Performance
Flag this post
Help with AI Fatigue
news.ycombinator.com·5h·
Discuss: Hacker News
Performance
Flag this post
Coding on Paper
thepalindrome.org·8h·
Discuss: Hacker News
Performance
Flag this post
Experts find flaws in hundreds of tests that check AI safety and effectiveness
theguardian.com·2d·
Performance
Flag this post
Cursor's Composer-1 vs. Windsurf's SWE-1.5: The Rise of Vertical Coding Models
inkeep.com·2d·
Discuss: Hacker News
Performance
Flag this post
The Learning Loop and LLMs
martinfowler.com·2d·
Performance
Flag this post
Thoughts by a non-economist on AI and economics
windowsontheory.org·2d·
Discuss: Hacker News
Performance
Flag this post
GTIG AI Threat Tracker: Advances in Threat Actor Usage of AI Tools
cloud.google.com·1d·
Discuss: Hacker News
Performance
Flag this post
From vibe coding to context engineering: 2025 in software development
technologyreview.com·1d·
Discuss: Hacker News
Performance
Flag this post
Reasoning with Sampling: Your Base Model Is Smarter Than You Think
aakaran.github.io·5h·
Discuss: Hacker News
Performance
Flag this post
30% of workers have put sensitive company data into ChatGPT
sweep.io·1d·
Discuss: Hacker News
Performance
Flag this post
Open Source Context-Aware PII Classifier
corp.roblox.com·2d·
Discuss: Hacker News
Performance
Flag this post
Improving Structured Outputs in the Gemini API
blog.google·1d·
Discuss: Hacker News
Performance
Flag this post