LLM As A Judge is not the shortcut you think
softwaredoug.com·20h·
Discuss: Hacker News
💻computing
Flag this post
Kimi Linear: An Expressive, Efficient Attention Architecture
arxiviq.substack.com·2d·
Discuss: Substack
🤖AI
Flag this post
How LLMs Cheat: Modifying Tests and Overloading Operators
enbao.me·16h·
Discuss: Hacker News
🤖AI
Flag this post
AI Summarization Optimization
schneier.com·1d·
Discuss: Hacker News
🤖AI
Flag this post
The Work of AI, Ourselves
oliverbatemandoesthework.substack.com·8h·
Discuss: Substack
🤖AI
Flag this post
Writing an LLM from scratch, part 27 – what's left, and what's next?
gilesthomas.com·12h·
Discuss: Hacker News
⌨️programming
Flag this post
Deep DIVE: AI progress continues, as IQ scores rise linearly
maximumtruth.org·1d·
Discuss: Hacker News
🤖AI
Flag this post
Our newest model: Chandra (OCR)
datalab.to·2d·
Discuss: Hacker News
💻computing
Flag this post
Small Vs. Large Language Models
semiengineering.com·1d·
Discuss: Hacker News, r/LLM
🤖AI
Flag this post
Yes, you should understand backprop (2016)
karpathy.medium.com·2d·
Discuss: Hacker News
⌨️programming
Flag this post
Mathematics solves problems by pen and paper. CS helps us to go far beyond that
cacm.acm.org·1d·
Discuss: Hacker News
💻computing
Flag this post
Google's Jeff Dean on the Coming Era of Virtual Engineers
sequoiacap.com·1d·
Discuss: Hacker News
🤖AI
Flag this post
AI Models Write Code with Security Flaws 18–50% of the Time, New Study Finds
medium.com·19h·
Discuss: Hacker News
🤖AI
Flag this post
Advancing cybersecurity a comprehensive review of AI-driven detection techniques
journalofbigdata.springeropen.com·6d·
Discuss: Hacker News
🤖AI
Flag this post
Your Transformer is Secretly an EOT Solver
elonlit.com·4d·
Discuss: Hacker News
🤖AI
Flag this post
Researchers build magnetic computer that thinks like a brain
thebrighterside.news·22h·
Discuss: Hacker News
💻computing
Flag this post
Programming for Computations: Matlab/Octave
link.springer.com·1d·
Discuss: Hacker News
⌨️programming
Flag this post
Generation at the Speed of Thought: Speculative Decoding
bittere.substack.com·2d·
Discuss: Substack
💻computing
Flag this post
Experts find flaws in hundreds of tests that check AI safety and effectiveness
theguardian.com·13h·
🤖AI
Flag this post
Introducing IndQA
openai.com·14h·
Discuss: Hacker News
🤖AI
Flag this post