How LLMs Cheat: Modifying Tests and Overloading Operators
enbao.me·4h·
Discuss: Hacker News
🤖Software Engineering with AI
Flag this post
What Are Auto-regressive Models? A Deep Dive and Typical Use Cases
blog.pangeanic.com·12h
💬Large Language Models
Flag this post
Building a Production-Ready AI Agent
api.github.com·4h·
Discuss: DEV
🤖Software Engineering with AI
Flag this post
How AI Will Quietly Rebuild Our World
future.forem.com·7h·
Discuss: DEV
🤖Software Engineering with AI
Flag this post
The Silent Threat: Visually Triggered AI Hijacking
dev.to·6h·
Discuss: DEV
🤖Software Engineering with AI
Flag this post
How Powerful AIs Get Cheap
lesswrong.com·7h
🤖Software Engineering with AI
Flag this post
The Threats of Agentic AI Data Trails
blogger.com·1d
🤖Software Engineering with AI
Flag this post
Agentic Entropy-Balanced Policy Optimization
paperium.net·2h·
Discuss: DEV
🤖Software Engineering with AI
Flag this post
Now AI fakes are fooling news outlets — and maybe AI pros?
businessinsider.com·4h
⚔️Realist IR Theory
Flag this post
Experts find flaws in hundreds of tests that check AI safety and effectiveness
theguardian.com·1h
🤖Software Engineering with AI
Flag this post
Beyond Brute Force: AI That Thinks Like an Engineer by Arvind Sundararajan
dev.to·14h·
Discuss: DEV
🤖Software Engineering with AI
Flag this post
LinEAS: End-to-end Learning of Activation Steering with a Distributional Loss
machinelearning.apple.com·1d
🧬Computational Neuroscience
Flag this post
CyberSlop — meet the new threat actor, MIT and Safe Security
doublepulsar.com·6h
🤖Software Engineering with AI
Flag this post
New prompt injection papers: Agents Rule of Two and The Attacker Moves Second
simonw.substack.com·1d·
Discuss: Substack
🤖Software Engineering with AI
Flag this post
Why do some of us love AI, while others hate it? The answer is in how our brains perceive risk and trust
theconversation.com·7h
🤖Software Engineering with AI
Flag this post
Writing an LLM from scratch, part 26 – evaluating the fine-tuned model
gilesthomas.com·5h·
Discuss: Hacker News
💬Large Language Models
Flag this post
Incremental AI Risk: A Governance Lens for Digital Infrastructure and Public Policy
circleid.com·10h
🤖Software Engineering with AI
Flag this post
ISC2 Security Congress: The shaky state of AI security today
scworld.com·1d·
Discuss: Hacker News
🤖Software Engineering with AI
Flag this post
Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning
arxiv.org·20h
🤖Software Engineering with AI
Flag this post