Alignment Research, Model Robustness, Adversarial Examples, Risk Assessment

Start Speaking AI: Easy Explanations for 15 Common Terms
future.forem.com·1d·
Discuss: DEV
🔤Type Systems
Flag this post
AI coding is moving faster than the guardrails meant to secure it and that's risky business.
blog.codacy.com·1d·
Discuss: r/programming
🔤Type Systems
Flag this post
Generative and Predictive AI in Application Security: A Comprehensive Guide
dev.to·1d·
Discuss: DEV
🔤Type Systems
Flag this post
Show HN: GPU-accelerated sandboxes for running AI coding agents in parallel [video]
youtube.com·1d·
Discuss: Hacker News
🌐WebAssembly
Flag this post
Silent Sabotage: When Hardware Flaws Poison Medical AI by Arvind Sundararajan
dev.to·1d·
Discuss: DEV
💿Operating Systems
Flag this post
Don't Just Fine-tune the Agent, Tune the Environment
paperium.net·1d·
Discuss: DEV
🔤Type Systems
Flag this post
Generative AI Security: The Shared Responsibility Framework
enkryptai.com·3d·
Discuss: Hacker News
🔤Type Systems
Flag this post
AI and Data Virtualization: A Symbiotic Relationship For Smart Data Management
dev.to·3h·
Discuss: DEV
💿Operating Systems
Flag this post
AI Scientists History
diffuse.one·2d·
Discuss: Hacker News
🔤Type Systems
Flag this post
The Hybrid Thinking
muzammil.dev·1d·
Discuss: DEV
🔤Type Systems
Flag this post
Unlocking AI's Potential: Reframing 'Instrumental Goals' as Engineering Opportunities
dev.to·2d·
Discuss: DEV
🔤Type Systems
Flag this post
ACADREASON: Exploring the Limits of Reasoning Models with Academic ResearchProblems
paperium.net·1d·
Discuss: DEV
🔤Type Systems
Flag this post
Why Every AI Project Needs Annotation QA
aitaggers.com.au·4d·
Discuss: DEV
🔤Type Systems
Flag this post
High-Fidelity Simulated Data Generation for Real-World Zero-Shot RoboticManipulation Learning with Gaussian Splatting
paperium.net·20h·
Discuss: DEV
🔤Type Systems
Flag this post
Introducing gpt-oss-safeguard
openai.com·3d·
Discuss: Hacker News
🔤Type Systems
Flag this post
Speedrunning an RL Environment
sidb.in·8h·
Discuss: Hacker News
🌐WebAssembly
Flag this post
Why AI companions exploit the same psychology as teddy bears
lightcapai.medium.com·34m·
Discuss: Hacker News
🔤Type Systems
Flag this post
Let's Poison Your LLM Application: A Security Wake-Up Call
dev.to·2d·
Discuss: DEV
🔤Type Systems
Flag this post
On Epistemic Uncertainty of Visual Tokens for Object Hallucinations in LargeVision-Language Models
paperium.net·21h·
Discuss: DEV
🔤Type Systems
Flag this post
Agents Rule of Two: A Practical Approach to AI Agent Security
ai.meta.com·22h·
Discuss: Hacker News
🔤Type Systems
Flag this post