Alignment Research, Model Robustness, Adversarial Examples, Risk Assessment

DS-STAR: A state-of-the-art versatile data science agent
research.google·10h
🤖AI
Flag this post
Evaluating Generative AI as an Educational Tool for Radiology Resident Report Drafting
arxiv.org·23h
🤖AI
Flag this post
We Started with Jax but Moved to PyTorch
mlechner.substack.com·11h·
Discuss: Substack
🤖AI
Flag this post
New AI security tool lays out key exposures
reversinglabs.com·12h
🤖AI
Flag this post
My Hands-On Review of Kimi K2 Thinking: The Open-Source AI That's Changing the Game
reddit.com·1h·
Discuss: r/LocalLLaMA
🤖AI
Flag this post
AI's capabilities may be exaggerated by flawed tests, according to new study
nbcnews.com·12h·
Discuss: Hacker News
🤖AI
Flag this post
Reflection
alexpolozov.com·8h·
Discuss: Hacker News
🤖AI
Flag this post
Neural Physics: Using AI Libraries to Develop Physics-Based Solvers for Incompressible Computational Fluid Dynamics
arxiv.org·23h
🤖AI
Flag this post
How reliable are AI agents?
droidrun.ai·16h·
Discuss: DEV
🤖AI
Flag this post
Great, now even malware is using LLMs to rewrite its code, says Google, as it documents new phase of 'AI abuse'
pcgamer.com·15h
🤖AI
Flag this post
Learning to Model the World with Language
dynalang.github.io·4h·
Discuss: Hacker News
🤖AI
Flag this post
## Adaptive Multi-Heuristic Intrusion Detection for Collaborative Welding Robot Networks
freederia.com·8h
🔗Systems Thinking
Flag this post
Efficiency vs. Alignment: Investigating Safety and Fairness Risks in Parameter-Efficient Fine-Tuning of LLMs
arxiv.org·2d
🧵Concurrency
Flag this post
The OWASP AI/LLM Top 10: Understanding Security and Privacy Risks in AI-Powered Mobile Applications
nowsecure.com·1d
🤖AI
Flag this post
GTIG AI Threat Tracker: Advances in Threat Actor Usage of AI Tools
cloud.google.com·1d·
Discuss: Hacker News
🤖AI
Flag this post
ChatGPT Glossary: 60 AI Terms Everyone Should Know
cnet.com·10h
🤖AI
Flag this post
Moonshot's Kimi K2 Thinking emerges as leading open source AI, outperforming GPT-5, Claude Sonnet 4.5 on key benchmarks
venturebeat.com·9h
🤖AI
Flag this post
There might be something fundamentally wrong with many AI systems, scientists say
the-independent.com·1d
🤖AI
Flag this post
Trusted enterprise AI at scale depends on robust cybersecurity
nordot.app·1d
🔗Microservices
Flag this post