Alignment Research, Model Robustness, Adversarial Examples, Risk Assessment

AI Safety at the Frontier: Paper Highlights of October 2025
lesswrong.com·1d
🤖AI
Flag this post
LLMs Add Safety Risks To Physical AI
semiengineering.com·12h
🤖AI
Flag this post
Decoupling Augmentation Bias in Prompt Learning for Vision-Language Models
arxiv.org·15h
🤖AI
Flag this post
The Rise of the Specialist: Why Small Language Models are the Future of Enterprise AI
dev.to·4h·
Discuss: DEV
🤖AI
Flag this post
Oxford Study Says AI Safety Should Build on Existing Global Standards
pymnts.com·53m
🔗Systems Thinking
Flag this post
Microsoft siktar bortom AI – vill ta fram superintelligens
omni.se·4h
🤖AI
Flag this post
An anomaly detection method for gas turbines in power plants using conditional variational autoencoder optimized with self-attention
sciencedirect.com·4h
🤖AI
Flag this post
Your AI-driven threat hunting is only as good as your data platform and pipeline
cybersecuritydive.com·10h
🤖AI
Flag this post
Emulating human-like adaptive vision for efficient and flexible machine visual perception
nature.com·20h
🤖AI
Flag this post
The Production Generative AI Stack: Architecture and Components
thenewstack.io·4h
🤖AI
Flag this post
Marketers: Stop Anthropomorphizing AI, Learn What It Actually Does Under the Hood
cmswire.com·7h
🤖AI
Flag this post
The Complexity Cliff: Why Reasoning Models Work Right Up Until They Don't
rewire.it·20h·
Discuss: Hacker News
🔗Systems Thinking
Flag this post
Continuous Autoregressive Language Models
shaochenze.github.io·1d·
Discuss: Hacker News
🤖AI
Flag this post
OpenAI Model Spec
model-spec.openai.com·12h·
Discuss: Hacker News
🤖AI
Flag this post
New AI security tool lays out key exposures
reversinglabs.com·4h
🤖AI
Flag this post
DS-STAR: A state-of-the-art versatile data science agent
research.google·2h
🤖AI
Flag this post
Evaluating Generative AI as an Educational Tool for Radiology Resident Report Drafting
arxiv.org·15h
🤖AI
Flag this post
We Started with Jax but Moved to PyTorch
mlechner.substack.com·3h·
Discuss: Substack
🤖AI
Flag this post
Trusted enterprise AI at scale depends on robust cybersecurity
nordot.app·23h
🔗Microservices
Flag this post