🛡️ AI Safety - aholstenson · Scour

DeepSight: An All-in-One LM Safety Toolkit

arxiv.org·1d

🧵Concurrency

rhollick.wordpress.com·17h

GTIG AI Threat Tracker: Distillation, Experimentation, and (Continued) Integration of AI for Adversarial Use

cloud.google.com·1d·

Discuss: Hacker News

SafeNeuron: Neuron-Level Safety Alignment for Large Language Models

arxiv.org·1d

Formal Verification First: How AI Supports But Cannot Replace It

semiengineering.com·23h

AI News Roundup: GPT-5.2 Makes Physics Discovery, Gemini 3 Deep Think Drops, and an AI Agent Published a Hit Piece

buildrlab.com·6h·

Discuss: DEV

Is AI self-aware?

lesswrong.com·14h

On-Device AI Tools

trendhunter.com·16h

MHub.ai: Standardizing AI for Reproducible Medical Imaging

cbirt.net·4h

Forge: Scalable Agent RL Framework and Algorithm

minimax.io·22h·

Discuss: Hacker News

🧵Concurrency

The democratization of AI data poisoning and how to protect your organization

csoonline.com·20h

Quality Assurance in AI Assisted Software Development: Risks and Implications

dev.to·1d·

Discuss: DEV

Does AI Really Understand What You’re Asking? New Study Raises Doubts

studyfinds.org·20h

Owning the AI Pareto Frontier

latent.space·1d

The Facade of AI Safety Will Crumble

lesswrong.com·1d

🔗Systems Thinking

Ask HN: Best practices for AI agent safety and privacy

news.ycombinator.com·1d·

Discuss: Hacker News

The Weapons of Mass Destruction AI Security Gap

rand.org·1d

Data Engineering for Large Models: Architecture, Algorithms & Projects

github.com·6h·

Discuss: Hacker News

🧵Concurrency

DaVinci-Agency: A Shortcut to Long-Horizon AI Agents

hackernoon.com·7h

Artificial Insecurity: threats to information integrity

accessnow.org·1d

Loading more...