🎯 AI Alignment - flicksinfants1y · Scour

Neglected Basics of AI Alignment

lesswrong.com·

Personal-Values Alignment Tech: Some Initial Motivations

🛡LLM safety News Blog

blog.danielsosebee.com··Hacker News

The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model

🤖AI Academic

The Ghost of Alignment — Why AI Should Never Fully Obey Humanity

🛡LLM safety Blog

·

AI Governance Tools: How To Achieve Compliance and Visibility

⚖️AI Governance Blog

What is AI Governance? (10 minute read)

⚖️AI Governance Blog

Mechanistic Governance: Mapping and Securing the Agentic Reasoning Trajectory

⚖️AI Governance Blog

·

[Recorded talk] "AI Alignment Versus AI Ethical Treatment: 10 Challenges"

⚖️AI Governance Blog

meditationsondigitalminds.substack.com··Substack

xAI fired an engineer who raised alarms about Grok safety, new lawsuit claims

🛡️Red Teaming

techcrunch.com·

My Oslo Freedom Forum Keynote: Authoritarians and AI

⚖️AI Governance Blog

redpacket.substack.com··Substack

Mechanistic Interpretability: The Key to Trusting Agentic AI

🤖AI Discussion

bradenkelley.com·

Anthropic’s 30-Day Data Policy Exposes Enterprise AI Governance Gaps

⚖️AI Governance

Ethical Considerations and AI Governance

⚖️AI Governance Blog

blog.domb.net·

VFUSE: Virulent Feature Understanding with Sparse autoEncoders

🧬Embeddings Academic

Advanced AI Safety Addendum

⚖️AI Governance

cloud.google.com··Hacker News

OpenAI says it will comply with Trump's order to let the government review AI models before release

⚖️AI Governance

Organizations can’t see much of their mobile AI activity

⚖️AI Governance

helpnetsecurity.com·

Shadow AI Governance: How to Secure Employee AI Use in 2026

⚖️AI Governance Blog

cswithsanjay.blogspot.com·

NTT Data’s Tom Winstanley: UK & Japan Can Rewrite AI Rules

⚖️AI Governance News

aimagazine.com·

Germany to create AI safety agency

⚖️AI Governance

techxplore.com·

Log in to enable infinite scrolling