🎯 AI Alignment - faruk · Scour

Designer babies. Self-improving AI. Are we ready for either?

🧩Epistemics News

·

umair-tareen/philosopher-council: An eleven-philosopher LLM council - ask it questions or point it at AI-research trends. Claude-powered deliberation through the four classical branches of philosophy. Methodology, not metaphysics.

🧠LLMs Code

github.com··r/SideProject

A Unifying Lens on Reward Uncertainty in RLHF

🧠LLMs Academic

Guardian Angels: LLM Personalization for Productivity and Security

📊AI Monitoring

gwern.net··Hacker News

High Dynamic Range DIY Air Testing

🧑‍💻Indie Hackers

Coelho Mollo and Millière: The Vector Grounding Problem

philosophyofbrains.com·

Neglected Basics of AI Alignment

lesswrong.com·

Multilingual Sentiment Aware Text Summarization A Reinforcement Learning Approach for Consistency Maintenance

🧠LLMs Academic

OpenClaw Won: How Big Tech Adopted the AI Agent

📊AI Monitoring

thelettertwo.com·

Finding Inner Stillness at the Jinmandir

💡Framework Thinking

srmdwpsitelive.kinsta.cloud·

(VERY PARTIAL) CROSSPOST: ALEX HEATH: SubStack Is Opening Up to AI: Interviewing CEO Chris Best

🚀Startups News Blog

braddelong.substack.com

A Regret Minimization Framework on Preference Learning in Large Language Models

🧠LLMs Academic

A Mike's-Eye View of ARC's Research

⚙️AI Infrastructure

lesswrong.com·

SLUUG Talk: Demystifying Large Language Models on Linux

🧠LLMs Code

github.com··DEV

SecureBio Detection is Hiring Software Engineers

🧑‍💻Indie Hackers

Representation-Aware Advantage Estimation: Your Reward Model Provides More Than A Scalar Output

🧠LLMs Academic

Iliad is Hiring

lesswrong.com·

Hidden Consensus:Preference-Validity Compression in Human Feedback

🧠LLMs Academic

Learnings from starting an AI safety research team

📊AI Monitoring

lesswrong.com·

Trajectory Geometry of Transformer Representations Across Layers

🧠LLMs Academic

Log in to enable infinite scrolling