Back to article

[2212.08073] Constitutional AI: Harmlessness from AI Feedback (opens in new tab)

Covered by 9 sources including DEV Community, lesswrong.com

Covered in 11 articles

DEV Community·

When SafetyCo Goes to War: Anthropic, the DOD, and the Limits of Ideals-Based Frameworks

Discussed on DEV

lesswrong.com·

How I think developers of frontier AI systems and regulators ought to act in the face of existential AI risk

lesswrong.com·

Synthetic Persona Pretraining: Alignment from Token Zero

blogs.cisco.com·

Cisco AI Defense Policy Studio: Turning Unwritten Policy into Adaptive AI Guardrails

owainlewis/awesome-artificial-intelligence

Does ChatGPT need a psychiatrist? Similarities between human psychopathology and errors in large language models

Discussed on Hacker News

·

State media control influences large language models

Discussed on Hacker News, Hacker News, and r/science

dayafter.substack.com·

The Universe just wants to learn

Discussed on Substack

Strange Loop Canon·

Who Audits the Auditors?

wafer.systems·

Evolutionary Data Making – How to train embedding models

Discussed on Hacker News