Alignment Research, Model Robustness, Adversarial Examples, Risk Assessment
Musings: Data poisoning
joshs.bearblog.dev·23h
The 5% Playbook: Turning Generative AI Potential into Provable Business Value
pub.towardsai.net·3h
Can profanity causes emergent misalignment, but with qualitatively different results than insecure code
lesswrong.com·8h
Smarter navigation: AI helps robots stay on track without a map
techxplore.com·21h
AI Agents Can Talk, But Can We Trust Them?
thenewstack.io·19h
Evaluating image segmentation models for background removal for Images
blog.cloudflare.com·2h
How Do You Teach an AI Model to Reason? With Humans
blogs.nvidia.com·17h
AI Effort And Money Misplaced
semiengineering.com·9h
How to create content that works for search and generative engines
searchengineland.com·4h
I’ve been working on something new:
threadreaderapp.com·3h
Loading...Loading more...