🛡️ AI Safety - emschwartz · Scour

Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems

lesswrong.com·7h

🪄Prompt Engineering

Flag this post

Show HN: Hot or Slop – Visual Turing test on how well humans detect AI images

hotorslop.com·13h·

Discuss: Hacker News

Flag this post

Vibe Check: I Canceled Two AI Max Plans for Factory’s Coding Agent Droid

kill-the-newsletter.com·23h

🛡️AI Security

Flag this post

Cloud CISO Perspectives: AI as a strategic imperative to manage risk

cloud.google.com·23m

Flag this post

GenAI Poisoning: How Fewer Than 100 Samples Can Corrupt a Multi-Billion Parameter Model

pub.towardsai.net·1h

🛡️AI Security

Flag this post

Anthropic's Pilot Sabotage Risk Report

alignment.anthropic.com·9h·

Discuss: Hacker News

🛡️Anthropic PBC

Flag this post

Mistake-filled legal briefs show the limits of relying on AI tools at work

techxplore.com·22h

👨‍💻AI Coding

Flag this post

What Is an AI PaaS? A Guide to the Future of AI Development

thenewstack.io·20h

🏗️LLM Infrastructure

Flag this post

Quadric: Revolutionizing Edge AI

semiwiki.com·23h

📱Edge AI Optimization

Flag this post

AI coding is moving faster than the guardrails meant to secure it and that's risky business.

blog.codacy.com·3h·

Discuss: r/programming

🛡️AI Security

Flag this post

Project-MONAI/MONAI

github.com·14h

🔎Meilisearch

Flag this post

Too much social media gives AI chatbots ‘brain rot’

nature.com·4h

🛡️AI Security

Flag this post

AI skeptics and AI boosters are both wrong

understandingai.org·20h

🛡️AI Security

Flag this post

Generative AI, Simplicity, and Easiness

gioleppe.github.io·20h·

Discuss: Hacker News

Flag this post

🧠🚀 Excited to introduce Supervised Reinforcement Learning—a framework that leverages expert trajectories to teach small LMs how to reason through hard problems ...

threadreaderapp.com·14h

🏗️LLM Infrastructure

Flag this post

Study: AI Models Trained On Clickbait Slop Result In AI ‘Brain Rot,’ ‘Hostility’

techdirt.com·4h·

Discuss: r/technews

🛡️Content Moderation

Flag this post

OpenAI launches Aardvark to detect and patch hidden bugs in code

infoworld.com·4h

🔓Open Source Software

Flag this post

Andrew Shindyapin: AI’s Impact on Software Development

skmurphy.com·13h

⚡Developer Experience

Flag this post

The AI Buildout Is So Big Even a Haunted House Owner Wants in

bloomberg.com·6h

Flag this post

Machine learning predicts meter-scale laboratory earthquakes

nature.com·20h

🧠LLM Inference

Flag this post

Loading more...