๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐Ÿ† LLM Benchmarking

Model Evaluation, Leaderboards, Capability Assessment, AI Competition

AI and the Fight Between Democracy and Autocracy
theatlantic.comยท14h
๐Ÿ›ก๏ธAI Security
OpenAI widely thought to be Broadcom's mystery $10 billion custom AI processor customer โ€” order could be for millions of AI processors
tomshardware.comยท2h
๐Ÿ–ฅGPUs
A tournament tried to test how well experts could forecast AI progress. They were all wrong.
vox.comยท13h
๐Ÿ†•New AI
Are LLMs better suited for PR reviews than full codebases?
news.ycombinator.comยท5hยท
Discuss: Hacker News
๐Ÿ•ณLLM Vulnerabilities
DeepSeek's New AI Model Brings Fight to Alibaba
bloomberg.comยท19h
๐Ÿ†•New AI
Are LLM Agents Behaviorally Coherent? Latent Profiles for Social Simulation
arxiv.orgยท20h
๐Ÿ›ก๏ธAI Safety
Clingy chatbots, AI recruiters and other new research findings
restofworld.orgยท14hยท
Discuss: Hacker News
๐Ÿ›ก๏ธAI Security
In Defense of the Mediocre Developer (are we overestimating averages?)
pugsiman.github.ioยท5hยท
Discuss: r/programming
๐Ÿ‘จโ€๐Ÿ’ปSoftware development practices
Hiring Top AI Talent When Youโ€™re Not a Tech Giant
hbr.orgยท7h
๐Ÿš€Startups
Swiss launch open source AI model as โ€œethicalโ€ alternative to big US LLMs
nordot.appยท23h
๐Ÿค–AI
If you write blog posts that use LLMs for facts and you do not independently verify what the LLM tells you, you are writing with as much authority as the LLM, w...
bsky.appยท9hยท
Discuss: Bluesky
๐Ÿ“‹Text Quality
Reflections on Random Kitchen Sinks
archives.argmin.netยท2hยท
Discuss: Hacker News
๐Ÿ“ŠVector Databases
I used an AI triage bot to close 85 GitHub issues in a weekend
bagerbach.comยท9hยท
Discuss: Hacker News
๐Ÿ‘จโ€๐Ÿ’ปAI Coding
When LLMs Grow Hands and Feet, How to Design our Agentic RL Systems?
reddit.comยท2hยท
Discuss: r/LocalLLaMA
๐Ÿ†•New AI
Retraining AI to fortify itself against rogue rewiring even after key layers are removed
techxplore.comยท16h
๐Ÿ›ก๏ธAI Safety
AI can detect and interpret social situations between people from images and videos almost as reliably as humans, and even more consistently than just a single ...
utu.fiยท14h
๐Ÿ”AI Interpretability
Awesome AI Agent Frameworks
github.comยท16hยท
Discuss: Hacker News
๐Ÿ†•New AI
No Thoughts Just AI: Biased LLM Recommendations Limit Human Agency in Resume Screening
arxiv.orgยท20h
๐Ÿ›ก๏ธAI Security
GPT-5 bio bug bounty call
openai.comยท15h
๐Ÿš€Startups
MTQA:Matrix of Thought for Enhanced Reasoning in Complex Question Answering
arxiv.orgยท20h
๐Ÿง LLM Inference
Loading...Loading more...
AboutBlogChangelogRoadmap