Decker 1.61
internet-janitor.itch.io·3h
Flag this post
Introducing Metrax: performant, efficient, and robust model evaluation metrics in JAX
developers.googleblog.com·20h
Flag this post
BlueCodeAgent: A blue teaming agent enabled by automated red teaming for CodeGen AI
microsoft.com·3d
🟨javascript
Flag this post
Correction for Hazime et al., Nanoscale restructuring of the immune synapse with an engager enhances NK cell function
pnas.org·11h
Flag this post
DevOps Eng Looking for Collaboration: Exchange High-Perf US-East Infra for Project Ideas
Flag this post
Intentional, Not Reflexive: A Manager's Thoughts on AI
Flag this post
Dutch police seize thousands of servers used for ransomware, child sex abuse footage
nltimes.nl·5h
Flag this post
Evaluation Avoidance: How Humans and AIs Hack Reward by Disabling Evaluation Instead of Gaming Metrics
lesswrong.com·18h
Flag this post
The Code Vault
Flag this post
ChatGPT 5.1 is here.
threadreaderapp.com·1d
Flag this post
MINDS: A Cross-cultural Dialogue Corpus for Social Norm Classification and Adherence Detection
arxiv.org·13h
Flag this post
BUILDING A BIGRAM LANGUAGE MODEL
🟨javascript
Flag this post
Loading...Loading more...