Project Aletheia: Verifier-Guided Distillation of Backtracking for Small Language Models
arxiv.org·48m
Experiments on Reward Hacking Monitorability in Language Models
lesswrong.com·52m
Making a Language
thunderseethe.dev·7h
CodeSOD: Validation Trimmed Away
thedailywtf.com·23h
Randomization in Typst
idraluna-archives.bearblog.dev·10h
t2x - a CLI tool for AI-first text operations
shruggingface.com·1d
Use of Assertions
blog.regehr.org·14h
Building a Self-Healing Data Pipeline That Fixes Its Own Python Errors
towardsdatascience.com·16h
Dealing with alternatives
jemarch.net·1d
Loading...Loading more...