Profanity causes emergent misalignment, but with qualitatively different results than insecure code
lesswrong.com·4d
Decoupling Support Enumeration and Value Discovery in Non-Binary ISD
eprint.iacr.org·6d
Song recommendations with C# free monads
blog.ploeh.dk·6h
Incremental query updating in adhesive categories
topos.institute·22h
I’ve been working on something new:
threadreaderapp.com·3d
Thrifty wide-context models of B cell receptor somatic hypermutation
elifesciences.org·3d
Disabling Self-Correction in Retrieval-Augmented Generation via Stealthy Retriever Poisoning
arxiv.org·4d
Building a simple reranker
blog.veitheller.de·5d
Loading...Loading more...