Serious Data From Testing LLMs
satisfice.comยท4h
โšกProof Automation
Flag this post
Show HN: Daily Clash Royale Card Guessing Game
clashle.appยท1dยท
Discuss: Hacker News
๐Ÿ“ŠHyperLogLog
Flag this post
Secondhand embarrassment
robinsloan.comยท3d
๐Ÿฆ€Rust Macros
Flag this post
Artist uses vintage typewriters as drawing tool
mymodernmet.comยท1dยท
Discuss: Hacker News
๐Ÿ” Terminal Fonts
Flag this post
Signed Backdoor Hiding in Plain Sight on Framework Devices
eclypsium.comยท16hยท
Discuss: Hacker News
๐Ÿ”’Secure Boot
Flag this post
The Country That Broke Kotlin
sam-cooper.medium.comยท2dยท
๐ŸŒ€Brotli Dictionary
Flag this post
Europe's Digital Sovereignty Paradox โ€“ "Chat Control" Update
process-one.netยท4hยท
Discuss: Hacker News
๐Ÿ‡ฉ๐Ÿ‡ฐDanish Computing
Flag this post
Paper2Agent: Research Papers as Interactive AI Agents
huggingface.coยท4dยท
Discuss: Hacker News
๐Ÿค–AI Curation
Flag this post
MAI-Image-1, debuting in the top on LMArena
microsoft.aiยท2hยท
Discuss: Hacker News
๐Ÿ“ธTIFF Evolution
Flag this post
How One Project is Making Philippine Laws Actually Accessible
diff.wikimedia.orgยท2d
๐Ÿ“ฒDigitization
Flag this post
Generative AI Systems Miss Vast Bodies of Human Knowledge, Study Finds
slashdot.orgยท18h
๐ŸŒCultural Algorithms
Flag this post
[D] Should I attend EMNLP 2025 in-person?
reddit.comยท22hยท
๐Ÿด๓ ง๓ ข๓ ณ๓ ฃ๓ ด๓ ฟScottish Computing
Flag this post
Deploying a Flask Email API on Render
dev.toยท22hยท
Discuss: DEV
โšกgRPC
Flag this post
Complete Guide to Imdone Pull
dev.toยท1dยท
Discuss: DEV
๐ŸŒณGit Internals
Flag this post
Do LLMs Know They Are Being Tested? Evaluation Awareness and Incentive-Sensitive Failures in GPT-OSS-20B
arxiv.orgยท2d
๐Ÿ”Concolic Testing
Flag this post
Automated Process Optimization via Hybrid Symbolic-Numerical Simulation and HyperScore Validation
dev.toยท6hยท
Discuss: DEV
๐Ÿ”งCassette Engineering
Flag this post
Krish Naik: Complete RAG Crash Course With Langchain In 2 Hours
dev.toยท2dยท
Discuss: DEV
๐ŸŒ€Brotli Internals
Flag this post
ChatGPT will soon allow erotica for verified adults
bbc.comยท10hยท
๐Ÿš€Indie Hacking
Flag this post
Unifying Deductive and Abductive Reasoning in Knowledge Graphs with Masked Diffusion Model
arxiv.orgยท1d
๐Ÿง Computational Logic
Flag this post
Reimagine Libraries management as Apps using Agentic Executable framework
dev.toยท2dยท
Discuss: DEV
๐ŸงฑImmutable Infrastructure
Flag this post