Normalized Entropy or Apply Rate? Evaluation Metrics for Online Modeling Experiments
engineering.indeedblog.comยท1h
๐Ÿ“Šdata science
Flag this post
The model underlying R-hat and a Bayesian estimator
statmodeling.stat.columbia.eduยท14h
๐Ÿ“Šdata science
Flag this post
Show HN: I got Stability AI's small audio model into a consumer iOS app
news.ycombinator.comยท14hยท
Discuss: Hacker News
๐Ÿ”ฅPyTorch
Flag this post
A Deep Dive into the Morris Worm
rapid7.comยท1dยท
Discuss: Hacker News
๐Ÿ•ธsmall web
Flag this post
Disassembling Terabytes of Random Data with Zig and Capstone to Prove a Point
jstrieb.github.ioยท2dยท
๐Ÿ“Šdata science
Flag this post
Using Coding Agents to Decompile Nintendo 64 Games
blog.chrislewis.auยท1dยท
Discuss: Hacker News
๐Ÿ“Markdown
Flag this post
Handling Noisy Plaintext Checking Oracles with SPiRiT
eprint.iacr.orgยท1d
๐Ÿ•ธsmall web
Flag this post
AILA--First Experiments with Localist Language Models
arxiv.orgยท1d
๐Ÿค—Hugging Face
Flag this post
Cyclic Proofs for iGL via Corecursion
arxiv.orgยท2d
๐Ÿค–llm
Flag this post
CaRF: Enhancing Multi-View Consistency in Referring 3D Gaussian Splatting Segmentation
arxiv.orgยท5h
๐Ÿค—Hugging Face
Flag this post
Create a MCP server from scratch
dev.toยท18hยท
Discuss: DEV
๐Ÿ“Šdata science
Flag this post
flowengineR: A Modular and Extensible Framework for Fair and Reproducible Workflow Design in R
arxiv.orgยท3d
๐Ÿ“Markdown
Flag this post
Automated Prompt Generation for Code Intelligence: An Empirical study and Experience in WeChat
arxiv.orgยท1d
๐Ÿ“Markdown
Flag this post
Challenging DINOv3 Foundation Model under Low Inter-Class Variability: A Case Study on Fetal Brain Ultrasound
arxiv.orgยท2d
๐Ÿ”ฅPyTorch
Flag this post
Do Androids Dream of Unseen Puppeteers? Probing for a Conspiracy Mindset in Large Language Models
arxiv.orgยท1d
๐Ÿ“Šdata science
Flag this post
Magika 1.0 Goes Stable As Google Rebuilds Its File Detection Tool In Rust
developers.slashdot.orgยท10h
๐Ÿ•ธsmall web
Flag this post
Unclonable Cryptography in Linear Quantum Memory
arxiv.orgยท5h
๐Ÿ”ฅPyTorch
Flag this post
Large language models require a new form of oversight: capability-based monitoring
arxiv.orgยท1d
๐Ÿค–llm
Flag this post