RedCodeAgent: Automatic red-teaming agent against diverse code agents
microsoft.comยท13h
๐Ÿ›ก๏ธParser Security
Flag this post
The promise of AI chat assistants: they solve 90% of the problems users have (by looking up the docs and telling them)
bsky.appยท13hยท
Discuss: Bluesky
๐Ÿ’ฌInteractive REPLs
Flag this post
Beyond Standard LLMs
magazine.sebastianraschka.comยท16hยท
Discuss: Hacker News, r/LLM
๐Ÿ”„Subinterpreters
Flag this post
What to Do When Your Credit Risk Model Works Today, but Breaks Six Months Later
towardsdatascience.comยท11h
๐Ÿ”ขAlgebraic Datatypes
Flag this post
Cutting LLM Batch Inference Time in Half: Dynamic Prefix Bucketing at Scale
daft.aiยท12hยท
Discuss: Hacker News
๐Ÿ“กErlang BEAM
Flag this post
Post-training methods for language models
developers.redhat.comยท22h
๐ŸชœRecursive Descent
Flag this post
Perceived Femininity in Singing Voice: Analysis and Prediction
arxiv.orgยท1h
๐ŸŒฑMinimal Interpreters
Flag this post
[TUI] Ricing the original Rogue
github.comยท9hยท
๐Ÿท๏ธSymbol Mangling
Flag this post
A Unified Model for Human Mobility Generation in Natural Disasters
arxiv.orgยท1h
๐Ÿš‚Error Propagation
Flag this post
My dumb prompts that worked better
blog.nilenso.comยท1d
๐Ÿ”„Subinterpreters
Flag this post
I Got Tired of Deceptive Casino Bonuses, So I Built a "Truth Calculator" with Vanilla JavaScript. Here's How You Can Too.
dev.toยท13hยท
Discuss: DEV
โœจCode Formatting
Flag this post
Automated Variant Calling Refinement via Multi-Modal Neuro-Symbolic Integration (AMVR-MNSI)
dev.toยท11hยท
Discuss: DEV
๐Ÿ“‹JSON Parsing
Flag this post
The Next Frontier in NLP: Smarter Agents, Not Just Bigger Models
pub.towardsai.netยท38m
๐ŸชœRecursive Descent
Flag this post
Understanding New-Knowledge-Induced Factual Hallucinations in LLMs: Analysis, Solution, and Interpretation
arxiv.orgยท1h
โœจGleam
Flag this post
Lowering in Reverse
buttondown.comยท1d
๐Ÿ—ƒ๏ธQuery Compilation
Flag this post
Teaching My Team How to Build LINQ from Scratch
dev.toยท15hยท
Discuss: DEV
๐Ÿ“‹Souffle Datalog
Flag this post
Balancing Cost, Power, and AI Performance
oreilly.comยท11h
โšกTokenizer Optimization
Flag this post
I built a leaderboard for Rerankers
reddit.comยท9hยท
Discuss: r/LocalLLaMA
๐Ÿ’ฌInteractive REPLs
Flag this post