Whole-Program Optimization, ML Compiler, Defunctionalization, Performance

Cure - Verification-First Programming for the BEAM
cure-lang.orgยท2dยท
Discuss: Lobsters
๐Ÿ“กErlang BEAM
Flag this post
Building our geospatial database in production
radar.comยท11hยท
Discuss: Hacker News
๐Ÿ“‹JSON Parsing
Flag this post
Benchmarking the Thomson Reuters legal agent
thomsonreuters.comยท10hยท
Discuss: Hacker News
๐ŸŽฎLanguage Ergonomics
Flag this post
The Exhaust Port of Cohesion: Precision Provocation in LLMs
blog.gopenai.comยท1dยท
Discuss: Hacker News
๐Ÿ”ฎMetacircular Evaluators
Flag this post
Benchmarking Large Language Models and Privacy Protection
priv.gc.caยท2d
โšกTokenizer Benchmarks
Flag this post
My dumb prompts that worked better
blog.nilenso.comยท2d
๐Ÿ”„Subinterpreters
Flag this post
0055: consulting, sql needed structure, slow forum, on the line, out of thin air, papers, other stuff
scattered-thoughts.netยท5d
๐Ÿ”„Bootstrapping
Flag this post
Running MiniMax-M2 locally - Existing Hardware Advice
reddit.comยท1dยท
Discuss: r/LocalLLaMA
โšกPerformance
Flag this post
100 Techniques for Writing Readable Rust Code
reddit.comยท22hยท
Discuss: r/rust
โš™๏ธTOML Parsers
Flag this post
Topographical sparse mapping: A training framework for deep learning models
sciencedirect.comยท1dยท
Discuss: Hacker News
๐Ÿ—บ๏ธRegion Polymorphism
Flag this post
GDM: Consistency Training Helps Limit Sycophancy and Jailbreaks in Gemini 2.5 Flash
lesswrong.comยท1d
๐ŸŽฒParser Fuzzing
Flag this post
Formal Verificationโ€™s Value Grows
semiengineering.comยท18h
๐ŸŽญProgram Synthesis
Flag this post
Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm EnablesFine-Grained Policy Optimization
paperium.netยท3dยท
Discuss: DEV
๐ŸชœRecursive Descent
Flag this post
Iterative Foundation Model Fine-Tuning on Multiple Rewards
arxiv.orgยท1d
โœจEffect Inference
Flag this post
Show HN: Refusal-Aware Logical Framework for LLMs
github.comยท1dยท
Discuss: Hacker News
๐ŸงฉConstraint Logic
Flag this post
The AI Speed Illusion
dev.toยท22hยท
Discuss: DEV
โšกLive Coding
Flag this post
DialectalArabicMMLU: Benchmarking Dialectal Capabilities in Arabic and Multilingual Language Models
arxiv.orgยท2d
๐Ÿ—๏ธMLIR
Flag this post
MedRECT: A Medical Reasoning Benchmark for Error Correction in Clinical Texts
arxiv.orgยท1d
๐ŸŒฑMinimal ML
Flag this post
What I Learned From Working on Legacy Codebases (And How It Made Me a Better Developer)
dev.toยท7hยท
Discuss: DEV
๐ŸบCode Archeology
Flag this post