🚀 MLton - abnv · Scour

Cure - Verification-First Programming for the BEAM

cure-lang.org·2d·

Discuss: Lobsters

📡Erlang BEAM

Flag this post

Building our geospatial database in production

radar.com·11h·

Discuss: Hacker News

📋JSON Parsing

Flag this post

Benchmarking the Thomson Reuters legal agent

thomsonreuters.com·10h·

Discuss: Hacker News

🎮Language Ergonomics

Flag this post

The Exhaust Port of Cohesion: Precision Provocation in LLMs

blog.gopenai.com·1d·

Discuss: Hacker News

🔮Metacircular Evaluators

Flag this post

Benchmarking Large Language Models and Privacy Protection

priv.gc.ca·2d

⚡Tokenizer Benchmarks

Flag this post

My dumb prompts that worked better

blog.nilenso.com·2d

🔄Subinterpreters

Flag this post

0055: consulting, sql needed structure, slow forum, on the line, out of thin air, papers, other stuff

scattered-thoughts.net·5d

🔄Bootstrapping

Flag this post

Running MiniMax-M2 locally - Existing Hardware Advice

reddit.com·1d·

Discuss: r/LocalLLaMA

Flag this post

100 Techniques for Writing Readable Rust Code

reddit.com·22h·

Discuss: r/rust

⚙️TOML Parsers

Flag this post

Topographical sparse mapping: A training framework for deep learning models

sciencedirect.com·1d·

Discuss: Hacker News

🗺️Region Polymorphism

Flag this post

GDM: Consistency Training Helps Limit Sycophancy and Jailbreaks in Gemini 2.5 Flash

lesswrong.com·1d

🎲Parser Fuzzing

Flag this post

Formal Verification’s Value Grows

semiengineering.com·18h

🎭Program Synthesis

Flag this post

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm EnablesFine-Grained Policy Optimization

paperium.net·3d·

Discuss: DEV

🪜Recursive Descent

Flag this post

Iterative Foundation Model Fine-Tuning on Multiple Rewards

arxiv.org·1d

✨Effect Inference

Flag this post

Show HN: Refusal-Aware Logical Framework for LLMs

github.com·1d·

Discuss: Hacker News

🧩Constraint Logic

Flag this post

The AI Speed Illusion

dev.to·22h·

Discuss: DEV

Flag this post

A Dual Large Language Models Architecture with Herald Guided Prompts for Parallel Fine Grained Traffic Signal Control

arxiv.org·1d

Flag this post

DialectalArabicMMLU: Benchmarking Dialectal Capabilities in Arabic and Multilingual Language Models

arxiv.org·2d

Flag this post

MedRECT: A Medical Reasoning Benchmark for Error Correction in Clinical Texts

arxiv.org·1d

Flag this post

What I Learned From Working on Legacy Codebases (And How It Made Me a Better Developer)

dev.to·7h·

Discuss: DEV

🏺Code Archeology

Flag this post

Loading more...