[D] Best videos of talks on using RL to train reasoning models
reddit.com·6h·
📼Tape Linguistics
How to store ordered information in a Relational Database (2015)
softwareengineering.stackexchange.com·2d·
🧮Algebraic Datatypes
RND1: Simple, Scalable AR-to-Diffusion Conversion
radicalnumerics.ai·2d·
Discuss: Hacker News
💻Local LLMs
Haskell Weekly Issue 493
haskellweekly.news·2d·
Discuss: Hacker News
🧬Functional Programming
What's the Role of Trust in AI?
algorithmictradeoff.substack.com·1d·
Discuss: Substack
🔲Cellular Automata
Beyond Vector Search: Building a RAG That *Actually* Understands Your Data
dev.to·2d·
Discuss: DEV
🗂️Vector Databases
The Hidden Bias: A Study on Explicit and Implicit Political Stereotypes in Large Language Models
arxiv.org·1d
🤖Grammar Induction
The Rise of the Knowledge Sculptor: A New Archetype for Knowledge Work in the Age of Generative AI
arxiv.org·1d
🗺️Competency Maps
Generalized Orders of Magnitude (GOOMs)
github.com·11h·
Discuss: Hacker News
🕸️Tensor Networks
To Sink or Not to Sink: Visual Information Pathways in Large Vision-Language Models
arxiv.org·1d
📊Learned Metrics
Randomized and quantum approximate matrix multiplication
arxiv.org·1d
🔐Quantum Cryptography
Unraveling LCRE-Mediated Chromatin Loops: A Predictive Model for Gene Expression Fine-Tuning in Desert Genomes
dev.to·1d·
Discuss: DEV
📥Feed Aggregation
Enhanced SoC Design via Adaptive Topology Optimization with Reinforcement Learning
dev.to·1d·
Discuss: DEV
🧩RISC-V
LexiCon: a Benchmark for Planning under Temporal Constraints in Natural Language
arxiv.org·3d
🧮Kolmogorov Complexity
MaNGO - Adaptable Graph Network Simulators via Meta-Learning
arxiv.org·3d
🕸️Tensor Networks
Which Heads Matter for Reasoning? RL-Guided KV Cache Compression
arxiv.org·1d
📼Cassette Combinators
The Alignment Waltz: Jointly Training Agents to Collaborate for Safety
arxiv.org·1d
🔲Cellular Automata
Fixed Points and Stochastic Meritocracies: A Long-Term Perspective
arxiv.org·1d
🔲Cellular Automata