Generalised solutions and law of conservation of difficulty (2008)
terrytao.wordpress.comยท2hยท
Discuss: Hacker News
๐Ÿ”—Concatenative Theory
Zero-Based Numbering
en.wikipedia.orgยท8hยท
Discuss: Hacker News
๐ŸชขRope Data Structures
An Age of AI Enlightenment
xiangfu.coยท3hยท
Discuss: Hacker News
โœจEffect Inference
Status Week 39
blogs.gnome.orgยท12h
๐Ÿ›Interactive Debuggers
MDD-Thinker: Towards Large Reasoning Models for Major Depressive Disorder Diagnosis
arxiv.orgยท1d
โœจEffect Inference
Correct Reasoning Paths Visit Shared Decision Pivots
arxiv.orgยท2d
๐ŸงฎTheorem Provers
Large language models management of medications: three performance analyses
arxiv.orgยท1d
๐ŸŽฎLanguage Ergonomics
ChessArena: A Chess Testbed for Evaluating Strategic Reasoning Capabilities of Large Language Models
arxiv.orgยท1d
๐ŸŽญRacket
Learning to Reason in Structured In-context Environments with Reinforcement Learning
arxiv.orgยท1d
๐ŸชœRecursive Descent
Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training
arxiv.orgยท2h
๐ŸชœRecursive Descent
LatentEvolve: Self-Evolving Test-Time Scaling in Latent Space
arxiv.orgยท1d
๐ŸชœRecursive Descent
Robust Preference Optimization: Aligning Language Models with Noisy Preference Feedback
arxiv.orgยท1d
๐ŸชœRecursive Descent
Agentic Exploration of Physics Models
arxiv.orgยท1d
โœจGleam
Dual Mechanisms of Value Expression: Intrinsic vs. Prompted Values in LLMs
arxiv.orgยท1d
๐ŸŽฐParsing Machines
Knowledge Homophily in Large Language Models
arxiv.orgยท1d
๐ŸชœRecursive Descent
Hardening Your AI Agent Against Prompt Injection via MCP
dev.toยท9hยท
Discuss: DEV
๐Ÿ”„Subinterpreters
RoleConflictBench: A Benchmark of Role Conflict Scenarios for Evaluating LLMs' Contextual Sensitivity
arxiv.orgยท2h
๐Ÿ”—Lexical Scoping
Making sense of parameter-space decomposition
lesswrong.comยท3d
๐Ÿ”ขAlgebraic Datatypes
How to Make Large Language Models Generate 100% Valid Molecules?
arxiv.orgยท1d
๐ŸŽฏFinite Automata