PORTool: Tool-Use LLM Training with Rewarded Tree
arxiv.orgยท1d
๐Ÿ”ML Language
Flag this post
Milestones in open weights AI: what models shaped your journey?
reddit.comยท7hยท
Discuss: r/LocalLLaMA
๐ŸŒฑMinimal Interpreters
Flag this post
Counteracting Matthew Effect in Self-Improvement of LVLMs through Head-Tail Re-balancing
arxiv.orgยท1d
๐Ÿ“ˆEarley Parsing
Flag this post
Reward Collapse in Aligning Large Language Models
arxiv.orgยท1d
โš–๏ธWeighted Automata
Flag this post
Turbocharge Your AI: A Smarter Way to Explore Decision Trees
dev.toยท1dยท
Discuss: DEV
๐ŸšถTree-walking
Flag this post
ReLook: Vision-Grounded RL with a Multimodal LLM Critic for Agentic Web Coding
paperium.netยท4hยท
Discuss: DEV
๐Ÿ’ฌInteractive REPLs
Flag this post
Daily Artificial Intelligence Digest - Oct 31, 2025
dev.toยท1dยท
Discuss: DEV
๐ŸŽญProgram Synthesis
Flag this post
Anthropic Research Shows How LLMs Perceive Text via @sejournal, @martinibuster
searchenginejournal.comยท1d
๐Ÿ”ML Language
Flag this post
Context-Bench: Benchmarking LLMs on Agentic Context Engineering
letta.comยท10hยท
Discuss: Hacker News
๐ŸLanguage Benchmarks
Flag this post
Roadmap for Improving the Type Checker
forums.swift.orgยท1dยท
โœ…Type Checking
Flag this post
Deep Reinforcement Learning Book
deepreinforcementlearningbook.orgยท1dยท
Discuss: Hacker News
๐ŸŽฏFinite Automata
Flag this post
Do Not Step Into the Same River Twice: Learning to Reason from Trial and Error
arxiv.orgยท1d
๐ŸŽญErlang OTP
Flag this post
Show HN: Everything it took to run an LLM at 10k tok/s on H200s
relace.aiยท2dยท
Discuss: Hacker News
โœจGleam
Flag this post
Do LLMs Signal When They're Right? Evidence from Neuron Agreement
arxiv.orgยท1d
๐Ÿ”ML Language
Flag this post
**Breaking the Curse of Dimensionality: A Game-Changer for L
dev.toยท12hยท
Discuss: DEV
๐Ÿ”„Subinterpreters
Flag this post
Understanding Hardness of Vision-Language Compositionality from A Token-level Causal Lens
arxiv.orgยท1d
๐Ÿ“‹S-Expression
Flag this post
Building a Rules Engine from First Principles
towardsdatascience.comยท1d
โš–๏ธInference Rules
Flag this post
Wednesday 26 November - 11am
informatics.ed.ac.ukยท3d
๐Ÿ”ML Language
Flag this post
ReForm: Reflective Autoformalization with Prospective Bounded Sequence Optimization
arxiv.orgยท3d
โšกPartial Evaluation
Flag this post