PORTool: Tool-Use LLM Training with Rewarded Tree
arxiv.orgยท1d
๐ML Language
Flag this post
Milestones in open weights AI: what models shaped your journey?
๐ฑMinimal Interpreters
Flag this post
Counteracting Matthew Effect in Self-Improvement of LVLMs through Head-Tail Re-balancing
arxiv.orgยท1d
๐Earley Parsing
Flag this post
Reward Collapse in Aligning Large Language Models
arxiv.orgยท1d
โ๏ธWeighted Automata
Flag this post
ReLook: Vision-Grounded RL with a Multimodal LLM Critic for Agentic Web Coding
๐ฌInteractive REPLs
Flag this post
Anthropic Research Shows How LLMs Perceive Text via @sejournal, @martinibuster
searchenginejournal.comยท1d
๐ML Language
Flag this post
Context-Bench: Benchmarking LLMs on Agentic Context Engineering
๐Language Benchmarks
Flag this post
Questionnaire meets LLM: A Benchmark and Empirical Study of Structural Skills for Understanding Questions and Responses
arxiv.orgยท1d
๐ฑMinimal ML
Flag this post
Roadmap for Improving the Type Checker
โ
Type Checking
Flag this post
Deep Reinforcement Learning Book
๐ฏFinite Automata
Flag this post
Do Not Step Into the Same River Twice: Learning to Reason from Trial and Error
arxiv.orgยท1d
๐ญErlang OTP
Flag this post
Do LLMs Signal When They're Right? Evidence from Neuron Agreement
arxiv.orgยท1d
๐ML Language
Flag this post
Understanding Hardness of Vision-Language Compositionality from A Token-level Causal Lens
arxiv.orgยท1d
๐S-Expression
Flag this post
Building a Rules Engine from First Principles
towardsdatascience.comยท1d
โ๏ธInference Rules
Flag this post
Wednesday 26 November - 11am
informatics.ed.ac.ukยท3d
๐ML Language
Flag this post
ReForm: Reflective Autoformalization with Prospective Bounded Sequence Optimization
arxiv.orgยท3d
โกPartial Evaluation
Flag this post
Loading...Loading more...