This Puzzle Shows Just How Far LLMs Have Progressed in a Little Over a Year
towardsdatascience.comยท1d
๐Ÿ“Mathematical Art
Teaching Models to Decide When to Retrieve: Adaptive RAG, Part 4
blog.reachsumit.comยท2dยท
Discuss: Hacker News
๐Ÿง Learned Indexing
Read Between the Lines: A Benchmark for Uncovering Political Bias in Bangla News Articles
arxiv.orgยท1d
โš™๏ธCompression Benchmarking
Categorical Invariants of Learning Dynamics
arxiv.orgยท1d
๐Ÿ•ธ๏ธAlgebraic Topology
Shaken or Stirred? An Analysis of MetaFormer's Token Mixing for Medical Imaging
arxiv.orgยท17h
๐Ÿ“„Document Streaming
Mitigating Premature Exploitation in Particle-based Monte Carlo for Inference-Time Scaling
arxiv.orgยท17h
๐ŸงฎKolmogorov Bounds
Show HN: A Field Report on Teaching a Chinese AI to Deconstruct Its Censorship
github.comยท1hยท
Discuss: Hacker News
๐Ÿ”ฒCellular Automata
Python 3.14 Released with Template String Literals, Deferred Annotations, and
socket.devยท23hยท
Discuss: Hacker News
๐Ÿ’งLiquid Types
Pathology-CoT: Learning Visual Chain-of-Thought Agent from Expert Whole Slide Image Diagnosis Behavior
arxiv.orgยท1d
๐Ÿค–Advanced OCR
GRACE: Generative Representation Learning via Contrastive Policy Optimization
arxiv.orgยท1d
๐Ÿ“ŠHyperLogLog
LLMs as Policy-Agnostic Teammates: A Case Study in Human Proxy Design for Heterogeneous Agent Teams
arxiv.orgยท17h
๐Ÿ”ฒCellular Automata
Evaluating LLM Safety Across Child Development Stages: A Simulated Agent Approach
arxiv.orgยท17h
๐Ÿ’ปProgramming languages
Large Language Models Achieve Gold Medal Performance at International Astronomy & Astrophysics Olympiad
arxiv.orgยท1d
๐Ÿš€SIMD Text Processing
Collaborative and Proactive Management of Task-Oriented Conversations
arxiv.orgยท17h
๐Ÿ“Linear Logic
Efficient Test-Time Scaling for Small Vision-Language Models
arxiv.orgยท1d
๐Ÿ—œ๏ธLZW Variants
Learning to Route: A Rule-Driven Agent Framework for Hybrid-Source Retrieval-Augmented Generation
arxiv.orgยท2d
๐Ÿ”Information Retrieval
Can an LLM Induce a Graph? Investigating Memory Drift and Context Length
arxiv.orgยท1d
๐Ÿ“‹Document Grammar
PsycholexTherapy: Simulating Reasoning in Psychotherapy with Small Language Models in Persian
arxiv.orgยท1d
๐Ÿ’ปProgramming languages