Think for Yourself
๐ตDigital Minimalism
Flag this post
Dive into Systems
๐ปCS
Flag this post
Good abstractions for humans turn out to be good abstractions for LLMs
๐คProgram Synthesis
Flag this post
Writing an LLM from scratch, part 26 โ evaluating the fine-tuned model
๐Automata Learning
Flag this post
iFlyBot-VLA Technical Report
arxiv.orgยท1h
๐Automata Learning
Flag this post
Panther: A Cost-Effective Privacy-Preserving Framework for GNN Training and Inference Services in Cloud Environments
arxiv.orgยท1d
ฮปFunctional Programming
Flag this post
RLAC: Reinforcement Learning with Adversarial Critic for Free-Form Generation Tasks
arxiv.orgยท1d
๐Formal Verification
Flag this post
Adding New Capability in Existing Scientific Application with LLM Assistance
arxiv.orgยท1d
โ๏ธCompiler Design
Flag this post
LLM-Centric RAG with Multi-Granular Indexing and Confidence Constraints
arxiv.orgยท2d
๐ซOCaml
Flag this post
Measuring the Intrinsic Dimension of Earth Representations
arxiv.orgยท1h
๐ธ๏ธGraph Theory
Flag this post
Self-Harmony: Learning to Harmonize Self-Supervision and Self-Play in Test-Time Reinforcement Learning
arxiv.orgยท1d
๐ฎVerification Games
Flag this post
Surfacing Subtle Stereotypes: A Multilingual, Debate-Oriented Evaluation of Modern LLMs
arxiv.orgยท1d
โExistential Types
Flag this post
Diagnosing Hallucination Risk in AI Surgical Decision-Support: A Sequential Framework for Sequential Validation
arxiv.orgยท1d
๐Hoare Logic
Flag this post
AI Progress Should Be Measured by Capability-Per-Resource, Not Scale Alone: A Framework for Gradient-Guided Resource Allocation in LLMs
arxiv.orgยท1d
๐ฏHindley-Milner
Flag this post
RIS-Assisted 3D Spherical Splatting for Object Composition Visualization using Detection Transformers
arxiv.orgยท1h
๐งMicrocontrollers
Flag this post
I Want to Break Free! Persuasion and Anti-Social Behavior of LLMs in Multi-Agent Settings with Social Hierarchy
arxiv.orgยท1h
๐ฒCellular Automata
Flag this post
Spot The Ball: A Benchmark for Visual Social Inference
arxiv.orgยท1d
๐ฎVerification Games
Flag this post
Loading...Loading more...