Cross-Platform Evaluation of Reasoning Capabilities in Foundation Models
arxiv.org·1d
🧠Automated Reasoning
Flag this post
StreetMath: Study of LLMs' Approximation Behaviors
arxiv.org·1d
🔍CBMC
Flag this post
Building a Visual Diff System for AI Edits (Like Git Blame for LLM Changes)
🔤Language Design
Flag this post
Empirical Bayesian Multi-Bandit Learning
arxiv.org·1d
📚Automata Learning
Flag this post
How Data Mixing Shapes In-Context Learning: Asymptotic Equivalence for Transformers with MLPs
arxiv.org·2d
🔁Fixed-Point Theory
Flag this post
Reflection for Aggregates (2020)
🔢Algebraic Data Types
Flag this post
De Bruijn Numerals
🧮Lambda Calculus
Flag this post
zFLoRA: Zero-Latency Fused Low-Rank Adapters
arxiv.org·1d
🧮Z3 Solver
Flag this post
Enhanced Knowledge Graph Reasoning via Multi-Modal Data Fusion and Automated Verification
🧠Automated Reasoning
Flag this post
Approximating Heavy-Tailed Distributions with a Mixture of Bernstein Phase-Type and Hyperexponential Models
arxiv.org·1d
📐Linear Algebra
Flag this post
Framework for Machine Evaluation of Reasoning Completeness in Large Language Models For Classification Tasks
arxiv.org·4d
✓Automated Theorem Proving
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.com·20h
🔄Finite State Machines
Flag this post
Scalable Static Analysis Framework – hardening large C++ codebases (LLVM/Apple)
🔬Static Analysis
Flag this post
Do LLMs Signal When They're Right? Evidence from Neuron Agreement
arxiv.org·1d
🧩Parser Combinators
Flag this post
From Scripts to Scale: Python, Mypy, and the Rise of Static Typing
🔀Brzozowski Derivatives
Flag this post
Loading...Loading more...