Bottleneck Analysis, Memory Optimization, Throughput Measurement, Benchmarking
RePaCA: Leveraging Reasoning Large Language Models for Static Automated Patch Correctness Assessment
arxiv.org·3d
Explainability Through Systematicity: The Hard Systematicity Challenge for Artificial Intelligence
arxiv.org·3d
LLM-Crowdsourced: A Benchmark-Free Paradigm for Mutual Evaluation of Large Language Models
arxiv.org·3d
Causal Identification of Sufficient, Contrastive and Complete Feature Sets in Image Classification
arxiv.org·2d
Good Learners Think Their Thinking: Generative PRM Makes Large Reasoning Model More Efficient Math Learner
arxiv.org·2d
Spec-VLA: Speculative Decoding for Vision-Language-Action Models with Relaxed Acceptance
arxiv.org·3d
Invisible Architectures of Thought: Toward a New Science of AI as Cognitive Infrastructure
arxiv.org·2d
Loading...Loading more...