AGCI: A Framework for Evaluating Artificial General Coding Intelligence
dropstone.io·14h·
Flag this post
🚀LLM Overthinking? DTS makes LLM think shorter and answer smarter
reddit.com·4h·
Discuss: r/LLM
Flag this post
Show HN: Autogenerate efficient backward kernels for Triton
github.com·1d·
Discuss: Hacker News
Flag this post
Rethinking Explanation Evaluation under the Retraining Scheme
arxiv.org·15h
Flag this post
Inclusion of Role into Named Entity Recognition and Ranking
arxiv.org·1d
Flag this post
Analysing European Soccer Data with Deepnote in Windsurf IDE
dev.to·1d·
Discuss: DEV
Flag this post
Deep Learning Analysis of Prenatal Ultrasound for Identification of Ventriculomegaly
arxiv.org·15h
Flag this post
Automatic Paper Reviewing with Heterogeneous Graph Reasoning over LLM-Simulated Reviewer-Author Debates
arxiv.org·15h
Flag this post
REACT-LLM: A Benchmark for Evaluating LLM Integration with Causal Features in Clinical Prognostic Tasks
arxiv.org·1d
Flag this post
Optimistic Online-to-Batch Conversions for Accelerated Convergence and Universality
arxiv.org·1d
Flag this post
AI-Driven Contribution Evaluation and Conflict Resolution: A Framework & Design for Group Workload Investigation
arxiv.org·15h
Flag this post
Textual Self-attention Network: Test-Time Preference Optimization through Textual Gradient-based Attention
arxiv.org·1d
Flag this post
Announcing BigQuery-managed AI functions for better SQL
cloud.google.com·3h
Flag this post
Quantum Approximate Walk Algorithm
arxiv.org·15h
Flag this post
Building Intelligent Game AI with CXXGraph: From Grid Pathfinding to Strategic Navigation
dev.to·1d·
Discuss: DEV
Flag this post
Explaining Bayesian Neural Networks
arxiv.org·1d
Flag this post
Interaction Dynamics as a Reward Signal for LLMs
arxiv.org·15h
Flag this post
Anatomy-VLM: A Fine-grained Vision-Language Model for Medical Interpretation
arxiv.org·15h
Flag this post