Hearing More with Less: Multi-Modal Retrieval-and-Selection Augmented Conversational LLM-Based ASR
arxiv.org·18h
UrBLiMP: A Benchmark for Evaluating the Linguistic Competence of Large Language Models in Urdu
arxiv.org·18h
Rethinking Evidence Hierarchies in Medical Language Benchmarks: A Critical Evaluation of HealthBench
arxiv.org·1d
Trustworthy Reasoning: Evaluating and Enhancing Factual Accuracy in LLM Intermediate Thought Processes
arxiv.org·4d
Beyond Manually Designed Pruning Policies with Second-Level Performance Prediction: A Pruning Framework for LLMs
arxiv.org·18h
Lightweight Backbone Networks Only Require Adaptive Lightweight Self-Attention Mechanisms
arxiv.org·18h
Bounded fuzzy logic control for optimal scheduling of green hydrogen production and revenue maximisation
arxiv.org·18h
Loading...Loading more...