Model Quantization, Inference Optimization, GGUF Format, Privacy-preserving AI
Study reveals how much energy AI uses to answer your questions
mercurynews.comยท14h
EQuARX: Efficient Quantized AllReduce in XLA for Distributed Machine Learning Acceleration
arxiv.orgยท2d
Less Data Less Tokens: Multilingual Unification Learning for Efficient Test-Time Reasoning in LLMs
arxiv.orgยท2d
Aligning Frozen LLMs by Reinforcement Learning: An Iterative Reweight-then-Optimize Approach
arxiv.orgยท2d
RecLLM-R1: A Two-Stage Training Paradigm with Reinforcement Learning and Chain-of-Thought v1
arxiv.orgยท1d
Surgery-R1: Advancing Surgical-VQLA with Reasoning Multimodal Large Language Model via Reinforcement Learning
arxiv.orgยท1d
Reinforcement Learning from Human Feedback, Explained Simply
towardsdatascience.comยท2d
LOGICPO: Efficient Translation of NL-based Logical Problems to FOL using LLMs and Preference Optimization
arxiv.orgยท2d
DiaLLMs: EHR Enhanced Clinical Conversational System for Clinical Test Recommendation and Diagnosis Prediction
arxiv.orgยท4h
Loading...Loading more...