Model Quantization, Inference Optimization, GGUF Format, Privacy-preserving AI
I Built a Confidence-Aware Filter and It Removed 8% of Garbage
askthegame.bearblog.dev·12h
LLMs, Data Dysphoria, and the Global Regulatory Response
hackernoon.com·4d
Aligning Frozen LLMs by Reinforcement Learning: An Iterative Reweight-then-Optimize Approach
arxiv.org·3d
Less Data Less Tokens: Multilingual Unification Learning for Efficient Test-Time Reasoning in LLMs
arxiv.org·3d
RecLLM-R1: A Two-Stage Training Paradigm with Reinforcement Learning and Chain-of-Thought v1
arxiv.org·2d
Generative AI Model Data Pre-Training on Kubernetes: A Use Case Study - DevConf.CZ 2025
youtube.com·1d
scMamba: A Scalable Foundation Model for Single-Cell Multi-Omics Integration Beyond Highly Variable Feature Selection
arxiv.org·22h
Reinforcement Learning from Human Feedback, Explained Simply
towardsdatascience.com·4d
LOGICPO: Efficient Translation of NL-based Logical Problems to FOL using LLMs and Preference Optimization
arxiv.org·3d
Loading...Loading more...