Model Optimization, Inference Engines, LLM Quantization, Privacy-focused Deployments
Finetuning a Weather Foundation Model with Lightweight Decoders for Unseen Physical Processes
arxiv.org·19h
Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics Learning
arxiv.org·1d
SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning
arxiv.org·19h
HE-LRM: Encrypted Deep Learning Recommendation Models using Fully Homomorphic Encryption
arxiv.org·1d
Why Your Next LLM Might Not Have A Tokenizer
towardsdatascience.com·1d
Loading...Loading more...