Model Quantization, Inference Optimization, GGUF Format, Privacy-preserving AI
I Built a Confidence-Aware Filter and It Removed 8% of Garbage
askthegame.bearblog.devยท6h
Distilling On-device Language Models for Robot Planning with Minimal Human Intervention
arxiv.orgยท3d
Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language Models
arxiv.orgยท4d
LLMs, Data Dysphoria, and the Global Regulatory Response
hackernoon.comยท4d
EQuARX: Efficient Quantized AllReduce in XLA for Distributed Machine Learning Acceleration
arxiv.orgยท3d
Aligning Frozen LLMs by Reinforcement Learning: An Iterative Reweight-then-Optimize Approach
arxiv.orgยท3d
Loading...Loading more...