Optimizing LLMs for Performance and Accuracy with Post-Training Quantization
developer.nvidia.com·3d
Language Arithmetics: Towards Systematic Language Neuron Identification and Manipulation
arxiv.org·3d
SOME: Symmetric One-Hot Matching Elector -- A Lightweight Microsecond Decoder for Quantum Error Correction
arxiv.org·2d
LVM-GP: Uncertainty-Aware PDE Solver via coupling latent variable model and Gaussian process
arxiv.org·3d
Mamba-based Efficient Spatio-Frequency Motion Perception for Video Camouflaged Object Detection
arxiv.org·2d
Good Learners Think Their Thinking: Generative PRM Makes Large Reasoning Model More Efficient Math Learner
arxiv.org·2d
Investigating the Invertibility of Multimodal Latent Spaces: Limitations of Optimization-Based Methods
arxiv.org·2d
Loading...Loading more...