Accelerating LLM inference with post-training weight and activation using AWQ and GPTQ on Amazon SageMaker AI
aws.amazon.com·2d
Automatic Prompt Optimization for Multimodal Vision Agents: A Self-Driving Car Example
towardsdatascience.com·12h
Loading...Loading more...