transformer architecture, model training, LLM inference, distributed training
Press ? anytime to show this help