Towards Robust Training in NNGPT AutoML Pipeline: A Loss-Optimizer Pairing Selection Study (opens in new tab)

The choice of loss function and optimizer is an important decision, that shapes further model training. Yet automated architecture search pipelines (AutoML) benefits significantly more from the optimal pairing selection and vice versa. This paper investigates whether a single recipe is sufficient for heterogeneous architecture pools, or whether the optimal pairing varies across structurally diverse models. We conduct a systematic empirical study...

Read the original article