📑 arXiv 3d ago
Benchmarking Optimizers for MLPs in Tabular Deep Learning
Systematic benchmark of multiple optimizers for MLP training on tabular data finds Muon consistently outperforms the standard AdamW. First comprehensive optimizer comparison for tabular deep learning, challenging the default choice practitioners use.