ceoAMAN/Sturnus: A horizontal Self supervising sparse MoE architecture (opens in new tab)
A horizontal Self supervising sparse MoE architecture - ceoAMAN/Sturnus
Read the original articleA horizontal Self supervising sparse MoE architecture - ceoAMAN/Sturnus
Read the original article