Does this framework support SFT, RL training, and inference for MoE models?
Does this framework support SFT, RL training, and inference for MoE models?