Researcher, ByteDance Inc.
2 papers at NeurIPS 2025
We identified a safety issue in the MoE architecture and designed experiments to demonstrate it.
We train a token-level neural router to let SLM following LLM reasoning paths by replacing only divergent tokens