Researcher, University of Pennsylvania
1 paper at NeurIPS 2025
We study the problem of computing an optimal large language model (LLM) policy for a constrained alignment problem.