Researcher, Amazon Web Service
1 paper at NeurIPS 2025
We study the problem of computing an optimal large language model (LLM) policy for a constrained alignment problem.