Research Engineer, Center for AI Safety
1 paper at NeurIPS 2025
We discover that coherent value systems emerge with scale in LLMs and propose the research avenue of utility engineering to analyze and control these emergent value systems.