Alexander Robey

Postdoc, CMU, Carnegie Mellon University

3 papers at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 2

1 paper

Wednesday, December 3, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Emerging Risks from Embodied AI Require Urgent Policy Action

#2203 · Jared Perlo, Alexander Robey, Fazl Barez, Jakob Mökander

Machine learning researchers must urgently work with policymakers to address growing risks from embodied AI by plugging gaps in existing frameworks.

Poster Session 4

2 papers

Thursday, December 4, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Antidistillation Sampling

#5402 · Yash Savani, Asher Trockman, Zhili Feng, Yixuan Even Xu, Avi Schwarzschild, Alexander Robey, Marc Anton Finzi, J Zico Kolter

Antidistillation sampling strategically modifyies a model's next-token probability distribution to poison reasoning traces, rendering them significantly less effective for distillation while preserving the model's practical utility.

Safety Pretraining: Toward the Next Generation of Safe AI

#5210 · Pratyush Maini, Sachin Goyal, Dylan Sam, Alexander Robey, Yash Savani, Yiding Jiang, Andy Zou, Matt Fredrikson, Zachary Chase Lipton, J Zico Kolter

We present a data-centric pretraining framework that builds safety into the model from the start