Adam Gleave

Principal Researcher, FAR.AI

1 paper at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 1

1 paper

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Preference Learning with Lie Detectors can Induce Honesty or Evasion

#509 · Chris Cundy, Adam Gleave

We incorporate lie detectors into the labelling step of preference learning and characterize the factors that lead the trained policy to be honest or to evade the detector.