Associate Professor, McGill University
2 papers at NeurIPS 2025
This work introduces a fast and efficient method for epistemic uncertainty in ensemble models for regression. Our method significantly improves baseline active learning methods on high-dimensional tasks.
We present a theoretical framework for policy convergence in RL, which permits convergence of return distribution estimates.