Silviu Pitis

PhD student, University of Toronto

1 paper at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 2

1 paper

Wednesday, December 3, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Simulating Viva Voce Examinations to Evaluate Clinical Reasoning in Large Language Models

#1713 · Christopher Chiu, Silviu Pitis, Mihaela van der Schaar

We introduce VivaBench, an extendable benchmark that simulates multi-turn medical conversations. We demonstrate that LLM agents are clinically knowledgeable, but limited in ability to gather information and diagnose from incomplete presentations.