1 paper across 1 session
We present CheXStruct and CXReasonBench: CheXStruct, an automated pipeline for extracting intermediate reasoning steps directly from chest X-rays, and CXReasonBench, a benchmark for evaluating whether models follow structured diagnostic reasoning.