1 paper across 1 session
New dataset and benchmark evaluate LLMs on molecular property reasoning at a fine-grained functional group level