2 papers across 2 sessions
We introduce a bicycle design benchmark that evaluates multiphysics performance, constraint satisfaction, and adherence to human preferences, and we benchmark LLMs, tabular generative models, and design optimization algorithms side by side.