2 papers across 2 sessions
DRG-Sapphire, an LLM trained with GRPO, achieves SOTA accuracy in DRG coding and demonstrates a logarithmic scaling relationship between SFT examples and RL performance.
Novel visually prompted diffusion model generates spatially controlled histopathology images, extends to unannotated TCGA, and produces synthetic training data that outperforms real data for segmentation models.