Researcher, Google
3 papers at NeurIPS 2025
Leveraging correlations among hierarchies present in natural language processing tasks is used for zero-shot peformance prediction of learning curves for scaling law research.
We proposed a new data selection method for pretraining multilingual Large Language Models