3 papers across 3 sessions
We propose a novel benchmark for pattern recognition for many-shot in-context learning for large language models and conduct extensive empirical analysis with many insights.