Juan Claude Formanek

PhD student, University of Cape Town

2 papers at NeurIPS 2025

OpenReview· Semantic Scholar· Google Scholar

Poster Session 3

Thursday, December 4, 2025 · 11:00 AM → 2:00 PM

Breaking the Performance Ceiling in Reinforcement Learning requires Inference Strategies

#311 · Felix Chalumeau, Daniel Rajaonarivonivelomanantsoa, Ruan John de Kock, Juan Claude Formanek, Sasha Abramowitz, Omayma Mahjoub, Wiem Khlifi, Simon Verster Du Toit, Louay Ben Nessir, Refiloe Shabe, Arnol Manuel Fokam, Siddarth Singh, Ulrich Armel Mbou Sob, Arnu Pretorius

Using search strategies at inference-time can provide massive performance boost on numerous complex reinforcement learning tasks, within only a couple seconds of execution time.

Poster Session 6

1 paper

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Oryx: a Scalable Sequence Model for Many-Agent Coordination in Offline MARL

#210 · Juan Claude Formanek, Omayma Mahjoub, Louay Ben Nessir, Sasha Abramowitz, Ruan John de Kock, Wiem Khlifi, Daniel Rajaonarivonivelomanantsoa, Simon Verster Du Toit, Arnol Manuel Fokam, Siddarth Singh, Ulrich Armel Mbou Sob, Felix Chalumeau, Arnu Pretorius

We extend autoregressive multi-agent sequence models, including Sable and MAT, to the Offline MARL setting and demonstrate that they significanlty outperform current state-of-the-art methods across a diverse set of benchmarks with up to 50 agents.