Vice President, Amazon Web Services
1 paper at NeurIPS 2025
We enable tree-based decoding on SSMs to facilitate speculative decoding with tree-based verification with SSMs