Imputer: Sequence Modelling Via Imputation And Dynamic Programming

The imputer is a sequence-to-sequence model that strikes a balance between fully autoregressive models with long inference times and fully non-autoregressive models with fast inference. The imputer achieves constant decoding time independent of sequence length by exploiting dynamic programming.

Abstract:
This paper presents the Imputer, a neural sequence model that generates output sequences iteratively via imputations. The Imputer is an iterative generative model, requiring only a constant number of generation steps independent of the number of input or output tokens. The Imputer can be trained to approximately marginalize over all possible alignments between the input and output sequences, and all possible generation orders. We present a tractable dynamic programming training algorithm, which yields a lower bound on the log marginal likelihood. When applied to end-to-end speech recognition, the Imputer outperforms prior non-autoregressive models and achieves competitive results to autoregressive models. On LibriSpeech test-other, the Imputer achieves 11.1 WER, outperforming CTC at 13.0 WER and seq2seq at 12.5 WER.

Authors: William Chan, Chitwan Saharia, Geoffrey Hinton, Mohammad Norouzi, Navdeep Jaitly

Links:
YouTube:
Twitter:
BitChute:
Minds:

source

What's Hot

Mistral and ASML forge €1.7bn alliance to shape Europe’s AI future

Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play? – Takara TLDR

OpenAI could leave California in last-ditch effort to avoid political scrutiny

Imputer: Sequence Modelling via Imputation and Dynamic Programming

AGI is not coming!

Context Rot: How Increasing Input Tokens Impacts LLM Performance (Paper Analysis)

Energy-Based Transformers are Scalable Learners and Thinkers (Paper Review)

Leon Black and Leslie Wexner’s Letters to Jeffrey Epstein Released

Anne Imhof Reimagines Football Jerseys with Nike

Jason Wu, Robert Rauschenberg Collaboration for New York Fashion Week

Storied Collector and MoMA Trustee Dies at 92

Mistral and ASML forge €1.7bn alliance to shape Europe’s AI future

Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play? – Takara TLDR

OpenAI could leave California in last-ditch effort to avoid political scrutiny

What's Hot

Imputer: Sequence Modelling via Imputation and Dynamic Programming

Related Posts

Subscribe to Updates