Browsing: Yannic Kilcher
DeepMind’s Agent57 is the first RL agent to outperform humans in all 57 Atari benchmark games. It extends previous algorithms…
Peer Review is outdated and ineffective. SOAR is a new and revolutionary way to distribute scientific reviewing and scale to…
My thoughts on the let-the-young-get-infected argument. Abstract: In this article, we present an analysis of a risk-based selective quarantine model…
Dreamer is a new RL agent by DeepMind that learns a continuous control task through forward-imagination in latent space. Videos:…
From the makers of Go-Explore, POET is a mixture of ideas from novelty search, evolutionary methods, open-ended learning and curriculum…
Current NLP models are often “cheating” on supervised learning tasks by exploiting correlations that arise from the particularities of the…
Funny Twitter spat between researchers arguing who was the first to invent an idea that has probably been around since…
Normalization and activation layers have seen a long history of hand-crafted variants with various results. This paper proposes an evolutionary…
The enhanced POET makes some substantial and well-crafted improvements over the original POET algorithm and excels at open-ended learning like…
Contrastive Learning has been an established method in NLP and Image classification. The authors show that with relatively minor adjustments,…