Browsing: Yannic Kilcher
DDL is an auxiliary task for an agent to learn distances between states in episodes. This can then be used…
Stunning evidence for the hypothesis that neural networks work so well because their random initialization almost certainly contains a nearly…
The imputer is a sequence-to-sequence model that strikes a balance between fully autoregressive models with long inference times and fully…
FixMatch is a simple, yet surprisingly effective approach to semi-supervised learning. It combines two previous methods in a clever way…
Offline Reinforcement Learning has come more and more into focus recently in domains where classic on-policy RL algorithms are infeasible…
A closer look at the OpenAI microscope, a database of visualizations of the inner workings of ImageNet classifiers, along with…
This paper establishes a framework for looking at out-of-distribution generalization failures of modern deep learning as the models learning false…
Geoffrey Hinton and his co-authors describe a biologically plausible variant of backpropagation and report evidence that such an algorithm might…
The Longformer extends the Transformer by introducing sliding window attention and sparse global attention. This allows for the processing of…
Multi-Task Learning can be very challenging when gradients of different tasks are of severely different magnitudes or point into conflicting…