Browsing: Yannic Kilcher
#summarization #gpt3 #openai Text Summarization is a hard task, both in training and evaluation. Training is usually done maximizing the…
#ai #chess #alphazero Chess is a very old game and both its rules and theory have evolved over thousands of…
#ai #research #hardware We like to think that ideas in research succeed because of their merit, but this story is…
#ai #research #optimization Optimization is still the domain of hand-crafted, simple algorithms. An ML engineer not only has to pick…
#ai #research #transformers Transformers are Ruining Convolutions. This paper, under review at ICLR, shows that given enough data, a standard…
#ai #research #optimization Deep Learning famously gives rise to very complex, non-linear optimization problems that cannot be solved analytically. Therefore,…
#ai #research #attention Transformers, having already captured NLP, have recently started to take over the field of Computer Vision. So…
#ai #research #attention Transformers have huge memory and compute requirements because they construct an Attention matrix, which grows quadratically in…
#ai #research #nlp Knowledge Graphs are structured databases that capture real-world entities and their relations to each other. KGs are…
#ai #research #machinelearning Deep Learning models are often overparameterized and have many degrees of freedom, which leads to many local…