Overview

This file presents the results of automatic summarization of an online lecture video titled: DeepMind x UCL | Deep Learning Lectures | 7/12 | Deep Learning for Natural Language Processing - YouTube. The abstract is generated using LED, or Longformer-Encoder-Decoder, a state-of-the-art Transformer-based language model. This implementation uses a pre-trained model, fine-tuned on PubMed, a long-range summarization dataset. The top-ranked words/phases and sentences are extracted from the original transcript of the video to produce a summary using textrank, an unsupervised graph-based algorithm. The sentences for the summary are returned in the order of original occurrence in the transcript (i.e., not ranked order). Words and sentences in the summary are reduced by 57% and 70%, respectively, compared with the original transcript. Sentences are grouped into paragraphs based on their positional locations. Long paragraphs indicate several sentences in close proximity with minimal pruning between them. Short paragraphs and orphaned sentences suggest that more context may be needed. The final section is the full extracted transcript, line by line. Sentences in the 'Summary' section are hyperlinked to the 'Full Transcript' section. Sentences in the 'Full Transcript' section are hyperlinked to the video at the approximate time of utterance.

Abstract

Deep learning and language understanding is an enormous area of research in machine learning and neural computation. It has been shown that deep learning has been able to improve performance on a lot of language processing applications over the last few years, so it raises the question of why deep learning, and models which have this neural computation at the heart of their processing, have been so effective in language processing. In the first section of this lecture we give an overview of neural computation in general and language in general, and then we give some idea of why neural computation, deep learning or language might be an appropriate fit to come together and produce the sort of improvements and impressive language processing performance that we have seen over the past few years. In particular, we focus in on one particular neural language model, which we think is quite representative of many of the principles that govern all neural language models. And that model is the transformer which was released in 2018 and then in section three we go a bit deeper into a particular application of the transformer, that's the well known BERT model, and BERT in particular is an impressive demonstration of unsupervised learning and the ability of neural language language models to transfer knowledge from one training environment to another. And in the final section, we take a bit more of a look towards the future of language understanding and deep learning. To do that we delve into some work that's been done at DeepMind on grounded language learning, where we study the acquisition of language in deep neural networks that have the ability to interact and move around simulated environments.

Keywords/phrases

language models, word sentences, neural language models, different neural language models, input words, different language understanding models, word meanings, many neural language models, different words, word representations, masked language model prediction, word order, word senses, word, words, representing words, many other words, individual words, transformer models, distributed word representations