Machine Learning: A Brief Overview (LinkMap)

This is a LinkMap: A 2D layout of links, controlled by a wiki.

[Lecture 1](https://www.youtube.com/watch?v=qMp3s7D_8Xw&list=PLemsnf33Vij4eFWwtoQCrt9AHjLe3uo9_&index=1)

[Lecture 2](https://www.youtube.com/watch?v=0X610-2cH8I&list=PLemsnf33Vij4eFWwtoQCrt9AHjLe3uo9_&index=3)

[Lecture 3](https://www.youtube.com/watch?v=1uLzkX19zPc&list=PLemsnf33Vij4eFWwtoQCrt9AHjLe3uo9_&index=4)

[Lecture 4](https://www.youtube.com/watch?v=ens7rnhk0e0&list=PLemsnf33Vij4eFWwtoQCrt9AHjLe3uo9_&index=5)

[Lecture 4](https://www.youtube.com/watch?v=ens7rnhk0e0&list=PLemsnf33Vij4eFWwtoQCrt9AHjLe3uo9_&index=5)

[Lecture 5](https://www.youtube.com/watch?v=Mx0GMr_3Z4U&list=PLemsnf33Vij4eFWwtoQCrt9AHjLe3uo9_&index=6)

[Lecture 5](https://www.youtube.com/watch?v=Mx0GMr_3Z4U&list=PLemsnf33Vij4eFWwtoQCrt9AHjLe3uo9_&index=6)

[Lecture 6](https://www.youtube.com/watch?v=vWg9cWUqSnI&list=PLemsnf33Vij4eFWwtoQCrt9AHjLe3uo9_&index=7)

[Lecture 6](https://www.youtube.com/watch?v=vWg9cWUqSnI&list=PLemsnf33Vij4eFWwtoQCrt9AHjLe3uo9_&index=7)

[Lecture 7](https://www.youtube.com/watch?v=JEhHvY5k3Q0&list=PLemsnf33Vij4eFWwtoQCrt9AHjLe3uo9_&index=8)

[Lecture 8](https://www.youtube.com/watch?v=hz7DBfGLDV4&list=PLemsnf33Vij4eFWwtoQCrt9AHjLe3uo9_&index=9)

[Lecture 9](https://www.youtube.com/watch?v=HU6HtpQL_YA&list=PLemsnf33Vij4eFWwtoQCrt9AHjLe3uo9_&index=10)

[58 Rosenblatt](https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.335.3398&rep=rep1&type=pdf)
The perceptron: the original artificial neural network.

[Review 89 Hinton](http://www.cs.toronto.edu/~fritz/absps/clp.pdf)
"Connectionist Learning Procedures". This review covers the state of the art in 1989, including backpropagation, Hopfield models, Boltzmann machines, reinforcement learning, a first mention of autoencoders (self-supervision), some remarks on mutual information, and much more. Marvellous!

[89 Cybenko](https://link.springer.com/article/10.1007/BF02551274)
The original proof of the fact that neural networks with at least one hidden layer can approximate arbitrary smooth functions (with sufficiently many neurons).

[Review 89 Hinton](http://www.cs.toronto.edu/~fritz/absps/clp.pdf)
"Connectionist Learning Procedures". This review covers the state of the art in 1989, including backpropagation, Hopfield models, Boltzmann machines, reinforcement learning, a first mention of autoencoders (self-supervision), some remarks on mutual information, and much more. Marvellous!

[76 Linnainmaa](https://link.springer.com/article/10.1007/BF01931367)
Original introduction of the backpropagation algorithm: "Taylor expansion of the accumulated rounding error". Discusses automatic differentiation. Not yet applied to the special case of neural networks.

[85 Rumelhart et al](http://www.cs.utoronto.ca/~hinton/absps/naturebp.pdf)
Rumelhart, Hinton, Williams. The famous introduction of backpropagation for neural networks. Practical applications. Already includes gradient descent with 'momentum'.

[80 Fukushima](https://www.cs.princeton.edu/courses/archive/spr08/cos598B/Readings/Fukushima1980.pdf)
Neocognitron: The grandparent of all convolutional neural networks. Far ahead of its time: a multilayer (deep) convolutional structure, with unsupervised learning. Inspired by the human visual cortex.

[89 LeCun et al](https://papers.nips.cc/paper/1989/file/53c3bce66e43be4f209556518c2fcb54-Paper.pdf)
Deep CNN for digit recognition: First to use backpropagation to train a multilayer CNN. Here applied to MNIST-style dataset of handwritten digits.

[12 Krizhevsky et al](https://dl.acm.org/doi/10.1145/3065386)
The summary of the famous deep CNN that won the 2012 ImageNet competition, using ReLU and dropout in a deep CNN.

[89 Baldi et al](https://www.sciencedirect.com/science/article/abs/pii/0893608089900142?via%3Dihub)
They introduce a linear autoencoder and show that it is doing PCA.

[06 Hinton et al](https://www.cs.toronto.edu/~hinton/science.pdf)
Deep autoencoders & pretraining: Uses restricted Boltzmann machine for pretraining, proposes layer-wise pretraining for deep architectures. Examples from different applications.

[Review 12 Bengio et al](https://arxiv.org/abs/1206.5538)
Representation learning, i.e. unsupervised learning of useful features. Covers the various autoencoder types, PCA, sparse coding, distributed representations, and more.

[08 van der Maaten et al](http://www.cs.toronto.edu/~hinton/absps/tsne.pdf)
t-SNE: Everyone's favourite nonlinear dimensionality reduction technique for visualization.

[95 Hochreiter et al](http://people.idsia.ch/~juergen/FKI-207-95ocr.pdf)
Long Short-Term Memory (LSTM): The classic paper describing how to get rid of exploding/vanishing gradients in recurrent networks using gated units. (Technical report version; there is a 97 version)

[13 Mikolov et al](https://arxiv.org/abs/1310.4546)
Word vectors: This paper shows you can indeed do basic arithmetic on word vectors and get meaningful results.

[92 Watkins et al](http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=98ECF011CFFA9E02C015F96A1DF3A471?doi=10.1.1.466.7149&rep=rep1&type=pdf)
Introduces Q Learning

[92 Williams](https://link.springer.com/article/10.1007/BF00992696)
Introduces REINFORCE (policy gradient)

[85 Sejnowski](https://onlinelibrary.wiley.com/doi/abs/10.1207/s15516709cog0901_7)
The original paper that introduced restricted Boltzmann machines as a tool for learning to sample from an observed probability distribution. Explains the connections to statistical physics and the learning procedure.

[Lecture Notes: 20 Marquardt](https://arxiv.org/abs/2101.01759)
Lecture Notes for the 2019 Les Houches Lectures.

[Book: 16 Goodfellow et al](https://www.deeplearningbook.org)
Complete book on deep learning, online.

[Book: 15 Nielsen](http://neuralnetworksanddeeplearning.com)
A short online book introducing the very basics.

[Online course: Machine Learning for Physicists](https://pad.gwdg.de/s/Machine_Learning_For_Physicists_2021)
Free online course, including github repository, slides, videos (see links to lecture videos on this link map page here).

LinkMap Code by Florian Marquardt, 2020 (MIT license). See florianmarquardt.github.io.
Contents by Florian Marquardt (Creative Commons license).