Project Awesome project awesome

Deep Learning > Education

Collection 1.8k stars GitHub

Papers

Convolutional Neural Networks

The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation

Merges the ideas of the U-Net and the DenseNet, this new neural network is especially good for huge datasets in image segmentation.

What is the Best Multi-Stage Architecture for Object Recognition?

Awesome for the use of "local contrast normalization".

ImageNet Classification with Deep Convolutional Neural Networks

AlexNet, 2012 ILSVRC, breakthrough of the ReLU activation function.

Visualizing and Understanding Convolutional Networks 34.8k updated 1y ago

For the "deconvnet layer".

Fast and Accurate Deep Network Learning by Exponential Linear Units

ELU activation function for CIFAR vision tasks.

Very Deep Convolutional Networks for Large-Scale Image Recognition

Interesting idea of stacking multiple 3x3 conv+ReLU before pooling for a bigger filter size with just a few parameters. There is also a nice table for "ConvNet Configuration".

Going Deeper with Convolutions

GoogLeNet: Appearance of "Inception" layers/modules, the idea is of parallelizing conv layers into many mini-conv of different size with "same" padding, concatenated on depth.

Highway Networks

Highway networks: residual connections.

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Batch normalization (BN): to normalize a layer's output by also summing over the entire batch, and then performing a linear rescaling and shifting of a certain trainable amount.

U-Net: Convolutional Networks for Biomedical Image Segmentation

The U-Net is an encoder-decoder CNN that also has skip-connections, good for image segmentation at a per-pixel level.

Deep Residual Learning for Image Recognition

Very deep residual layers with batch normalization layers - a.k.a. "how to overfit any vision dataset with too many layers and make any vision model work properly at recognition given enough data".

Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning

For improving GoogLeNet with residual connections.

WaveNet: a Generative Model for Raw Audio

Epic raw voice/music generation with new architectures based on dilated causal convolutions to capture more audio length.

Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling

3D-GANs for 3D model generation and fun 3D furniture arithmetics from embeddings (think like word2vec word arithmetics with 3D furniture representations).

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

Incredibly fast distributed training of a CNN.

Densely Connected Convolutional Networks

Best Paper Award at CVPR 2017, yielding improvements on state-of-the-art performances on CIFAR-10, CIFAR-100 and SVHN datasets, this new neural network architecture is named DenseNet.

The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation

Merges the ideas of the U-Net and the DenseNet, this new neural network is especially good for huge datasets in image segmentation.

Prototypical Networks for Few-shot Learning

Use a distance metric in the loss to determine to which class does an object belongs to from a few examples.

Complex Numbers & Digital Signal Processing