VGG: Very Deep Convolutional Networks for Large-Scale Image Recognition

Authors

Karen Simonyan, Andrew Zisserman

University of Oxford

Portals

Abstract

In this work we investigate the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting. Our main contribution is a thorough evaluation of networks of increasing depth using an architecture with very small (3x3) convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers. These findings were the basis of our ImageNet Challenge 2014 submission, where our team secured the first and the second places in the localisation and classification tracks respectively. We also show that our representations generalise well to other datasets, where they achieve state-of-the-art results. We have made our two best-performing ConvNet models publicly available to facilitate further research on the use of deep visual representations in computer vision.

Contribution

Our main contribution is a rigorous evaluation of networks of increasing depth, which shows that a significant improvement on the prior-art configurations can be achieved by increasing the depth to 16-19 weight layers, which is substantially deeper than what has been used in the prior art. To reduce the number of parameters in such very deep networks, we use very small 3×3 filters in all convolutional layers (the convolution stride is set to 1).

PDF Preview

1409.1556

VGG: Very Deep Convolutional Networks for Large-Scale Image Recognition

VGG: Very Deep Convolutional Networks for Large-Scale Image Recognition

Authors

Portals

Abstract

Contribution

PDF Preview

Like this:

Leave a Reply Cancel reply

VGG: Very Deep Convolutional Networks for Large-Scale Image Recognition

VGG: Very Deep Convolutional Networks for Large-Scale Image Recognition

Authors

Portals

Abstract

Contribution

PDF Preview

Like this:

You may also Like:

An Image is Worth 16×16 Words: Transformers for Image Recognition at Scale

Generative Modelling of BRDF Textures from Flash Images

A Generative Model for Texture Synthesis based on Optimal Transport between Feature Distributions

Leave a Reply Cancel reply