Not logged in. Login | Signup

ImageNet Large Scale Visual Recognition Challenge 2013 (ILSVRC2013)

Back to Main page  

CitationNEW

When using the dataset, please cite:
    Olga Russakovsky*, Jia Deng*, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg and Li Fei-Fei. (* = equal contribution) ImageNet Large Scale Visual Recognition Challenge. arXiv:1409.0575, 2014. paper | bibtex

Development Kit

    Please be sure to consult the included readme.txt file for competition details. Additionally, the development kit includes

    • Overview and statistics of the data.
    • Meta data for the competition categories.
    • Matlab routines for evaluating submissions.

    Development kit (updated Aug 24, 2013)

Images

Only images from the DET dataset should be used for the detection competition and only images from the CLS-LOC dataset should be used for classification and classification with localization competitions.

DET dataset

There are a total of 395,909 images for training. The number of positive images for each synset (category) ranges from 417 to 66,911. The number of negative images ranges from 185 to 10,073 per synset. There are 20121 validation images, and 40,152 test images. All images are in JPEG format.

    Training images. 40GB. MD5: 516b61e845794133b7e049d59f52a65a

      There is significant overlap between the CLS-LOC training images below (also used for the ILSVRC2012 challenge) and the DET training images. Those who have already downloaded the CLS-LOC data can download just the new DET images here. Please carefully consult the readme.txt in the development kit for the list of images which may be used for the detection challenge.

      Training images not in CLS-LOC data. 14GB. MD5: b093799ab4d9be34662a83a58cd36919

CLS-LOC dataset

This dataset is unchanged from ILSVRC2012. There are a total of 1,281,167 images for training. The number of images for each synset (category) ranges from 732 to 1300. There are 50,000 validation images, with 50 images per synset. There are 100,000 test images. All images are in JPEG format.

Terms of use: by downloading the image data from the above URLs, you agree to the following terms:

  1. You will use the data only for non-commercial research and educational purposes.
  2. You will NOT distribute the above URL(s).
  3. Stanford University and Princeton University make no representations or warranties regarding the data, including but not limited to warranties of non-infringement or fitness for a particular purpose.
  4. You accept full responsibility for your use of the data and shall defend and indemnify Stanford University and Princeton University, including their employees, officers and agents, against any and all claims arising from your use of the data, including but not limited to your use of any copies of copyrighted images that you may create from the data.

Bounding Boxes

Only annotations from the DET dataset should be used for the detection competition and only annotations from the CLS-LOC dataset should be used for classification and classification with localization competitions.

DET dataset

CLS-LOC dataset