Published April 16, 2021 | Version v1
Dataset Open

Image classification in Galaxy with MNIST handwritten digits dataset

Creators

  • 1. Penn State University

Description

Credit: Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. "Gradient-based learning applied to document recognition." Proceedings of the IEEE, 86(11):2278-2324, November 1998

This is a subset of MNIST handwritten digits dataset (http://yann.lecun.com/exdb/mnist/). Training data of composed of 12,000 images of digits 0 to 9. Test data is composed of 6,000 images of digits 0 to 9 (Original dataset has 60,000 training and 10,000 testing images. We are using a subset for a Galaxy tutorial, so the training is not too computationally intensive). Images are grayscale and 28 by 28 pixels. Each pixel has a value between 0 and 255 (0 for color black, 255 for color white, and all other values for different shades of gray).   

Notes

Credit: Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. "Gradient-based learning applied to document recognition." Proceedings of the IEEE, 86(11):2278-2324, November 1998 http://yann.lecun.com/exdb/mnist/

Files

Files (131.6 MB)

Name Size Download all
md5:12f731f0ac1b3197363ae04c555ee704
43.8 MB Download
md5:946db18725bfdeee7f079706635b5595
87.8 MB Download
md5:ffb3f9bb2640bd078691c51a03416ed2
12.0 kB Download
md5:52e85ce2d279b0dec03f4168959fcb3f
24.0 kB Download