5 Datasets


Showing 1 - 5 of 6

MNIST handwritten digits

MNIST: handwritten digits: The most commonly used sanity check. Dataset of 25x25, centered, B&W handwritten digits. It is an easy taskjust because somethi…

natural-image

WikiText

The WikiText language modeling dataset is a collection of over 100 million tokens extracted from the set of verified Good and Featured articles on Wikiped…

language modeling, wiki

Netflix Prize

Netflix released an anonymized version of their movie rating dataset; it consists of 100 million ratings, done by 480,000 users who have rated between 1 a…

movie, ranking

Uber 2B trip data

Uber Movement provides anonymized data from over two billion trips to help urban planning around the world. You need to sign up to download this data.

trips, uber, urban planning

Open Source Biometric Recogni…

A communal biometrics framework supporting the development of open algorithms and reproducible evaluations. OpenBR is a framework for investigating new m…

age estimation, biometric, face detection, gender estimation