Introducing Public Datasets

One of the most painstaking components of getting started with computer vision is getting access to clean, labeled data. For example, when the Roboflow team built BoardBoss, we painstakingly collected hundreds of Boggle board images from various devices (iPhone 7, iPhone X, iPhone 11…) and then labeled each individual letter tile (at least 16 per image!). Many lessons of that problem are now Roboflow features.

Now, you may not be building a Boggle solver app as we were. But you may be simply looking for a place to start. And an object detection problem like Boggle is reminiscent of an object detection version of MNIST.

So, we’re releasing our Boggle images — all 357 images and 7110 labels across those images — under public license. Free to use, but we welcome attribution! 😉

Welcome to public.roboflow.ai

In fact, we’re releasing many datasets, and we encourage you to do the same, when applicable.

https://public.roboflow.ai is a space for anyone to find and share datasets for computer vision. Each dataset includes details about its contents (like how many images and when was it uploaded), the license for reuse, and any Roboflow exports the original author published with the dataset. You can freely download images and annotations in any format: VOC XML, COCO JSON, YOLOv3 flat text files, even TFRecords.

We're launching with three board game-related datasets.

License permitting, anyone is free to fork a given dataset to their own account and apply different preprocess/augmentation space. If you have a dataset you’d like to share with the public, let us know at help@roboflow.ai

We’ll be adding our own datasets and datasets from the community across a wide array of domains: animals, board games, self driving cars, medicine, etc. If you’d like to be aware of these, subscribe to our newsletter.

Show Comments