Introducing Bounding Box Level Augmentations

Having training data that matches the diversity of your task is paramount to the success of your models.

At Roboflow, we’re committed to providing you with state-of-the-art techniques that can improve your deep learning model’s performance -- without needing to collect anymore data or even re-label images. We’re focused on assuring the data and annotations you’ve collected are of the highest quality so that you can focus on the important parts of development, like fine tuning your computer vision models or determining how a model should be used for your business context.

Thus, we’re introducing a new augmentation technique unavailable anywhere else: bounding box level augmentation.

What is Bounding Box Level Augmentation

Image augmentation is the act of increasing your dataset size through manipulating existing training data. Fundamentally, it’s creating real-looking fake data. Image augmentation helps a model generalize better to a wide array of contexts. (More about that in our other blog posts.)

We’ve seen users leverage image augmentation to create production-ready models for industrial tasks with just eight (8!) original source images.

Bounding box augmentation takes this idea and goes one step further: what if we altered the content of an image, but only the content within a given bounding box? For example, vary the brightness or darkness of an object relative to its background. Or, perhaps blur an object relative to its background for tasks that are often capturing rapidly moving objects.

Bounding box level augmentation generating new training data by only altering the content of a source image’s bounding boxes. In doing so, developers have greater control over creating training data that is more suitable to their problem’s conditions.

An original image.
Bounding box only augmentation that has randomly horizontally flipped and altered the brightness of objects. Note that each individual bounding box receives its own settings.

The Genesis of Bounding Box Level Augmentation

We can’t take credit for being the first to consider bounding box only augmentation. A 2019 paper from Google researchers introduces the idea of using bounding box only augmentation to create optimal data for their models. In this paper, researchers showed bounding box only modifications generated systemic improvements, especially for models that were small datasets.

Via the researchers' paper.

Using Bounding Box Level Augmentation

We envision many use cases for more specialized augmentation. Consider modifying colors of only objects in a OCR image, introducing blur for only the moving objects like pets or cars, rotating objects like board game pieces, and flipping the orientation of objects to produce a mirrored effect like those present in most camera situations.

Getting started is easy. Simply toggle on any options of interest, and generate a new dataset.

Here's the settings used for the chess images above.

Our advanced augmentations (like Bounding box level augmentation) are reserved for paid accounts in Roboflow. Get in touch if you’d like access. Free accounts can access all the standard augmentations though, so give it a try!


Want to be the first to know about new content like this? Subscribe.

Roboflow accelerates your computer vision workflow through automated annotation quality assurance, universal annotation format conversion (like PASCAL VOC XML to COCO JSON), team sharing and versioning, and exports directly to file format, like TFRecords. It's free for datasets up to 1GB.Share

Show Comments