Get the latest Science News and Discoveries

A survey of synthetic data augmentation methods in machine vision - EurekAlert


The standard approach to tackling computer vision problems is to train deep convolutional neural network (CNN) models using large-scale image datasets that are representative of the target task. However, in many scenarios, it is often challenging to obtain sufficient image data for the target task. Data augmentation is a way to mitigate this challenge. A common practice is to explicitly transform existing images in desired ways to create the required volume and variability of training data necessary to achieve good generalization performance. In situations where data for the target domain are not accessible, a viable workaround is to synthesize training data from scratch, i.e., synthetic data augmentation. This paper presents an extensive review of synthetic data augmentation techniques. It covers data synthesis approaches based on realistic 3D graphics modelling, neural style transfer (NST), differential neural rendering, and generative modelling using generative adversarial networks (GANs) and variational autoencoders (VAEs). For each of these classes of methods, researchers focus on the important data generation and augmentation techniques, general scope of application and specific use-cases, as well as existing limitations and possible workarounds. Additionally, they provide a summary of common synthetic datasets for training computer vision models, highlighting the main features, application domains and supported tasks. Finally, they discuss the effectiveness of synthetic data augmentation methods. Since this is the first paper to explore synthetic data augmentation methods in great detail, researchers are hoping to equip readers with the necessary background information and in-depth knowledge of existing methods and their attendant issues.

None

Get the Android app

Or read this on Eureka Alert

Read more on:

Photo of EurekAlert

EurekAlert

Photo of survey

survey

Photo of machine vision

machine vision

Related news:

News photo

U of T researchers develop deep-learning model that outperforms Google AI system to predict peptide structures - EurekAlert

News photo

The future of metals research with artificial intelligence - EurekAlert

News photo

Innovative UAV and deep learning method enhances maize tassel detection accuracy - EurekAlert