Data augmentation

Technique to artificially increase the size of the training dataset by applying transformations to existing data

By exposing the model to more variations, it ensures robustness and prevents overfitting by learning to recognise patterns and features that are supposed to be invariant to certain changes such as translations, rotations and scaling.

They can also be used to correct imbalance in distribution of classes by artificially increasing the number of examples of the under-represented classes. This helps when the amount of available training data is limited.

Transformation examples