Data Augmentation for Regression

Question

I want to use deep learning for regression. However, the number of training samples is not large. In image processing, some new samples are generated on the basis of initial data through tasks like rotation to increase the number of data sets. Can I know if there is any algorithm to generate more data based on current samples (data augmentation for regression)?

Michael Brundage · Answer 1 · 2018-05-19T20:01:10.577

3

Yes, you can perturb your data (and targets) in ways that you wish your model to be robust against, for example by adding small amounts of noise (possibly Gaussian) or synthetic anomalies, or by creating meaningful aggregates. Some models also improve just by running the same training data through them more than once, randomly shuffled (“epochs”).

edited May 19 '18 at 20:01

answered May 19 '18 at 14:50

Michael Brundage

31
5

score 0 · Answer 2 · answered Oct 04 '17 at 12:29

You can try something like SMOTE and see how your newly generated data fits your requirements. If your data has a statistical model you can use an appropriate parametric model to generate data. You can even try methods like Non Parametric estimation such as Parzen windows etc. All of this depends on the statistical fit of your image data which you have processed so far.

Other Methods similar to SMOTE: ADASYN, Boundary SMOTE etc (Look them up on the internet)

Note: Apply SMOTE on the already processed image data such as the training data you already have not directly on the image itself.

Data Augmentation for Regression

2 Answers2