Towards Understanding Deep Generative Models for Continuous Data Augmentation

Mishra, Harsh

doi:10.25417/uic.23661801.v1

Towards Understanding Deep Generative Models for Continuous Data Augmentation

thesis

posted on 2023-05-01, 00:00 authored by Harsh Mishra

Score-based models, a class of Deep Generative models, are considered state-of-the-art for image generation. The key idea behind such models is to gradually add noise to the data and then use a neural network to recover the original data by reversing the noising process. From a computational perspective, existing score-based models can be efficiently trained only if the forward or the corruption process comprises Gaussian noise, i.e., can be computed in closed form. On this, we first propose the use of continuous data augmenting methods, which provide the alternative to Gaussian noise. We also propose a new framework, named Intermediate Generator Optimization (IGO), that explicitly models such Non-Gaussian forward processes, by using the relationship between the corruption process and the layers in a deep Generative model. The main advantage of our framework is that it can be incorporated into any standard autoencoder pipeline for generative tasks. We provide implementation details to apply our framework on benchmark image generation and point-cloud denoising models, as well as the downstream task of Generative PCA.

History

Advisor

Ravi, Sathya N.

Chair

Ravi, Sathya N.

Department

Computer Science

Degree Grantor

University of Illinois at Chicago

Degree Level

Masters

Degree name

MS, Master of Science

Committee Member

Trivedi, Amit R Vamanan, Balajee Parde, Natalie

Submitted date

May 2023

Thesis type

application/pdf

Language

en

Usage metrics

Keywords

Deep Generative models U-Net Score-based models Differential Equations

Licence

In Copyright

Towards Understanding Deep Generative Models for Continuous Data Augmentation

History

Advisor

Chair

Department

Degree Grantor

Degree Level

Degree name

Committee Member

Submitted date

Thesis type

Language

Usage metrics

Categories

Keywords

Licence

Exports