You are here

Unpaired Style Transfer Conditional Generative Adversarial Network for Scanned Document Generation

Description: 
Neural networks are a powerful machine learning tool, especially when trained on a large dataset of relevant high-quality data. Generative adversarial networks, image super resolution and most other image manipulation neural networks require a dataset of images and matching target images for training. Collecting and compiling that data can be time consuming and expensive. This work explores an approach for building a dataset of paired document images with a matching scanned version of each document without physical printers or scanners. A dataset of these document image pairs could be used to train a generative adversarial network or image super resolution neural network to convert a scanned document into a pristine document free of artifacts. It could also be used in optical character recognition of scanned documents to improve understanding of documents with degraded quality. Generating a dataset like this without mechanical hardware saves time and materials and has the potential to build similar paired image datasets for other applications. The proposed approach centers on conditional generative adversarial networks to generate the paired dataset from unpaired document images. This work explores StyleGAN2, CycleGAN, CUT, Pix2PixHD, SPADE and SEAN. I find that the base version of each model is currently insufficient for this task.
Record Format: 
application/pdf
2022-07-07T07:00:00Z
Subject: 
Neural networks (Computer science)
Machine learning
Image processing
Artificial Intelligence and Robotics
Computer Sciences
Type: 
text
Raw Url: 
http://pdxscholar.library.pdx.edu/do/oai/?metadataPrefix=&verb=GetRecord&identifier=oai:pdxscholar.library.pdx.edu:open_access_etds-7263
Source: 
Dissertations and Theses
Repository Record Id: 
oai:pdxscholar.library.pdx.edu:open_access_etds-7263
SetSpec: 
publication:compsci
publication:students
publication:compsci_grad
publication:communities
publication:mcecs
publication:etds
publication:open_access_etds
Record Title: 
Unpaired Style Transfer Conditional Generative Adversarial Network for Scanned Document Generation
https://pdxscholar.library.pdx.edu/open_access_etds/6197
info:doi/10.15760/etd.8042
https://pdxscholar.library.pdx.edu/context/open_access_etds/article/7263/viewcontent/Hawbaker_psu_0180E_13041.pdf
Database: 
Resource OE Format: 
randomness