Show simple item record

dc.contributor.advisorLiu, Chenang
dc.contributor.authorSlater, Kent
dc.contributor.authorLi, Yuxuan
dc.contributor.authorShan, Yongwei
dc.date.accessioned2023-04-11T14:30:03Z
dc.date.available2023-04-11T14:30:03Z
dc.date.issued2023-04-18
dc.identifieroksd_URS_2023_slater
dc.identifier.citationSlater, K., Li, Y., Shan, Y., & Liu, C. (2023, April 18). A Generative Adversarial Network (GAN)-assisted data quality monitoring approach for out-of-distribution detection of high dimensional data. Poster session presented at the Oklahoma State University Undergraduate Research Symposium, Stillwater, OK.
dc.identifier.urihttps://hdl.handle.net/11244/337350
dc.description.abstractData quality monitoring plays a critical role in various real-world engineering system inspection problems. Anomalous or invalid inspection data commonly exist due to computer/human recording errors, sensor faults, etc. Thus, an efficient tool to detect data anomalies is critically needed. However, it is challenging due to high dimensionality, unknown underlying distribution, insufficient sample size, and high level of noise. To address these challenges, an effective approach that can learn the underlying distribution of normal data with anomaly detection rules was developed. In this approach, the Generative Adversarial Network (GAN) was employed to identify the underlying distribution of normal data and filter out noise. After using the trained GAN to generate points of the learned distribution, a k-nearest neighbor-based approach is used to define the anomaly detection rules. In the proposed approach, the normal records are used to train the GAN and establish the control rule. Specifically, after training the GAN using the normal records, the pairwise distances over all the GAN-generated data points are calculated, and the k-nearest neighbors for every single data point are accordingly determined. Then, the average distance from each single data point to its k-nearest neighbors is calculated as the statistics to indicate the data quality and establish a control chart. When a new record comes in, its similarity to the GAN-generated distribution can be evaluated by the established control chart to identify whether the new record is anomalous or not.
dc.formatapplication/pdf
dc.publisherOklahoma State University
dc.rightsIn the Oklahoma State University Library's institutional repository this paper is made available through the open access principles and the terms of agreement/consent between the author(s) and the publisher. The permission policy on the use, reproduction or distribution of the article falls under fair use for educational, scholarship, and research purposes. Contact Digital Resources and Discovery Services at lib-dls@okstate.edu or 405-744-9161 for further information.
dc.titleGenerative Adversarial Network (GAN)-assisted data quality monitoring approach for out-of-distribution detection of high dimensional data
osu.filenameoksd_URS_2023_slater.pdf
dc.description.departmentIndustrial Engineering and Management
dc.type.genrePoster
dc.type.materialText
dc.type.materialImage


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record