Show simple item record

dc.contributor.advisorLavine, Barry K.
dc.contributor.authorDing, Tao
dc.date.accessioned2018-06-29T14:32:28Z
dc.date.available2018-06-29T14:32:28Z
dc.date.issued2016-12
dc.identifier.urihttps://hdl.handle.net/11244/300336
dc.description.abstractProfiling of complex samples using spectroscopic techniques continues to be an active area of research with a large and burgeoning literature. The overall goal of profile analysis is to correlate a characteristic fingerprint pattern in a spectrum with the properties of a sample or in biomedical studies with the presence or absence of disease in a patient or animal from which the sample was taken. Fingerprinting experiments of this type often yield profiles containing hundreds of constituents. Multivariate statistical and pattern recognition techniques can be effective methods for the analysis of such complex data. However, the classification of complex samples on the basis of their spectroscopic profiles is complicated by several factors: (1) confounding of the desired group information by experimental variables or other systematic variations in the data, and (2) the presence of noisy data and irrelevant variables that unnecessarily enlarge the data space and the complexity of the classification model developed from the data, an effect that tends to increase both the error rate and to reduce robustness of data-driven predictions. Several interesting projects involving these effects and methods for dealing with them are highlighted in this dissertation. In one study, the identification of N-linked glycan biomarkers in serum samples measured by MALDI-IMSMS and analyzed by pattern recognition techniques to screen a population at risk for esophageal adenocarcinoma (EAC) is discussed. In another study, search prefilters were developed as part of a prototype pattern recognition library search system to facilitate searching of infrared spectra in the Paint Data Query database and to improve discrimination capability for automotive paint comparisons involving the original equipment manufacturer. A genetic algorithm for variable selection to improve classifications was used in both of these studies. The approach taken by the genetic algorithm for pattern recognition relies heavily on graphics for the presentation of results.
dc.formatapplication/pdf
dc.languageen_US
dc.rightsCopyright is held by the author who has granted the Oklahoma State University Library the non-exclusive right to share this material in its institutional repository. Contact Digital Library Services at lib-dls@okstate.edu or 405-744-9161 for the permission policy on the use, reproduction or distribution of this material.
dc.titlePattern recognition studies of complex spectroscopic data sets
dc.contributor.committeeMemberGelder, John
dc.contributor.committeeMemberEl Rassi, Ziad
dc.contributor.committeeMemberKrishnan, Sadagopan
dc.contributor.committeeMemberZhou, Donghua
osu.filenameDing_okstate_0664D_14972.pdf
osu.accesstypeOpen Access
dc.type.genreDissertation
dc.type.materialText
thesis.degree.disciplineChemistry
thesis.degree.grantorOklahoma State University


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record