Show simple item record

dc.contributor.advisorSan, Omer
dc.contributor.authorVaddireddy, Harsha Vardhan Reddy
dc.date.accessioned2020-09-09T20:58:52Z
dc.date.available2020-09-09T20:58:52Z
dc.date.issued2020-05
dc.identifier.urihttps://hdl.handle.net/11244/325468
dc.description.abstractExtracting governing equations from data can be viewed as reverse engineering of Nature- using data to identify the physical laws/models. This approach is crucial for fields where data is abundant ( such as geophysical flows, finance, and neuroscience) but the physical laws based on the first principles are not available. In recent years, the use of machine learning (ML) methods complemented the need for formulating mathematical models through the application of data analysis algorithms that allow accurate estimation of observed dynamics by learning automatically from the given observations. The neural networks and symbolic regression (SR) based approaches are the most popular ML frameworks used to learn the underlying physical process by only the observing data. While neural network approaches have shown great promise, its black-box nature makes it difficult to interpret the learned models. On the other hand, symbolic regression algorithms are capable of learning/finding an analytically tractable function in symbolic form. Hence to address the functional expressibility, a key limitation of the black-box machine learning methods, this study has explored the use of symbolic regression approaches for identifying relations and operators that accurately represent the underlying physical processes. This study demonstrates the use of an evolutionary algorithm called gene expression programming (GEP) and a sparse optimization algorithm called sequential threshold ridge regression (STRidge) in discovering physical models. The effectiveness of these algorithms is demonstrated on four different applications: (1) partial differential equation (PDE) discovery, (2) truncation error analysis, (3) hidden physics discovery and (4 ) discovering subgrid-scale closure models. This study shows the GEP and STRidge algorithms are able to distill various linear/nonlinear PDEs, truncation error terms and unknown source terms of 1D and 2D PDEs. Furthermore, the classical Smagorinsky model is identified for subgrid-scale (SGS) closure from an array of tailored features in solving the 2D Kraichnan turbulence problem. Our results demonstrate the huge potential of these techniques in distilling complex nonlinear physics models from only observing the data. Furthermore, this study reveals the importance of feature selection/feature engineering and embedding the prior knowledge about the unknown dynamical system in terms of invariances for identifying models.
dc.formatapplication/pdf
dc.languageen_US
dc.rightsCopyright is held by the author who has granted the Oklahoma State University Library the non-exclusive right to share this material in its institutional repository. Contact Digital Library Services at lib-dls@okstate.edu or 405-744-9161 for the permission policy on the use, reproduction or distribution of this material.
dc.titleIdentification of physical processes via data driven methods
dc.contributor.committeeMemberSanthanakrishnan, Arvind
dc.contributor.committeeMemberKara, Kursat
osu.filenameVaddireddy_okstate_0664M_16664.pdf
osu.accesstypeOpen Access
dc.type.genreThesis
dc.type.materialText
dc.subject.keywordscompressive sensing
dc.subject.keywordsgene expression programming
dc.subject.keywordsmodel discovery
dc.subject.keywordsneural networks
dc.subject.keywordssparse regression
dc.subject.keywordssymbolic regression
thesis.degree.disciplineMechanical and Aerospace Engineering
thesis.degree.grantorOklahoma State University


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record