Show simple item record

dc.contributor.advisorMcGovern, Amy
dc.contributor.authorBurke, Amanda
dc.date.accessioned2024-07-18T13:18:08Z
dc.date.available2024-07-18T13:18:08Z
dc.date.issued2024-08-01
dc.identifier.urihttps://hdl.handle.net/11244/340483
dc.description.abstractThis dissertation emphasizes the contribution of expert knowledge in the development and assessment of machine learning (ML) models within the Earth sciences, specifically Meteorology. Despite the common focus on achieving high skill scores, conventional metrics may inadequately capture the nuanced patterns learned by these models. This dissertation underscores the importance of incorporating end-user feedback, demonstrating that with this feedback, tailored yet flexible ML models can effectively learn specific meteorological patterns while remaining applicable to broader contexts. The first focus of ML development is in identification of above-anvil cirrus plumes (plumes). In satellite imagery, plumes serve as critical indicators of impending severe weather, often appearing 30 minutes before reported events. Their real-time identification is particularly valuable in radar-deficient regions, where they offer insights into the convective environment. However, manually labeling plumes is labor-intensive and requires specialized expertise. To streamline this process, I develop a deep learning (DL) model trained on expert-annotated data to create skillful pixel-level plume classifications using remote sensing data that is available globally. This approach was tested on combinations of spectral data across the contiguous United States, showing above-average object correspondence with human-derived labels. Another focus of this dissertation is leveraging ML models for severe hail prediction on localized scales. Existing ML models have demonstrated proficiency across the United States during spring and summer but have struggled to capture the nuanced spatio-temporal dynamics of thunderstorm development in local contexts. Addressing this gap, I develop a novel localization technique that prioritizes storm object weighting without imposing substantial additional burdens on model developers. Results indicate that localized weighting of storm objects matches or outperforms existing ML approaches, while improving the physical relevance of the top predictors in the trained ML model. Lastly, leveraging extensive satellite data archives, this dissertation addresses the challenge of efficiently creating training sets that accurately represent large-scale Earth science datasets. This work explores clustering approaches to capture regional nuances within a vast dataset of remote sensing data, focusing on a straightforward use case with an established baseline: land cover classification. By using surface reflectance bands in a random forest (RF) model, I compare classification outcomes between randomly sampled datasets of varying sizes and datasets created using clustering. The clustering approach produced a training sample that was 200% smaller than the largest sample studied, yet it achieved a 77% increase in F1 score. This suggests that clustering may offer an effective alternative (or addition) to increasing computing power when modeling "Big Data".en_US
dc.languageen_USen_US
dc.rightsAttribution-NoDerivatives 4.0 International*
dc.rights.urihttps://creativecommons.org/licenses/by-nd/4.0/*
dc.subjectMeteorologyen_US
dc.subjectAIen_US
dc.subjectMachine Learningen_US
dc.titleExpert-Guided Machine Learning for Meteorological Predictions Across Spatio-Temporal Scalesen_US
dc.contributor.committeeMemberHomeyer, Cameron
dc.contributor.committeeMemberMartin, Elinor
dc.contributor.committeeMemberRazzaghi, Talayeh
dc.date.manuscript2024-07-15
dc.thesis.degreePh.D.en_US
ou.groupCollege of Atmospheric and Geographic Sciences::School of Meteorologyen_US


Files in this item

Thumbnail
Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record


Attribution-NoDerivatives 4.0 International
Except where otherwise noted, this item's license is described as Attribution-NoDerivatives 4.0 International