Impact of Temporal Order Selection on Clustering Intensive Longitudinal Data Based on VAR Models

dc.contributor.advisorSong, Hairong
dc.contributor.authorLi, Yaqi
dc.contributor.committeeMemberShi, Dingjing
dc.contributor.committeeMemberGronlund, Scott
dc.contributor.committeeMemberLu, Kun
dc.date.accessioned2023-07-19T19:42:57Z
dc.date.available2023-07-19T19:42:57Z
dc.date.issued2023-08
dc.date.manuscript2023-06
dc.description.abstractIn real-world research, intensive longitudinal data (ILDs) are typically collected from a group of individuals of interest, which enables researchers to model not only the within-individual dynamics of the studied processes but also the between-individual differences on the within-individual dynamics. Among the statistical techniques proposed for modeling ILDs of multiple individuals, clustering of intensive longitudinal data provides a meaningful way to quantify sample heterogeneity in dynamic processes, assuming that such heterogeneity reflects the distinct nature of the studied processes. The aims of this dissertation are threefold: (a) to introduce a VAR-based clustering technique, (b) to examine the impact of temporal order selection on clustering accuracy and parameter estimation by a simulation study, and (c) to demonstrate the application of the clustering technique through an empirical analysis. Specially, I investigated the influence of two temporal order selection strategies: (1) using the most complex structure or highest order (HO) for all individual processes, and (2) using the most parsimonious structure or the lowest order (LO) for all individuals on the performance of two-step model-based clustering procedure. This procedure extracted dynamic coefficients from vector autoregressive (VAR) models and employed the Gaussian mixture model (GMM) and K-means clustering algorithms on the coefficients for cluster identification. Additionally, I also examined whether the influence varied across two clustering algorithms. The simulation study showed that, regardless of the clustering algorithms used, LO strategy consistently outperformed HO strategy in terms of recovering the number of clusters, cluster membership, and cluster-specific AR and CR effects. GMM performed better than K-means when LO strategy was applied; however, the performance of GMM decreased while the temporal orders increased. Additionally, GMM showed more vulnerability with smaller numbers of participants. The application of the two-step VAR-based method to affect data yielded a meaningful and informative clustering solution, which provided further insights of the uses of the model-based clustering approach Lastly, suggestions and recommendations were offered based on the results of the simulation and empirical analyses.en_US
dc.identifier.urihttps://shareok.org/handle/11244/337940
dc.languageenen_US
dc.subjectVector autoregressive modelen_US
dc.subjectTemporal orderen_US
dc.subjectIntensive longitudinal dataen_US
dc.subjectClusteringen_US
dc.thesis.degreePh.D.en_US
dc.titleImpact of Temporal Order Selection on Clustering Intensive Longitudinal Data Based on VAR Modelsen_US
ou.groupDodge Family College of Arts and Sciences::Department of Psychologyen_US
shareok.nativefileaccessrestricteden_US
shareok.orcid0009-0009-8388-5791en_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2023_Li_Yaqi_Dissertation.pdf
Size:
1.4 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: