Impact of Temporal Order Selection on Clustering Intensive Longitudinal Data Based on VAR Models

Li, Yaqi

Impact of Temporal Order Selection on Clustering Intensive Longitudinal Data Based on VAR Models

dc.contributor.advisor	Song, Hairong
dc.contributor.author	Li, Yaqi
dc.contributor.committeeMember	Shi, Dingjing
dc.contributor.committeeMember	Gronlund, Scott
dc.contributor.committeeMember	Lu, Kun
dc.date.accessioned	2023-07-19T19:42:57Z
dc.date.available	2023-07-19T19:42:57Z
dc.date.issued	2023-08
dc.date.manuscript	2023-06
dc.description.abstract	In real-world research, intensive longitudinal data (ILDs) are typically collected from a group of individuals of interest, which enables researchers to model not only the within-individual dynamics of the studied processes but also the between-individual differences on the within-individual dynamics. Among the statistical techniques proposed for modeling ILDs of multiple individuals, clustering of intensive longitudinal data provides a meaningful way to quantify sample heterogeneity in dynamic processes, assuming that such heterogeneity reflects the distinct nature of the studied processes. The aims of this dissertation are threefold: (a) to introduce a VAR-based clustering technique, (b) to examine the impact of temporal order selection on clustering accuracy and parameter estimation by a simulation study, and (c) to demonstrate the application of the clustering technique through an empirical analysis. Specially, I investigated the influence of two temporal order selection strategies: (1) using the most complex structure or highest order (HO) for all individual processes, and (2) using the most parsimonious structure or the lowest order (LO) for all individuals on the performance of two-step model-based clustering procedure. This procedure extracted dynamic coefficients from vector autoregressive (VAR) models and employed the Gaussian mixture model (GMM) and K-means clustering algorithms on the coefficients for cluster identification. Additionally, I also examined whether the influence varied across two clustering algorithms. The simulation study showed that, regardless of the clustering algorithms used, LO strategy consistently outperformed HO strategy in terms of recovering the number of clusters, cluster membership, and cluster-specific AR and CR effects. GMM performed better than K-means when LO strategy was applied; however, the performance of GMM decreased while the temporal orders increased. Additionally, GMM showed more vulnerability with smaller numbers of participants. The application of the two-step VAR-based method to affect data yielded a meaningful and informative clustering solution, which provided further insights of the uses of the model-based clustering approach Lastly, suggestions and recommendations were offered based on the results of the simulation and empirical analyses.	en_US
dc.identifier.uri	https://shareok.org/handle/11244/337940
dc.language	en	en_US
dc.subject	Vector autoregressive model	en_US
dc.subject	Temporal order	en_US
dc.subject	Intensive longitudinal data	en_US
dc.subject	Clustering	en_US
dc.thesis.degree	Ph.D.	en_US
dc.title	Impact of Temporal Order Selection on Clustering Intensive Longitudinal Data Based on VAR Models	en_US
ou.group	Dodge Family College of Arts and Sciences::Department of Psychology	en_US
shareok.nativefileaccess	restricted	en_US
shareok.orcid	0009-0009-8388-5791	en_US

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

OU - Dissertations

SHAREOK^TM

advancing Oklahoma scholarship, research and institutional memory

Impact of Temporal Order Selection on Clustering Intensive Longitudinal Data Based on VAR Models

Files

License bundle

Collections