Impact of inferred comorbidity networks on health outcomes
Abstract
High dimensionality in Big Data can be modeled using network approach. The traditional networks (e.g. online social network) are explicit and easily observed. However, there are certain networks that are implicit and exist by virtue of some underlying collective behavior. Our focus is on these implicit networks, which can be inferred from the secondary data using statistical modeling. An example of such a network is a comorbidity network. In a comorbidity network, diseases form connections based on their co-occurrences in patients. We use data on the health history of 24.7 million patients recorded in US hospitals (2000-2016) to infer comorbidity networks. Since most statistical models depend upon sample size, it is important to study how sample size affects the structure of an implicit network. We study the impact of sample size on comorbidity networks developed using Pearson's Correlation Coefficient (PCC) and Salton Cosine Index (SCI). We present a comparative analysis and show that a network developed using SCI is robust to sample size as compared to the PCC. Our first study of comorbidity networks employs descriptive analytics. We investigate how comorbidity networks are different across population groups. We compare networks based on gender, race and insurance types. Our analysis at the comorbidity level presents health disparities across population groups. These disparities across population groups are considered to study the impact of comorbidity network on patients' hospital length of stay in second study. We develop an explanatory and predictive model to estimate length of stay using features extracted from comorbidity networks and compare with the extant models. We show that our model outperforms the existing models. Finally, we study how an implicit network can help theorize certain phenomenon related to it. With respect to the comorbidity network, we theorize clique property of a network as trap state. The trap state is hypothesized to be related to mortality risk of a patient. We identify eighteen such cliques in a comorbidity network. This dissertation contributes to network science, analytics and healthcare literature but the theory, models, algorithms, and processes developed are generalizable to other inferred networks.
Collections
- OSU Dissertations [11222]