Next Article in Journal
Signs of Dehydration after Hip Fracture Surgery: An Observational Descriptive Study
Next Article in Special Issue
The Combined Effects of Sports Smart Bracelet and Multi-Component Exercise Program on Exercise Motivation among the Elderly in Macau
Previous Article in Journal
Expression Levels of Inflammatory and Oxidative Stress-Related Genes in Skin Biopsies and Their Association with Pityriasis Alba
Previous Article in Special Issue
Combination of a Self-Regulation Module and Mobile Application to Enhance Treatment Outcome for Patients with Acne
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Predicting Old-age Mortality Using Principal Component Analysis: Results from a National Panel Survey in Korea

1
Department of Policy Analysis and Management, College of Human Ecology, Cornell University, Ithaca, NY 14853, USA
2
Department of Preventive Medicine, School of Medicine, Ajou University, Suwon, Gyeonggi-do 16499, Korea
3
Department of Health Administration, College of Health Sciences, Yonsei University, Wonju, Gwangwondo 26493, Korea
4
Department of Health Administration, College of Health Science, Dankook University, Cheonan-si, Chungcheongnam-do 31116, Korea
*
Author to whom correspondence should be addressed.
Submission received: 29 May 2020 / Revised: 14 July 2020 / Accepted: 15 July 2020 / Published: 18 July 2020
(This article belongs to the Special Issue Artificial Intelligence Research in Healthcare)

Abstract

:
Background and Objectives: This study aimed to group diseases classified by the International Classification of Diseases using principal component analysis, and discuss a systematic approach to reducing the preventable death rate from a perspective of public health. Materials and Methods: Using a 10-year follow-up analysis of the Korean Longitudinal Study of Aging (KLoSA) data, this study obtained de-identified data including participants’ data of community-dwelling individuals aged ≥45 years from 2006 to 2016. Participants were randomly selected using a multistage, stratified probability sampling based on geographical area and housing type. We excluded 37 participants with missing information at baseline and included 10,217 study participants. This study used the principal component analysis to extract comorbidity patterns, and chi-square test and Cox proportional hazards models for analyzing the association between the factors of interest. Results: Principal component 1 (diabetes, heart disease, and hypertension) was associated with an increased hazard ratio (HR) of 1.079 (95% confidence interval (CI) 1.031–1.129, p = 0.001). Principal component 3 (psychiatric and cerebrovascular diseases) was related to an increased HR of 1.134 (95% CI 1.094–1.175, p < 0.0001). Moreover, principal component 4 was associated with a high HR of 1.172 (95% CI 1.130–1.215, p < 0.0001). However, among participants aged between 45 and 64 years, principal component 4 showed a meaningfully increased HR of 1.262 (95% CI 1.184–1.346, p < 0.001). In this study, among the four principal components, three were statistically associated with increased mortality. Conclusions: The principal component analysis for predicting mortality may become a useful tool, and artificial intelligence (AI) will improve a value-based healthcare strategy, along with developing a clinical decision support model.

1. Introduction

Aging is a global phenomenon that is particularly observed in industrialized countries [1]. Presently, the proportion of older individuals in these countries is growing at an accelerated pace, mainly due to increasing longevity. Among the industrialized countries, South Korea shows the fastest aging society among the Organisation for Economic Co-operation and Development (OECD) countries, and the related health care cost has been growing [2]. The proportion of individuals aged ≥65 years was 7.2% in 2000 and 14.2% in 2017. In 2025, 20.3% of the total population in Korea will be aged ≥65 years. This aging speed is 7 years faster than that in Japan.
As the population is aging, it is important to estimate the population mortality based on medical conditions. Most estimation methods for mortality are based on a single independent factor (e.g., blood pressure, glycemic variability [3], muscle strength [4], self-rated health [5], health-rated quality of life [6], health literacy, and frailty). Otherwise, other estimators are calculated as a single score using weighted values to predict mortality. For example, Charlson et al. [7] defined numerous clinical conditions through reviewing hospital charts and assessed their relevance in the prediction of 1-year mortality. The Charlson Comorbidity Index (CCI) has been validated in large populations [8,9,10,11]. Similarly, the Elixhauser Comorbidity Index is also widely used to predict mortality [12].
However, since a systematic approach becomes a trend in medical research [13,14], it is questionable as to whether the comorbidity indices may have limitations in providing mortality prevention strategies based on the perspective of public health. In particular, members of the elderly population aged ≥65 years may suffer from multiple noncommunicable diseases. The multiple diseases share common risk factors, such as smoking, alcohol drinking, insufficient physical activity, and obesity [15,16]. Therefore, it is important to group the associated disease entities that may share some disease causes and pathophysiology using a machine-learning or econometric approach [16]. This study aimed to group each disease classified by the International Classification of Diseases Tenth Edition, using principal component analysis, and discuss the systematic approach to reduce preventable death rate from the perspective of public health.

2. Materials and Methods

2.1. Study Sample and Design

This study was conducted by the Korea Labor Institute for this rapidly growing population and obtained de-identified data from the first wave of the 2006 Korean Longitudinal Study of Aging (KLoSA), including participants’ data of community-dwelling Korean individuals aged ≥45 years until follow−up in 2016. Participants were randomly selected using a multistage, stratified probability sampling design based on geographical area and housing type across the country to create a nationally representative sample. As per the KLoSA protocol, trained surveyors obtained informed consent from participants and conducted face-to-face interviews using a computer-assisted personal interviewing program. The data were composed of seven categories, including population, family, health, employment, income, wealth, and subjective and life expectation. In the baseline survey in 2006, 10,254 individuals from 6171 households (1.7 per household) were interviewed.
To forecast the association between frequent comorbidities and all-cause mortality among these individuals, we excluded 37 participants with missing information at baseline 2006 and finally included 10,217 study participants. Thus, comorbidities for these participants were investigated, comorbidity patterns were derived from principal component analysis, and the association between each pattern and mortality was analyzed. This study doesn’t require ethical approval, because it is not a study using human derivatives, and all subjects are encrypted and cannot be identified.

2.2. Independent Variables

With both questionnaires surveyed on the middle-aged and elderly panel and diseases that frequently develop in the elderly population, the diagnoses of hypertension, diabetes, malignant tumor, chronic lung disease, liver disease, heart disease, psychiatric disease, and arthritis or rheumatism; experience of falls in 2 years; and difficulty in daily activities due to visual impairment were analyzed to extract comorbidity patterns. Each comorbidity was categorized according to a “yes” or “no” response to the question “Have you ever been diagnosed by a doctor?”

2.3. Dependent Variable

Death (all-cause mortality) over a maximum follow-up period of 10 years was determined by a death certificate and coroner’s report.

2.4. Control Variables

2.4.1. Socioeconomic and Demographic Factors

Age groups were divided into five categories: 45–54, 55–64, 65–74, and ≥75 years. Educational level was categorized into four groups: elementary school or lower, middle school, high school, and college or higher. Sex was categorized as male and female, and residential region was categorized into urban (administrative divisions of a city: Seoul, Daejeon, Daegu, Busan, Incheon, Kwangju, or Ulsan) or rural (not classified as administrative division of a city). Marital status was divided into two groups: single or married. The single group included separation and separation by death or divorce. Labor was divided into two categories: yes or no. Health insurance was categorized as National Health Insurance or Medical Aid.

2.4.2. Health Status and Behavior Factors

Smoking status was categorized into three groups: current smoker, former smoker, or never smoker. Alcohol use was also divided into three groups: current drinker, former drinker, or never drinker. Moreover, the number of chronic diseases, such as hypertension, diabetes, cancer, chronic obstructive pulmonary disease, liver disease, cardiovascular disease, cerebrovascular disease, psychiatric disease, and arthritis, was also included as a covariate in our analyses.

2.5. Analytical Approach and Statistics

This study conducted a principal component analysis with varimax rotation, which is a statistical technique used at one level of principal component analysis to extract comorbidity patterns [17]. The principal component analysis analyzes the correlations between item and variables to identify principal components with high correlation. In this study, we included principal component scores by calculating a weighted sum of the items. Each item’s weight is derived from its principal component loading and each item’s contribution to the principal component score depends on how strongly it relates to the principal component. Because those weights are all between −1 and 1, the scale of the principal component scores will be very different from each other. Results of the principal component analysis indicated 4 categories of 11 items, which measure disease comorbidity patterns. In addition, chi-square test and Cox proportional hazards models were used to investigate the association between comorbidity patterns and all-cause mortality through a 10-year follow-up database. In all analyses, the criterion for statistical significance was a p-value of <0.05 in a two-tailed test. All analyses were conducted using the SAS software package version 9.4 (SAS Institute Inc., Cary, NC, USA)

3. Results

3.1. Sample Characteristics

Table 1 presents the descriptive statistics of all variables at baseline (2006). In the 10,217 participants included in our study, the incidence of mortality was 1,487 (14.6%). In 2,828 participants (27.7%) with hypertension, the mortality rate was 19.3% (n = 546), and in 1,219 participants (11.9%) with diabetes, the mortality rate was 22.8% (n = 278). In 245 participants (2.4%) with malignant tumor, the mortality rate was 26.5% (n = 65), and in 226 participants (2.2%) with chronic lung disease, the mortality rate was 32.7% (n = 74). These two chronic medical conditions have the highest mortality rate among the included comorbidities (Figure 1).
In terms of sociodemographic factors, in 4,799 participants (47.0%) with an educational level of elementary school, the mortality rate was 21.8% (n = 1047); in 3,571 participants (35.0%) in the rural area, the mortality rate was 17.7% (n = 631); and in 2,281 participants (22.3%) living alone, the mortality rate was 24.8% (n = 24.8%). Participants with Medical Aid (n = 640, 6.3%) had a mortality rate of 25.2% (n = 161). In health behavior, former smokers (n = 976, 9.5%) and former drinkers (n = 687, 6.7%) had high mortality rates, with 23.2% (n = 226) and 27.8% (n = 191), respectively.

3.2. Principal Component Analysis for Grouping Chronic Medical Conditions

Table 2 shows the results of principal component analysis among 11 chronic medical conditions. As a result of calculating the Cronbach alpha coefficient through correlation analysis for each of the four principal components, it was confirmed that the four disease-oriented principal component domains were independent of each other at 0.00067. According to the result, we can identify four distinct principal components: principal component 1 (Disease of the circulatory system: diabetes, heart disease, and hypertension), principal component 2 (Disease of visual and musculoskeletal system: difficulty in daily activities due to visual impairment, arthritis and rheumatism, and fall during the last 2 years), principal component 3 (Disease of mental disorder: psychiatric and cerebrovascular diseases), and principal component 4 (Disease of the respiratory and digestive system: liver disease except fatty liver, diagnosis of malignant tumor, and chronic lung disease).

3.3. Association Between Grouped Principal Components and All-Cause Mortality

In Table 3, we performed a survival analysis to investigate the relationship between all-cause mortality and each grouped principal component. The presence of principal component 1 (diabetes, heart disease, and hypertension) was associated with an increased hazard ratio (HR) of 1.079 (95% CI 1.031–1.129, p = 0.001). The presence of principal component 3 (psychiatric and cerebrovascular diseases) was related to an increased HR of 1.134 (95% CI 1.094–1.175, p < 0.0001). Moreover, principal component 4 was associated with a high HR of 1.172 (95% CI 1.130–1.215, p < 0.0001). In participants aged between 45 and 64 years, principal component 4 (liver disease except fatty liver, diagnosis of malignant tumor, and chronic lung disease) showed a significantly increased HR of 1.262 (95% CI 1.184–1.346, p < 0.001). Other principal components were not significantly associated with an increased mortality rate. In participants aged ≥65 years, principal component 1 (HR = 1.065, 95% CI 1.012–1.120, p = 0.015), principal component 3 (HR = 1.140, 95% CI 1.096–1.187, p < 0.0001), and principal component 4 (HR = −1.132, 95% CI 1.083–1.184, p < 0.0001) were associated with an increased mortality rate. This section may be divided by subheadings. It should provide a concise and precise description of the experimental results, their interpretation as well as the experimental conclusions that can be drawn.

4. Discussion

In this study, we suggest that increased mortality rate is associated with principal component 3 (Disease of mental disorder: psychiatric and cerebrovascular diseases), principal component 1 (Disease of the circulatory system: diabetes, heart disease, and hypertension), and principal component 4 (Disease of the respiratory and digestive system: liver disease except fatty liver, diagnosis of malignant tumor, and chronic lung disease) in the elderly Korean population aged ≥65 years. Similarly, principal component 4 is associated with an increased mortality rate in the population aged between 45 and 64 years.
This result is similar to that in the national statistics of Korea. Among the population aged ≥65 years, malignant tumor has been the leading cause of death since 2007. Cardiovascular disease was the second leading cause of death in 2017. Respiratory disease is the fourth leading cause of death, and neurologic disease is the fifth. Additionally, other diseases comprising principal component 3 and 4 are among the most common causes of death. Although the Difficulty in daily activities due to sight in principal component 3 was excluded based on statistical criteria, it is known that it is closely related to brain disease and psychiatric disease [18]. Therefore, the result of the study is valid based on the crude national statistics.
In the population aged between 45 and 64 years, principal component 1 and 3 are not related to increased mortality, but the result was only marginally insignificant. The two principal component entities both have relatively long disease progression and manageable medical plans compared to principal component 4 entities. For example, liver disease was the third leading cause of death in the middle-aged population in 2017. Since the prevalence rate of hepatitis B and C in Korea is 0.8%, and in hepatitis C, interferon-γ treatment does not have a promising effect, with direct-acting oral regimens only being approved in Korea in 2016, the study population did not have sufficient treatment options during the study period. Without HCV screening, it may progress to liver cirrhosis or malignancy without specific symptoms.
This result is similar to that in the national statistics in Japan. The five leading causes of death in the Japanese population aged between 40 and 59 years are malignant neoplasm, heart disease, suicide, cardiovascular disease, and liver disease [19].
Through the valid mortality results from the principal component analysis, we may have meaningful implications in preventing premature mortality. First, we may consider group-based strategies in public health. Most prevention programs in Korea are based on a single disease or single risk factor approach (e.g., the Korean national diabetes prevention program [20,21] or national smoking cessation program [22,23]). In this study, we found that principal component 1, which includes diabetes, heart disease, and hypertension, is associated with increased mortality rate in individuals aged ≥65 years. These diseases share important risk factors and health behaviors, such as decreased daily activity [24], unbalanced diet [25,26], and smoking [26,27,28,29,30,31]. Therefore, it is necessary to control the group-based target diseases rather than a single-disease approach.
Although we know that it is important to prevent the risk from common and important factors in non-communicable diseases, it is difficult to have a scope that includes all factors. Thus, we suggest that the principal component analysis for mortality may have an effect on researchers and policy makers to review the common factors for prevention strategies in mortality. Second, policy makers in healthcare may have targeted strategies for preventing mortality based on the principal component analysis. However, we can determine which factor may have the largest population involved and the highest association with mortality. In this study, principal component 2 has the largest number of deaths (n = 602), followed by principal component 1 (n = 470), principal component 3 (n = 191), and principal component 4 (n = 184). However, principal component 2 does not have a statistically significant association with mortality. Therefore, it is assumed that principal component 1, which has the second largest number of deaths, has the largest impact on the Korean population. For principal component 1, which includes hypertension, diabetes, and heart disease, we have to implement an educational program for the general population who can be potential patients in principal component 1. In contrast, we have a different strategy for principal component 4. Although principal component 4 has the smallest number of deaths in the study, it has the highest mortality rate in the study population. Therefore, it is important to determine the risk group earlier and provide high-quality healthcare to prevent mortality in the principal component 4 group. Since we can estimate the number of deaths based on the factors and the degree of relationship to mortality through the principal component analysis, it will help policy makers to establish tailored strategies in preventing mortality. Lastly, we can follow the mortality based on the principal component analysis. The grouped principal components are different annually. In the 1970s, the most common cause of death in Korea was cerebrovascular disease, while it was malignancy in the 2010s [32,33]. This means that the causes of deaths have changeable dynamics and should be monitored annually. Using the principal component analysis, we can generate the grouped principal components and can have a proper health policy for preventing mortality. The dynamics also help us to understand the macroscopic health-related issues in Korea and how they can be managed using policy intervention.
In addition, we suggest that studies revealing multiple grouped conditions associated with mortality may improve a value-based healthcare strategy, along with the advanced estimation model using AI. Since 2019, the Centers for Medicare and Medicaid Services (CMS) has launched the projects for AI Health Outcomes Challenge and offered federal grants and contracts to innovators to demonstrate how AI tools—such as deep learning and neural networks—can be used to predict unplanned hospital and skilled nursing facility admissions as well as adverse events [34,35]. A variety of lifestyle and health data enable us to analyze negative health outcomes at once and prevent such harmful triggers as soon as possible. Other healthcare trends also explain tangible AI healthcare service model as well [36].
Through technology, we may provide mortality estimation service among the elderly population. It can be defined this process as AI-based decision support system for healthcare [37]. First, it is possible to calculate a probability or a propensity of mortality using the five principal components, those we have suggested using the health insurance claim data. Second, we can add their socio-demographic factors to increase the predictability of the model and capture unobservable factors related to mortality. It is also possible to obtain this from administrative data. Moreover, we can obtain their functional status using the biennial national health screening in Korea [38]. It contains information of lifestyle (e.g., smoking status, binge drinking, regular exercise, etc.), as well as functional status reflecting frailty among the elderly population (e.g., time to up and go, simple cognitive function test, etc.). Third, we can develop a clinical decision support model to analyze the probability of mortality risk. Finally, we can target the vulnerable population, which has shorter life expectancy than the same age group. Based on the result, we can perform a checkup with respect to more specific medical conditions in order to prevent early mortality cases through national screening. Therefore, we believe that our study can provide an essential step in the entire process of targeting the vulnerable population to increase the efficiency of the healthcare system, in other words, towards a value-based healthcare system [39].
This study has several limitations. First, medical history is self-reported. We believe that fatal diseases, such as malignancy or cerebrovascular disease, have low probability of ambiguity, but there is still a risk that participants may have biased memory or skip recording information. Second, the mortality cases may have duplicated and complex comorbidities. For example, more than half of the population aged ≥65 years have two or more noncommunicable diseases in Korea [40]. It is possible that two or more diseases may have an effect on mortality. Third, although we calculated the Cronbach alpha coefficient through correlation analysis, it was confirmed that the principal component scores for each principal component were independent of each other, but it still remains likely that there are multicollinearity problems. Fourth, the basic difference between principal component analysis and confirmatory factor analysis is the a priori assumption is that each factor is associated with a specified subset of indicator variables. The major limitation behind principal component analysis is its simplicity. Therefore, it is necessary to analyze the results through confirmatory factor analysis in a future study. Lastly, we investigated association rather than causality. Although we could measure the association between the factors and mortality, it is still possible that there are some unknown confounders.

5. Conclusions

We analyzed the association between diseases and mortality using principal component analysis. Among the four principal components, three are statistically significantly associated with increased mortality rate. The principal component analysis for grouping causes of death or disease categories can generate meaningful predictors for the analysis of mortality risk. This approach may become a useful tool for providing comprehensive and targeted strategies to healthcare professionals and policy makers. Further study can develop a clinical decision support model to analyze the probability of mortality risk along with advanced estimation model using AI.

Author Contributions

J.S., K.-S.L., and J.-H.K. interpreted the patient data and were the major contributors in writing the manuscript. J.-H.K. analyzed and interpreted the patient data. J.S. and K.-S.L. were major contributors in patient data collection and the design of the study. J.S. was a major contributor in reviewing the manuscript. J.-H.K. was a major contributor in patient data collection, design of the study and manuscript revision. All authors read and approved the final manuscript.

Funding

The present research was supported by the research fund of Dankook University in 2020.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Murray, C.L.J.; Callender, C.S.K.H.; Kulikoff, X.R.; Srinivasan, V.; Abate, D.; Abate, K.H.; Abay, S.M.; Abbasi, N.; Abbastabar, H.; Abdela, J.; et al. Population and fertility by age and sex for 195 countries and territories, 1950–2017: A systematic analysis for the Global Burden of Disease Study 2017. Lancet 2018, 392, 1995–2051. [Google Scholar] [CrossRef] [Green Version]
  2. OECD Health at a Glance 2017. In Health at a Glance; Economic Cooperation and Development: Paris, France, 2017. [CrossRef]
  3. Krinsley, J.S. Glycemic variability: A strong independent predictor of mortality in critically ill patients*. Crit. Care Med. 2008, 36, 3008–3013. [Google Scholar] [CrossRef] [PubMed]
  4. Metter, E.J.; Talbot, L.A.; Schrager, M.; Conwit, R. Skeletal muscle strength as a predictor of all-cause mortality in healthy men. J. Gerontol. Ser. A Boil. Sci. Med. Sci. 2002, 57, B359–B365. [Google Scholar] [CrossRef] [PubMed]
  5. Mossey, J.M.; Shapiro, E. Self-rated health: A predictor of mortality among the elderly. Am. J. Public Heal. 1982, 72, 800–808. [Google Scholar] [CrossRef] [Green Version]
  6. Rumsfeld, J.S.; Mawhinney, S.; McCarthy, J.M.; Shroyer, A.L.W.; Villanueva, C.B.; O’Brien, M.; Moritz, T.E.; Henderson, W.G.; Grover, F.L.; Sethi, G.K.; et al. Health-Related Quality of Life as a Predictor of Mortality Following Coronary Artery Bypass Graft Surgery. JAMA 1999, 281, 1298. [Google Scholar] [CrossRef] [Green Version]
  7. Charlson, M.E.; Pompei, P.; Ales, K.L.; MacKenzie, C.R. A new method of classifying prognostic comorbidity in longitudinal studies: Development and validation. J. Chronic Dis. 1987, 40, 373–383. [Google Scholar] [CrossRef]
  8. Gabriel, S.E.; Crowson, C.S.; O’Fallon, W. A comparison of two comorbidity instruments in arthritis. J. Clin. Epidemiol. 1999, 52, 1137–1142. [Google Scholar] [CrossRef]
  9. Zhang, J.X.; Iwashyna, T.J.; Christakis, N.A. The Performance of Different Lookback Periods and Sources of Information for Charlson Comorbidity Adjustment in Medicare Claims. Med. Care 1999, 37, 1128–1139. [Google Scholar] [CrossRef] [Green Version]
  10. Thygesen, S.K.; Christiansen, C.F.; Christensen, S.; Lash, T.L.; Soerensen, H.T. The predictive value of ICD-10 diagnostic coding used to assess Charlson comorbidity index conditions in the population-based Danish National Registry of Patients. BMC Med. Res. Methodol. 2011, 11, 83. [Google Scholar] [CrossRef] [Green Version]
  11. Sundararajan, V.; Henderson, T.; Perry, C.; Muggivan, A.; Quan, H.; Ghali, W.A. New ICD-10 version of the Charlson comorbidity index predicted in-hospital mortality. J. Clin. Epidemiol. 2004, 57, 1288–1294. [Google Scholar] [CrossRef]
  12. Ladha, K.S.; Zhao, K.; Quraishi, S.A.; Kurth, T.; Eikermann, M.; Kaafarani, H.M.A.; Klein, E.N.; Seethala, R.; Lee, J. The Deyo-Charlson and Elixhauser-van Walraven Comorbidity Indices as predictors of mortality in critically ill patients. BMJ Open 2015, 5, 008990. [Google Scholar] [CrossRef] [PubMed]
  13. Vincent, C.; Taylor-Adams, S.; Stanhope, N. Framework for analysing risk and safety in clinical medicine. BMJ 1998, 316, 1154–1157. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Bird, A. Systematicity, knowledge, and bias. How systematicity made clinical medicine a science. Synthese 2017, 196, 863–879. [Google Scholar] [CrossRef] [Green Version]
  15. Habib, S.H.; Saha, S. Burden of non-communicable disease: Global overview. Diabetes Metab. Syndr. Clin. Res. Rev. 2010, 4, 41–47. [Google Scholar] [CrossRef]
  16. Hodgkins, C.E.; Egan, B.; Peacock, M.; Klepacz, N.; Miklavec, K.; Pravst, I.; Pohar, J.; Gracia, A.; Groeppel-Klein, A.; Rayner, M.; et al. Understanding How Consumers Categorise Health Related Claims on Foods: A Consumer-Derived Typology of Health-Related Claims. Nutrients 2019, 11, 539. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Choi, J.W.; Kim, J.-H.; Yoo, K.B. Subjective expectations for future and mortality among middle-aged and older adults. Medicine 2020, 99, e19421. [Google Scholar] [CrossRef]
  18. Kim, J.-H.; Yoo, K.-B.; Lee, Y. Development and validation of the Korea Dementia Comorbidity Index (KDCI): A nationwide population-based cohort study from 2002 to 2013. Arch. Gerontol. Geriatr. 2017, 72, 195–200. [Google Scholar] [CrossRef]
  19. Ministry of Health, L., and Welfare. andbook of Health and Welfare Statistics 2017. Available online: https://www.mhlw.go.jp/english/database/db-hh/1-2.html (accessed on June 21 2019).
  20. Rhee, S.Y.; Chon, S.; Kwon, M.K.; Park, I.B.; Ahn, K.J.; Kim, I.J.; Kim, S.-H.; Lee, H.W.; Koh, K.S.; Kim, D.M.; et al. Prevalence of Chronic Complications in Korean Patients with Type 2 Diabetes Mellitus Based on the Korean National Diabetes Program. Diabetes Metab. J. 2011, 35, 504–512. [Google Scholar] [CrossRef] [Green Version]
  21. Yang, S.J.; Hwang, S.Y.; Baik, S.H.; Lee, K.W.; Nam, M.S.; Park, Y.S.; Woo, J.T.; Kim, Y.S.; Park, S.; Park, S.-Y.; et al. Serum Magnesium Level Is Associated with Type 2 Diabetes in Women with a History of Gestational Diabetes Mellitus: The Korea National Diabetes Program Study. J. Korean Med. Sci. 2014, 29, 84–89. [Google Scholar] [CrossRef]
  22. Kim, Y.; Park, I.; Park, J.-S. Meta-Analysis of Effects on Adolescent Smoking Cessation Programs in Korea. J. Korean Acad. Nurs. 2008, 38, 204. [Google Scholar] [CrossRef]
  23. Park, E.J.; Koh, H.K.; Kwon, J.W.; Suh, M.K.; Kim, H.; Cho, S.-I. Secular trends in adult male smoking from 1992 to 2006 in South Korea: Age-specific changes with evolving tobacco-control policies. Public Heal. 2009, 123, 657–664. [Google Scholar] [CrossRef] [PubMed]
  24. Beaglehole, R.; Yach, D. Globalisation and the prevention and control of non-communicable disease: The neglected chronic diseases of adults. Lancet 2003, 362, 903–908. [Google Scholar] [CrossRef]
  25. Beaglehole, R.; Bonita, R.; Horton, R.; Adams, C.; Alleyne, G.; Asaria, P.; Baugh, V.; Bekedam, H.; Billo, N.; Casswell, S.; et al. Priority actions for the non-communicable disease crisis. Lancet 2011, 377, 1438–1447. [Google Scholar] [CrossRef]
  26. Kelishadi, R.; Alikhani, S.; Delavari, A.; Alaedini, F.; Safaie, A.; Hojatzadeh, E. Obesity and associated lifestyle behaviours in Iran: Findings from the First National Non-communicable Disease Risk Factor Surveillance Survey. Public Heal. Nutr. 2008, 11, 246–251. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Lariscy, J.T. Smoking-attributable mortality by cause of death in the United States: An indirect approach. SSM Popul. Heal. 2019, 7, 100349. [Google Scholar] [CrossRef] [PubMed]
  28. Carter, B.D.; Abnet, C.C.; Feskanich, D.; Freedman, N.D.; Hartge, P.; Lewis, C.E.; Ockene, J.K.; Prentice, R.L.; Speizer, F.E.; Thun, M.J.; et al. Smoking and Mortality — Beyond Established Causes. N. Engl. J. Med. 2015, 372, 631–640. [Google Scholar] [CrossRef] [Green Version]
  29. Wu, F.; Guo, Y.; Chatterji, S.; Zheng, Y.; Naidoo, N.; Jiang, Y.; Biritwum, R.B.; Yawson, A.; Minicuci, N.; Salinas-Rodríguez, A.; et al. Common risk factors for chronic non-communicable diseases among older adults in China, Ghana, Mexico, India, Russia and South Africa: The study on global AGEing and adult health (SAGE) wave 1. BMC Public Heal. 2015, 15, 88. [Google Scholar] [CrossRef] [Green Version]
  30. Sarich, P.; Ding, D.; Sitas, F.; Weber, M.F.W. Co-occurrence of chronic disease lifestyle risk factors in middle-aged and older immigrants: A cross-sectional analysis of 264,102 Australians. Prev. Med. 2015, 81, 209–215. [Google Scholar] [CrossRef]
  31. Kinra, S.; Bowen, L.J.; Lyngdoh, T.; Prabhakaran, D.; Reddy, K.S.; Ramakrishnan, L.; Gupta, R.; Bharathi, A.V.; Vaz, M.; Kurpad, A.V.; et al. Sociodemographic patterning of non-communicable disease risk factors in rural India: A cross sectional study. BMJ 2010, 341, c4974. [Google Scholar] [CrossRef] [Green Version]
  32. Jung, K.-W.; Won, Y.-J.; Kong, H.-J.; Oh, C.-M.; Cho, H.; Lee, D.H.; Lee, K.H. Cancer Statistics in Korea: Incidence, Mortality, Survival, and Prevalence in 2012. Cancer Res. Treat. 2015, 47, 127–141. [Google Scholar] [CrossRef]
  33. Lim, D.; Ha, M.; Song, I. Trends in the Leading Causes of Death in Korea, 1983-2012. J. Korean Med. Sci. 2014, 29, 1597–1603. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Matheny, M.E.; Whicher, D.; Israni, S.T. Artificial Intelligence in Health Care. JAMA 2020, 323, 509. [Google Scholar] [CrossRef] [PubMed]
  35. Gerke, S.; Babic, B.; Evgeniou, T.; Cohen, I.G. The need for a system view to regulate artificial intelligence/machine learning-based software as medical device. npj Digit. Med. 2020, 3, 53. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Lee, K.; Wan, T.T.H. Effects of hospitals’ structural clinical integration on efficiency and patient outcome. Heal. Serv. Manag. Res. 2002, 15, 234–244. [Google Scholar] [CrossRef]
  37. Gurupur, V.; Wan, T.T.H. Inherent Bias in Artificial Intelligence-Based Decision Support Systems for Healthcare. Medicina 2020, 56, 141. [Google Scholar] [CrossRef] [Green Version]
  38. Seong, S.C.; Kim, Y.-Y.; Park, S.K.; Khang, Y.-H.; Kim, H.C.; Park, J.H.; Kang, H.-J.; Do, C.-H.; Song, J.-S.; Lee, E.-J.; et al. Cohort profile: The National Health Insurance Service-National Health Screening Cohort (NHIS-HEALS) in Korea. BMJ Open 2017, 7, e016640. [Google Scholar] [CrossRef] [Green Version]
  39. Porter, M.E. Value-Based Health Care Delivery. Trans. Meet. Am. Surg. Assoc. 2008, 126, 144–150. [Google Scholar] [CrossRef]
  40. Affairs, K.I.f.H.a.S. The Survey of Living Conditions and Welfare Needs of Korean Older Persons; Korea Institute for Health and Social Affairs: Sejong, Korea, 2018. [Google Scholar]
Figure 1. General characteristics for mortality by disease.
Figure 1. General characteristics for mortality by disease.
Medicina 56 00360 g001
Table 1. General characteristics of subjects included for analysis.
Table 1. General characteristics of subjects included for analysis.
TotalMortalityp−Value
N%No%Yes%
Age <0.0001
45–54328832.18318796.931013.07
55–64278927.30258092.512097.49
65–74267126.14217081.2450118.76
≥75146914.3879353.9867646.02
Education <0.0001
≤Elementary school479946.97375278.18104721.82
Middle school165616.21150090.581569.42
High school270526.48250492.572017.43
≥College105710.3597492.15837.85
Gender <0.0001
Male445143.56365782.1679417.84
Female576656.44507387.9869312.02
Residential Region <0.0001
Urban664665.05579087.1285612.88
Rural357134.95294082.3363117.67
Marital Status <0.0001
Married793677.67701488.3892211.62
Single (including Separated, divorced)228122.33171675.2356524.77
Labor <0.0001
Yes395038.66369493.522566.48
No626761.34503680.36123119.64
National Health Insurance <0.0001
Health insurance957793.74825186.15132613.85
Medical aid6406.2647974.8416125.16
Smoking Status <0.0001
Never727471.20635287.3292212.68
Former smoker9769.5575076.8422623.16
Smoker196719.25162882.7733917.23
Alcohol Use <0.0001
Never387137.89339787.7647412.24
Former Drinker6876.7249672.2019127.80
Drinker565955.39483785.4782214.53
Total10217100.00873085.45148714.55
Table 2. Results of principal component analysis.
Table 2. Results of principal component analysis.
PC 1PC 2PC 3PC 4
Cerebrovascular Disease0.179−0.1180.7480.006
Psychiatric Disease−0.0560.1860.7340.007
Difficulty in Daily Activities Due to Sight0.2180.4750.1820.069
Heart Disease0.3650.1960.0960.030
Hypertension0.7070.1050.080−0.004
Diagnosis of Cancer and Malignant Tumor (Excluding Slight Skin Cancer)0.023−0.0800.0310.512
Chronic Lung Disease−0.1800.3970.0240.497
Fall for the Last 2 Years−0.0600.5930.014−0.083
Liver Disease (Except Fatty Liver)0.110−0.065−0.0360.711
Diabetes0.729−0.047−0.0460.038
Arthritis and Rheumatism0.2690.594−0.096−0.079
Eigen Value1.6211.0801.0491.012
The highest principal component loadings of more than 0.3 are bold.
Table 3. Survival analysis based on the grouped principal components, based on the age group
Table 3. Survival analysis based on the grouped principal components, based on the age group
Mortality
Total45-64≥65
HRp-ValueHRp-ValueHRp-Value
PC 11.0790.0011.0970.0961.0650.015
PC 20.9680.1580.9820.7620.9680.196
PC 31.134<0.00011.0880.0511.140<0.0001
PC 41.172<0.00011.262<0.00011.132<0.0001
Age
45–541.000 1.000 N/A
55–641.847<0.00011.690<0.0001
65–743.865<0.0001N/A1.000
≥7510.278<0.00012.751<0.0001
Education (≥College)
≤Elementary School1.5610.0002.0950.0011.3310.046
Middle School1.1330.3651.4440.1260.9940.972
High School1.0640.6341.3480.1840.9450.727
Gender (vs Female)
Male2.236<0.00013.338<0.00011.937<0.0001
Residential Region (vs Urban)
Rural1.293<0.00011.4490.0021.2470.000
Marital Status (vs Married)
Single (including Separated, Divorced)1.491<0.00011.930<0.00011.3270.000
Labor (vs No)
Yes0.577<0.00010.486<0.00010.646<0.0001
National Health Insurance (vs Medical Aid)
Health Insurance0.8990.2150.7750.2230.9380.499
Smoking Status (vs Never)
Former Smoker1.3190.0021.2550.2621.3320.003
Smoker1.460<0.00011.3020.1081.458<0.0001
Alcohol use (vs Never)
Former Drinker1.1440.1321.1650.4571.1500.160
Drinker1.1490.0471.1790.2641.1490.083
HR: hazard ration, PC: principal component, N/A: not available.

Share and Cite

MDPI and ACS Style

Shin, J.; Lee, K.-S.; Kim, J.-H. Predicting Old-age Mortality Using Principal Component Analysis: Results from a National Panel Survey in Korea. Medicina 2020, 56, 360. https://0-doi-org.brum.beds.ac.uk/10.3390/medicina56070360

AMA Style

Shin J, Lee K-S, Kim J-H. Predicting Old-age Mortality Using Principal Component Analysis: Results from a National Panel Survey in Korea. Medicina. 2020; 56(7):360. https://0-doi-org.brum.beds.ac.uk/10.3390/medicina56070360

Chicago/Turabian Style

Shin, Jaeyong, Kwang-Soo Lee, and Jae-Hyun Kim. 2020. "Predicting Old-age Mortality Using Principal Component Analysis: Results from a National Panel Survey in Korea" Medicina 56, no. 7: 360. https://0-doi-org.brum.beds.ac.uk/10.3390/medicina56070360

Article Metrics

Back to TopTop