Excess Deaths in Korea During the COVID-19 Pandemic: 2020-2022
Article information
Abstract
Objectives:
Excess deaths, an indicator that compares total mortality rates before and during a pandemic, offer a comprehensive view of the pandemic’s impact. However, discrepancies may arise from variations in estimating expected deaths. This study aims to compare excess deaths in Korea during the coronavirus disease 2019 pandemic using 3 methods and to analyze patterns using the most appropriate method.
Methods:
Expected deaths from 2020 to 2022 were estimated using mortality data from 2015-2019 as reference years. This estimation employed 3 approaches: (1) simple average, (2) age-adjusted average, and (3) age-adjusted linear regression. Excess deaths by age, gender, and cause of death were also presented.
Results:
The number of excess deaths varied depending on the estimation method used, reaching its highest point with the simple average and its lowest with the age-adjusted average. Age-adjusted linear regression, which accounts for both the aging population and declining mortality rates, was considered most appropriate. Using this model, excess deaths were estimated at 0.3% for 2020, 4.0% for 2021, and 20.7% for 2022. Excess deaths surged among individuals in their 20s throughout the pandemic, largely attributed to a rise in self-harm and suicide. Additionally, the results indicated sharp increases in deaths associated with “endocrine, nutritional, and metabolic diseases” and “symptoms, signs, and abnormal clinical and laboratory findings, not elsewhere classified.”
Conclusions:
Substantial variations in excess deaths were evident based on estimation method, with a notable increase in 2022. The heightened excess deaths among young adults and specific causes underscore key considerations for future pandemic responses.
INTRODUCTION
The coronavirus disease 2019 (COVID-19) pandemic, which originated from an unidentified cause of pneumonia in Wuhan, China, in late December 2019, was officially declared a pandemic by the World Health Organization on March 11, 2020. Over the next 3 years, it inflicted considerable damage worldwide. The impact of this pandemic on society has 2 main types: direct damage caused by the disease itself and collateral damage resulting from infection control policies. Direct damage is exemplified by the deaths attributed to COVID-19, while collateral damage includes a broad spectrum of adverse effects on physical and mental health, the economy, education, and various other societal dimensions [1].
Various studies are being conducted to evaluate the impact of the pandemic from different perspectives. One such approach involves estimating the excess deaths experienced by each country. Excess deaths represent a metric for comparing total mortality rates before and during the pandemic. This metric accounts for not only direct deaths from COVID-19 but also those that may have occurred due to the pandemic and its associated policies [2]. Therefore, excess deaths are considered a comprehensive indicator for assessing the overall impact of the pandemic on a country.
However, the number of excess deaths can vary depending on the estimation method. To calculate excess deaths, one must initially establish a reference year prior to the pandemic. Then, the expected number of deaths during the pandemic is determined by applying the mortality rate from the reference year to the population at the time. This expected mortality figure is then compared to the actual number of deaths observed. Notably, the selection of the reference year and the methodology employed to estimate expected deaths can produce different results [3].
For instance, Our World in Data, which consistently provides information on excess deaths by country, has employed 2 methods to calculate excess deaths [4]. In the early stages of the COVID-19 pandemic, their estimation of expected deaths was based on the average mortality rate observed from 2015 to 2019. Later, they utilized regression equations derived from mortality rates between 2015 and 2019 to estimate expected deaths. In Korea, Statistics Korea reported excess deaths by using the highest number of deaths observed weekly over the past 3 years as the expected number and comparing this to the observed deaths during the pandemic [5]. However, this reporting has been suspended.
Although numerous articles have documented excess deaths in various countries during the pandemic [6,7], including Korea [8-10], few studies have compared excess death estimates obtained with different methodologies [11,12]. For example, an analysis of excess deaths in 2020-2021 across 33 countries, using 6 distinct methods, underscored the importance of age adjustment. While there were strong correlation coefficients of around 0.9 between excess death estimates from different methodologies, considerable absolute differences in the estimated excess deaths were observed across most countries [11].
Thus, this study was conducted with 2 primary objectives. First, we compared the differences in excess deaths from 2020 to 2022 using 3 estimation methods, including those similar to the ones commonly employed by publicly available websites such as Our World in Data and Statistics Korea. Second, we presented the excess deaths from 2020 to 2022, categorized by age group, gender, and cause of death, using the method that was determined to be the most appropriate.
METHODS
Data Sources
Data on deaths from 2015 to 2022 were obtained from Statistics Korea’s cause-of-death statistics. This dataset includes information such as age, gender, cause of death, and place of death [13]. Among the 2 431 753 deaths recorded from January 2015 to December 2022, 308 deaths lacking age information were excluded from this study. The monthly resident registered population by age group, segmented into 10-year intervals ranging from 0-9 years to 80 years or older, was acquired from the Korean Statistical Office [13].
Statistical Analysis
In this study, excess deaths were quantified using 2 approaches: (1) by calculating the difference between the observed and expected number of deaths, and (2) by determining the P-score, which is defined as ([(observed deaths−expected deaths)/expected deaths]×100) [14]. The term “observed deaths” refers to the actual number of deaths recorded from 2020 to 2022. In contrast, expected deaths were estimated using 3 distinct methods, based on the reference period of 2015-2019:
(1) Multiplying the average monthly mortality rates from 2015 to 2019 by the monthly population figures from 2020 to 2022; (2) Stratifying by age, in which the average monthly mortality rates from 2015 to 2019 were multiplied by the age-stratified monthly population figures from 2020 to 2022; and (3) Conducting linear regression analysis on the age-stratified monthly mortality rates from 2015 to 2019 to predict mortality rates for 2020 to 2022. These predicted rates were then multiplied by the corresponding age-stratified monthly population figures from 2020 to 2022.
Excess deaths estimated using these methods were based on (1) simple average, (2) age-adjusted average, and (3) age-adjusted linear regression methods.
Age-adjusted linear regression-based excess deaths were deemed the most suitable method among the 3 considered, as this approach accounts for both changes in age structure and mortality rate trends. Therefore, excess deaths were characterized by age, gender, and cause of death using estimates from this method. For the analysis, age was categorized into 9 groups at 10-year intervals. Causes of death were examined according to 22 major categories from the Korean Standard Classification of Diseases (KCD), which have remained consistent from 2015 to 2022 [13]. The analysis was performed using Microsoft Excel (Microsoft Corp., Redmond, WA, USA) and SPSS version 25.0 (IBM Corp., Armonk, NY, USA).
Ethics Statement
This study utilized publicly available secondary data, so ethical approval was not required.
RESULTS
Comparison of 3 Methods for Estimating Excess Deaths
Excess death estimates varied significantly based on the 3 methods used to calculate expected deaths, as illustrated in Table 1 and Figure 1. The simple average-based method, which does not consider changes in age structure, suggested the occurrence of 16 951 excess deaths in 2020, corresponding to a P-score of 5.9%. In contrast, the age-adjusted average method, which accounts for shifts in age structure, yielded an estimate of -27 491 excess deaths and a P-score of -8.3%. Additionally, when calculating age-adjusted excess deaths while also considering changes in mortality trends in each age group, the estimated number of excess deaths was 937, with a P-score of 0.3%. Despite substantial monthly variations in excess deaths, no correlation was observed between the monthly excess deaths and the monthly COVID-19 deaths for any method. Specifically, the correlation coefficient for the age-adjusted linear regression method was -0.119.
Comparison of excess deaths using three methods (simple average, age-adjusted average, and age-adjusted linear regression) from 2020 to 2022.
In 2021, we noted a clear increase in excess deaths compared to 2020, as evidenced by both simple average-based and age-adjusted linear regression-based estimates. The total excess deaths were 30 469 and 12 216, with P-scores of 10.6% and 4.0%, respectively. However, when using the age-adjusted average method, excess deaths did not differ notably from those in 2020. Additionally, a monthly analysis of excess deaths revealed no clear correlation with the number of monthly COVID-19 deaths. The correlation coefficient for the age-adjusted linear regression was +0.151.
When comparing excess deaths in 2022 with those in 2020 and 2021, all 3 methods indicated sharp increases. The simple average-based and age-adjusted average-based methods estimated excess deaths at 86 585 and 13 035, respectively, with corresponding P-scores of 30.2% and 3.6%. In comparison, the age-adjusted linear regression-based method estimated excess deaths at 63 907, with a P-score of 20.7%. Analysis of the monthly excess death data, as determined by the age-adjusted linear regression, revealed a strong correlation with monthly COVID-19 deaths (r=0.929).
Comparison of Excess Deaths by Age Group Using Age-adjusted Linear Regression
Table 2 presents excess deaths by age group. In 2020, the age group with the highest number of excess deaths was the 20s, with 231 excess deaths and a P-score of 9.3%. In 2021, this age group continued to have the highest number, with 371 excess deaths and a P-score of 15.4%, indicating a worsening situation compared to 2020. The 60s and 70s age groups also saw increased excess deaths, with P-scores of 7.8% and 6.1%, respectively. In 2022, a notable increase in excess deaths was observed among the elderly population. The 80s age group experienced the highest number, with 39 354 excess deaths and a P-score of 24.4%. This was followed by the 70s and 60s age groups. The 20s age group also saw an increase, with 433 excess deaths and a P-score of 18.6%.
Comparison of Excess Deaths by Gender Using Age-adjusted Linear Regression
Table 3 presents excess deaths by gender. Due to privacy concerns related to the age distribution of COVID-19 deaths by gender, it was not feasible to calculate excess deaths excluding those from COVID-19. In both 2020 and 2021, excess deaths were more pronounced among men than women, with P-scores for men at 0.7% and 4.5%, respectively, while the scores for women were -0.1% and 3.4%. However, a shift occurred in 2022, with women experiencing more excess deaths than men, evidenced by P-scores of 23.6% for women and 18.1% for men. When analyzing excess deaths by both gender and age, high excess deaths were noted among individuals in their 20s, regardless of gender. In 2020, the excess deaths of women in their 20s exceeded those of men in the same age group, with P-scores of 15.0% for women and 6.0% for men. This trend reversed in 2021 and 2022, with excess deaths among men in their 20s surpassing those among women; P-scores for men were 19.6% and 21.0%, respectively, compared to 8.6% and 14.9% for women.
Analyzing Cause-specific Excess Deaths Using Age-adjusted Linear Regression
Table 4 presents the estimated excess deaths for the 22 categories of causes of death according to the KCD. The U00-U99 category, which includes deaths from COVID-19, displayed a sharp rise in mortality since 2020. Consequently, these deaths were omitted from the cause-of-death analysis, as it was not suitable to estimate expected deaths using regression analysis for this category. Furthermore, we only reported excess deaths for causes where the difference was 1000 or more deaths and the P-score was at least 5.0%. When calculating excess deaths, if the absolute number of deaths is low, the P-score may appear inflated. For instance, if the expected number of deaths for a disease is 10 and the observed number is 20, the excess deaths would be 10, yielding a P-score of 100%. However, such an increase would not be considered meaningful from a public health perspective.
In 2020, we noted an increase in excess deaths associated with endocrine, nutritional, and metabolic diseases (E00-E90; P-score=13.9%), and with symptoms, signs, and abnormal clinical and laboratory findings not elsewhere classified (R00-R99; P-score=7.6%). In contrast, deaths from diseases of the respiratory system (J00-J99) decreased, as indicated by a P-score of -12.3%. Similarly, in 2021, excess deaths were greatest in E00-E90, with a P-score of 32.6%, more than doubling from 2020. Additionally, excess deaths increased significantly in R00-R99, with a P-score of 25.2%, more than tripling from 2020. Conversely, deaths from respiratory system diseases (J00-J99) decreased further (P-score=-16.6%).
In 2022, excess mortality was noted across various causes. The highest excess deaths were recorded in the E00-E90 category, with a P-score of 83.3%. Additionally, the R00-R99 category saw an estimated doubling of excess deaths from the previous year, reaching a P-score of 41.5%. Moreover, diseases of the nervous system (G00-G99), diseases of the digestive system (K00-K93), diseases of the circulatory system (I00-I99), and certain other consequences of external causes (S00-T98) exhibited P-scores of 32.6%, 15.4%, 14.8%, and 6.0%, respectively. Conversely, we observed a continued decline in the J00-J99 category, which had a P-score of -11.8%.
Additionally, an analysis was conducted on the causes of death among individuals in their 20s, who consistently exhibited high P-scores from 2020 to 2022. Within this age group, the highest P-score was for S00-T98, which mainly includes injuries, poisoning, and certain other consequences of external causes, with values of 12.0% in 2020, 21.2% in 2021, and 16.4% in 2022. A detailed examination revealed that the primary cause was self-harm, specifically suicide (Supplemental Material 1).
DISCUSSION
This study revealed significant variations in excess deaths depending on the method used to calculate expected deaths during the pandemic. Among 3 different methods, the approach based on the simple average showed the highest excess deaths. However, this method likely overestimated excess deaths because it failed to account for the rapidly aging population structure. In contrast, the age-adjusted average method, which considered changes in age structure when estimating expected deaths, may have underestimated excess deaths. This potential underestimation is due to the decreasing trends in age-specific mortality rates from 2015 to 2019 in most age groups. Therefore, both methods could potentially mislead the public about the actual extent of excess deaths. Consequently, the age-adjusted linear regression-based method, which considered both changes in age structure and mortality rate trends, appeared to be the most appropriate among the 3 methods.
The estimation of excess deaths has been conducted using a variety of methods. For instance, a study estimated excess deaths in Korea from March 2020 to June 2022 using a quasi-Poisson interrupted time-series model. This model used monthly mortality rates from January 2013 to February 2020 as the reference period [8]. The study reported P-scores of 2.1% for 2020, 6.0% for 2021, and 24.4% for 2022. These figures are generally higher than those from the current study, which found P-scores of 0.3% for 2020, 4.0% for 2021, and 20.7% for 2022, although slight differences were present in the study periods for 2020 and 2022 between the studies. In contrast, other studies focusing on the year 2020 reported no excess deaths [10] or even negative excess deaths [9]. Studies comparing international data have also shown a wide range of excess death estimates, depending on the methodology used [6]. This variability makes it challenging to determine the most valid method.
During the pandemic, P-scores for 2020 and 2021 were notably low but surged in 2022. This trend could be associated with Korea’s COVID-19 strategy, which enforced stringent measures to control the virus until the end of 2021. The strategy then underwent a marked shift with the emergence of the milder Omicron variant [15]. However, other factors may have played a more substantial role, considering that most East Asian countries experienced lower COVID-19 impacts than Europe and America in the first year of the pandemic, despite having diverse policies [16].
The comparison between Korea and Japan is particularly intriguing because Japan adopted a relatively relaxed policy, especially regarding COVID-19 testing [17], in contrast to Korea’s compulsory testing and contact tracing approach [18]. For instance, during the early stages of the pandemic, Japan primarily conducted polymerase chain reaction tests among patients who exhibited persistent symptoms or had risk factors [17]. Given the high proportion of asymptomatic cases [19], Japan likely experienced greater community transmission than Korea. In support of this view, a study of healthcare workers in Japan during mid-2020 revealed that 64% of seropositive cases had not been previously diagnosed with COVID-19 [20]. Despite Japan’s more lenient approach, the age-standardized mortality rate in Japan exhibited a decreasing trend in 2020 and only a slight increase in 2021 [21], which was similar to the trends observed in Korea.
One potential explanation for the favorable outcomes observed in East Asian countries could be that individuals in this region may possess a high level of pre-existing immunity [16]. Prior exposure to other coronaviruses, such as endemic human coronaviruses and severe acute respiratory syndrome coronavirus 1, might confer immunity to COVID-19, resulting in better clinical outcomes [22,23]. Indeed, genomic evidence of past coronavirus epidemics has been found in East Asian populations [24], and a considerable number of people in Southeast Asia are infected annually with previously undetected coronaviruses that originate from bats [25,26]. Therefore, East Asia may represent a region where individuals experience more frequent and repeated exposure to coronaviruses and other related viruses compared to other regions, potentially bolstering their resistance to COVID-19.
One of the key findings from the analysis of excess deaths by age group is the marked excess deaths among individuals in their 20s throughout the pandemic. Further examination of the causes of death indicates that most of these excess deaths were due to self-harm and suicide. Particularly noteworthy is that, while the total number of COVID-19 deaths was only 900 in 2020, the estimated excess deaths in the 20s age group was estimated to be 231. This finding is of critical public health importance, considering that most COVID-19 fatalities occurred among elderly individuals with multiple comorbidities, and the COVID-19 mortality rate for those in their 20s was nearly 0% [27]. Previous studies have consistently reported increases in suicide among adolescents and young adults in Korea [28,29] and other countries [30,31]. Further research is needed to understand the reasons behind the rise in suicide rates among young adults, which may be associated with the deterioration of mental and emotional health during the COVID-19 pandemic [32,33].
In the analysis of cause-specific excess deaths, the categories of “endocrine, nutritional, and metabolic diseases” (E00-E90) and “symptoms, signs, and abnormal clinical and laboratory findings, not elsewhere classified” (R00-R99) exhibited the greatest increases in excess deaths during the COVID-19 pandemic. The number of excess deaths and the P-score for these categories rose by more than fivefold from 2020 to 2022. While the cause of the surge in E00-E90 excess deaths remains uncertain, it may be associated with a general decline in healthy living during the pandemic, including increased obesity rates, reduced physical activity, unhealthy dietary habits, sleep disturbances, and mental stress [15,34]. The increase in excess deaths within the R00-R99 category calls for further investigation to ascertain whether this trend was directly tied to the COVID-19 pandemic or whether other factors contributed. This trend may also be linked to a rise in the number of elderly individuals dying at home rather than in healthcare facilities [35].
Conversely, excess deaths attributed to respiratory diseases (J00-J99) have consistently decreased, with the P-score falling by more than 10%. While this decline may be influenced by various factors, it is important to interpret this finding within the context of COVID-19 deaths. As respiratory disease deaths predominantly occur among the elderly, there may be a competing risk between COVID-19 and other respiratory diseases. If vulnerable elderly individuals primarily die from COVID-19, a corresponding decrease in excess deaths from other respiratory diseases would be an expected observation.
Our study has several limitations. First, we did not consider alternative methods for estimating excess deaths. While we deemed the age-adjusted regression equation-based method the most appropriate of the 3 methods used, more accurate approaches may exist. Notably, however, sophisticated statistical methodologies do not necessarily ensure the validity of excess death estimates. For instance, a study [6] that assessed global excess mortality in 2020 and 2021 using an ensemble of 6 models faced strong criticism for producing unrealistic figures for many countries [36]. Second, excess deaths do not account for the number of life years lost. Deaths occurring at very old ages typically result in fewer life years lost than those at younger ages. Therefore, additional analyses that focus on years of life lost could offer a more detailed perspective on the pandemic’s impact. Third, our analysis of cause-specific excess deaths was limited to several categories with enough deaths. This limitation was due to the small number of deaths across many categories.
In summary, this study revealed significant differences in excess deaths depending on the methodologies employed to determine expected mortality. This underscores the critical importance of incorporating adjustments for shifts in population age structure as well as mortality rate trends. Furthermore, while the annual data on excess deaths revealed a pronounced spike in 2022, age-specific analysis showed a consistent rise in excess deaths among individuals in their 20s throughout the duration of the pandemic. In anticipation of future infectious disease outbreaks, it is imperative to thoroughly evaluate the policies implemented during the COVID-19 pandemic. This should include the formulation of plans that address both the direct health consequences of the pandemic and the collateral damage to society that may arise from protective measures.
Supplemental Materials
Supplemental material is available at https://doi.org/10.3961/jpmph.24.254.
Trends in all-cause mortality, mortality for S00-T98 (injuries, poisoning, and certain other consequences of external causes), and suicide rate in individuals aged 20 to 29 from 2015 to 2022
Notes
Conflict of Interest
The authors have no conflicts of interest associated with the material presented in this paper.
Funding
None.
Author Contributions
Conceptualization: Lee DH. Data curation: Im SJ. Formal analysis: Im SJ. Funding acquisition: None. Methodology: Lee DH. Project administration: Lee DH. Visualization: Im SJ. Writing – original draft: Lee DH, Im SJ. Writing – review & editing: Lee DH, Shin JY, Im SJ.
Acknowledgements
None.
