Article Text
Abstract
Objective To investigate determining factors of happiness during the COVID-19 pandemic.
Design Observational study.
Setting Large online surveys in Japan before and during the COVID-19 pandemic.
Participants A random sample of 25 482 individuals who are representatives of the Japanese population.
Main outcome measure Self-reported happiness measured using a 10-point Likert scale, where higher scores indicated higher levels of happiness. We defined participants with ≥8 on the scale as having high levels of happiness.
Results Among the 25 482 respondents, the median score of self-reported happiness was 7 (IQR 6–8), with 11 418 (45%) reporting high levels of happiness during the pandemic. The multivariable logistic regression model showed that meaning in life, having a spouse, trust in neighbours and female gender were positively associated with happiness (eg, adjusted OR (aOR) for meaning in life 4.17; 95% CI 3.92 to 4.43; p<0.001). Conversely, self-reported poor health, anxiety about future household income, psychiatric diseases except depression and feeling isolated were negatively associated with happiness (eg, aOR for self-reported poor health 0.44; 95% CI 0.39 to 0.48; p<0.001). Using machine-learning methods, we found that meaning in life and social capital (eg, having a spouse and trust in communities) were the strongest positive determinants of happiness, whereas poor health, anxiety about future household income and feeling isolated were important negative determinants of happiness. Among 6965 subjects who responded to questionnaires both before and during the COVID-19 pandemic, there was no systemic difference in the patterns as to determinants of declined happiness during the pandemic.
Conclusion Using machine-learning methods on data from large online surveys in Japan, we found that interventions that have a positive impact on social capital as well as successful pandemic control and economic stimuli may effectively improve the population-level psychological well-being during the COVID-19 pandemic.
- COVID-19
- MENTAL HEALTH
- PUBLIC HEALTH
Data availability statement
No data are available. No additional data are available.
This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.
Statistics from Altmetric.com
STRENGTHS AND LIMITATIONS OF THIS STUDY
This is the first study that investigated the determinants of happiness during the COVID-19 pandemic using data on more than 25 000 individuals in Japan.
We used machine-learning methods to comprehensively investigate various factors that may affect people’s self-reported happiness during the COVID-19 pandemic.
Our surveys using a single question based on a 10-point Likert scale could capture only some aspects of happiness, which is a multifaceted concept.
Given the exploratory design of our research, we could not elucidate causal relationships between the candidate factors and individuals’ levels of happiness during the COVID-19 pandemic.
Our finding may not be generalisable to the populations in other countries or among people who are not experiencing the pandemic.
Introduction
The COVID-19 pandemic has impacted the physical, psychological and social aspects of our lives. Recent studies have reported the negative impacts of the pandemic on mental health across the world.1–3 For example, Zacher and Rudolph4 found that self-reported well-being of the population in Germany has decreased during the COVID-19 pandemic, and Wang and Tang5 reported 44.8% of study participants were depressed under the initial COVID-19 pandemic in China. The deterioration of mental health during the pandemic may be attributable to multiple factors including physical and social isolation from lockdowns imposed to control the outbreaks, and economic downturns due to the pandemic.6 7 Additionally, the psychological stress itself was likely to cause excessive preventive behaviour against the COVID-19, which could exacerbate population psychological state.8 Given that the worsening of population mental health could result in a rise in suicide rates,9 it is critically important for policymakers to design interventions that could effectively buffer the negative impact of the COVID-19 pandemic on the psychological well-being of the population.
However, designing interventions to maintain the well-being of the population during the pandemic is not straightforward because the general foundation of well-being is highly complicated.10 11 According to GENIAL framework, well-being is defined as positive psychological experience, which is impacted by sociocontextual factors beyond the control of each individual,11 and the influence of sociocontextual factors on well-being could be mediated by three major pathways: behavioural, psychological and physiological mechanisms.12 Therefore, in this study, we focused on determinant factors of happiness (ie, psychological well-being) for designing interventions to ameliorate major societal challenges during the dramatic changes in our lifestyle due to the pandemic.
Studies conducted before the COVID-19 pandemic found that sociodemographic factors (eg, academic attainment and income) are among the strongest predictors of happiness.13 14 However, with the dramatic alterations in our lifestyle caused by the COVID-19 outbreak, the perceptions and priorities that determine our happiness also may have changed. Although recent research has identified several factors regarding COVID-19 that could act as psychological stressors—such as longer duration of quarantines, financial loss and stigma1 15–17—few studies have investigated the determinants of happiness during the COVID-19 pandemic.4 16
To address this important knowledge gap, we investigated the factors determining happiness during the COVID-19 pandemic using machine-learning methods on data from large online surveys in Japan.
Methods
Data source and study participants
We retrospectively analysed the data of two sequential, large, cross-sectional, self-reported questionnaire online surveys: (1) the Japan ‘COVID-19 and Society’ Internet Survey (JACSIS) study and (2) the Japan ‘Society and New Tobacco’ Internet Survey (JASTIS) study. Both surveys, conducted by a large internet research agency (Rakuten Insight, Tokyo, Japan),18 included approximately 2.2 million panellists who are representatives of the Japanese population from the perspectives of age, gender and socioeconomic status.19 Using the unique individual number assigned to each participant, we were able to identify the subjects who participated in both the JACSIS and JASTIS studies, providing us data from before and during the pandemic for these participants.
In the JACSIS study, the questionnaires were randomly distributed to 224 389 panellists selected by each category of gender, age and prefecture (covering all 47 prefectures in Japan) among 2.2 million panellists on 25 August 2020. The target numbers of respondents for each gender, age and prefecture category were met on 30 September 2020 (target numbers for each gender, age and prefecture category had been determined in advance based on the population distribution in 2019; 28 000 respondents; and response rate of 12.5%). We finally identified 25 482 respondents—defined as ‘total respondents’—after excluding 2518 respondents with unnatural or inconsistent responses from 28 000 respondents using the original algorithm we developed. We used this cross-sectional data set, including 25 482 respondents, to investigate determinants of happiness during the COVID-19 pandemic as our primary analysis.
Similarly, 11 000 people responded to the questionnaires in the JASTIS study, which was conducted prior to the JACSIS study between 9 February and 2 March 2020 (before the COVID-19 pandemic), and we identified 7434 respondents who replied to both the JASTIS and JACSIS studies using the unique individual number assigned to each participant. Among the 7434 subjects, we defined 6965 individuals with ≥3 out of the 10-point self-reported happiness scale in the JASTIS study as ‘longitudinal respondents’ to investigate the determinants of declined happiness (a decrease by ≥2 on the 10-point self-reported happiness scale) during the COVID-19 pandemic. Given that the state of emergency over COVID-19 was first declared in Japan on 7 April 2020, we defined the COVID-19 pandemic period as beginning after April 2020 in this study. We used this longitudinal data set, including 6965 panellists, to investigate determinants of declined happiness during the COVID-19 pandemic.
Candidate determinants of happiness
We selected the candidate determinants of happiness from the JACSIS study data based on prior studies looking at the factors associated with well-being—a superordinate concept of happiness.10–12 Specifically, we used respondents’ demographics (ie, age,20 21 gender,13 22 body mass index), sociodemographic characteristics (eg, having a spouse,13 academic attainment,13 type of occupation,23 reception of public assistance,24 annual household income more than average in Japan, ≥5 000 000 yen (≥US$48 000 based on an exchange rate of 105 yen per US dollar as of August 2020)13), COVID-19-related sociodemographic characteristics (eg, anxiety about future household income, one-time receipt of the 100 000 yen (US$950) support allowance for the COVID-19 pandemic,25 trust in communities, online interactions, feeling isolated during the COVID-19 pandemic26) and health-related or psychological characteristics (chronic conditions other than psychiatric diseases (ie, hypertension, diabetes, asthma, coronary disease, stroke, chronic obstructive pulmonary disease (COPD), cancer),27 depression,28 psychiatric diseases other than depression (eg, schizophrenia),29 chronic pain,30 meaning in life,31 32 self-reported poor health,33 worsening self-reported health during the pandemic). The respondents with activities of daily living (ADL) intact were defined as those who had no problems walking, washing or dressing by themselves; nor with the usual activities in the EuroQoL 5 Dimensions 5-Level questionnaire used in the JACSIS study.34 All the candidate determinants are shown in table 1.
COVID-19 pandemic in Japan and related policies
We defined the areas most affected by the COVID-19 pandemic as the seven prefectures (Tokyo, Kanagawa, Saitama, Chiba, Osaka, Hyogo and Fukuoka) where the state of emergency over COVID-19 was first declared on 7 April 2020. The Japanese government established the support allowance system (100 000 yen per person) under the COVID-19 outbreak in April 2020.25
Outcome measures (happiness)
Our primary outcome was self-reported happiness. Participants were asked to rate their levels of happiness using a 10-point Likert scale, where higher scores indicated higher levels of happiness.
Statistical analysis
We first delineated the sociodemographic and health-related characteristics of respondents and showed the distribution of the self-reported happiness scale during the COVID-19 pandemic. We then explored the determinants of happiness as a binary outcome using the multivariable logistic regression model. For the logistic regression, the outcome variable was defined as high levels of happiness, defined as ≥8 on the 10-point self-reported happiness scale, with 10 being the best. To address the potential collinearity of the variables included in each model, we calculated the variance inflation factor (VIF) using the car package.35 36
Next, to explore determinants of happiness as a 10-point discrete outcome without model misspecification, we developed two machine-learning-based models (random forest and gradient-boosted decision tree) as a non-parametric approach that can accommodate non-linearities and interactions without prespecification of a particular parametric model. For machine-learning-based models, the outcome measure was defined as the 10-point happiness scale of each responder. Random forest is an ensemble of decision trees using bootstrap aggregation and random feature selection.37 38 Gradient-boosted decision tree is an additive model of decision trees estimated by gradient descent.38 39 To identify the best tuning hyperparameters, we used a grid search strategy using ranger, xgboost and caret packages for the random forest and gradient-boosted decision tree models.40–42 Both 10-fold cross-validation and out-of-bag estimation techniques were used to minimise potential overfitting and improve the internal validity of our machine-learning-based models. We estimated the contribution of each candidate determinant to machine-learning-based models based on the variable importance computed by the random forest and the gradient-boosted decision tree. The variable importance is a scaled measure to have a maximum value of 100. Additionally, we calculated Shapley Additive Explanations (SHAP) values to assess the detailed contribution of each determinant factor on the regression output by using the SHAPforxgboost package.43 44 Higher SHAP values represent higher possibilities that the given determinant factor contributes to higher level of happiness. If a variable is binary, the high feature value of the variable means that respondents answered ‘Yes’ to the question.
Sensitivity analyses
We conducted several sensitivity analyses. First, we used different thresholds for high levels of happiness for multivariable logistic regression: (1) ≥7 and (2) ≥9 on the 10-point scale. Second, to minimise an issue of sample selection bias (eg, younger people may be more likely to enrol in an online study), we developed a weighted multivariable logistic regression model (ie, the multivariable logistic regression model in our primary analysis with inverse probability weighting added) (see online supplemental appendix 1 for details). Third, to address the possibility that our perceptions and determinant factors of happiness may vary depending on age,13 20 45 we described the average of the happiness scale by age and conducted multivariable logistic regression analyses stratified by age: young (15–39 years), middle-aged (40–59 years) and elderly (60–79 years) respondents. Fourth, to investigate whether the determinants of happiness differ by other characteristics of the respondents,22 28 29 we also stratified analyses by gender and history of psychiatric diseases, respectively. Fifth, we investigated the determinants of declined happiness during the COVID-19 pandemic using the multivariable logistic regression model for longitudinal responders. A decline in happiness was defined as a decrease by ≥2 on the 10-point happiness scale during the COVID-19 pandemic. Sixth, to evaluate the association between age and declines in happiness levels, we described the levels of decline in happiness by age and stratified analyses by age.
Supplemental material
A p value <0.05 was considered statistically significant. All analyses were performed using R version 4.0.2. (The R Foundation for Statistical Computing).46 We created the SHAP summary plot using the SHAPforxgboost package,44 other figures using the ggplot2 package.47
Patient and public involvement
No patients were involved in setting the research question or the outcome measures, nor were they involved in developing plans for the design or implementation of the study. No patients were asked to advise on interpretation or writing up of results. There are no plans to disseminate the results of the research to study participants or the relevant patient community. Patient consent was not required for the study.
Results
Characteristics of study participants
Among 25 482 respondents (total respondents), 12 809 (50%) were female, and the median age was 49 years. The median happiness score during the pandemic was 7 (IQR 6–8), and 11 418 (45%) were defined as having high levels of happiness (≥8 on the 10-point Likert scale) (table 1). Among 6965 respondents (longitudinal respondents), 781 (11%) experienced a decline in happiness levels during the pandemic. The distribution of the happiness scale during the pandemic and changes in the score from the prepandemic assessment are shown in online supplemental figure 1.
Determinants of happiness
In the multivariable logistic regression, we found that meaning in life (adjusted OR (aOR) 4.17; 95% CI 3.92 to 4.43; p<0.001), having a spouse (aOR 2.22; 95% CI 1.93 to 2.56; p<0.001), trust in neighbours and communities (aOR 1.59; 95% CI 1.43 to 1.77; p=0.02) and female gender (aOR 1.51; 95% CI 1.42 to 1.60; p<0.001) were associated with higher odds of having high levels of happiness. On the other hand, self-reported poor health (aOR 0.44; 95% CI 0.39 to 0.48; p<0.001), anxiety about household income in the future (aOR 0.60; 95% CI 0.55 to 0.65; p<0.001), psychiatric disorder diseases other than depression (aOR 0.65; 95% CI 0.53 to 0.80; p<0.001) and feeling isolated (aOR 0.66; 95% CI 0.60 to 0.72; p<0.001) were associated with lower odds of having high levels of happiness. We did not observe a statistically significant association between age and happiness (aOR 1.00; 95% CI 1.00 to 1.00; p=0.24) (figure 1 and online supplemental table 1). We found no variables with high VIF (>10) among parameters used in the multivariable logistic regression model, indicating that collinearity among the candidate determinants is not an issue for our analyses (online supplemental table 2).
In our machine-learning models (random forest and gradient-boosted decision tree), the variable importance, indicating the contribution of each candidate determinant, was quantified (online supplemental figure 2). Consistent with the results using logistic regression, meaning in life, self-reported poor health and having a spouse were the common and important determinants of happiness during the pandemic in both models. To identify the detailed contribution of each feature to the model, we depicted the SHAP summary plot of the top 20 variables of the gradient-boosted decision tree model (figure 2). This plot suggested that meaning in life, having a spouse and female gender had positive contributions to the prediction of happiness. In contrast, self-reported poor health, anxiety about household income in the future and feeling isolated had a negative impact on the prediction of happiness.
Sensitivity analyses
When using different thresholds for logistic regression to define high levels of happiness, we found similar variables as determinants of happiness (online supplemental table 1). The weighted multivariable regression model showed results that were overall consistent with our main analysis (online supplemental table 3). With regard to age as the effect modifier of happiness, we found a U-shaped relationship between the average of the self-reported happiness scale and age (online supplemental figure 3) and identified similar determinant factors of happiness among each age group (ie, young, middle-aged and elderly respondents) (figure 3, online supplemental table 4 and figure 4). However, some factors (ie, ADL intact and having any children) may have different impacts on happiness across age groups (figure 3). The results of stratified analyses indicated that observed relationships do not vary substantially by gender and history of psychiatric diseases (online supplemental table 5 and 6 and figure 5 and 6). We found poor health and feeling isolated were the important determinants of a decline in happiness during the COVID-19 pandemic, whereas meaning in life and trust in communities may calm deteriorating happiness during the pandemic (online supplemental table 7 and figure 7). Although the decline in happiness during the COVID-19 pandemic varied with age and seemed to be less serious among elderly respondents than young ones, as shown in online supplemental figure 3, there was no systemic difference in the patterns as to determinants of declined happiness during the pandemic after the stratification by age (online supplemental table 8 and figure 8).
Discussion
Using the machine-learning methods and data collected through large-scale, nationwide surveys conducted before and during the COVID-19 pandemic, we found that meaning in life and social capital (eg, having a spouse and trust in communities) are the strongest positive determinants of happiness during the pandemic. On the other hand, poor health, anxiety about future household income and feeling isolated are the key factors negatively associated with happiness. These associations were consistent across different groups of age, gender and history of psychiatric diseases. Although poor health, anxiety about future household income and isolation were determinants of the decline in happiness during the pandemic, meaning in life and social capital may be able to alleviate deteriorating happiness during the pandemic. These findings should be informative for policymakers to design policies that can minimise the negative impact of the COVID-19 pandemic on the psychological well-being of the population.
Our findings indicated that uncertainty about the future due to the COVID-19 pandemic could be a key cause of low levels of happiness. A recent study found that elderly respondents—who are likely to have greater mindfulness and present-moment perspectives—seemed to have experienced less negative impact on psychological well-being during the pandemic compared with younger people,21 indicating that focusing on the present moment rather than excessive consideration of the future may retain psychological well-being in this unprecedented crisis. Paying attention to the present moment is an essential part of mindfulness, which has the potential to help people live happier as well as reduce mental health problems.48 Accordingly, to cope with future uncertainty from the psychological perspective, meaning in life could be a key factor of happiness during the time when uncertainties about the future are high. Indeed, the positive association between meaning in life and positive psychological well-being has long been known and has encouraged psychiatrists to take salutogenic approaches for their patients’ mental health—focusing on factors related to happiness, instead of factors directly causing diseases.31 As one of the salutogenic approaches, interventions that give people an opportunity to reflect on their life’s meaning, such as life-crafting interventions, could be a potential solution for improving happiness in such an unusual crisis with future uncertainties.49 On top of that, providing counselling on the basis of second wave positive psychology—confronting, accommodating or transforming negative emotions and finding another deep joy in such an unprecedented nightmare based on the reality that life itself is full of evil and suffering—was recently proposed to achieve sustainable well-being for individuals from psychological perspectives.50
Social capital was also identified as a strong determinant of happiness during the COVID-19 pandemic in our study. Given that physical isolation and lockdowns imposed to control the COVID-19 outbreak in many regions of the world could decrease social interactions and increase psychological isolations, we need urgent measures to reinforce social interactions without increasing the risk of infections (eg, social interactions through online networks, social interactions in outdoor settings while maintaining social distancing and wearing masks). A study conducted before the pandemic reported that improving components of individual social capital (eg, trust in communities and social interaction) could enhance both individual well-being and community social capital, which would subsequently create a virtuous circle that improves individual social capital.51 Moreover, although poor health status was negatively associated with happiness during the pandemic in our analyses, prior studies have found that higher levels of social capital are associated with improved health,52 and potentially lead to a higher uptake of preventive measures for infections (eg, wearing masks and washing hands) and social distancing that could reduce the transmission of the virus during the COVID-19 pandemic.53 Indeed, a study by Ye et al reported people who shared their activities with others or interacted with friends were more likely to take preventive measures for the COVID-19.54 However, as misinformation regarding the COVID-19 by non-experts through online networks can be disseminated and sometimes affect negatively the population mental health,55 the reliance only on social networking service for social interactions may be a double-edged sword in such a special time.
Therefore, while it is obviously important to introduce psychological counselling (eg, second wave positive psychology) for each individual, interventions for improving social capital as well as infection control and economic stimulus policies during the pandemic may effectively have positive effects on happiness. Of note, to improve population-level psychological well-being, it is important for policymakers to promote people’s voluntary and collective activities, ideally by developing asset-based communities and schemes, rather than by directly enforcing social capital increases.56
Limitations
Our study has several limitations. First, we used the machine-learning methods to comprehensively investigate various factors that may affect people’s self-reported well-being during the COVID-19 pandemic, and our study was not designed to evaluate causal relationships. Therefore, it remains unclear whether improving important determining factors identified in this study could actually lead to better well-being. Second, it is possible that our surveys using a single question based on a 10-point Likert scale captured only some aspects of happiness, psychological well-being which is a multifaceted concept.10–12 However, as we considered various factors, including indicators of material well-being, eudaimonia and ill-being as our candidate determinants of well-being during the COVID-19 pandemic,11 we believed the strength of our study was exhaustive analyses of determinants of happiness based on multiple domains. Third, given that we used data collected through internet surveys conducted in Japan, it is possible that respondents of the surveys may differ from the general population in meaningful ways (there is a risk of sample selection bias). However, the results of the sensitivity analysis using weighted analyses accounting for the probability of responding to the surveys were not qualitatively different from our main analyses (unweighted analyses), supporting the robustness of our analyses. Fourth, our study was conducted in Japan, and therefore our findings may not be generalised to populations in other countries. Lastly, our study was not pre-registered in an authorised clinical trial registry (eg, UMIN-CTR), but our study protocol was approved by an institutional review board, which did not require us to pre-register our study in such an authorised clinical trial registry beforehand.
Conclusions
Using nationwide online surveys before and during the COVID-19 pandemic in Japan, we identified that meaning in life and social capital components were the strongest positive determinants of happiness. On the other hand, poor health, anxiety about future household income and feeling isolated were key factors that negatively impacted happiness. These findings should be informative for policymakers in designing policies and interventions that could effectively improve the psychological well-being of the population during the COVID-19 pandemic.
Data availability statement
No data are available. No additional data are available.
Ethics statements
Patient consent for publication
Ethics approval
The study protocol was approved by the Institutional Review Board of the Osaka International Cancer Institute (No 20084).
References
Supplementary materials
Supplementary Data
This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.
Footnotes
Twitter @Osawa_Itsuki, @GtoDr, @TakahiroTabuchi, @KogaHayami, @Yusuke_Tsugawa
Contributors IO and TT had full access to all the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. IO, TG, TT and YT conceived and designed the study. IO, TG, HKK, TT and YT interpreted the data, critically revised the manuscript for important intellectual content and approved the final manuscript. IO, TG and YT drafted the initial manuscript. YT supervised the study. IO, TG and TT are the guarantors of the study.
Funding This study was funded by the Japan Society for the Promotion of Science (JSPS) KAKENHI Grants (grant numbers: 17H03589, 19K10671, 19K10446, 18H03107, 18H03062), the JSPS Grant-in-Aid for Young Scientists (grant number: 19K19439), Research Support Program to Apply the Wisdom of the University to Tackle COVID-19 Related Emergency Problems, University of Tsukuba, and Health Labour Sciences Research Grant (grant numbers: 19FA1005, 19FG2001). YT was supported by the National Institutes of Health (NIH)/NIMHD Grant (R01MD013913) and NIH/NIA Grant (R01AG068633) for other work not related to this study.
Disclaimer The findings and conclusions of this article are the sole responsibility of the authors and do not represent the official views of the funders/sponsors.
Competing interests None declared.
Patient and public involvement Patients and/or the public were not involved in the design, or conduct, or reporting, or dissemination plans of this research.
Provenance and peer review Not commissioned; externally peer reviewed.
Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.