I. Introduction
Is regular physical exercise important for educational achievements? This question relates to key debates in educational policy. In the United States, for example, there is a growing concern that physical education is no longer considered a critical element of schooling: In some areas of the country, half the students report having no physical education in an average week (De La Cruz 2017). This lack of activity may contribute to bad health and obesity and is a major concern in all OECD countries (OECD 2017).
Physical activity may also be important for academic performance through enhanced psychological health, habit formation, and changes in the brain, supporting cognitive function and learning (Centers for Disease Control and Prevention [20…
I. Introduction
Is regular physical exercise important for educational achievements? This question relates to key debates in educational policy. In the United States, for example, there is a growing concern that physical education is no longer considered a critical element of schooling: In some areas of the country, half the students report having no physical education in an average week (De La Cruz 2017). This lack of activity may contribute to bad health and obesity and is a major concern in all OECD countries (OECD 2017).
Physical activity may also be important for academic performance through enhanced psychological health, habit formation, and changes in the brain, supporting cognitive function and learning (Centers for Disease Control and Prevention 2010; Biddle and Asare 2011; Booth, Roberts, and Laye 2012; Reiner et al. 2013; Khan and Hillman 2014; Mura et al. 2015; Wiebe and Karbach 2017; Young et al. 2018). At the same time, it may worsen academic performance by directing time and attention away from studying. While there is research suggesting a positive association between physical activity and academic performance, the evidence is mixed, and even more importantly, very little is known about the causal pathways of the association (Grissom 2005; Aaltonen et al. 2016).
In this paper, we report the results of a large randomized controlled trial that removed a barrier to exercise by providing free gym cards to university students in Bergen, Norway. We measured the effect of this intervention on the students’ academic performance relative to a control group that was not offered a free gym card. We combine complete and precise administrative data on both gym attendance and academic performance with subjective survey data (baseline and follow-up) on self-control, lifestyle habits, study hours, and happiness. The administrative data on academic performance have zero attrition in the main variable of interest, and the randomized controlled trial therefore offers a clean identification of the causal effect of providing a free gym card on academic performance. The administrative data on gym attendance and the detailed survey data (with less than 5% attrition) further allow us to provide evidence on the underlying mechanisms of the relationship between removing a barrier for exercise and academic performance. Our analysis builds on a preanalysis plan registered after the posttreatment data collection was completed but before the researchers had access to the data.1
In our study, removing a barrier for physical activity by providing a free gym card generates a strong and significant increase in exercise and an improvement in academic performance. We find that the treated students completed on average 0.15 standard deviations more courses than students in the control group, which is driven by them dropping fewer classes and failing fewer exams. The finding is robust to the inclusion of a rich set of control variables and multiple-hypothesis testing. The increase in completed courses did not have a negative effect on grade average. Further analysis shows that the increase in exercise was critical in enhancing academic performance, but also highlights that there is an optimal level of exercise for promoting academic performance.
We provide two sets of evidence on the mechanism driving the effect of exercise on academic performance. First, we show that the intervention has strong positive effects on perceived self-control and lifestyle, which are well known to be important dimensions for learning. Second, a prespecified heterogeneity analysis of the treatment effect, based on whether the students are below the median in terms of perceived self-control, lifestyle, happiness, and study hours, shows that there are consistently very strong treatment effects on academic performance for those who struggle in these dimensions at baseline. The students who struggled in all four dimensions have an estimated treatment effect of 0.5 standard deviations in completed courses. Hence, the evidence suggests that our intervention is a powerful tool for human-capital accumulation, by causing positive changes in self-control and lifestyle that ultimately lead to an improvement in academic performance.
The findings are consistent with previous research showing that exercise promotes a healthier lifestyle in terms of sleep and dietary habits (Driver and Taylor 2000; Youngstedt and Kline 2006; Fleig et al. 2011) and strengthens executive functions such as self-control, flexible thinking, working memory, and planning (Oaten and Cheng 2006; Hillman, Erickson, and Kramer 2008; Smith et al. 2010; Audiffren and André 2015; Berryman, Pothier, and Bherer 2017; Salas-Gomez et al. 2020). There is also evidence suggesting that practicing self-control may increase self-efficacy (Bandura 1977; Muraven 2010) and that increased self-control in one domain may carry over to other domains (Oaten and Cheng 2006; Duhigg 2012; Schilbach 2019). Finally, there is a literature documenting that lifestyle and self-control affect academic performance (Ariely and Wertenbroch 2002; Duckworth and Seligman 2005; Short et al. 2013; Kim and Seo 2015; Donnelly et al. 2016; Hysing et al. 2016). Our study connects and integrates these different literatures by providing evidence of an increase in exercise improving academic performance through changes in self-control and lifestyle.
We also provide some suggestive evidence on which barriers were removed by providing the students with a free gym card. At baseline, about 20% of participants reported that they did not exercise because they found the gym card too expensive, and about 13% reported that they did not have time to exercise. We show that the treatment effect on gym visits is larger for the group that has financial constraints but no time constraints than for the rest of the sample, and smaller for the group that has time constraints. However, a large majority of students do not report having financial or time constraints at baseline, which suggests that the free gym card primarily removed psychological barriers to exercise. This is in line with the fact that many participants reported at baseline that they did not exercise at the gym because they were lazy, which may be interpreted as a sign of procrastination. The existing literature suggests different mechanisms that can explain why the free gym card contributed to removing psychological barriers to exercise. Having a free gym card may have increased the disutility from not exercising by removing an excuse for not going to the gym (Exley 2016; Lepper 2022) and by making salient the injunctive norm that one should go to the gym and exercise (Bordalo, Gennaioli, and Shleifer 2012, 2013). It may also have represented a temporal landmark that gave the students a fresh start and increased the motivation for goal setting (Clark et al. 2020).
An important issue for any field study is the external validity of the findings, which rests on the selection of study participants, attrition, naturalness of the study environment, and the possibility for scaling (the SANS conditions; List 2020). In particular, policymakers might wonder whether they can expect a similar treatment effect if the intervention is implemented at scale (Al-Ubaydli, List, and Suskind 2020; Al-Ubaydli et al. 2021), and whether it passes a benefit-cost test. We will argue that the present study indeed satisfies the SANS conditions. In terms of selection of participants, the students in our study span two different institutions and all major study programs. Hence, they can be considered largely representative of the general student population in Norway in terms of relevant observables. Further, we have no attrition in academic performance (which is measured using administrative data) and minimal attrition in the follow-up survey. The study was also implemented in a natural environment and with a relevant time frame. Hence, the finding that providing free gym cards to students who do not exercise regularly improves their academic performance, particularly for struggling students, is likely to replicate and generalize to other student populations.
We also believe that it should be quite easy to scale up the treatment, since providing a free gym card does not require significant human involvement and is unlikely to impose any congestion in the gyms. In a simple back-of-the-envelope calculation, we can compare the costs of providing a free gym card with the additional income that the respective education institutions get from the Norwegian government because of the treated students completing more study points. The higher education institutions in Norway are on average paid 2,350 NOK for each completed study point.2 We find that providing a free gym card causes an average increase of 1.74 study points in the treatment group, which translates into an estimated increase in income for the education institutions of 4,089 NOK per student. If we focus on the students who struggled at the baseline, we find an increase of 5.93 study points and correspondingly an estimated increase in income for the educational institutions of about 14,000 NOK. In comparison, the market price of a gym membership for one semester in Norway is about 1,800 NOK. Hence, given the estimated treatment effect, the increase in income for the educational institutions is greater than the cost of providing the gym card.
This back-of-the-envelope calculation is not meant to represent a comprehensive benefit-cost analysis, and we would like to highlight some important issues that need to be addressed in an overall policy evaluation of this type of intervention. First, as required in scientific studies we had to ask subjects for informed consent, which implies that we have a self-selected sample that might be more positively inclined to exercise (relative to the remaining student population without a gym membership). However, this is a genuine uncertainty of any finding from a randomized controlled trial that adheres to common ethical guidelines. Second, winning a free gym card as part of a research study might be perceived differently compared to simply receiving a free gym card from the educational institution. Third, a general question for this type of intervention and the broader benefit-cost analysis is whether to consider a universal or targeted approach. A universal approach, in line with what is common in the United States, would be to offer a free gym card to all students, while a targeted approach would be to try to screen the students and offer a free gym card to those who are identified as struggling in their studies.
Moving beyond the population of students in higher education, there is growing evidence suggesting that physical activity and exercise is an efficient way to enhance neurocognitive functions throughout life (Berryman, Pothier, and Bherer 2017), which means that the type of intervention studied in this paper may have broad applicability. The insights may be especially important for child development. The World Health Organization (2019) argues that there is a global epidemic of childhood inactivity, and that the benefits of physical activity are particularly high in early childhood (under 5 years of age). In this age group, there is rapid physical and cognitive development and extensive habit formation, potentially with long-term implications. The report argues that early childhood lifestyle behaviors may have lasting effects on physical activity throughout the life course, which suggests that exercise habits should be promoted and preserved early in life (Aarts, Paulussen, and Schaalma 1997). Interestingly, the focus of the report is on the health implications of physical inactivity, while the present study provides evidence suggesting that increased physical activity in early childhood also may have important implications for human-capital accumulation.
Indeed, a growing number of studies have provided evidence suggesting that physical exercise strengthens executive functions in children (Álvarez-Bueno et al. 2017), and it is well established that executive functions, including self-control, are crucial for the formation of academic skills and more broadly also for behavioral and emotional regulation in childhood and adolescence (Berthelsen et al. 2017). In line with this, there are several studies suggesting that there is a positive association between physical activity and academic performance among children (Haapala 2012). Further, a number of studies have shown that socioeconomic status is a strong predictor of physical activity among children (Love et al. 2019). The removal of barriers for physical exercise at an early age may thus be an important dimension in policies aiming at reducing the socioeconomic gradient of human-capital formation in childhood (Heckman 2006).
Increasing physical activity in childhood is likely to involve parents, the schools, and the broader community (Mitchell 2019). Parents are important in terms of both facilitating physical activity for young children outside of school and creating lifestyle habits. They are likely to face many of the same barriers as the students in our study (financial, time, psychological), and thus there are several possible removal-of-barriers strategies that may foster increased physical activity for young children through their parents. In particular, the removal of financial barriers to exercise may be a key strategy for reaching low socioeconomic status parents, and may also initiate some of the same psychological mechanisms discussed in the present study. In schools, a question is whether to reallocate time from academic classes to physical exercise. The present study suggests that this trade-off is more subtle than sometimes recognized, since a certain level of physical exercise is likely to be beneficial for learning. Further, the removal-of-barriers approach may also be important at the school, for example to ensure that all children have the equipment needed to take part in physical activity. Finally, at the community level, the removal-of-barriers approach may involve making parks and playgrounds more accessible for families with small children.
Our paper contributes to several literatures.3 Most importantly, it offers novel causal evidence to the literature studying the relationship between physical activity and academic achievements (Rasberry et al. 2011; Haapala 2012; Howie and Pate 2012; Singh et al. 2012; Felfe, Lechner, and Steinmayr 2016; Álvares-Bueno et al. 2017), including the nonexperimental literature in economics reporting positive effects on educational achievement from participation in sports activities in adolescence (Lipscomb 2007; Pfeifer and Cornelissen 2010; Stevenson 2010). Haapala (2012) identified nine randomized or quasi-randomized controlled trials on the relationship between physical activity and academic performance for the period 1966–2011, all with small samples of children or adolescents and challenging designs.4 He concludes (p. 153), “Physical training may improve academic performance in children and adolescence, but evidence is lacking.” We show that exercise causally improves academic performance among university students, and especially among the students who struggle with lifestyle and self-control at baseline, consistent with the findings in a correlational study of 12,000 university students in Norway (Hayley et al. 2017).
Turning to the experimental literature on behavioral interventions, there is strong evidence that financial incentives can affect behavior at least while these incentives are present (Charness and Gneezy 2009; Gneezy, Meier, and Rey-Biel 2011; Volpp et al. 2011; Acland and Levy 2015; Babcock et al. 2015; Royer, Stehr, and Sydnor 2015; Carrera et al. 2017; Homonoff, Willage, and Willén 2020). However, on the basis of existing evidence from randomized controlled trials, there has been less success in using direct financial incentives to increase academic performance (Angrist, Lang, and Oreopoulos 2009; Fryer 2011; Levitt, List, and Sadoff 2016). Scott-Clayton (2011) and Garibaldi et al. (2012) report more positive results in the educational context using quasi-experimental approaches. A major challenge with direct incentives to improve academic performance is that many policymakers and educators object to paying students to study, arguing that this could crowd out the existing intrinsic motivation to do so. Our approach avoids this criticism by using an indirect approach. We remove the barrier to gym exercise for students and show that it has large positive spillover effects on students’ academic performance. Our findings show that the rate of return of behavioral interventions may be greater than a narrow perspective may indicate (see also Bjorvatn, Ekström, and Pires 2021). Interventions may not only affect the investments in the targeted good (exercise), but they may also produce indirect spillover effects in other types of investment goods (education).
Finally, our paper contributes to the literature studying the importance of noncognitive skills for human-capital investments (Cunha et al. 2006; Heckman, Stixrud, and Urzua 2006; Heckman, Pinto, and Savelyev 2013; Alan and Ertac 2018; Alan, Bonerva, and Ertac 2019). Noncognitive skills may promote academic performance directly, for example by helping students persist with exam preparation during intense study periods. They may also have an indirect effect by helping students maintain a lifestyle that maximizes academic potential. The consensus in the literature is that self-control, broadly defined, is an important factor for academic performance, and perhaps even more important than cognitive skills (Duckworth and Seligman 2005; de Ridder et al. 2012; Heckman and Kautz 2012). Increased self-control seems directly linked to improved focus and concentration (facilitating regulatory behaviors that put aside distractions and unhelpful thoughts) and to healthy lifestyle choices.5 We contribute to this literature by showing that exercise induces improved perceived self-control and a healthier lifestyle, which ultimately causes improvements in academic performance. Several mechanisms may explain our findings, including self-signaling (e.g., Benabou and Tirole 2004) and habit formation.
The paper proceeds as follows. Section II describes the experiment in more detail, section III outlines the hypotheses and the empirical strategy, section IV presents the main results, section V discusses possible mechanisms, and section VI concludes. Supplementary analysis and material are reported in appendixes A–D.
II. The Experiment
We recruited undergraduate students at the campuses of the University of Bergen (UIB) and Bergen City College (HIB), both located in Bergen, the second largest city in Norway. The students were randomly assigned to a treatment or a control group, where the treatment provides free membership at the student gym (SIB). Gym membership is not free at universities in Norway, and the present study thus investigates a different environment than a typical US university, where most previous studies of exercise were conducted.
A. Recruitment
The recruitment took place across the campuses of the UIB and HIB. Students who were interested in participating received a booklet containing an information sheet, a consent form, and a baseline survey. Each participant was asked to sign the consent form and upon returning the booklet was paid 100 NOK (equivalent to $12.50 at the time of the experiment). We informed participants that everyone who fulfilled the stated requirements would have at least a 40% probability of winning a free gym card at SIB for the spring term of 2016. The main requirements were that they be registered as students at UIB or HIB and not presently members of any gym. The regular price of a gym card at SIB was 1,100 NOK in 2016; for comparison, the compulsory membership fee to the student welfare organization is 550 NOK. Participants were also informed that they would have the opportunity to participate in paid follow-up surveys via text messages or emails (see app. C for the information sheet and baseline and follow-up surveys).
B. Timeline
Table 1 gives an overview of important dates and events in the study. The semester started the second week of January. Recruitment for the experiment was done in the first week of February 2016, which has two advantages. First, 30%–40% of the student population at both institutions pays the membership fee at the student gym in each semester. By targeting students who were not yet members of the gym 3 weeks into the semester, we were able to recruit participants who are less likely to have a regular exercise regime. Second, the deadline for course registration was February 1, which implies that the announcement of the study, participation, and treatment assignment cannot influence the courses for which students chose to register.
**Table 1. **
Timeline
| Date | Event | January 11, 2016 | February 1, 2016 | February 1–8, 2016 | February 10–12, 2016 | February 15, 2016 | February 16, 2016 | March 21–28, 2016 | May 2–June 17, 2016 | May 9, 2016 | June 17, 2016 | June 30, 2016 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Start of spring semester | ||||||||||||
| Deadline for course registration | ||||||||||||
| Recruitment of participants and baseline survey | ||||||||||||
| Screen out preexisting SIB members and randomize to treatment | ||||||||||||
| Inform participants on whether they won gym card at SIB or not | ||||||||||||
| Possible to pick up gym card and begin exercise | ||||||||||||
| Easter break | ||||||||||||
| Exam period | ||||||||||||
| Follow-up survey sent via email and SMS to participants | ||||||||||||
| Academic semester ends | ||||||||||||
| Gym card expires |
The students who won the free gym card could collect it from February 16. Since the academic semester ended June 17, the relevant exercise period is roughly 4 months. The academic semester is divided in two parts: the teaching period (January–April) and the exam period (May–June). The follow-up survey was distributed to participants in the middle of May.
C. The Sample
In total, 817 students signed the consent form and returned the booklet. Six students filled in the booklet twice; we only included the first booklet turned in. We excluded 32 students who already had a gym card at SIB according to the administrative records. Another two students were registered at other educational institutions than UIB and HIB, for which we did not have administrative data on academic performance, and one student withdrew from the study after handing in the booklet. After these exclusions, we were left with 782 participants who were included in the study.
We randomly selected 400 students out of the 782 to receive a free gym card, and they constitute the treatment group.6 The remaining participants constitute the control group. All participants in the treatment group received an email and a text message on February 15, in which we informed them that they had been randomly selected to receive a gym card that they could pick up at one of SIB’s gyms. After the randomization had taken place, one participant from the control group withdrew from the study, and three other participants, one control and two treated, turned out not to be registered as students at either UIB nor HIB. We base the analysis on the remaining 778 participants.
Table 2 presents the background characteristics, measured at baseline, for the full sample (col. 1), and separately for the treatment group and the control group (cols. 2 and 3). In column 4, we show that all background characteristics are balanced between the treatment group and the control group. Participants were on average 22 years old and in their second year at the university; about 50% of the sample is female and about 50% studies at each of UIB and HIB.
**Table 2. **
Balance Test
| | Overall | Control | Treatment | p-Value (Treatment-Control) | | (1) | (2) | (3) | (4) | Age | Female (1/0) | Year of study | UIB (1/0) | Self-control (1–10) | Lifestyle | Study hours | Happiness (1–10) | Registered study points | Observations | | | —–– | —–– | ——— | —————————–– | | —– | —– | —– | —– | — | ––––––– | ———–– | ———– | ———————– | ——— | ———– | –––––––––– | ———————– | ———— | | 22.258 | 22.316 | 22.204 | .640 | | .533 | .550 | .518 | .366 | | 2.008 | 2.034 | 1.982 | .561 | | .519 | .500 | .538 | .294 | | 5.814 | 5.751 | 5.875 | .249 | | .004 | −.025 | .032 | .703 | | 22.599 | 22.884 | 22.330 | .475 | | 7.340 | 7.287 | 7.391 | .412 | | 33.838 | 33.868 | 33.809 | .955 | | 778 | 380 | 398 | |
Self-control, lifestyle, study hours, and happiness are prespecified self-reported background variables used in the discussion of mechanisms. “Self-control” is an index based on four questions taken from a larger set of questions that assess the stated willingness to wait in the preference survey module on time discounting by Falk et al. (2016): (1) “I am a person who often acts too hastily.” (2) “I have difficulties resisting temptations.” (3) “I have a tendency to procrastinate on things, even though it would be best to take care of them quickly.” (4) “I am a person who follows my plans.” Each question was answered on a scale of 1–10, expressing the extent to which the participant agreed with the statement (1 = completely disagree, 10 = completely agree). The index is the rescaled average of the responses to questions, with a higher value corresponding to having more self-control. As shown in table A1, the components of the index are significantly correlated.
“Lifestyle” is the combination of the responses to three questions in the baseline survey, where each response is standardized to have a zero mean and a standard deviation of 1. The three questions are as follows: (1) “How many days last week did you go to bed after midnight?” (2) “How many days last week did you feel tired/unrested?” (3) “How satisfied are you currently with your health?” (The scale was 1–10; 1 = very unsatisfied, 10 = very satisfied.) On average, students report at baseline going to bed after midnight and being tired more than 3 days every week, with more than 30% of the students reporting this for at least 5 days every week; they also self-report being somewhat happy with their health. The index is the sum of the responses to these three questions with the values of questions 1 and 2 rescaled such that a higher number means going to bed earlier, feeling less tired, and being more satisfied with your current health. Hence, a higher value on the index serves as a proxy for a healthy lifestyle. In table A1, we show that the components in the lifestyle index are significantly correlated at baseline, and are also significantly correlated with most of the self-control questions.
“Study hours” is the self-reported number of hours studied, including attended lectures the week prior to the baseline survey. On average, participants reported having spent 23 hours studying that week. “Happiness” is the answer to the question, “How satisfied are you currently with your life in general?” (The scale again was 1–10; 1 = very unsatisfied, 10 = very satisfied.)
“Registered study points” is the sum of course credits that a student signed up for at the beginning of the semester. On average students registered for courses worth 34 ECTS (European Credit Transfer and Accumulation System) credits, which indicates that students sign up for slightly more course credits than the 30 ECTS credits that correspond to full-time study (likely reflecting some uncertainty among students about which courses to focus on at the beginning of the semester). Table A2 provides an extended version of the balance table, including variables that were not part of the preanalysis plan. We show that academic performance in the fall semester prior to the intervention balances across treatments.
D. Data Sources
We draw on four data sources in the analysis. First, from the baseline survey we have the background characteristics of the participants, including gender, age, and year of study. In addition, to further explore the underlying mechanisms, we use a battery of questions related to self-control, lifestyle habits, study hours, and happiness at baseline.
The second data source is the scanner data on each occasion a participant swiped their gym card at one of the SIB gyms, which gives us the total number of gym visits during the semester. The student gym created a unique identifier within its IT system for each student participating in the study, which enabled SIB to keep track of whether students in the control group bought a gym card at SIB and began to exercise.
Third, from UIB and HIB we have the complete administrative data related to the academic performance of each student. These include records of which courses a student registered for at the beginning of the semester, which exams a student registered for later in the semester, the number of completed study points that the student obtained at the end of the semester, and the grade received on each exam. In figure 1, we show the link between the variables “registered study points” (RSPs), “exam study points” (ESPs), and “completed study points” (CSPs). The difference between RSPs and ESPs captures the extent to which students drop out of courses during the semester, whereas the difference between ESPs and CSPs captures the extent to which students fail at exams. “Grade average” (GA) is the average grade in a given semester for the courses that the student passes, weighted by study points.7
**Fig. 1. **
Overview of academic variables. The figure shows the timeline for when registered study points (RSPs), exam study points (ESPs), completed study points (CSPs), and grade average (GA) are recorded.
Our final data source is the follow-up survey, which provides self-reported data on relevant outcomes that are not captured in the administrative data. It includes (as the baseline) questions related to self-control, lifestyle, study hours, and happiness, which allows us to identify the causal effect of the intervention on these dimensions.
A concern in any field experiment is a Hawthorne effect, according to which an improvement in the performance of participants could result from their response to the feeling that they are being accorded some attention.8 Several features of our study make it unlikely that there would be a strong Hawthorne effect. First, the students were only given explicit attention when we conducted the surveys at the beginning and at the end of the semester; there was no focus on this study throughout the semester. Second, the emphasis in the interaction with the students was on exercise, not on academic performance, which makes it unlikely that there could be a Hawthorne effect in our main variable of interest.
III. Hypotheses and Empirical Strategy
This section outlines our main hypotheses and the regression model specifications employed in the analysis.9
A. Gym Attendance
On the basis of past research and in line with our preanalysis plan, we expect that the treatment increases gym attendance.
Hypothesis 1.
Offering a free gym card has a positive causal effect on gym attendance.
We test hypothesis 1 by estimating the following OLS regression equation:
(1)Gi=α+βTreatedi+δXi+εi,
where Gi is the number of gym visits (standardized) of individual i in the spring semester of the intervention, or a dummy variable equal to 1 if the student visited the gym at least once. “Treated” is an indicator variable taking the value 1 if individual i were assigned to the treatment group, and Xi is a vector of prespecified covariates (age, gender, institution, year of study, and dummies for being above the median in terms of the lifestyle index, hours studied, happiness, or the self-control index at baseline). The estimated causal effect of the treatment on gym attendance is given by β in (1), which we expect to be positive.
B. Academic Performance
Our main hypothesis, as stated in the preanalysis plan, is that offering a free gym card improves academic performance.
Hypothesis 2.
Offering a free gym card has a positive causal effect on academic performance.
We test hypothesis 2 by estimating the following OLS regression equation:
(2)Ai=α+βTreatedi+δXi+εi,
where Ai is a measure of academic performance of individual i. Throughout the analysis, we standardize the outcome measures by dividing the respective variable with its standard deviation, without subtracting the mean. We include registered study points (RSPs) in Xi, to improve the precision of the estimates. The estimated causal effect of offering a free gym card on academic performance is given by β in (2). We focus on the two prespecified main dimensions of academic performance reported on students’ grade transcript: the number of completed study points (CSPs) and grade average (GA).
C. Lifestyles
To shed further light on underlying mechanisms, we provide two types of analysis. First, we study whether the intervention affects dimensions that we hypothesize are important for succeeding as a student: lifestyle, self-control, happiness, and study hours. Second, we study whether the intervention has a stronger impact on academic performance for people who at the baseline score low on these dimensions.
Formally, we test the following two prespecified hypotheses.
Hypothesis 3.
Offering a free gym card has a positive effect on lifestyle, self-control, happiness, and study hours.
Hypothesis 4.
Offering a free gym card has a stronger causal effect on academic performance for students who score low on lifestyle, self-control, happiness, or study hours than for those who score high on these dimensions.
We test hypothesis 3 by estimating the following OLS regression equation separately for each of the four dimensions:
(3)Yi=α+βTreatedi+δXi+εi.
In (3), Yi represents the standardized value of the lifestyle index, the self-control index, happiness, or study hours, and the estimated causal effect of the intervention on each of these dimensions is given by β in the corresponding regression.
We test hypothesis 4 by estimating the following OLS regression equation:
(4)Ai=α+β1Treatedi+β2Above×Treatedi+δXi+εi.
In (4), we interact the treatment dummy with a dummy variable for being above the median at baseline in terms of the lifestyle index, the self-control index, happiness, and study hours, where the dummy variable for being above median is included in Xi. In (4), the estimated coefficient β2 shows whether the intervention worked differently for those who were below the median in all dimensions and those who were above the median in at least one of the dimensions at baseline. In the analysis, we also report specifications that study each subgroup separately.
D. Robustness Analysis
In the analysis, we report p-values correcting for multiple-hypothesis testing, using the Holm–Bonferroni (Holm 1979), Romano–Wolf (Romano and Wolf 2005), and List–Shaikh–Vayalinkal (List, Shaikh, and Vayalinkal 2023) methods. In appendix A, we report further analysis and robustness checks.
IV. Main Results
We report the main results in this section, with supporting analysis in appendix A.
A. Effect on Exercise
We start by studying whether the intervention, which directly targeted physical exercise, induced students to visit the gym more frequently (hypothesis 1). Figure 2 shows the cumulative distribution of gym visits by treatment. We observe a large difference in gym visits between the treatment and control groups: Few students in the control group visited the gym at all, while the majority of those in the treatment group did visit the gym. The cumulative distribution of gym visits for the treatment group first-order stochastically dominates the cumulative distribution of the control group. On average, treated students visited the gym 7.5 times in the semester, which is a more than 300% increase relative to the 1.8 visits in the control group. We also note that 30% of the students in the treatment group exercised 10 times or more, compared to 7% in the control group. Table 3 reports the effect of the treatment intervention on gym exercise using OLS regression analysis. Column 1 reports the effect on the number of gym visits (standardized) for the whole semester, where we find an average treatment effect of more than 0.6 standard deviations (p<.001).10 We see from columns 2 and 3 that the effect is robust to adding control variables. In columns 4–6, we report the effect of the intervention on the extensive margin, where we observe a highly significant and robust increase in the share of students exercising at least once, from 11.6% in the control group to 68.6% in the treatment group. In the lower rows, we show that the estimated treatment effects are robust to multiple-hypothesis correction.
**Fig. 2. **
Gym visits by treatments. The figure provides the cumulative distribution of gym visits by treatment arm, censored at 20 gym visits.
**Table 3. **
Regression Results on Gym Attendance
Outcome Variable Visits (Standardized)Visited (1/0) (1)(2)(3)(4)(5)(6) Treated.629***.625***.627***.570***.571***.571*** (.0676)(.0677)(.0678)(.0285)(.0285)(.0288) Age−.0175**−.0145−.00445−.00438 (.00855)(.00899)(.00428)(.00436) Female−.0131−.0176.0635**.0644** (.0691)(.0700)(.0289)(.0297) Year of study−.0614***−.0607***−.00357−.00429 (.0208)(.0214)(.0124)(.0123) UIB−.0403−.0527.0164.0213 (.0687)(.0698)(.0289)(.0302) Above_Study−.0135.0171 (.0712)(.0303) Above_Lifestyle−.123*−.0412 (.0660)(.0300) Above_Happiness.0365.00421 (.0791)(.0348) Above_Self-control.0473.00785 (.0686)(.0298) Constant.199***.741***.715***.116***.179*.181* (.0390)(.229)(.250)(.0164)(.0974)(.102) HB p-value: Treated.0003.0007 RW p-value: Treated.0001.0001 LSV p-value: Treated.0003.0003 Observations778778758778778758
Active card users made about one additional gym visit per week from the start of the intervention to the start of the exam period. To put this effect into perspective, Charness and Gneezy (2009) report significant improvements in weight, waist size, BMI, body fat percentage, and pulse rate from a similar treatment effect on gym attendance. Clinical trials in the medical literature also document positive health effects from exercising 60 minutes per week in periods from 4 to 20 weeks (Janssen and LeBlanc 2010), and similar levels of exercise can generate positive effects in terms of depression (Mammen and Faulkner 2013), impulse control (Oaten and Cheng 2006) and memory (Roig et al. 2012).11
To provide some evidence on which barriers were removed by providing the students with a free gym card, we consider how the estimated treatment effects on gym visits depend on whether the students expressed that they did not exercise at the gym at baseline because they were financially constrained or time constrained. In figure 3, we show the estimated treatment effects for those who were financially constrained but not time constrained and for those who were most time constrained, compared to the rest of the students. In table A3, we report the corresponding regression analysis. We observe that the estimated effect of providing a free gym card on gym visits for those who were financially constrained but not time constrained is significantly larger than for the rest of the students (p=.098), and the effect for those who were time constrained is significantly smaller (p=.066). The difference between these two groups is also highly significant (p=.018). This analysis was not prespecified and is therefore exploratory, but it provides some suggestive evidence of the free gym card being particularly powerful in increasing exercise among the most financially constrained, while having a smaller effect on those who were most time constrained.
**Fig. 3. **
Treatment effect on gym visits by barrier to exercise. The figure shows the unconditional standardized treatment effects on gym visits for people who found the SIB card expensive but were not constrained by time (left), and for people who were constrained by time (right). We define the constraint as binding if the answer to the specific question is 8 or higher on a scale from 1 to 10. The horizontal line indicates the average treatment effect in the full sample (0.629 standard deviations).
However, about 65% of the students were not financially constrained or time constrained at baseline, and we show in figure A1 that there is a highly significant estimated treatment effect on gym visits also for these students. We interpret this as suggestive evidence of the free gym card not only removing a financial barrier, but also contributing to reducing psychological barriers to exercise. Potential mechanisms may be that the free gym card removes an excuse to exercise, makes more salient the injunctive norm that one should go to the gym and exercise, or provides students with a fresh start and puts them on a path of goal setting. In line with psychological barriers being an important explanation for why they do not exercise, many students stated at baseline that they were not a member at SIB because they were lazy.
B. Effect on Academic Performance
We now turn to investigate whether the intervention that targeted exercise also had an impact on the students’ academic performance (hypothesis 2).
Figure 4 shows the standardized treatment effect on completed study points (CSPs) and grade average (GA). We see an increase in the CSPs of 0.13 standard deviations in the treatment group, without it affecting GA. In figure A2, we show the cumulative distributions of CSPs and GA. The treatment group first-order stochastically dominates the control group in terms of CSPs. Significantly fewer students achieve 30 CSPs in the control group than in the treatment group, which means that the students in the control group are more likely to delay study progress than students in the treatment group. Given that the education literature stresses the value of completing college, it is important that exercise enables students to avoid delays in their studies.
**Fig. 4. **
Treatment effects on academic outcomes. The figure shows the unconditional standardized (std) treatment effects on completed study points (CSPs) on the left and grade average (GA) on the right.
Table 4 presents the results from OLS regressions of the intention-to-treat effect on academic performance.12 We observe from columns 1–3 that the treatment affects positively CSPs, with an estimated effect of about 0.15 standard deviations when including control variables (p=.027), which is robust to multiple-hypothesis correction (lower rows).
**Table 4. **
Regression Results on Academic Performance
Outcome Variable (Standardized) CSPsGA (1)(2)(3)(4)(5)(6) Treated.128*.139**.146**.0113.0358.0327 (.0715)(.0658)(.0664)(.0742)(.0720)(.0728) Age−.0355***−.0328**−.0356***−.0301*** (.0128)(.0134)(.0114)(.0114) Female.0894.0901.118.115 (.0657)(.0661)(.0729)(.0751) Year of study.0142.00242.140***.138*** (.0313)(.0327)(.0301)(.0312) UIB−.248***−.221***−.337***−.289*** (.0680)(.0707)(.0739)(.0764) RSPs (standardized).363***.368***−.0486−.0549 (.0618)(.0639)(.0371)(.0374) Above_Study.0853.0960 (.0707)(.0777) Above_Lifestyle.0736−.0194 (.0665)(.0788) Above_Happiness.0646.0465 (.0743)(.0847) Above_Self-control.180***.161** (.0672)(.0787) Constant2.097***2.105***1.863***3.349***4.060***3.811*** (.0495)(.300)(.322)(.0517)(.276)(.283) HB p-value: Treated.0553.6380 RW p-value: Treated.0562.6529 LSV p-value: Treated.0487.6380 Observations778778758726726707
To study whether the increase in CSPs reflects that treated students are less likely to drop out from courses or less likely to fail on the exam, we decompose completed study points as follows (see also fig. 1):
(5)CSPsi=RSPsi−(RSPsi−ESPsi)−(ESPsi−CSPsi).
Completed study points (CSPsi) are given by the study points that the student registered for at the start of the semester (RSPsi), minus the study points that the student does not receive because he or she drops out from courses (RSPsi−ESPsi) or fails on exams (ESPsi−CSPsi). As shown in table 2, there is no difference between the treatment group and the control group in terms of RSPsi, and it turns out that 62% of the treatment effect on completed study points comes from treated students dropping fewer courses and 38% from them failing less at exams.
A potential concern is that an increase in completed study points may decrease the grade average (which is derived conditional on passing the exam), but this is not what we observe. As shown in columns 4–6 in table 4, the estimated treatment effect on GA is marginally positive, though not statistically different from zero. The analysis thus shows that the treated students managed to complete more courses, without suffering in their grade average. In appendix A, we show that these results are robust to controlling for academic performance in the previous semester (table A4) and to including alcohol consumption in the lifestyl