Skip to main content

A critical evaluation of systematic reviews assessing the effect of chronic physical activity on academic achievement, cognition and the brain in children and adolescents: a systematic review



International and national committees have started to evaluate the evidence for the effects of physical activity on neurocognitive health in childhood and adolescence to inform policy. Despite an increasing body of evidence, such reports have shown mixed conclusions. We aimed to critically evaluate and synthesise the evidence for the effects of chronic physical activity on academic achievement, cognitive performance and the brain in children and adolescents in order to guide future research and inform policy.


MedLine, Embase, PsycINFO, Cochrane Library, Web of Science, and ERIC electronic databases were searched from inception to February 6th, 2019. Articles were considered eligible for inclusion if they were systematic reviews with or without meta-analysis, published in peer-reviewed (English) journals. Reviews had to be on school-aged children and/or adolescents that reported on the effects of chronic physical activity or exercise interventions, with cognitive markers, academic achievement or brain markers as outcomes. Reviews were selected independently by two authors and data were extracted using a pre-designed data extraction template. The quality of reviews was assessed using AMSTAR-2 criteria.


Of 908 retrieved, non-duplicated articles, 19 systematic reviews met inclusion criteria. One high-quality review reported inconsistent evidence for physical activity-related effects on cognitive- and academic performance in obese or overweight children and adolescents. Eighteen (critically) low-quality reviews presented mixed favourable and null effects, with meta-analyses showing small effect sizes (0.1–0.3) and high heterogeneity. Low-quality reviews suggested physical activity-related brain changes, but lacked an interpretation of these findings. Systematic reviews varied widely in their evidence synthesis, rarely took intervention characteristics (e.g. dose), intervention fidelity or study quality into account and suspected publication bias. Reviews consistently reported that there is a lack of high-quality studies, of studies that include brain imaging outcomes, and of studies that include adolescents or are conducted in South American and African countries.


Inconsistent evidence exists for chronic physical activity-related effects on cognitive-, academic-, and brain outcomes. The field needs to refocus its efforts towards improving study quality, transparency of reporting and dissemination, and is urged to differentiate between intervention characteristics for its findings to have a meaningful impact on policy.


Physical inactivity is an important risk factor for chronic diseases (e.g. cardiovascular disease, depression), obesity, and early deaths [1,2,3,4] placing a high economic burden on society [5]. Conversely, higher levels of physical activity (PA) have been associated with lower risk of mortality, beneficial mental- and cardiovascular health outcomes [6,7,8], and possibly improvements in cognitive and brain health [9]. Increasing PA has therefore been considered a low-cost strategy for global health improvement [10, 11], and has received great interest from scientific and public health communities demonstrated by the 2012 and 2016 Lancet series on PA (2012 series:, and 2016 series:, the United States (US) 2018 Physical Activity Guidelines [12] and the 2018 World Health Organization (WHO) Global Action Plan for Physical Activity [13]. Specifically, both the US and WHO guidelines recommend 150 min of moderate intensity PA (MPA) or 75 min of vigorous intensity PA (VPA) per week for adults and at least 60 min of MVPA per day for children and adolescents (5–18 years), as well as muscle-strengthening activities. However, globally approximately 20–30% of adults [4, 11] and the majority of youth, including 80% of adolescents [11, 14], do not meet recommended levels of PA.

Childhood and adolescence is marked by rapid social, psychological and neurobiological development and provides the foundation for future health [15,16,17]. Consequently, scientists have begun to examine the effects of PA on brain structure and function, using neuroimaging tools such as magnetic resonance imaging (MRI) and electroencephalography (EEG), and cognition in this population [9]. The findings of these studies have been summarised extensively in systematic reviews, which have been evaluated by The 2018 Physical Activity Guidelines Advisory Committee (PAGAC) to inform policy [12]. The report suggested moderate evidence for PA-related beneficial effects on cognitive performance during pre-adolescence, but inconsistent evidence during adolescence [12]. These conclusions were recently updated to also suggest beneficial effects on brain structure and function during pre-adolescence and limited but promising evidence for PA on cognition in adolescence [9]. In contrast, the United Kingdom (UK) equivalent of the PAGAC concluded there was inconclusive evidence for effects of PA on cognitive and academic performance, but beneficial effects on maths performance [18].

Both of these reports were based on conclusions from a small, non-overlapping set of systematic reviews (US: 9 and UK: 2) and did not provide insight into review quality, nor quality of the primary studies on which the reviews’ conclusions were based. A recent systematic review of reviews considered findings from 25 reviews and concluded that the evidence supports a causal link between PA and cognition in young people [19]. However, the authors did not incorporate review quality in their evidence synthesis and based their conclusions on a subset of reviews that included a mixture of observational and interventional evidence. To seek clarity among the contrasting reports, provide specific recommendations for the field, and inform policy, this systematic review of reviews aimed to synthesise the evidence for the effect of PA or exercise on brain structure and function, academic-, and cognitive performance in childhood and adolescence. In this review we aimed to include overall reporting of bias and heterogeneity in the literature, the quality of the primary studies and reporting of intervention fidelity, as well as the consistency of conclusions, limitations and recommendations across the literature.


This systematic review was registered at PROSPERO (ID: CRD42019124472) and conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-analysis (PRISMA) guidelines (checklist is provided in Additional file 1) [20].

Search strategy and selection criteria

An information specialist (NR) performed an electronic search of the following databases, originally on June 14th 2018, and updated on February 6th 2019: MedLine, Embase, PsycINFO, Cochrane Library, Web of Science and ERIC. No date restrictions were applied to the initial search. Reference lists of identified articles were examined for additional relevant articles. Data sources and (medical subject headings) search terms are provided in Additional File 2. In line with the a priori defined selection criteria (PROSPERO, ID: CRD42019124472), systematic reviews with or without meta-analysis that were published in English with clearly defined inclusion criteria were included if they reported on the effects of chronic PA interventions (i.e. more than one PA session, over a set period), including randomised controlled trials (RCT), quasi-experimental studies, controlled and pre-post designs, on cognitive, academic and /or brain MRI outcomes in school-aged children or adolescents. Systematic reviews that contained a mixture of observational and interventional evidence, that reported on a single bout of acute PA (i.e. a single session) or cardiovascular fitness only, or included case reports, were excluded.

The in- and exclusion criteria were adapted post-protocol to exclude systematic reviews that also included observational studies because the findings from interventional and observational studies were generally combined in the evidence synthesis. It was deemed too subjective to extract conclusions regarding the effects of intervention studies only. Studies that distinguished between multicomponent (e.g. PA and diet) and PA only interventions were included, but only results from PA only interventions were considered in this systematic review.

Systematic reviews that also included findings of acute PA interventions were considered for inclusion only if they selectively reported on the effects of chronic PA interventions. Following data extraction, it became clear that no single systematic review had aimed to include MRI studies only. Instead, two out of four systematic reviews included studies that used either MRI or EEG. We therefore decided to adapt our inclusion criteria to also include brain EEG markers as an outcome measure. Because the search criteria included the term MRI, we post-hoc searched the databases (October 1st, 2019) for systematic reviews that included EEG studies. No additional systematic reviews were found.

Review selection and data extraction

Following removal of duplicates, two reviewers (TW, WW) independently screened all titles and abstracts using Abstrackr [21]. Short-listed full-text reviews were then independently assessed using the in- and exclusion criteria. Any disagreements between the authors were discussed with two other authors (CS and CF) and resolved by consensus. A data extraction form was piloted independently by two authors (TW, WW) and adjusted to ensure it captured all relevant data, including: year of publication, type of review (systematic review with or without meta-analysis), review methodology (aim, in/exclusion criteria, number of studies included/excluded, database search, quality and bias assessment method), characteristics of included studies (design, number of included participants, type of participants and countries), outcome measures, reporting on intervention fidelity, limitations and recommendations. A single author extracted the data from selected studies using the data extraction template which were verified by a second author and disagreements were resolved by consensus.

Quality assessment

The methodological quality of all reviews was assessed using the updated AMSTAR checklist, AMSTAR-2 [22]. Two reviewers (TW, WW) independently scored the selected reviews. The quality of each review is reflected by an overall confidence rating, which is determined by an evaluation of non-critical and critical domains (seven critical items: preregistration, literature search, justification for excluding studies, risk of bias assessment, appropriateness of meta-analytical methods, risk of bias in synthesis of results, publication bias). A lack of addressing one or multiple critical domains resulted in a, respectively, low or critically-low confidence rating. If no critical flaws were present, the presence of non-critical weaknesses determined whether the review received a high (no weaknesses) or moderate (one or more weaknesses) confidence rating. While it is not recommended to combine individual item ratings into an overall score [22], we provided the total score to merely acknowledge that a gradient exists between reviews in their study quality.


Search results

Of 908 abstracts that were screened, 71 full-text articles were reviewed. Only 19 articles met inclusion criteria (Fig. 1). An overview of excluded studies, with reasons, is provided in Additional file 3. Twelve articles reviewed academic outcomes [23,24,25,26,27,28,29,30,31,32,33,34], thirteen reviewed cognitive outcomes [24, 25, 27, 29,30,31, 33, 35,36,37,38,39,40], and three evaluated brain outcomes [27, 39, 41]. Seven systematic reviews included randomised controlled trials (RCT) only, and the remainder included a combination of intervention studies (including RCTs, quasi-experimental studies and controlled designs). Eight systematic reviews (42%) also included a meta-analysis. An overview of search details of each systematic review is provided in Additional file 4.

Fig. 1
figure 1

PRISMA flow diagram

The 19 systematic reviews included a total of 118 unique primary studies, of which 84 on cognitive outcomes, 53 on academic outcomes and nine on brain outcomes. The research was predominantly conducted in developed countries (Fig. 2, an overview of countries per outcome is provided in Additional file 5), especially in the USA and Australia. None of the primary studies were conducted in countries in South-America or Africa, with the exception of South Africa. The quality of the primary studies was generally regarded to be low to moderate (details of assessments per review are provided in Additional file 6).

Fig. 2
figure 2

Countries where PA interventions have been conducted. PA interventions included in the 19 systematic reviews have been conducted in 26 countries. The findings for these interventions have been presented in 118 unique publications, the majority (40%) of which were conducted in the USA

Review quality

Of 19 systematic reviews, one received a high confidence rating, one a low confidence rating and 17 a critically low confidence rating (i.e. did not address multiple critical domains, as defined in quality assessment section above; see Table 1, and details in Additional file 7). While 18 reviews performed a comprehensive search (95%) and provided study characteristics (95%), only five (26%) had an a-priori design, three (16%) included an overview of excluded studies, 14 assessed risk of bias (72%), seven of eight (88%) performed a meta-analysis with generally appropriate methods (i.e. used accepted statistical techniques for combination of results and exploration of causes of heterogeneity), and three (of 15 that performed a quality assessment: 20%) considered quality information in their evidence synthesis.

Table 1 Descriptors of systematic reviews


Academic achievement

Twelve systematic reviews evaluated academic outcomes (Table 2), one of which synthesised evidence from overweight/obese children and adolescents [31]. Reviews included between one [30] and 26 primary studies [23] and 19 studies (36%) were included in more than one review (Additional file 8). Four reviews included a meta-analysis, and reviews synthesised findings of studies in terms of overall academic achievement, its sub-domains (e.g. maths, language, reading), or both.

Table 2 Academic outcomes: findings from systematic reviews and meta-analyses (ordered by quality rating)

Across all twelve reviews, six concluded that PA benefits academic performance [23, 24, 26, 29, 32, 33], and the remaining six concluded that there was mixed or inconclusive evidence for PA-related academic changes (Tables 1 and 2) [25, 27, 28, 30, 31, 34].

Only one review received a high confidence rating [31]. Among three RCTs of obese/overweight children, this review found no evidence for PA-related improvements in maths, reading or language performance. Sensitivity analyses for risk of bias (e.g. attrition) and cluster RCT designs were performed, but were less meaningful due to a small number of studies.

Among the 11 low- to critically low- quality reviews (i.e. lack of addressing respectively one versus multiple critical domains), six reviews found (mainly) positive effects of PA on academic performance and five presented mixed evidence and conclusions. All three meta-analyses showed a small beneficial effect of PA on overall academic performance (effect sizes are presented in Table 2), but only one (of two) found significant evidence for small positive effects on its sub-domains [23]. This meta-analysis included 26 intervention studies (RCT and quasi-experimental) and found evidence for beneficial effects of PA on maths, reading and composite scores, albeit with substantial heterogeneity among studies (I2 > 50%, apart from reading). Sensitivity analyses showed changes in effect sizes of reading-, language-related skills and composite scores after removal of studies, and a drop in effect size for maths performance (from d = 0.21 to d =0.12) upon exclusion of low quality quasi-experimental studies. Among the other two meta-analyses, one included three studies [24], whereas the other included a large number of non-peer reviewed dissertations [26].

Subgroup analyses of meta-analyses (Additional file 9) further suggest that academic performance may benefit most from PA during curricular PE [23] and cognitively challenging PA [24, 26]. Intervention duration does not seem to be an important moderator of PA-related academic changes [23, 24, 26].

In summary, evidence from systematic reviews is inconsistent, with conclusions suggesting positive as well as mixed (inconclusive) effects of PA on overall academic achievement and its sub-domains. One high confidence review on overweight and obese children did not report beneficial effects of PA on academic performance [31].

Cognitive function

Thirteen systematic reviews evaluated cognitive outcomes, one of which synthesised evidence from children with ADHD [36] and two evaluated findings from overweight/obese children [31, 39]. Reviews included between one [29] and 36 intervention studies [27, 38], and of 83 published studies, 42 (50%) were included in more than one review (Additional file 8). Six reviews included a meta-analysis and reviews synthesised findings of studies in terms of overall cognitive performance, and / or various sub-domains (e.g. executive functions, memory).

Across all thirteen reviews, eight concluded that PA was beneficial [24, 29, 33, 35, 36, 38,39,40] and five reported mixed or inconclusive evidence [25, 27, 30, 31, 37]. None of the reviews suggested worse cognitive performance following PA (Tables 1 and 3).

Table 3 Cognitive outcomes: findings from systematic reviews and meta-analyses (ordered by quality rating)

One review received a high confidence rating [31]. Among seven RCTs of obese/overweight children, this review reported a significant PA-related benefit on non-verbal memory and composite executive functions based on findings from single studies, but not inhibition control, attention, working memory, cognitive flexibility or visuo-spatial abilities.

Among 12 (of 13) low- [25] to critically low- quality reviews, six reviews found positive effects of PA on cognitive performance and/or its sub-domains in healthy young people [24, 29, 33, 35, 38, 40], and four presented mixed evidence and conclusions [25, 27, 30, 37]. Two meta-analyses showed a positive effect of PA on overall cognitive performance [24, 35], two of three showed PA-related benefits on overall executive functions [24, 37, 38] and inhibition [24, 38, 40], and two of three meta-analyses reported non-significant effect sizes for planning or higher-level cognitive functions [24, 37]. Moreover, a meta-analysis additionally reported beneficial effects of PA on non-executive functions and working memory, but not selective attention or cognitive flexibility [38]. Despite some positive results, the effect sizes are generally small (0.1–0.3) and suffer from substantial heterogeneity. Moreover, only two of five meta-analyses performed a sensitivity analysis [35, 38], showing a lower effect size for overall cognitive functions following removal of outliers [35] and a change in effect size for working memory upon removal of studies [38]. There is some evidence for beneficial effects of PA on cognitive functions in children with overweight and obesity [39] and with ADHD [36].

Subgroup analyses of meta-analyses further suggest that cognitive performance may benefit most from PA during curricular PE [38], with mixed evidence for enhanced (i.e. quantitative increase in PA) or enriched PA (i.e. qualitative manipulation of PA, e.g. increasing coordinative task requirements; Additional file 9) [35, 38]. Intervention duration does not seem to be an important moderator of PA-related cognitive changes [24, 37, 40], although one review reported a negative relationship with working memory [38]. No significant relationships were found between study quality and heterogeneity [38] or effect sizes [37].

In summary, evidence from predominantly low-quality systematic reviews is inconsistent, with conclusions suggesting both positive and mixed (inconclusive) effects of PA on overall cognitive performance and its sub-domains. A single high-quality review [31] showed that PA may benefit executive functions and non-verbal memory of obese/overweight children, but evidence is based on findings from single studies. A comparison between reviews is complicated by the considerable variability in reporting of findings across systematic reviews (e.g. overall cognition vs sub-domains, choice of sub-domains).

Brain structure and function

While four systematic reviews reported on the effects of PA on the brain [27, 33, 39, 41], only three of these systematically explored exercise-related brain changes [27, 39, 41], the findings of which are discussed here. Reviews included five or six primary studies, and three (of 10 unique publications) were included in all reviews (Additional file 8) [42,43,44]. All reviews were classified as critically low quality. Across three reviews, one concluded that PA is beneficial for the brain [39], one reported mixed favourable and null effects [27] and one reported a lack of available evidence (Table 4) [41].

Table 4 Brain outcomes: findings from systematic reviews and meta-analyses

Among healthy children and adolescents, Gunnell et al. [27] stratified findings from six RCTs by brain function and structure and showed some evidence for changes in activation and resting-state synchrony following a PA intervention, but not blood flow and inconsistent changes in white matter structure. Lubans et al. [41] tabulated brain changes from six studies to explore neurobiological mechanisms of cognitive changes, and showed increases, decreases or no changes following PA interventions in widespread brain areas (as well as cognitive changes). The authors did not synthesise brain findings and reported a lack of overlap between studies (e.g. imaging methods, brain regions). Bustamante et al. [39] focused on children and adolescents with overweight/obesity and interpreted PA-related brain changes of four high-quality studies as beneficial, but their conclusions lack anatomical specificity.

In summary, only a small number of studies have examined PA-related brain changes and while PA-related effects have been reported, findings are inconsistent with little methodological or anatomical overlap between studies.

Intervention fidelity reporting

Reporting of fidelity is crucial for accurate interpretation of intervention results [45, 46]. This is particularly the case in behavioural interventions where there is substantial heterogeneity in intervention characteristics. Only two (of 19) reviews reported on intervention fidelity in their results section [28, 32] and noted a lack of fidelity reporting in studies. Several other reviews discuss the lack- and importance of reporting fidelity metrics in their discussion [25, 31, 38]. None of the systematic reviews took intervention fidelity into account when summarising the findings of effects of PA on cognitive-, academic-, or brain outcomes.

Limitations and recommendations


An overview of limitations included in systematic reviews is provided in Additional file 10. A (non-exhaustive) narrative synthesis is provided here, intended to identify limitations that were common to the discussion section of at least two reviews. One of the most reported limitations is the presence of high heterogeneity across studies: in designs of PA interventions (duration, frequency, resources provided, delivery [32, 40]), the (appropriate) control groups [25, 30, 40], and the measurement tools that were used [29, 30, 41]. Reviews often reported a lack of detailed reporting of interventions (e.g. duration, intensity, compliance, resources, delivery), assessments [30, 32, 38], and potential moderators [24, 34], as well as a lack of valid measurement tools [29, 30]. These within study limitations may contribute to a general lack of high-quality studies [26, 35, 38, 39]. Other limitations that were frequently noted are the lack of studies in adolescents [25, 30, 31] and those covering various sub-domains of cognition/academic achievement [24, 35, 37], the presence of relatively small samples [29, 30] and samples being predominantly from high-income countries [31, 32, 34]. While some reviews tested for the presence of publication bias, this too was often reported as a limitation [23, 25, 40].


In addition to resolving the above limitations, authors recommended to include long term follow-up assessments [28, 29, 31, 34, 38], explore the effect of different PA characteristics (intensity, duration) [24, 37], include brain imaging [25, 27, 31], monitor the PA dose that participants receive [25, 37], and explore the influence of effect modifiers (e.g. sex, ethnicity and socioeconomic status [31, 34]). Furthermore, authors suggested to report effect sizes and standardized regression coefficients [25, 41], focus on examining the qualitative aspects of PA [24, 25, 35, 39], explore after-school PA interventions [23, 38] and conduct transitional work, examining what interventions are most effective for implementation in schools [23, 39].


Key findings

The aim of this review was to critically and systematically evaluate systematic reviews that examined the effects of chronic PA interventions on cognitive-, academic- and brain outcomes in children and adolescents. Of 19 systematic reviews, only one received a high confidence rating and reported inconsistent evidence for PA-related effects on academic performance and favourable effects on executive functions in overweight/obese children, albeit based on results from single studies. Reviews with a (critically) low confidence rating presented mixed favourable or null effects. Reviews that evaluated brain outcomes suggested PA-related brain changes, but with little anatomical or methodological overlap which has complicated the synthesis and interpretation of findings.

In general, the quality of the majority of systematic reviews is considered to be critically low with high heterogeneity between systematic reviews (e.g. number of included studies, presentation of findings). Furthermore, only three systematic reviews took the quality of primary studies into account when synthesising evidence and there is a general lack of reporting of intervention fidelity. Systematic reviews and meta-analyses consistently stated that the field suffers from high heterogeneity between primary studies (e.g. in the design, intervention, outcome measures and reporting) and that high-quality studies are required.

In the following sections we will discuss these key findings and argue that the field would benefit from improvements in study quality, reporting and dissemination.

Academic-, cognitive- and brain outcomes

The findings on cognitive outcomes are largely in line with conclusions from the UK Expert Working Group Working Paper on children and young people [18]. In contrast, the US 2018 PA Guidelines [12] suggested moderate evidence for PA-related improvements in cognitive and academic performance (among 5–13 year olds), and Biddle et al. [19] claimed a causal association of PA with cognitive functioning and indicated less clear results for academic achievement. The discrepancy in conclusions is likely due to the set of reviews considered for inclusion and the strategy for evidence synthesis. That is, conclusions of both the UK Working Group and US guidelines were based on a small number of systematic reviews (Additional file 11) without mention of review quality or reasons for selecting this (sub) set of reviews. Biddle et al. [19] based their conclusions on a large number of reviews (n = 25), including some that combined observational and interventional evidence, without reference to review quality or sub-domains of cognitive- or academic performance.

It is clear that the selection of reviews and strategy for synthesis could impact on conclusions, particularly if the synthesis is based on study authors’ conclusions, which commonly emphasise positive findings (e.g. [24, 33, 38]). Future reports would benefit from a nuanced overview, with conclusions that take into account (objective) findings presented in reviews’ results sections.

Study quality and methodological considerations

We observed high variability in quality and methods across systematic reviews. In particular, systematic reviews often lacked an a priori design, an overview of excluded studies, an assessment of publication bias (unless a meta-analysis was included), and did not consider quality information in their evidence synthesis. While the majority of reviews described the sample (e.g. size, age, sex) of the included studies, they did not discuss baseline PA levels, which is important for assessing whether the sample is representative of the general population and for the interpretation of reported effects (e.g. greater effects may be expected for a sample of low active individuals [47, 48]). Only a minority of reviews reported on intervention fidelity, which is crucial if one wishes to understand whether observed effects can be ascribed to the intervention or other environmental factors (e.g. PE enjoyment). Furthermore, reviews hardly distinguished between PA interventions (e.g. dose or type) in presenting results. This practice is useful if the goal is to examine whether any form of PA has an effect on cognitive, academic or brain outcomes, yet provides little insight into the appropriate dose or PA type (e.g. enhanced / quantitative or qualitative / enriched PA manipulations) that may benefit outcomes most. Similarly, reviews varied substantially in whether they presented findings by sub-domains or overall cognitive- or academic performance, as well as the definition of sub-domains. If conclusions are to be drawn regarding specific PA effects from findings across reviews, a more granular and consistent presentation of results is required. Finally, reviews included multiple publications stemming from the same study (and sample), which is rarely considered (but see e.g. [31, 39]). For instance, half of the neuroimaging publications included the same sample of overweight children, and care should be taken in generalising these results.

A subset of reviews included meta-analyses, the appropriateness of which has been questioned due to the heterogeneity between studies [27]. Among those that did include a meta-analyses, we observed variability in whether the meta-analysis was pre-registered, in the number of included studies (2–22), and whether one or multiple studies on the same sample were included and appropriately accounted for (e.g. Cochrane handbook 5.1, section 16.5.4 [49]). Importantly, meta-analyses rarely took study quality or study design ([cluster] RCT [50] and non-RCT) into account. This practice could bias effect sizes and requires separate reporting [22]. In addition, not all meta-analyses used inverse variance weighting and only a few meta-analyses fully explored the high observed heterogeneity using meta-regression and / or subgroup analysis [51]. Finally, small sample sizes of primary studies may present with larger effect sizes and thereby affect meta-analyses results [52, 53]. Although meta-analyses are a useful statistical tool to quantitatively synthesise findings [54, 55], care should be taken in their implementation.

Research gaps

Several gaps and forms of bias were identified in the evidence base. These would need to be addressed before findings about the effects of PA on cognition, academic performance and the brain can be generalised to a wider population and impact on policy.

  • Sample bias The majority of primary studies and by extension systematic reviews included samples of children (7–12 year old), with far fewer studies on adolescents [30]. While observational evidence in adolescents exists [56, 57], high-quality RCTs are needed to bridge the gap.

  • Geographical bias The majority of studies were conducted in the USA, and more generally in developed countries. RCTs in less-developed countries, particularly those from Africa and South America (Fig. 2), but also from Asia, are needed to explore whether findings can be generalised to the wider population. The “Cogni-Action” cross-over randomised trial of an acute PA intervention in Chile is one such attempt to close the geographical gap [58].

  • Publication bias Systematic reviews indicated the presence of publication bias: studies that report on significant, often positive, effects of PA interventions are more likely to be published. For instance, all brain imaging studies included at least one significant outcome. To counteract publication bias, PA researchers should publish non-significant findings and consider preregistering their analysis plans. Moreover, initiatives such as registered reports, journals soliciting negative results, and funder-supported journals that encourage open practices, are warranted.

  • Brain imaging There was a lack of studies that included brain imaging, in particular T1-weighted (structural) MRI. Consequently, the evidence for PA-related brain changes is based on a small number of studies, 50% of which included the same sample of overweight children. High variability in outcome measures and lack of replication further complicate the interpretation of brain findings. Surprisingly, none of studies that were included in the reviews have examined PA effects on hippocampal metrics (e.g. volume), despite overwhelming evidence from animal studies [59] and various reports in human adults [60]. To uncover neuro-biological mechanisms of exercise, brain imaging (MRI and/or EEG) should be considered as an outcome measure in future intervention studies, e.g. Cogni-Action project [58] and Fit to Study [61].

  • Moderators There is little understanding of the influence of potential moderators, such as sex, socio-economic status, ethnicity and genetic background on PA-related cognitive, academic and brain outcomes. Understanding the effect of such moderators could help develop targeted PA interventions for sub-groups of children who may benefit most.


To address research gaps and improve the quality of both primary studies and systematic reviews, the field is increasingly recognising that current research practice needs to evolve. We therefore provide recommendations, including references to resources, in an attempt to facilitate improvements in evidence generation and synthesis.

Future reviews are encouraged to screen for existing or pre-registered systematic reviews (e.g. on PROSPERO: or Open Science Framework, OSF: on the same topic. At least seven new systematic reviews have been published [62,63,64,65,66,67,68] since February 2019, the majority of which reported on the same studies that have been included in previous reviews. Moreover, researchers are encouraged to pre-register their protocol (e.g. on PROSPERO), follow the PRISMA guidelines and consult online resources (e.g. Cochrane handbook [69]), particularly if meta-analyses are conducted [54, 55, 70]. We encourage researchers to think carefully about their research question and inclusion criteria (e.g. type of design, PA interventions and outcomes) which will determine the scope of the review. To further our understanding of PA effects, we recommend researchers to accurately describe the sample of the included studies (including baseline PA levels) and summarise findings by cognitive- or academic sub-domains and PA characteristics, taking into account the quality of the studies, with conclusions providing a balanced summary of findings. For instance, researchers could consider focussing on the high(est) quality studies and, in addition to summarising findings by cognitive sub-domain, show the relationship between dose parameters and outcome measures (e.g. effect sizes, for an example see ref [66]). If a meta-analysis is appropriate, researchers are encouraged to consult guidelines to ensure the correct statistical methods are used (e.g. [54, 55, 70]), heterogeneity and publication bias are assessed, and sub-group and/or sensitivity analyses are conducted (e.g. for cluster-RCT effects [50, 64]).

While a systematic review of reviews allows for an evaluation of the field and provide recommendations for future research and policy [71], it does not allow for a detailed discussion of primary studies. Based on systematic reviews, however, we found that the quality of primary studies is low to moderate. To improve study quality, researchers are advised to consult the protocol- (Standard Protocol Items: Recommendations for Interventional Trials, SPIRIT) [72] and reporting (Consolidated Standards of Reporting Trials, CONSORT) [73, 74] guidelines for trials early on in the design process, and pre-register the study on an independent registry (e.g. An adequate sampling strategy should be followed to ensure an unbiased sample and generalizability of findings [75], and a power analysis ought to be performed a-priori to determine an adequate sample size. PA researchers are encouraged to carefully consider the PA intervention, including the type, the dose, the control group(s) [25], and to implement strategies to measure adherence (e.g. using actigraphy or heart rate monitors). Moreover, the validity of outcome measures should be ensured. Researchers are encouraged to follow the CONSORT guidelines for reporting of RCT outcomes and the Template for intervention description and replication (TIDieR) checklist for intervention reporting [76]. Point estimates, confidence intervals and effect sizes should be provided, and, in addition to an intention-to-treat analysis, instrumental variable or compliance-average causal effect approaches may be used in cases of non-compliance [77, 78]. Missing data should be reported and dealt with appropriately (e.g. using multiple imputation) [79, 80].

Strengths and limitations

Strengths of this review include its evidence synthesis by review quality and its focus on interventional evidence only, thereby only considering studies from which causality can be inferred. At the same time, relevant systematic reviews may have been excluded. In addition, the AMSTAR-2 criteria merely consider methodological quality, not whether the findings were synthesised appropriately. For instance, Martin et al. [31] based their conclusions for beneficial effects of PA on findings from a single study and Singh et al. [54] used vote-counting to summarise findings, the validity of which has been questioned [54]. None of the reviews took intervention characteristics (dose, type) into account when summarising results of primary studies. Therefore, no conclusions could be drawn regarding the potential differential effects of PA characteristics on outcomes. Reviews on acute PA effects were not considered in this review and may benefit from an independent evaluation [81]. Furthermore, systematic reviews used a variety of quality assessments which prevented a direct comparison of quality of primary studies. We further acknowledge that our summary of limitations and recommendations given by systematic reviews may be influenced by our own interpretations, yet its aim was to provide an overview of common discussion points. Finally, differences in reporting of findings across systematic reviews limited the extent to which evidence at the level of individual cognitive and academic domains was summarised.


Based on a single high-quality review and 18 (critically) low-quality reviews, we found inconsistent evidence for PA-related effects on cognitive- and academic performance in children and adolescents. Low-quality reviews suggest PA-related brain changes, but lack an interpretation of these findings. If this field were to inform policy, high-quality systematic reviews and primary studies are needed that provide insight into the effect of dose and PA characteristics on (domains of) cognitive-, academic and brain outcomes, in particular in adolescents and children in developing countries.

Availability of data and materials

All data generated as part of this systematic review are included as additional files in this published article.



Consolidated Standards of Reporting Trials




Moderate physical activity


Magnetic Resonance Imaging


Moderate-to-vigorous physical activity


Open Science Framework


Physical activity


Physical Activity Guidelines Advisory Committee


Preferred Reporting Items for Systematic Reviews and Meta-analysis


Randomised controlled trial


Standard Protocol Items: Recommendations for Interventional Trials


Template for intervention description and replication


Vigorous physical activity


  1. Lee IM, Shiroma EJ, Lobelo F, Puska P, Blair SN, Katzmarzyk PT, et al. Effect of physical inactivity on major non-communicable diseases worldwide: an analysis of burden of disease and life expectancy. Lancet. 2012;380(9838):219–29.

    PubMed  PubMed Central  Google Scholar 

  2. Kohl HW, Craig CL, Lambert EV, Inoue S, Alkandari JR, Leetongin G, Kahlmeier S, Lancet Physical Activity Series Working Group. The pandemic of physical inactivity: Global action for public health. Lancet. 2012;380:294–305.

  3. McMahon EM, Corcoran P, O’Regan G, Keeley H, Cannon M, Carli V, et al. Physical activity in European adolescents and associations with anxiety, depression and well-being. Eur Child Adolesc Psychiatry. 2017;26(1):111–22.

    PubMed  Google Scholar 

  4. Guthold R, Stevens GA, Riley LM, Bull FC. Worldwide trends in insufficient physical activity from 2001 to 2016: a pooled analysis of 358 population-based surveys with 1·9 million participants. Lancet Glob Health. 2018;6(10):e1077–86.

    PubMed  Google Scholar 

  5. Ding D, Lawson KD, Kolbe-Alexander TL, Finkelstein EA, Katzmarzyk PT, van Mechelen W, et al. The economic burden of physical inactivity: a global analysis of major non-communicable diseases. Lancet. 2016;388(10051):1311–24.

    PubMed  Google Scholar 

  6. Ekelund U, Steene-Johannessen J, Brown WJ, Fagerland MW, Owen N, Powell KE, et al. Does physical activity attenuate, or even eliminate, the detrimental association of sitting time with mortality? A harmonised meta-analysis of data from more than 1 million men and women. Lancet. 2016;388(10051):1302–10.

    PubMed  Google Scholar 

  7. Lear SA, Hu W, Rangarajan S, Gasevic D, Leong D, Iqbal R, et al. The effect of physical activity on mortality and cardiovascular disease in 130 000 people from 17 high-income, middle-income, and low-income countries: the PURE study. Lancet. 2017;390(10113):2643–54.

    PubMed  Google Scholar 

  8. Dale LP, Vanderloo L, Moore S, Faulkner G. Physical activity and depression, anxiety, and self-esteem in children and youth: an umbrella systematic review. Ment Health Phys Act. 2019;16:66–79.

    Google Scholar 

  9. Erickson KI, Hillman C, Stillman CM, Ballard RM, Bloodgood B, Conroy DE, et al. Physical activity, cognition, and brain outcomes: a review of the 2018 physical activity guidelines. Med Sci Sports Exerc. 2019;51(6):1242–51.

    PubMed  PubMed Central  Google Scholar 

  10. Bauer UE, Briss PA, Goodman RA, Bowman BA. Prevention of chronic disease in the 21st century: elimination of the leading preventable causes of premature death and disability in the USA. Lancet. 2014;384(9937):45–52.

    PubMed  Google Scholar 

  11. Sallis JF, Bull F, Guthold R, Heath GW, Inoue S, Kelly P, et al. Progress in physical activity over the Olympic quadrennium. Lancet. 2016;388(10051):1325–36.

    PubMed  Google Scholar 

  12. 2018 Physical Activity Guidelines Advisory Committee. 2018 Physical Activity Guidelines Advisory Committee Scientific Report. Washington, DC: US Department of Health and Human Services; 2018.

  13. World Health Organization. Global action plan on physical activity 2018–2030: more active people for a healthier world. 2018. Cited 2019 Nov 13.

    Google Scholar 

  14. Guthold R, Stevens GA, Riley LM, Bull FC. Global trends in insufficient physical activity among adolescents: a pooled analysis of 298 population based surveys with 1·6 million participants. Lancet Child Adolesc Heal. 2019;4642:1–13.

    Google Scholar 

  15. Sawyer SM, Azzopardi PS. The age of adolescence. Lancet Child Adolesc Heal. 2018;2(18):223–8.

    Google Scholar 

  16. Foulkes L, Blakemore SJ. Studying individual differences in human adolescent brain development. Nat Neurosci. 2018;21:1–9.

    Google Scholar 

  17. Patton GC, Sawyer SM, Santelli JS, Ross DA, Afifi R, Allen NB, et al. Our future: a lancet commission on adolescent health and wellbeing. Lancet. 2016;387(10036):2423–78.

    PubMed  PubMed Central  Google Scholar 

  18. Jago R, Fairclough SJ, Mackintosh KA, Mccrorie P, Sebire SJ, Sherar LB, et al. Expert working group working paper children and young people UK physical activity guidelines: draft review and recommendations for children and young people [internet]. 2018.

    Google Scholar 

  19. Biddle SJH, Ciaccioni S, Thomas G, Vergeer I. Physical activity and mental health in children and adolescents: an updated review of reviews and an analysis of causality. Psychol Sport Exerc. 2019;42:146–55.

    Google Scholar 

  20. Moher D, Liberati A, Tetzlaff J, Altman DG. Group the P. preferred reporting items for systematic reviews and meta-analyses:the PRISMA statement. Ann Intern Med. 2009;151(4):264–9.

    PubMed  Google Scholar 

  21. Rathbone J, Hoffmann T, Glasziou P. Faster title and abstract screening? Evaluating Abstrackr, a semi-automated online screening program for systematic reviewers. Syst Rev. 2015;4(80):1–7.

    Google Scholar 

  22. Shea BJ, Reeves BC, Wells G, Thuku M, Hamel C, Moran J, et al. AMSTAR 2: a critical appraisal tool for systematic reviews that include randomised or non-randomised studies of healthcare interventions, or both. BMJ. 2017;358:1–9.

    Google Scholar 

  23. Álvarez-Bueno C, Pesce C, Cavero-Redondo II, Sanchez-Lopez M, Garrido-Miguel M, Martinez-Vizcaino V, et al. Academic achievement and physical activity: a meta-analysis. Pediatrics. 2017;140(6):e20171498.

    PubMed  Google Scholar 

  24. de Greeff JW, Bosker RJ, Oosterlaan J, Visscher C, Hartman E. Effects of physical activity on executive functions, attention and academic performance in preadolescent children: a meta-analysis. J Sci Med Sport. 2018;21(5):501–7.

    PubMed  Google Scholar 

  25. Singh AS, Saliasi E, Van Den Berg V, Uijtdewilligen L, De Groot RHM, Jolles J, et al. Effects of physical activity interventions on cognitive and academic performance in children and adolescents: a novel combination of a systematic review and recommendations from an expert panel. Br J Sports Med. 2019;53(10):640–7.

    PubMed  Google Scholar 

  26. Spruit A, Assink M, van Vugt E, van der Put C, Stams GJ. The effects of physical activity interventions on psychosocial outcomes in adolescents: a meta-analytic review. Clin Psychol Rev. 2016;45:56–71.

    PubMed  Google Scholar 

  27. Gunnell KE, Poitras VJ, LeBlanc A, Schibli K, Barbeau K, Hedayati N, et al. Physical activity and brain structure, brain function, and cognition in children and youth: a systematic review of randomized controlled trials. Ment Health Phys Act. 2018;16:105–27.

    Google Scholar 

  28. Haapala E. Physical activity, academic performance and cognition in children and adolescents. A systematic review. Balt J Health Phys Act. 2012;4(1):53–61.

    Google Scholar 

  29. Lees C, Hopkins J. Effect of aerobic exercise on cognition, academic achievement, and psychosocial function in children: a systematic review of randomized control trials. Prev Chronic Dis. 2013;10(10):1–8.

    Google Scholar 

  30. Li JW, O’Connor H, O’Dwyer N, Orr R. The effect of acute and chronic exercise on cognitive function and academic performance in adolescents: a systematic review. J Sci Med Sport. 2017;20(9):841–8.

    PubMed  Google Scholar 

  31. Martin A, Booth JN, Laird Y, Sproule J, Reilly JJ, Saunders DH. Physical activity, diet and other behavioural interventions for improving cognition and school achievement in children and adolescents with obesity or overweight. Cochrane Database Syst Rev. 2018;3(3):CD009728.

    PubMed  Google Scholar 

  32. Martin R, Murtagh EM. Effect of active lessons on physical activity, academic, and health outcomes: a systematic review. Res Q Exerc Sport. 2017;88(2):149–68.

    PubMed  Google Scholar 

  33. Mura G, Vellante M, Nardi AE, Machado S, Carta MG. Effects of school-based physical activity interventions on cognition and academic achievement: a systematic review. CNS Neurol Disord Drug Targets. 2015;14(9):1194–208.

    CAS  PubMed  Google Scholar 

  34. Pucher KK, Boot NMWM, de Vries NK. Systematic review: school health promotion interventions targeting physical activity and nutrition can improve academic performance in primary- and middle school children. Health Educ. 2013;113(5):372–91.

  35. Vazou S, Pesce C, Lakes K, Smiley-Oyen A. More than one road leads to Rome: a narrative review and meta-analysis of physical activity intervention effects on cognition in youth. Int J Sport Exerc Psychol. 2019;17(2):153–78.

    PubMed  Google Scholar 

  36. Suarez-Manzano S, Ruiz-Ariza A, De La Torre-Cruz M, Martínez-López EJ. Acute and chronic effect of physical activity on cognition and behaviour in young people with ADHD: a systematic review of intervention studies. Res Dev Disabil. 2018;77:12–23.

    PubMed  Google Scholar 

  37. Verburgh L, Königs M, Scherder EJAA, Oosterlaan J. Physical exercise and executive functions in preadolescent children, adolescents and young adults: a meta-analysis. Br J Sports Med. 2014;48(12):973–9.

    PubMed  Google Scholar 

  38. Álvarez-Bueno C, Pesce C, Cavero-Redondo I, Sánchez-López M, Martínez-Hortelano JA, Martínez-Vizcaíno V. The effect of physical activity interventions on Children’s cognition and metacognition: a systematic review and meta-analysis. J Am Acad Child Adolesc Psychiatry. 2017;56(9):729–38.

    PubMed  Google Scholar 

  39. Bustamante EE, Williams CF, Davis CL. Physical activity interventions for neurocognitive and academic performance in overweight and obese youth. A systematic review. Pediatr Clin N Am. 2016;63(3):459–80.

    Google Scholar 

  40. Jackson WM, Davis N, Sands SA, Whittington RA, Sun LS. Physical activity and cognitive development: a meta-analysis. J Neurosurg Anesthesiol. 2016;28(4):373–80.

    PubMed  Google Scholar 

  41. Lubans D, Richards J, Hillman C, Faulkner G, Beauchamp M, Nilsson M, et al. Physical activity for cognitive and mental health in youth: A systematic review of mechanisms. Pediatrics. 2016;138(3):e20161642.

    PubMed  Google Scholar 

  42. Krafft CE, Schaeffer DJ, Schwarz NF, Chi L, Weinberger AL, Pierce JE, et al. Improved frontoparietal white matter integrity in overweight children is associated with attendance at an after-school exercise program. Dev Neurosci. 2014;36(1):1–9.

    CAS  PubMed  PubMed Central  Google Scholar 

  43. Krafft CE, Schwarz NF, Chi L, Weinberger AL, Schaeffer DJ, Pierce JE, et al. An 8-month randomized controlled exercise trial alters brain activation during cognitive tasks in overweight children. Obesity. 2014;22(1):232–42.

    PubMed  Google Scholar 

  44. Davis CL, Tomporowski PD, McDowell JE, Austin BP, Miller PH, Allison JD, et al. Exercises improves executive function and achievement and alters brain activation in overweight children a randomized controlled trial. Health Psychol. 2011;30(1):91–8.

    PubMed  PubMed Central  Google Scholar 

  45. Nelson MC, Cordray DS, Hulleman CS, Darrow CL, Sommer EC. A procedure for assessing intervention fidelity in experiments testing educational and behavioral interventions. J Behav Health Serv Res. 2012;39(4):374–96.

    PubMed  Google Scholar 

  46. Gearing RE, El-Bassel N, Ghesquiere A, Baldwin S, Gillies J, Ngeow E. Major ingredients of fidelity: a review and scientific guide to improving quality of intervention research implementation. Clin Psychol Rev. 2011;31(1):79–88.

    PubMed  Google Scholar 

  47. Minton J, Dimairo M, Everson-Hock E, Scott E, Goyder E. Exploring the relationship between baseline physical activity levels and mortality reduction associated with increases in physical activity: a modelling study. BMJ Open. 2013;3(10):1–6.

    Google Scholar 

  48. Grydeland M, Bergh IH, Bjelland M, Lien N, Andersen LF, Ommundsen Y, et al. Intervention effects on physical activity: the HEIA study - a cluster randomized controlled trial. Int J Behav Nutr Phys Act. 2013;10(1):17.

    PubMed  PubMed Central  Google Scholar 

  49. Higgins JPT, Green S (editors). Cochrane Handbook for Systematic Reviews of Interventions Version 5.1.0 [updated March 2011]: The Cochrane Collaboration; 2011. Available from

  50. Donner A, Klar N. Issues in the meta-analysis of cluster randomized trials. Stat Med. 2002;21(19):2971–80.

    PubMed  Google Scholar 

  51. Vetter D, Rucker G, Storch I. Meta-analysis: a need for well-defined usage in ecology and conservation biology. Ecosphere. 2013;4(6):1–24.

    Google Scholar 

  52. Turner RM, Bird SM, Higgins JPT. The impact of study size on meta-analyses: examination of underpowered studies in Cochrane reviews. PLoS One. 2013;8(3):1–8.

    Google Scholar 

  53. Sterne JAC, Gavaghan D, Egger M. Publication and related bias in meta-analysis: power of statistical tests and prevalence in the literature. J Clin Epidemiol. 2000;53(11):1119–29.

    CAS  PubMed  Google Scholar 

  54. Siddaway AP, Wood AM, Hedges LV. How to do a systematic review: a best practice guide for conducting and reporting narrative reviews, meta-analyses, and meta-syntheses. Annu Rev Psychol. 2019;70(1):747–70.

    PubMed  Google Scholar 

  55. Gurevitch J, Koricheva J, Nakagawa S, Stewart G. Meta-analysis and the science of research synthesis. Nature. 2018;555(7695):175–82.

    CAS  PubMed  Google Scholar 

  56. Ruotsalainen I, Renvall V, Gorbach T, Syväoja HJ, Tammelin TH, Karvanen J, et al. Aerobic fitness, but not physical activity, is associated with grey matter volume in adolescents. Behav Brain Res. 2019;362:122–30.

    PubMed  Google Scholar 

  57. Herting MM, Chu XF. Exercise, cognition, and the adolescent brain. Birth Defects Res. 2017;109(20):1672–9.

    CAS  PubMed  PubMed Central  Google Scholar 

  58. Solis-Urra P, Olivares-Arancibia J, Suarez-Cadenas E, Sanchez-Martinez J, Rodríguez-Rodríguez F, Ortega FB, et al. Study protocol and rationale of the “Cogni-action project” a cross-sectional and randomized controlled trial about physical activity, brain health, cognition, and educational achievement in schoolchildren. BMC Pediatr. 2019;19(1):1–16.

    Google Scholar 

  59. Vivar C, van Praag H. Running changes the brain: the long and the short of it. Physiology. 2017;32(6):410–24.

    CAS  PubMed  PubMed Central  Google Scholar 

  60. Firth J, Stubbs B, Vancampfort D, Schuch F, Lagopoulos J, Rosenbaum S, et al. Effect of aerobic exercise on hippocampal volume in humans: a systematic review and meta-analysis. Neuroimage. 2018;166:230–8.

    PubMed  Google Scholar 

  61. Wassenaar TM, Wheatley CM, Beale N, Salvan P, Meaney A, Possee JB, et al. Effects of a programme of vigorous physical activity during secondary school physical education on academic performance, fitness, cognition, mental health and the brain of adolescents (fit to study): study protocol for a cluster-randomised trial. Trials. 2019;20(1):1–14.

    Google Scholar 

  62. Norris E, van Steen T, Direito A, Stamatakis E. Physically active lessons in schools and their impact on physical activity, educational, health and cognition outcomes: a systematic review and meta-analysis. Br J Sports Med. 2019.

  63. Valkenborghs SR, Noetel M, Hillman CH, Nilsson M, Smith JJ, Ortega FB, et al. The impact of physical activity on brain structure and function in youth: a systematic review. Pediatrics. 2019;144(4):e20184032.

    PubMed  Google Scholar 

  64. Xue Y, Yang Y, Huang T. Effects of chronic exercise interventions on executive function among children and adolescents: a systematic review with meta-analysis. Br J Sports Med. 2019;1:1–9.

    Google Scholar 

  65. Masini A, Marini S, Gori D, Leoni E, Rochira A, Dallolio L. Evaluation of school-based interventions of active breaks in primary schools: a systematic review and meta-analysis. J Sci Med Sport. 2019.

  66. Christiansen L, Beck MM, Bilenberg N, Wienecke J, Astrup A, Lundbye-Jensen J. Effects of exercise on cognitive performance in children and adolescents with ADHD: potential mechanisms and evidence-based recommendations. J Clin Med. 2019;8(6):841.

    PubMed Central  Google Scholar 

  67. Sneck S, Viholainen H, Syväoja H, Kankaanpää A, Hakonen H, Poikkeus A-M, et al. Effects of school-based physical activity on mathematics performance in children: a systematic review. Int J Behav Nutr Phys Act. 2019;16(109):32864.

    Google Scholar 

  68. Bedard C, St John L, Bremer E, Graham JD, Cairney J. A systematic review and meta-analysis on the effects of physically active classrooms on educational and enjoyment outcomes in school age children. PLoS One. 2019;14(6):e0218633.

    CAS  PubMed  PubMed Central  Google Scholar 

  69. Higgins JPT, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, Welch VA (editors). Cochrane Handbook for Systematic Reviews of Interventions version 6.0 (updated July 2019): Cochrane; 2019. Available from

  70. Wiernik BM, Dahlke JA. Obtaining unbiased results in meta-analysis: the importance of correcting for statistical artefacts. Adv Methods Pract Psychol Sci. 2020;3(1):94–123.

    Google Scholar 

  71. Aromataris E, Fernandez R, Godfrey CM, Holly C, Khalil H, Tungpunkom P. Summarizing systematic reviews: methodological development, conduct and reporting of an umbrella review approach. Int J Evid Based Healthc. 2015;13(3):132–40.

    PubMed  Google Scholar 

  72. Chan A-W, Tetzlaff JM, Altman DG, Laupacis A, Gøtzsche PC, Krleža-Jerić K, et al. SPIRIT 2013 statement: defining standard protocol items for clinical trials. Ann Intern Med. 2013;158(3):200.

    PubMed  PubMed Central  Google Scholar 

  73. Grant S, Mayo-Wilson E, Montgomery P, Macdonald G, Michie S, Hopewell S, et al. CONSORT-SPI 2018 explanation and elaboration: guidance for reporting social and psychological intervention trials. Trials. 2018;19(1):1–18.

    Google Scholar 

  74. Begg C, Cho M, Eastwood S, Horton R, Moher D, Olkin I, et al. Improving the quality of reporting of randomized controlled trials: the CONSORT statement. J Am Med Assoc. 1996;276(8):637–9.

    CAS  Google Scholar 

  75. Rothwell PM. External validity of randomised controlled trials: “to whom do the results of this trial apply?”. Lancet. 2005;365:82–93.

    PubMed  Google Scholar 

  76. Hoffmann TC, Glasziou PP, Boutron I, Milne R, Perera R, Moher D, et al. Better reporting of interventions: template for intervention description and replication (TIDieR) checklist and guide. BMJ. 2014;348:1–12.

    Google Scholar 

  77. Hernán MA, Hernández-Díaz S. Beyond the intention-to-treat in comparative effectiveness research. Clin Trials. 2012;9(1):48–55.

    PubMed  Google Scholar 

  78. Mostazir M, Taylor RS, Henley W, Watkins E. An overview of statistical methods for handling nonadherence to intervention protocol in randomized control trials: a methodological review. J Clin Epidemiol. 2019;108:121–31.

    PubMed  Google Scholar 

  79. Sterne JAC, White IR, Carlin JB, Spratt M, Royston P, Kenward MG, et al. Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls. BMJ. 2009;338:b2393.

    PubMed  PubMed Central  Google Scholar 

  80. Carpenter JR, Kenward MG. Missing data in randomised controlled trials: a practical guide. Birmingham: Health Technology Assessment Methodology Programme; 2007. p. 199.

    Google Scholar 

  81. Chang Y-K, Erickson KI, Stamatakis E, Hung T-M. How the 2018 US physical activity guidelines are a call to promote and better understand acute physical activity for cognitive function gains. Sports Med. 2019:1625–7.

Download references


Not applicable.


HD is supported by the Elizabeth Casson Trust and the NIHR Oxford Health Biomedical Research Centre. HJB is supported by a Wellcome Principal Research Fellowship (110027/Z/15/Z). The Wellcome Centre for Integrative Neuroimaging is supported by core funding from the Wellcome Trust (203139/Z/16/Z). The views expressed are those of the authors and not necessarily those of the NHS, the NIHR or the Department of Health. The funding bodies had no role in the writing of this manuscript.

Author information

Authors and Affiliations



TW, WW, CF and CS conceptualized and designed the study. The literature search was performed by NR. Screening and article review were performed by TW and WW. Data extraction was done by TW and independently evaluated by WW. Data analysis and synthesis was performed by TW. TW drafted the manuscript. All authors revised the manuscript and approved the final manuscript.

Corresponding author

Correspondence to Thomas M. Wassenaar.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

CES has received consulting fees from Jazz Pharmaceuticals and is now a full-time employee at the Alzheimer’s Association. TW, WW, HJB, HD, NR, CF have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1.

PRISMA 2009 checklist.

Additional file 2.

Search strategy and search terms.

Additional file 3.

Excluded studies with reasons.

Additional file 4.

Search details of systematic reviews.

Additional file 5.

Countries where PA interventions were conducted.

Additional file 6.

Quality assessment and bias tools from systematic reviews.

Additional file 7.

Quality assessment AMSTAR-2.

Additional file 8.

Overlap of primary studies across reviews.

Additional file 9.

Supplementary findings from meta-analyses.

Additional file 10.

Limitations and recommendations of included reviews.

Additional file 11.

Studies included in PA committee reports.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wassenaar, T.M., Williamson, W., Johansen-Berg, H. et al. A critical evaluation of systematic reviews assessing the effect of chronic physical activity on academic achievement, cognition and the brain in children and adolescents: a systematic review. Int J Behav Nutr Phys Act 17, 79 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: