Skip to main content

Commercial app use linked with sustained physical activity in two Canadian provinces: a 12-month quasi-experimental study



Top tier commercial physical activity apps rarely undergo peer-reviewed evaluation. Even fewer are assessed beyond six months, the theoretical threshold for behaviour maintenance. The purpose of this study was to examine whether a multi-component commercial app rewarding users with digital incentives for walking was associated with an increase in physical activity over one year.


This 12-month quasi-experimental study was conducted in two Canadian provinces (n = 39,113 participants). Following a two-week baseline period, participants earned digital incentives ($0.04 CAD/day) every day they reached a personalized daily step goal. Mixed-effects models estimated changes in weekly mean daily step count between the baseline period and the last two recorded weeks. Models were fit for several engagement groups and separately by baseline physical activity status within engagement groups.


Nearly half of participants (43%) were categorized as physically inactive at baseline (fewer than 5000 daily steps), and 60% engaged with the app for at least six months [‘Regular’ (24–51 weeks of step data) or ‘Committed’ sub-groups (52 weeks)]. Weekly mean daily step count increased for physically inactive users regardless of engagement status (P < .0001). The increase was largest for ‘Regular’ and ‘Committed’ participants—1215 and 1821 steps/day, respectively. For physically active participants, step count increases were only observed in the ‘Committed’ sub-group (P < .0001). Effect sizes were modest-to-medium depending on the sub-group analyzed.


A commercial app providing small but immediate digital incentives for individualized goals was associated with an increased weekly mean daily step count on a population-scale over one year. This effect was more evident for physically inactive and more engaged participants.


Despite the health benefits of habitual moderate-vigorous physical activity (PA), [1,2,3] global rates are precipitously low [4, 5]. For good reason too—exercise is hard and our built environments discourage it [6]. New research, though, suggests health benefits are not just reserved for higher-intensity, harder-to-achieve moderate-vigorous PA, the traditional public health focus [7]. Light-intensity PA like walking has beneficial effects as well including lower mortality [8, 9]. From a behavioural perspective, regular participation in less strenuous light-intensity PA may be more attainable on a population level. This perspective was adopted in the latest US Physical Activity Guidelines which stress that some PA is better than none—shifting somewhat from the “at least” 150 moderate-vigorous PA minute/week message [10]. To achieve bold global physical inactivity reduction targets (15% by 2030) the World Health Organization recently singled out digital innovation (e.g., smartphone-based programmes) as an important component of a broad “systems-based” solution in their Global Action Plan on Physical Activity 2018–2030 [11]. To capitalize on the steady growth of the smartphone-based mobile health application (mHealth app) market evaluations of commercial apps that promote any intensity PA are needed [12].

This year more than 2.5 billion people worldwide own a smartphone [13]. The number of mHealth apps published in the major app stores continues to rise with 325,000 published in 2017, up 34% from the previous year [14]. This increase in part reflects evolving smartphone capabilities (e.g., built-in accelerometers, geo-location). Access to built-in accelerometer data in particular [15] has transformed PA promotion. For the first time, the majority of adults (approaching 90%) in the US and Canada, for example, carry a PA monitoring device (i.e. a smartphone accelerometer) most of the time [13]. This presents an unprecedented opportunity to deliver more precise public health interventions and bridge well-worn PA divides (e.g., gender PA gaps) [16] using instantaneous PA data to set and adjust realistic PA goals, provide immediate feedback, link users with friends to support long-term change, and so on. Not surprisingly, PA apps make up the bulk of all mHealth apps (30%, or roughly 100,000 apps) [17]. Unfortunately, low PA app engagement leading to small effects and little sustainability have been industry hallmarks [17,18,19].

A 2016 systematic review [18] and a 2019 meta-analysis [20] of studies using apps to improve PA found that few stand-alone app interventions reported positive effects. Another recent meta-analysis [21] and systematic review [19] on the other hand found that app-based interventions increased PA. The still limited number of RCTs in this area (n < 10), due in part to the rapid pace of app development and rollout, may help explain the discrepancies [19, 20]. To enhance our understanding of this rapidly evolving field non-RCT alternatives (e.g., quasi-experimental designs) are needed [19, 22, 23]. Longitudinal designs in particular are warranted given the majority of studies do not exceed three months [19,20,21] even though sustained PA is needed to attain many of the purported health benefits [1]. Rigorous quasi-experimental evaluations of top tier commercial apps (i.e. the top 2% of all apps reporting more than 500,000 Monthly Active Users, MAUs—at least one app view per month) [14] may provide especially valuable insight in a promising field where attrition is unfortunately the norm. Among the 15 studies included in the Petersen et al. (2019) review of PA apps only five examined commercially available ones (e.g., Fitbit, ‘Zombie, Run!’). Other studies have examined the Pokémon Go! [24] and Sweatcoin [25] apps, though important limitations preclude strong conclusions (e.g., unrepresentative samples).

The Carrot Rewards app, created by a private company with support from the Public Health Agency of Canada, [26] presents a unique opportunity to explore the long-term effectiveness of a top tier commercial app. Carrot Rewards was a popular Canadian app (i.e. 1.3+ million downloads, 500,000+ MAUs as of May 2019) leveraging gamification elements [27] and concepts from behavioural economics [28] and self-determination theory [29] to reward users with digital incentives (i.e. loyalty points redeemable for consumer goods like movies or groceries) for engaging in healthy behaviours such as walking. The multi-component Carrot Rewards “Steps” walking program (which included goal setting, biofeedback, daily/weekly incentives, etc.; Fig. 1) provided very small incentives for individualized daily step goal achievements. A three-month “Steps” evaluation was published in 2018 [30]. In this quasi-experimental study of users living in two Canadian provinces, Mitchell et al. (2018) found average daily step count increased by 5% between baseline and the three-month assessment (115.70 steps; P < .001). A more pronounced 32% increase was observed among highly engaged, physically inactive users (1224.66 steps; P < .001). As is commonly reported [18, 19] behavioural decay was noted in the later part of the three-month evaluation.

Fig. 1
figure 1

Carrot Rewards app “Steps” walking program screenshot

Since the effectiveness of PA apps have been shown to wane over time, the primary objective of this study is to examine the impact of the Carrot Rewards app over a longer 12-month period. Longitudinal designs such as this are especially important in a Canadian context, and for other countries, where inclement weather can dampen PA behaviours. Determining whether PA changes are moderated by app engagement is an important secondary objective.


Study design, setting and participants

A 12-month quasi-experimental (pre/post single group) study design was used. The free Carrot Rewards app was made available to British Columbia (BC) and Newfoundland and Labrador (NL) residents on the Apple iTunes and Google Play app stores on March 3 and June 13, 2016, respectively. Only users enabling the “Steps” walking program (i.e. allowing the app to access their step data) during the June 13 to July 10, 2016 recruitment period were included in the study. Additional background information, including recruitment details and theoretical underpinnings are published elsewhere [30]. Carrot Rewards was discontinued in June 2019 due to a lack of funding [31]. The Strengthening the reporting of observational studies in epidemiology (STROBE) statement checklist for cohort studies is provided (Additional file 1). The University of British Columbia Behavioural Research Ethics Board approved this study (H17–02814).

Intervention: “Steps” walking program

Once enrolled in the walking program users were instructed to “wear” their smartphone or Fitbit™ as much as possible during the two-week baseline period. After the baseline period, users could begin to earn incentives for reaching individualized daily step goals (Fig. 1). To set users’ first personalized daily step goal, 1000 steps was added to their baseline daily step count average. The incentives for daily achievements were worth $0.04 CAD in loyalty points. After roughly four weeks of earning daily rewards users could then begin to earn a $0.40 CAD bonus for reaching their daily goal 10 or more times in a 14-day period, called a “Step Up Challenge” (Fig. 1). For users who successfully completed a “Step Up Challenge” a new higher daily step goal was provided. For unsuccessful users, the previous goal persisted. Over the course of the 12-month evaluation, participants could earn a maximum of $25.00 CAD in points. Like with many apps the intervention evolved over time (Table 1). For more app design detail, completed Mobile App Rating Scale (MARS self-score 4.23/5; for understanding app quality, aesthetics and functional appeal) [32] and App Behavior Change Scale (ABACUS self-score 4.5/5; for measuring potential to change behaviour) [12] are provided (Additional files 2 and 3).

Table 1 Carrot Rewards “Steps” walking program timeline and evolution

Outcome measure

The primary outcome was mean daily step count as measured by either built-in smartphone accelerometers or Fitbit trackers (i.e. iPhone 5S or higher [26.21% of users], Android devices [42.78%]), FitbitTM devices [4.43%], Unknown device [26.59%]). In recent validation studies, the iPhone step counting feature (version 6 or newer), as well as those for Android smartphones (e.g., HTC) and Fitbit trackers (e.g., wrist-worn Flex) were accurate in laboratory and field conditions [33,34,35]. While wrist-worn activity trackers may record more daily steps than smartphone-based accelerometers (e.g., given wear time differences), given the small proportion of participants using Fitbit we decided not to examine effects by device.


The majority of demographic variables used to describe the study sample were self-reported (e.g., age, gender). Median personal income was inferred by linking user postal codes with census data (i.e. 2011 National Household Survey) at the Local Health Area level (89 LHAs) in British Columbia and Regional Health Authority level (four Regional Health Authorities) in Newfoundland & Labrador. Baseline daily step count set date was included as a co-variate in our analyses as well to adjust for potential seasonal effects.


Statistical analysis was performed using R 3.3.0 .68 Mavericks build (7202). Two sets of analyses were used to estimate changes in mean daily step count over the intervention period. In our primary analysis, and similar to our first 12-week examination of “Steps”, we estimated changes in mean daily step count between baseline and the last two recorded weeks. We included participants who had valid baseline data (four or more days with step counts in acceptable range, between 1000 and 40,000, during the 14-day baseline period) and at least one other valid week (at least four valid days in a week) between study week 1 and 52 (88% of those enrolling in “Steps” during recruitment period; 39,113/44373) in the analysis. We did not remove any cases or perform imputation to account for missing data since these approaches did not influence results in our 12-week analyses [30]. Time was coded as a three-level categorical variable (baseline = 0, second last recorded week = 1, and last recorded week = 2). Mixed-effects models were used to examine whether there were significant changes in mean daily step counts between baseline and the last two recorded weeks. A full model was fit that included time (with baseline as one of the three levels), demographics, and baseline set date as fixed effects along with participant as a random effect. A post hoc contrast was then estimated for the difference between the average of the last two recorded weekly average daily step count and baseline. A reduced model was also fit that included time and the baseline set date as the fixed effects.

In our secondary analysis, we estimated the longitudinal change in weekly record of mean daily step count across all 52 weeks. The purpose of this analysis was to illustrate how changes in weekly average daily step count varied across one year. The outcome variable was weekly mean recorded daily step count. Time was coded as a categorical variable (baseline = 0, week 1 = 1, …, week 52 = 52) to allow for the non-linear trajectory of daily step count. A mixed-effects model was used to examine on average the overall magnitudes of change across weeks. We fit a full model that included time with demographic variables, baseline set date, and baseline daily step count as fixed-effect co-variates and participant as a random effect.

As results from our 12-week analyses indicated that engagement and PA status had significant moderating effects on changes in weekly mean daily step counts over time, we fit all models separately for several engagement groups and then separately for physically active and physically inactive participants within each engagement group. Four engagement groups were formed based on number of weeks with four or more days of valid step count data: ‘Limited’ users: 1–11 weeks, ‘Occasional’ users: 12–23 weeks, ‘Regular’ users: 24–51 weeks and ‘Committed’ users: 52 weeks. An app view would trigger daily step count data retrieval for the previous four weeks. Two PA status categories were formed as defined by Tudor-Locke et al. [36]: physically inactive users = baseline mean steps per day less than 5000; physically active users = baseline mean steps per day 5000 or more. Cohen’s f2 for local effect sizes of weekly mean daily step counts within the mixed-effects models were calculated, with f2 ≥ 0.02, f2 ≥ 0.15, and f2 ≥ 0.35 representing small, medium, and large effect sizes, respectively [48]. Statistical significance of fixed effects was assessed using Wald statistics. Statistical significance levels were set at P < 0.05.


Baseline characteristics

Of the 39,113 participants, the mean age was 33.7 ± 11.6 years and 66.0% (25,809/39,113) were female (Table 2). Other incentive-based eHealth interventions have reported similar demographic profiles, with females in particular being more likely to adopt with digital wellness interventions in general [26, 27]. Sixty percent of users (23,505/39,113) engaged with the app for at least six months being categorized as either ‘Regular’ or ‘Committed’. The mean personal median income was $29,517 CAD, lower than the 2014 BC and NL means of $31,610 and $30,450 CAD, respectively [38]. Mean daily step count at baseline was 5560 steps per day. Nearly half of users (42.46%, 16,606/39,113) were categorized as physically inactive. Average incentive amount earned over the course of the year ranged from $5.88–$10.83 CAD depending on participant’s age, gender and province.

Table 2 Baseline characteristics of study participants by engagement group, and Canadians in general

Difference between the last two recorded weeks and baseline

As presented in Table 3, the intervention effect was more pronounced among physically inactive and more engaged sub-groups. For the engagement sub-group analysis, average daily step counts from baseline to last recorded weeks significantly increased in ‘Regular’ and ‘Committed’ users, but significantly decreased in ‘Limited’ and ‘Occasional’ users. Cohen’s f2 statistic indicated that the effect was small (0.0563) for ‘Committed’ users and modest for the other engagement groups. PA status also had a significant moderating effect. We observed a significant increase in daily step counts from baseline to the last two recorded weeks in physically inactive participants across engagement groups. However, significant and small decreases were noted for all physically active participants except those categorized as ‘Committed’ (Table 3).

Table 3 Changes in mean daily step counts between baseline and the last two recorded weeks stratified by engagement group and physical activity status within engagement group, least-square means (LSM) and 95% confidence intervals

Longitudinal changes in weekly mean daily step counts

For all models, we observed significant effects of time and baseline daily step count. The overall pattern of change in weekly recorded steps differed by PA status regardless of engagement group. In Fig. 2, we illustrate the longitudinal change in weekly recorded mean steps by engagement group within PA status. For physically inactive users, weekly recorded mean step counts were above baseline levels in most weeks. The magnitude of increase varied across weeks, reaching a peak at week 4 (July/August, summer season), followed by a modest PA decline and plateau from week 5 to week 27 (December, winter season), with a generally positive trend thereafter. For physically active users, weekly recorded mean step counts were below baseline in most weeks. A steeper decline from week 5 to 27 was noted for these physically active users, with daily step count increases in the remaining weeks. The dip in average daily step count in the weeks leading up to the shortest day of the year (Winter Solstice on December 19, 2016; weeks 23 to 26) was generally more pronounced in physically active compared to physically inactive users.

Fig. 2
figure 2

Longitudinal changes in weekly recorded mean step counts by physical activity status and engagement group, with 95% confidence intervals (dotted line). Models adjusted for baseline set date and baseline daily step counts. a-b, ‘Limited’ users; c-d, ‘Occasional’ users; e-f, ‘Regular’ users; g-h, ‘Committed’ users


Main finding

In this large quasi-experimental study examining the impact of the Carrot Rewards app on objectively-assessed PA over one year, we observed a significant intervention effect in the physically inactive users regardless of engagement status. The increase was largest for ‘Regular’ and ‘Committed’ users—1215 and 1821 steps per day, respectively. The clinical implications of these increases are important especially when one considers that the majority of the health benefits of PA (e.g., systolic blood pressure, glycemic control improvements) are reserved for inactive adults who become a little more active [9, 39]. From a public health perspective, a 1% reduction in the number of Canadians classified as physically inactive would yield annual healthcare savings of $2.1 billion CAD [40]. If we generalize our findings to the larger Carrot Rewards user base (1,046,185 users as of April 2019) then we estimate that the number of Canadians classified as physically inactive was reduced by 0.3% (about 100,000 Canadians).

Secondary findings

A dose-response relationship was evident with more favourable effects observed for more engaged users, irrespective of PA status. This highlights the importance of maximizing engagement with evidence-based mHealth intervention designs such as the ones included in the recent and very useful App Behaviour Change Scale, or ABACUS, checklist (e.g., Does the app allow for the setting of goals? Does the app have prompts for activity? Does the app provide material or social incentive?) [12]. Carrot Rewards’ high ABACUS rating (self-score = 4.5/5; see Additional file 3) may partly explain why 60% of the study sample used the app for at least six months (that is, those classified as ‘Regular’ or ‘Committed’)—the theoretical threshold of behaviour maintenance [41]. Maintaining fidelity to two behaviour change theories in particular also likely fostered high initial and sustained engagement (e.g., behavioural economics, by offering rewards instantaneously; self-determination theory, by providing realistic and personalized goals). Few studies in this field have reported engagement metrics, and even fewer have examined the interaction between engagement and health behaviours/outcomes [18,19,20]. Those that have suggest that intervention exposure is imperative and that greater engagement usually yields larger effects [20].

In addition, our longitudinal analysis illustrates great variation in PA over the course of a year. This is consistent with previous research that found seasonality impacts PA patterns in Canada [42]. Notably, seasonality impacts on PA vary across Canadian provinces with season being a stronger predictor of PA in BC then it is in NL. PA fluctuations over the year should therefore be considered in the refinement of PA apps in the future (e.g., PA goals could be reset in the winter to attenuate declines in steps rather than increasing steps). In addition, the longitudinal analysis partly confirms intervention effects among physically inactive users. Weekly mean daily step counts increased above baseline in most weeks in inactive users, but decreased below baseline in most weeks for active participants. In particular, the winter-time drop was less remarkable in ‘Committed’ physically inactive compared to physically active users as found in a recent Pokémon Go! app analysis where ‘players’ (vs. ‘non-players’) did not experience winter-time step count reductions [43]. This suggests that the intervention may have protected against winter-related PA decreases. Future study with a comparison condition is needed for verification.

Similar studies

Our findings are comparable to those of a recent meta-analysis of RCTs testing PA incentives delivered using smartphone/wearable technology (n = 12). In this study Mitchell et al. (2019) concluded that incentives increased mean daily step counts for short and long duration interventions by 607 steps [44]. Sub-group meta-analyses suggested physically inactive adults are especially sensitive to incentive intervention and that PA increases do not necessarily wane for longer interventions, consistent with what was found here. There was little to suggest physically active participants, other than ‘Committed’ ones, increased their steps over the year. In addition, our results build-on the efficiencies noted in the meta-analysis. That is, reward sizes needed to stimulate PA have dropped considerably in recent years due in part to technological advances that make it easier to track and reward activity, and stronger application of behavioural economics concepts. By offering digital incentives instantaneously Carrot Rewards drove incentive cost down (to pennies a day) by exploiting two behavioural economics concepts in particular: (a) the human tendency to prefer payoffs close to the present time (“present bias”) and (b) the tendency for people to equate larger numbers (i.e. the points used in this case) with greater value (“numerosity”).

On the other hand, few rigorous evaluations of top tier PA apps have been published [19]. Sweatcoin, a popular UK-based app (30+ million downloads globally) that converts step counts into a virtual currency, is a notable and relevant exception [25]. In a nine-month observational study (n = 5892), Elliot et al. (2019) determined in the six months following registration that daily step count increased by 18.7% (roughly 1200 steps) compared to baseline. While this study had several strengths (e.g., assessed long-term impact of a commercial app on objectively-measured PA) the main findings must be interpreted with caution. In particular, the Elliot et al. (2019) analysis included only very engaged users (opened app in last seven days) with complete data sets—unlike this study in which all users signing-up for the “Steps” program during the evaluation period and with one other valid week were included. It is unclear whether analyses of this highly engaged sub-sample—only 5892 users out of more than 30+ million were included—can be generalized to the broader user base. In addition, with the majority of the sub-sample in the Sweatcoin study registering in the winter it is unclear if effects are due to typical seasonal PA fluctuations. As well, smartphone wear time during the pre-registration period was not optimized unlike with the present study where users were encouraged to “wear” their smartphones as much as possible during the baseline period. In terms of effect magnitudes, the results of the present study generally align with those of Elliot et al. (2019) with roughly 500 to 1500 daily step count increases observed depending on the sub-group analysed. Notably, and consistent with our findings, physically inactive Sweatcoin users responded the most.


Our results should be interpreted with caution in light of some limitations. First, the internal validity (i.e. the extent to which PA increases were caused by Carrot Rewards) of our findings are limited by the absence of an equivalent control group. To address this limitation, we defined a pre-intervention time period (the two-week baseline period), distinct from the intervention, to reflect the counter-factual in this quasi-experimental setting [45]. The anticipated daily step count increase from the pre-intervention baseline period to intervention Weeks 1 and 2 was observed (Fig. 2) suggesting “Steps” boosted PA when introduced. Analysis-phase strategies were also employed to improve internal validity [45]. All models adjusted for key demographics variables, baseline set date and baseline step count, and accounted for measurements nesting within individuals. As well, a clear dose-response relationship between engagement and PA provides further support for the main conclusion that Carrot Rewards, when used above a threshold level, is associated with an increase in PA. A rival hypothesis may be that participants simply started carrying their smartphones more. The challenge of disentangling “wear time” from actual daily step count increases is a limitation of this and other similar studies [46]. A second limitation is that complete data sets (data for all 52 weeks) were only available for 20% of study participants. Unlike for the ‘Committed’ users (for whom we know the last two recorded weeks occurred exactly one year after baseline because data for all 52 weeks were available) it is not exactly clear when the last two recorded weeks for the other engagement groups occurred given their incomplete data sets. Data could have been recorded during a calendar month/season that was different from baseline, for instance. Third, it is not known at what intensity any extra steps were accumulated. Collecting step count data on a minute-by-minute basis in the future may help establish step cadences that could be classified as at least moderate intensity. Similarly, measuring key clinical variables (e.g., A1C) in at least a sub-sample of users may help establish the expected clinical benefits of app use and facilitate the ‘prescription’ of such an app and inform important health economic analyses.


A multi-component commercial app providing very small (i.e. $5-$10 CAD per person per year) but immediate digital incentives for individualized goals was associated with an increase in weekly mean daily step count on a population-scale over one year. This was particularly the case for physically inactive and more engaged users. The clear dose-response relationship between engagement and changes in daily step count reinforces the fundamental importance of engagement in digital health interventions. The high proportion of ‘Regular’ and ‘Committed’ users over one year suggests some success of the Carrot Rewards app in that regard.

Availability of data and materials

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.


  1. Lear SA, Hu W, Rangarajan S, Gasevic D, Leong D, Iqbal R, et al. The effect of physical activity on mortality and cardiovascular disease in 130 000 people from 17 high-income, middle-income, and low-income countries: the PURE study. Lancet. 2017;390(10113):2643–54.

    Article  Google Scholar 

  2. Blondell SJ, Hammersley-Mather R, Veerman JL. Does physical activity prevent cognitive decline and dementia?: a systematic review and meta-analysis of longitudinal studies. BMC Public Health. 2014;14:510.

    Article  Google Scholar 

  3. Mammen G, Faulkner G. Physical activity and the prevention of depression: a systematic review of prospective studies. Am J Prev Med. 45(5):649–57.

    Article  Google Scholar 

  4. Guthold R, Stevens GA, Riley LM, Bull FC. Worldwide trends in insufficient physical activity from 2001 to 2016: a pooled analysis of 358 population-based surveys with 1.9 million participants. Lancet Glob Health. 2018;6(10):e1077–e86.

    Article  Google Scholar 

  5. Althoff T, Sosic R, Hicks JL, King AC, Delp SL, Leskovec J. Large-scale physical activity data reveal worldwide activity inequality. Nature. 2017;547(7663):336–9.

    Article  CAS  Google Scholar 

  6. Cleland C, Reis RS, Ferreira Hino AA, Hunter R, Fermino RC, Koller de Paiva H, et al. Built environment correlates of physical activity and sedentary behaviour in older adults: a comparative review between high and low-middle income countries. Health Place. 2019;57:277–304.

    Article  Google Scholar 

  7. World Health Organization. Global recommendations on physical activity for health. Switzerland; 2010.

  8. Borgundvaag E, Janssen I. Objectively Measured Physical Activity and Mortality Risk Among American Adults. Am J Prev Med. 52(1):e25–31.

    Article  Google Scholar 

  9. Ekelund U, Tarp J, Steene-Johannessen J, Hansen BH, Jefferis B, Fagerland MW, et al. Dose-response associations between accelerometry measured physical activity and sedentary time and all cause mortality: systematic review and harmonised meta-analysis. BMJ. 2019;366:l4570.

    Article  Google Scholar 

  10. US Department of Health. Physical activity guidelines for Americans 2nd edition. Washington, DC; 2018.

  11. World Health Organization. Global action plan on physical activity 2018–2030: more active people for a healthier world. Geneva; 2018.

  12. McKay FH, Slykerman S, Dunn M. The app behavior change scale: creation of a scale to assess the potential of apps to promote behavior change. JMIR MHealth UHealth. 2019;7(1):e11130.

    Article  Google Scholar 

  13. Trends PRCGA. Smartphone ownership is growing rapidly around the world, but not always equally. 2019.

  14. Research2Guidance. mHealth App Economics 2017: Current status and future trends in mobile health 2017.

  15. Apple. Apple Unveils iOS 8, the Biggest Release Since the Launch of the App Store 2014 [Available from:

  16. The Lancet Public H. Time to tackle the physical activity gender gap. Lancet Public Health. 2019;4(8):e360.

    Article  Google Scholar 

  17. Research GV. mHealth Apps Market Size, Share & Trends Analysis Report by Type (Fitness, Lifestyle Management, Nutrition & Diet, Women's Health, Medication Adherence, Healthcare Providers/Payers), and Segment Forecasts, 2019–2026 2017 [Available from:

  18. Schoeppe S, Alley S, Van Lippevelde W, Bray NA, Williams SL, Duncan MJ, et al. Efficacy of interventions that use apps to improve diet, physical activity and sedentary behaviour: a systematic review. Int J BehavNutr Physical Activity. 2016;13(1):127.

    Article  Google Scholar 

  19. Petersen JM, Prichard I, Kemps E. A comparison of physical activity Mobile apps with and without existing web-based social networking platforms: systematic review. J Med Internet Res. 2019;21(8):e12687.

    Article  Google Scholar 

  20. Romeo A, Edney S, Plotnikoff R, Curtis R, Ryan J, Sanders I, et al. Can smartphone apps increase physical activity? Systematic review and meta-analysis. J Med Internet Res. 2019;21(3):e12053.

    Article  Google Scholar 

  21. Feter N, Dos Santos TS, Caputo EL, da Silva MC. What is the role of smartphones on physical activity promotion? A systematic review and meta-analysis. Int J Public Health. 2019;13:13.

    Google Scholar 

  22. Victora CG, Habicht JP, Bryce J. Evidence-based public health: moving beyond randomized trials. Am J Public Health. 2004;94(3):400–5.

    Article  Google Scholar 

  23. West SG, Duan N, Pequegnat W, Gaist P, Des Jarlais DC, Holtgrave D, et al. Alternatives to the randomized controlled trial. Am J Public Health. 2008;98(8):1359–66.

    Article  Google Scholar 

  24. Baranowski T, Lyons EJ. Scoping review of Pokemon Go: comprehensive assessment of augmented reality for physical activity change. Games Health J. 2019;06:06.

    Google Scholar 

  25. Elliott M, Eck F, Khmelev E, Derlyatka A, Fomenko O. Physical activity behavior change driven by engagement with an incentive-based app: evaluating the impact of Sweatcoin. JMIR MHealth UHealth. 2019;7(7):e12445.

    Article  Google Scholar 

  26. Government of Canada. National Healthy Living Platform: “Carrot Rewards” targets lifestyle improvements 2015 [Available from:

  27. Edwards EA, Lumsden J, Rivas C, Steed L, Edwards LA, Thiyagarajan A, et al. Gamification for health promotion: systematic review of behaviour change techniques in smartphone apps. BMJ Open. 2016;6(10):e012447.

    Article  CAS  Google Scholar 

  28. Camerer CF, Loewenstein G. Behavioral economics: past, present, future. Advances in behavioral economics. Princeton: Princeton University Press; 2003.

    Google Scholar 

  29. Deci E, Ryan R. Handbook of self-determination research. Rochester, NY: University of Rochester Press; 2002.

    Google Scholar 

  30. Mitchell M. Evaluating the carrot rewards app, a population-level incentive-based intervention promoting step counts across two Canadian provinces: a quasi-experimental study. JMIR. 2018.

  31. Marotta S. Ottawa-backed carrot rewards app shutting down after failing to find a buyer. The Globe And Mail 2019 June 19, 2019.

  32. Stoyanov SR, Hides L, Kavanagh DJ, Zelenko O, Tjondronegoro D, Mani M. Mobile app rating scale: a new tool for assessing the quality of health mobile apps. JMIR MHealth UHealth. 2015;3(1):e27.

    Article  Google Scholar 

  33. Duncan MJ, Wunderlich K, Zhao Y, Faulkner G. Walk this way: validity evidence of iphone health application step count in laboratory and free-living conditions. J Sports Sci. 2017:1–10.

  34. Hekler EB, Buman MP, Grieco L, Rosenberger M, Winter SJ, Haskell W, et al. Validation of physical activity tracking via android smartphones compared to ActiGraph accelerometer: laboratory-based and free-living validation studies. JMIR MHealth UHealth. 2015;3(2):e36.

    Article  Google Scholar 

  35. Evenson KR, Goto MM, Furberg RD. Systematic review of the validity and reliability of consumer-wearable activity trackers. Int J Behav Nutr Physical Activity. 2015;12:159.

    Article  Google Scholar 

  36. Tudor-Locke C, Craig CL, Thyfault JP, Spence JC. A step-defined sedentary lifestyle index: <5000 steps/day. Applied physiology, nutrition, & metabolism = Physiologie Appliquee. Nutr Metabol. 2013;38(2):100–14.

    Google Scholar 

  37. Colley RC, Garriguet D, Janssen I, Craig CL, Clarke J, Tremblay MS. Physical activity of Canadian adults: accelerometer results from the 2007 to 2009 Canadian health measures survey. Health Rep. 2011;22(1):7–14.

    PubMed  Google Scholar 

  38. Table 17-10-0005-01 Population estimates on July 1st, by age and sex and Table 11-10-0008-01 Tax filers and dependants with income by total income, sex and age [Internet]. 2017 [cited June 29, 2018]. Available from:

  39. Warburton DE, Charlesworth S, Ivey A, Nettlefold L, Bredin SS. A systematic review of the evidence for Canada's Physical Activity Guidelines for Adults. International Journal of Behavioral Nutrition & Physical Activity.7:39.

  40. Krueger H, Turner D, Krueger J, Ready AE. The economic benefits of risk factor reduction in Canada: tobacco smoking, excess weight and physical inactivity. Can J Public Health. 2014;105(1):e69–78.

    Article  Google Scholar 

  41. Prochaska JO, Velicer WF. The transtheoretical model of health behavior change. Am J Health Promot. 1997;12(1):38–48.

    Article  CAS  Google Scholar 

  42. Merchant AT, Dehghan M, Akhtar-Danesh N. Seasonal variation in leisure-time physical activity among Canadians. Can J Public Health. 2007;98(3):203–8.

    Article  Google Scholar 

  43. Hino K, Asami Y, Lee JS. Step counts of middle-aged and elderly adults for 10 months before and after the release of Pokemon GO in Yokohama. Japan J Med Internet Res. 2019;21(2):e10724.

    Article  Google Scholar 

  44. Mitchell M, Orstad S, Biswas A, Oh P, Jay M, Pakosh M, et al. Financial incentives for physical activity in adults: Systematic review and meta-analysis. Br J Sports Med. 2019.

  45. Handley MA, Lyles CR, McCulloch C, Cattamanchi A. Selecting and improving quasi-experimental designs in effectiveness and implementation research. Annu Rev Public Health. 2018;39:5–25.

    Article  Google Scholar 

  46. Finkelstein EA, Haaland BA, Bilger M, Sahasranaman A, Sloan RA, Nang EE, et al. Effectiveness of activity trackers with and without incentives to increase physical activity (TRIPPA): a randomised controlled trial. Lancet Diabetes Endocrinol. 4(12):983–95.

    Article  Google Scholar 

Download references


Carolyn Tailor for help developing statistical approach and conducting analyses. All who have contributed significantly to the work have been acknowledged.


Public Health Agency of Canada (PHAC) Multi-Sectoral Partnership Approach to Healthy Living and Chronic Disease Prevention, as well as the Canadian Institutes of Health Research (CIHR)-PHAC Chair in Applied Public Health. The funding body did not contribute to any aspect of the study.

Author information

Authors and Affiliations



All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by MM, EL, LW and GF. The first draft of the manuscript was written by MM and EL and all authors commented on previous versions of the manuscript. All authors read and approved the submitted manuscript and have agreed to be personally accountable for their contribution.

Corresponding author

Correspondence to Marc Mitchell.

Ethics declarations

Ethics approval and consent to participate

The University of British Columbia Behavioural Research Ethics Board approved this study (H17–02814). This study involved the secondary use of de-identified data. There was no consent for this secondary data analysis. However, app users were informed of and had to accept the app’s privacy policy describing how de-identified data may be used for reporting purposes and presented in aggregate.

Consent for publication

Not applicable.

Competing interests

MM received consulting fees from Carrot Insights Inc. from 2015 to 2018 as well as travel re-imbursement in January and March 2019. MM had stock options in the company as well but these are now void since Carrot Insights Inc. went out of business in June 2019. LW was employed by Carrot Insights Inc. from March 2016 to June 2019 and also had stock options which are now void. EL and GF declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1:

STROBE Checklist.

Additional file 2:

Mobile Application Rating Scale (MARS) and self-score.

Additional file 3:

App Behavior Change Scale (ABACUS) and self-score.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mitchell, M., Lau, E., White, L. et al. Commercial app use linked with sustained physical activity in two Canadian provinces: a 12-month quasi-experimental study. Int J Behav Nutr Phys Act 17, 24 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: