Skip to main content

Physical activity surveillance in the European Union: reliability and validity of the European Health Interview Survey-Physical Activity Questionnaire (EHIS-PAQ)



The current study examined the reliability and validity of the European Health Interview Survey-Physical Activity Questionnaire (EHIS-PAQ), a novel questionnaire for the surveillance of physical activity (PA) during work, transportation, leisure time, sports, health-enhancing and muscle-strengthening activities over a typical week.


Reliability was assessed by administering the 8-item questionnaire twice to a population-based sample of 123 participants aged 15-79 years at a 30-day interval. Concurrent (inter-method) validity was examined in 140 participants by comparisons with self-report (International Physical Activity Questionnaire-Long Form (IPAQ-LF), 7-day Physical Activity Record (PAR), and objective criterion measures (GT3X+ accelerometer, physical work capacity at 75 % (PWC75%) from submaximal cycle ergometer test, hand grip strength).


The EHIS-PAQ showed acceptable reliability, with a median intraclass correlation coefficient across PA domains of 0.55 (range 0.43–0.73). Compared to the GT3X+ (counts/minutes/day), the EHIS-PAQ underestimated moderate-to-vigorous PA (median difference -11.7, p-value = 0.054). Spearman correlation coefficients (ρ) for validity were moderate-to-strong (ρ’s > 0.41) for work-related PA (IPAQ = 0.64, GT3X + =0.43, grip strength = 0.48), transportation-related PA (IPAQ = 0.62, GT3X + =0.43), walking (IPAQ = 0.58), and health-enhancing PA (IPAQ = 0.58, PAR = 0.64, GT3X + =0.44, PWC75% = 0.48), and fair-to-poor (ρ’s < 0.41) for moderate-to-vigorous aerobic recreational and muscle-strengthening PA.


The EHIS-PAQ showed good evidence for reliability and validity for the measurement of PA levels at work, during transportation and health-enhancing PA.


Insufficient physical activity (PA) is the fourth-leading risk factor for premature mortality in western Europe and among the top 10 globally, causing about 5.3 million deaths per year [13]. Physical inactivity has increased substantially during the past decades in the Western world; about one third of adults worldwide do not engage in physical activity at sufficient levels [2, 4].

As the evidence of the adverse effects of physical inactivity accumulates, international policy frameworks have begun to acknowledge the importance of PA [5, 6]. Policy development and evaluation depend on consistent, understandable assessments of prevalence and trends in physical activity and adherence to PA recommendations. Continued improvements in monitoring PA are needed to guide development of policies and programs to increase activity levels and to reduce the burden of chronic disease. Consequently, the WHO and the European Commission have published policy guidelines to provide support in developing related policies [7, 8].

Recently, a health-enhancing physical activity (HEPA) policy audit tool for successful implementation of a population-wide approach to PA promotion across the life course was released by the WHO Regional Office for Europe [9]. A key element of the HEPA tool is the development and implementation of national PA guidelines [9]. So far, less than 40 % of all 53 countries of the WHO European Region have developed national PA recommendations [10]. The European Health Interview Survey (EHIS) is an integral part of the European Commission’s European Core Health Indicators [11], a component of the EU public health surveillance system, which provides data regarding PA prevalence and trends for cross-country comparisons.

Between 2006 and 2010, population-based data for the first cycle of the EHIS were collected in member states of the European Union. PA was assessed using a modified version of the International Physical Activity Questionnaire – Short Form (IPAQ-SF) [1215]. The IPAQ-SF was developed to facilitate surveillance based on a global standard and is one of the most widely used PA questionnaires [12, 15]. However, although the psychometric properties of the IPAQ-SF have been established and proven to be acceptable [12, 14, 15], the first EHIS wave revealed major difficulties with the IPAQ-SF during data collection [13]. Expert focus groups and cognitive testing studies showed substantial problems with regards to understanding different PA intensity levels, to indicate durations of routine activities such as walking or sitting, and to combine multiple activities to provide the total amount of PA [13, 16]. Consequently, the EHIS Core Group (a group of national health survey experts) commissioned development of the European Health Interview Survey-Physical Activity Questionnaire (EHIS-PAQ), a short, domain-specific PA questionnaire, which allows for the estimation of indicators for total PA, work-related PA, transportation-related PA, and health-enhancing leisure-time PA [13].

The current study examined the reliability and validity of the EHIS-PAQ in a population-based sample of adults drawn from a city in southern Germany. First, 30-day test-retest reliability was examined. Second, to assess concurrent (inter-method) validity, the correlations of the EHIS-PAQ with the International Physical Activity Questionnaire-Long Form (IPAQ-LF), a 7-day PA record (PAR), accelerometer, cardiorespiratory fitness, subjective performance limits, and hand grip strength were tested.


Study sample

A priori sample size calculations for statistical indices of test-retest reliability indicated that about 150 participants would be required to achieve sufficient statistical power [17]. Based on published trends of participation rates of population surveys, we anticipated that approximately 20 % of sampled individuals would agree to participate in the examination [18, 19]. Thus, we sampled 740 individuals aged 18 to 79 years (stratified by age and sex) from the population registry of the city of Regensburg in southern Germany and invited them to participate in the study. Baseline examinations were performed between November 25, 2013 and February 14, 2014 at the University of Regensburg among 140 of the 740 invited individuals (response proportion = 18.9 %). Approximately 30 days after the baseline examination, study participants were sent a resurvey questionnaire and 138 individuals (98.6 %) responded. Complete data were available from 123 participants for the reliability study and, depending on the number of missing values, between 75 and 140 for the validity study.


Data were collected by certified personnel with respect to participants’ socio-demographic characteristics, anthropometry, PA, cardiorespiratory fitness, and hand grip strength. Anthropometric measurements were conducted with participants in light underwear and not wearing shoes and they included height, weight, and hip and waist circumferences. Height was measured in centimeters using a digital stadiometer (SECA Stadiometer 274, SECA, Hamburg, Germany) and weight was measured in kilograms using a digital scale (SECA mBCA 515, SECA, Hamburg, Germany). The body mass index (BMI) was computed using the ratio of weight and height in meters squared (i.e., kg/m2).


The EHIS-PAQ consists of 8 items covering physical activities during work, transportation, and leisure time (including sports activities), aerobic health-enhancing and muscle-strengthening PA during a typical week [13]. Details regarding the development of the EHIS-PAQ and questionnaire items are provided in Finger et al. [13]. The work-related PA item was taken from the U.S. Behavioral Risk Factor Surveillance Survey [20] and it asked about light, moderate, and heavy and physically demanding activities. This particular item was selected because of its high reliability and validity [2123]. Four items focused on commuting and active traveling to get from one place to another and inquired about the number of days per week and the time per day spent walking and cycling. Transportation-related walking and cycling items were adapted from the Global Physical Activity Questionnaire (GPAQ) [24] and the IPAQ-LF [12, 14]. The final section of the EHIS-PAQ asked about sports, fitness, and recreational leisure-time physical activities. Sports, fitness, and recreational physical activities were assessed in line with the aerobic PA definition of the CDC/WHO [2527] but the distinction between moderate and vigorous intensity PA was removed and instead, the item refers to ‘at least moderate intensity’ PA. Participants were asked about how many days and the total duration during a typical week they spent in leisure time sports or fitness pursuits. The last question queried about the days per week the participants engaged in muscle-strengthening PA. The question was adapted from the U.S. National Health Interview Survey - Adult Core questionnaire [28].

We constructed several indices and binary indicators from EHIS-PAQ responses [13]. The work-related PA index summed dichotomous items for light, moderate, and vigorous activities and ranged from 1 to 3. The binary work-related PA indicator compared individuals who ‘mostly sit or stand’ when working with those who perform mostly tasks of ‘at least moderate physical effort’. At an average pace, walking requires about half the energy expenditure of cycling [29]. Thus, we derived a transportation-related PA index (in metabolic equivalent (MET)-minutes per day) by summing the minutes spent walking and cycling, each weighted with MET intensity values (i.e., 3.3 for walking and 6.0 for cycling), provided by Ainsworth’s PA Compendium [29], as suggested by the IPAQ-LF data processing guidelines [30]. A transportation-related PA indicator was generated by grouping participants who fell into the fifth quintile and those who fell into the lower four quintiles of the transportation activity index [13]. According to the WHO recommendations, adults aged 18 years or older should perform at least 150 min of moderate-intensity aerobic PA and at least two occasions of muscle-strengthening PA per week [27]. For constructing an indicator that estimates compliance with the WHO aerobic PA guidelines based on the EHIS-PAQ, information on transportation-related and leisure-time PA can be combined. An index of HEPA was derived by summing the minutes per day spent walking, cycling and engaging in leisure time moderate-intensity PA, where walking minutes were weighted by 0.5 [13, 29]. A recent study indicated that moderate PA is more strongly correlated with objective measurements (accelerometer and heart rate), when walking is excluded [31]. An alternative HEPA index, which was not considered in the current study, could therefore exclude walking from its computation [13]. We also computed the time spent in moderate-to-vigorous aerobic recreational activities (in minutes per day) and muscle-strengthening activities (in occasions per week). Compliance with muscle-strengthening PA guidelines was present when individuals reported at least two occasions of muscle-strengthening PA per week [13, 29]. An indicator of total PA was defined as sufficient aerobic PA according to the WHO recommendation, and being physically active at work.

Self-reported and objective criterion measures

The self-administered IPAQ-LF used in the present study was based on a validated and back-translated German-language version [12, 14]. The IPAQ-LF was chosen because it evaluates the time and frequency spent in activity domains similar to the EHIS-PAQ. The IPAQ covers four domains of PA: work-related, transportation, household/gardening, and leisure-time activity. For each of the four domains, the number of days per week and the time spent in both moderate and vigorous activity were recoded. To achieve comparability with the outcome measures derived from the EHIS-PAQ, we computed the minutes per day spent in the four activity domains instead of the number of MET-minutes per week, as suggested by the IPAQ-LF data processing guidelines [30]. Participants completed a 7-day PAR regarding activities during work, transportation, leisure-time, and household [32]. For each day, the record consisted of lines of activities grouped into the categories sleep and rest periods, activities at work, leisure time plus home activities, and sports. Total time (in minutes per day) and time spent in each of the four PA domains (i.e., work, transportation, leisure-time, household) were estimated and used as analysis variables. It has been demonstrated that the 7-day PAR is highly correlated with the doubly-labelled water method of energy expenditure assessment [32].

Accelerometer is an established simple, non-invasive and cost-efficient method for objectively measuring PA in a detailed and objective manner [3335] and were therefore selected as a reference for the concurrent validity study. Participants were asked to wear a GT3X+ accelerometer (ActiGraph LLC, Pensacola, Florida) on a belt at the natural waistline on the right hip in line with the right axilla during daytime and nighttime for seven consecutive days. Data were processed using standard methods; raw data collected from movement registering on the vertical axis were integrated in 60 s periods (epochs). Non-wear time was defined as an interval of at least 60 consecutive minutes of zero counts, allowing for intervals of 1 to 2 min of relatively low counts (i.e., 1-100 counts). Valid wear days were defined as ≥600 min wear time and participants (n = 6) with less than three or more valid wear days were excluded [35]. 113 participants provided data for the criterion validity analyses comparing accelerometer data with EHIS-PAQ variables. A cut-off of 1,952 counts per minutes was used to differentiate sedentary-to-light and moderate-to-vigorous activity [36, 37].

Submaximal incremental exercise testing for estimating cardiorespiratory fitness was performed using a calibrated electromagnetically braked cycle ergometer (Ergosana Sana Bike 350/450, Ergosana, Bitz, Germany). Cardiorespiratory fitness is a measure of the capacity of the cardiovascular system to transport oxygen and the capacity of the muscle to use it [38, 39]. The self-reported performance limit (in METs) was estimated using the Veterans Physical Activity Questionnaire (VSAQ) [40]. The VSAQ and the Physical Activity Readiness Questionnaire (PAR-Q) were used for protocol alignment, assessment of the eligibility of the physical fitness appraisal, and derivation of a fitness-adjusted ergometer starting value [40, 41]. The WHO protocol included the following steps: one minute of rest, cycling at a starting workload determined by the VSAQ, and stepwise increases in workload of 25 W every two minutes until 85 % of the estimated age-specific maximum heart rate (i.e., 208-age*0.7 [42]) was exceeded or the maximum intensity level was reached (i.e., 100-175 W) or study personnel terminated the exercise testing due to chest pain, followed by two minutes of recovery [41, 43, 44]. Participants (n = 27) were excluded from ergometry if they had a pacemaker, their body weight was >160 kg, they had an elevated blood pressure (≥180/110 mmHg) or resting heart rate (≥100/min), they exhibited ECG abnormalities, arrhythmia, or showed contraindications based on the PAR-Q. The physical work capacity at 75 % (PWC75%) of the age-predicted maximum heart rate was determined according to Finger et al. [41] and was used as a primary measure of cardiorespiratory fitness. And 75 participants provided data for the criterion validity analyses comparing the PWC75% with EHIS-PAQ variables.

Muscular fitness is a component of physical function that consists of muscular strength, endurance and power and was used to assess the validity of the EHIS-PAQ question on resistance activity. Isometric grip strength (in kg) was measured in seated position using a hand dynamometer (Jamar Plus+, Sammons Preston, Rolyon, Bolingbrook, IL). Each participant’s grip strength was measured three times with each hand while sitting in a straight-backed chair, feet flat on the floor, shoulders adducted in neutral position, arms unsupported, elbows flexed at 90°, and forearms in neutral rotation. The maximum value (in kilograms) of the six measurements was used [45].

Statistical analyses

Data on quantitative characteristics are expressed as median (interquartile range) and data on qualitative characteristics are expressed as percent values. Differences between men and women were tested using Kruskal-Wallis and χ2 tests. Test-retest reliability was assessed using two EHIS-PAQ administrations spaced approximately 30 days apart and quantified as intraclass correlation coefficient (ICC) and corresponding 95 % confidence limits, estimated using a mixed model [46]. ICCs of >0.5 and >0.7 were considered acceptable and good, respectively [47]. Spearman’s rank ordered correlation coefficients (ρ) were used to measure concurrent (inter-method) validity and the following benchmarks were used for interpretation: 0–0.20 = poor correlation, 0.21–0.41 = fair, 0.41–0.60 = moderate/acceptable, ≥0.6 = strong [48, 49]. The strength of agreement between methods and systematic mis-measurement was assessed using the Bland-Altman technique, which provides the mean bias and the 95 % limits of agreement (±2 standard deviations (SD) of the difference) and is plotted as the difference between the methods against the mean of the methods for visual inspection of error patterns [50]. Median differences between the EHIS-PAQ and accelerometer data were tested using a median regression model. Effect-measure modification with sex (dichotomous), age (continuous), and BMI (continuous) was tested using multiplicative interaction terms in median regression models. The significance level was set at p < 0.05. Analyses were performed using SAS 9.3 (SAS Institute, Cary, North Carolina) and PASS 13 Power Analysis and Sample Size Software (NCSS, LLC. Kysville, Utah).


Baseline characteristics of study participants

Socio-demographic, behavioral, anthropometric, and measures of cardiorespiratory and musculoskeletal fitness of the 140 study participants are shown in Table 1. The median age was 55 years, about half of the participants were married, more than two thirds had at least 10 years of schooling, and one third reported a history of cardiovascular disease. Men had higher BMI, PWC75%, self-assed performance limits, and handgrip strength than women.

Table 1 Participant characteristics at baseline (n = 140)

Physical activity assessed by the EHIS-PAQ

Table 2 provides information regarding binary PA indicators and PA indices of the EHIS-PAQ. In the total sample, 49 % reported work-related PA, about 24 % performed PA during transportation, 38 % had at least two occasions per week of muscle-strengthening activity, 61 % were compliant with the WHO health-enhancing aerobic PA recommendation, and 79 % reported any PA (i.e., achieved the WHO health-enhancing aerobic recommendation or active at work) (Table 2). Compared to women, men had higher values on the transportation-related PA and health-enhancing PA indices, and longer walking time.

Table 2 Physical activity assessed by the European Health Interview Survey Interview Physical Activity Questionnaire (EHIS-PAQ) (n = 140)

Test-retest reliability

ICCs for the 30-day test-retest reliability of the EHIS-PAQ ranged from 0.43 for the health-enhancing PA index to 0.73 for moderate-to-vigorous aerobic recreational PA (Table 3). The repeatability of the work-related PA index and the transportation-related PA were superior to the recall of walking time, cycling time, and muscle-strengthening activity. A higher reliability was noted among men, participants with BMI ≥ 25 kg/m2, and older adults (Additional file 1: Table S1).

Table 3 Test-retest reliability of the European Health Interview Survey Interview Physical Activity Questionnaire (EHIS-PAQ) (n = 123)

Concurrent validity

Concurrent validity correlation coefficients of the EHIS-PAQ indices with the IPAQ-LF, PAR, GT3X+, PWC75%, and grip strength are shown in Table 4. Strong correlations of the EHIS-PAQ and IPAQ-LF were apparent in the work and transportation domains, and for the HEPA index. Similarly, correlations were moderate in the work domain of the 7-day PAR and between the HEPA index and 7-day leisure time PA. Correlation coefficients were poor-to-fair for the remaining EHIS-PAQ indices.

Table 4 Concurrent validity comparing the European Health Interview Survey Interview Physical Activity Questionnaire (EHIS-PAQ) with the International Physical Activity Questionnaire (IPAQ), 7-day Physical Activity Record (PAR), GT3X+ accelerometer, physical work capacity at 75 % maximum heart rate (PWC75%), and grip strength

Moderate correlations with total accelerometer activity were found for transportation-related PA. Correlations between accelerometry-based moderate-to-vigorous activity and EHIS-PAQ domains were moderately strong for transportation activity. Correlations between accelerometry-based light activity and EHIS-PAQ measures were weak, except for work-related PA. Poor-to-fair correlations with accelerometry-based criterion measures were also found for the remaining EHIS-PAQ indices. A fair correlation with PWC75% was noted for the work-related PA and a moderate correlation was seen with HEPA. The association with grip strength was moderate for work-related PA and it was poor to fair for the remaining EHIS-PAQ indices. The median difference between moderate-to-vigorous PA levels from the EHIS-PAQ and the accelerometer was -11.7 min per day; however, the underestimation did not depend on the activity level (Additional file 1: Table S2, Fig. 1). There were no differences in the underestimation of moderate-to-vigorous PA levels by sex, age, and BMI, as indicated by statistically non-significant p-values for interaction (Additional file 1: Table S2).

Fig. 1
figure 1

Bland-Altman plot for the agreement of moderate-to-vigorous physical activity (MVPA) from the European Health Interview Survey Interview Physical Activity Questionnaire (EHIS-PAQ) and the GTX3+ accelerometer (N = 119). min/d, minutes per day


The current study tested the reliability and validity of the EHIS-PAQ, a short PA questionnaire for use in population-based surveillance in a multinational health interview survey context that was developed with the main goals of being easy to complete and enabling estimation of work-related PA, transportation-related PA, leisure PA, HEPA, and muscle-strengthening PA [13]. It was modeled after similar survey items of the IPAQ-LF [12], U.S. Behavioral Risk Factor Surveillance Survey [51], GPAQ [24], and U.S. National Health Interview Survey [28]. The EHIS-PAQ was developed to replace the IPAQ-SF as a PA measurement instrument for the second wave of the EHIS [13]. Field work during data collection of the first EHIS wave, expert groups and cognitive testing had revealed difficulties of study participants answering the IPAQ-SF [13, 16]. Major problems included misunderstanding of the concepts of ‘(light and) moderate’ and ‘vigorous’ PA, classifying activities into different PA intensities, recalling instances of routine activities such as walking, cycling, and sitting, and combining durations of activities in different domains and intensities for calculating the total amount of PA [16]. These problems were particularly evident in respondents aged 60 years or older. Thus, PA surveillance questionnaires used in general population surveys need to consider the cognitive abilities of older people and those with lower reading levels and cognitive abilities [52, 53]. Subsequently, the EHIS-PAQ was developed and pilot tested with the aim of improving understandability, allowing for reliable and valid estimation of PA levels in all population subgroups (including older adults and people with lower cognitive abilities), removing the distinction between intensity levels of PA, defining a minimum intensity level of at least moderate-to-vigorous intensity, and assessing activities of different PA domains [13].

Reliability analyses revealed a median ICC of the EHIS-PAQ of 0.55, with ICCs ranging from 0.43 to 0.73 for individual activity domains. Reproducibility was not modified by gender or age. Taken together, the EHIS-PAQ showed good evidence for reliability, particularity for work-related PA, transportation-related PA, and moderate-to-vigorous aerobic PA. Validity coefficients comparing the EHIS-PAQ questionnaire with self-report and objective criterion measures were stronger and most consistent for HEPA, showing correlation coefficients of 0.58 (IPAQ-LF), 0.64 (7-day PAR), 0.41 (accelerometry), and 0.48 (cardiorespiratory fitness). This highlights the value of the approach taken by the EHIS-PAQ of combining recreational and transportation activity to arrive at an indicator of compliance with the aerobic PA guidelines. By comparison, correlations were slightly weaker for moderate-to-vigorous aerobic recreational activity (disregarding transportation activity), showing correlations between EHIS-PAQ and reference measures of 0.45 (IPAQ), 0.51 (7-day PAR), 0.32 (accelerometry), and 0.25 (cardiorespiratory fitness).

For work-related PA, comparisons of the EHIS-PAQ with the IPAQ-LF and the 7-day PAR yielded moderate/acceptable and strong correlation coefficients of 0.64 and 0.47, respectively. This indicates that distinguishing between individuals who mostly sit or stand and those who perform mostly tasks of at least moderate physical intensity, as is done in the EHIS-PAQ, is particularly useful when assessing occupational activity. For transportation-related PA, we noted a reasonably strong correlation between the EHIS-PAQ and the IPAQ-LF (0.62), which could in part be due to similar question wording and the resulting correlated error structure between those two instruments [12, 14, 15]. By comparison, we found no correlation for transportation activity between the EHIS-PAQ and the 7-day PAR (ρ =−0.07), which may be explained by the fact that the 7-day PAR inquired about work-related transportation activity only and did not assess transportation in other activity domains [32]. Correlation of the EHIS-PAQ with objective criterion measures from the accelerometer, cardio-respiratory exercise testing, and hand grip strength were weaker. For example, the correlation between muscle-strengthening activity from the EHIS-PAQ and hand grip strength was only 0.10 in the overall sample. However, a more pronounced correlation coefficient of 0.42 was found in participants aged 50 years or older. Also, in those aged 50 years or older, muscle-strengthening activity was more strongly associated with cardiorespiratory fitness (0.63) than with grip strength. This suggests that muscle-strengthening activity represents a marker of fitness in middle-aged and elderly individuals and emphasizes the utility of considering population sub-groups in the current validation study.

Our results show that the EHIS-PAQ exhibited psychometric properties that are comparable to established self-report PA questionnaires. Three recent reviews summarized the reliability and criterion validity of common self-report PA questionnaires for adults [15, 53, 54]. According to Helmerhorst et al. [53], few PA questionnaires scored high on both reproducibility and validity, with median reproducibility coefficients of 0.62 to 0.71, and median validity coefficients of 0.30 to 0.41. The IPAQ-SF showed strong repeatability in adults (ρ = 0.76) [12, 53]. A meta-analysis of the validity of the IPAQ-SF reported modest validity correlation coefficients of the total PA level with objective standards in the range 0.09-0.39; none reached the minimal acceptable standard in the literature (i.e., 0.50 for objective activity measuring device; 0.40 for fitness measures) [15, 53, 55]. Time spent walking from the IPAQ-SF showed the highest validity with counts obtained from objective devices, while moderate or vigorous PA correlated weakly with objective standards [15].

This study has several strengths and limitations that need to be considered when interpreting the findings. Participants were recruited using methodologically rigorous general population sampling methods, which helped improve the generalizability of our results. We used objective instruments as reference measures, including an accelerometer, cardiorespiratory fitness, and handgrip strength, thereby enhancing the criterion validity of the study [55]. One methodological limitation is that accelerometers inadequately measure cycling, which was a common activity in our study and may help explain the weak correlation between cycling for transportation from the EHIS-PAQ and moderate-to-vigorous activity based on accelerometry data [56]. In addition, we employed the IPAQ-LF [12] and a 7-day PAR [32] as reference instruments, which allowed us to evaluate the validity of comparable domains of PA. However, recall periods of the self-report questionnaires differ. While the EHIS-PAQ assesses PA during a ‘typical week’, the IPAQ-LF and the 7-day PAR ask about the time participants spent being physically active in the last 7 days. The discrepancy in different time frames can impede comparability because physical activity during the last 7 days might not necessarily reflect the physical activity undertaken in a typical week. For example, physical activity levels are subject to seasonal variation [57] or might be lower when participants are suffering from an acute illness. The modest correlations observed between certain EHIS-PAQ scores and corresponding activity domains based on reference measures indicate that more suitable external criterion measures would have better reflected the EHIS-PAQ scores.


Overall, the current study examined the reproducibility and validity of the newly developed EHIS-PAQ to monitor PA levels. Findings indicated acceptable-to-good reliability and validity of the questionnaire, which is in good agreement with published review articles and meta-analyses [15, 53, 54]. Notwithstanding some degree of measurement error associated with the EHIS-PAQ, we conclude that the questionnaire quantifies PA and its sub-domains with sufficient validity for use in surveillance studies to inform public policy. Future validation studies should consider using doubly-labeled water as a criterion that despite its high cost remains the recommended standard [15]. Another remaining challenge is to derive harmonized PA measures from the IPAQ-SF and EHIS-PAQ that will allow investigating PA trends based on EHIS waves one and two [58].



body mass index


Center for Disease Control


European Health Interview Survey


European Health Interview Survey-Physical Activity Questionnaire


European Union


Global Physical Activity Questionnaire


health-enhancing physical activity


intraclass correlation coefficient


International Physical Activity Questionnaire-Long Form


International Physical Activity Questionnaire - Short Form


interquartile range


metabolic equivalent


moderate-to-vigorous physical activity


physical activity


7-day Physical Activity Record

PWC75% :

physical work capacity at 75 %


standard deviation


Veterans Physical Activity Questionnaire


World Health Organization


Spearman’s rank ordered correlation coefficients


  1. Ezzati M, Riboli E. Behavioral and dietary risk factors for noncommunicable diseases. N Engl J Med. 2013;369(10):954–64. doi:10.1056/NEJMra1203528.

    Article  CAS  PubMed  Google Scholar 

  2. Hallal PC, Martins RC, Ramirez A. The Lancet Physical Activity Observatory: promoting physical activity worldwide. Lancet. 2014;384(9942):471–2. doi:10.1016/s0140-6736(14)61321-0.

    Article  PubMed  Google Scholar 

  3. Lee IM, Shiroma EJ, Lobelo F, Puska P, Blair SN, Katzmarzyk PT, et al. Effect of physical inactivity on major non-communicable diseases worldwide: an analysis of burden of disease and life expectancy. Lancet. 2012;380(9838):219–29. doi:10.1016/S0140-6736(12)61031-9.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Hallal PC, Andersen LB, Bull FC, Guthold R, Haskell W, Ekelund U, et al. Global physical activity levels: surveillance progress, pitfalls, and prospects. Lancet. 2012;380(9838):247–57. doi:10.1016/S0140-6736(12)60646-1.

    Article  PubMed  Google Scholar 

  5. UN General Assembly. Political declaration of the high-level meeting of the general assembly on the prevention and control of non-communicable diseases. New York: UN; 2011.

    Google Scholar 

  6. World Health Organization. Action Plan for Implementation of the European Strategy for the Prevention and Control of Noncommunicable Diseases, 2012-2016. Copenhagen, Denmark: World Health Organization, Regional Office for Europe; 2012.

  7. EU Working Group "Sport & Health". EU Physical Activity Guidelines Recommended Policy Actions in Support of Health-Enhancing Physical Activity. Brussels, Belgium; 2008.

  8. World Health Organization. Steps to health: A European framework to promote physical activity for health. Copenhagen, Denmark: WHO Regional Office for Europe; 2007.

  9. Bull FC, Milton K, Kahlmeier S. National policy on physical activity: the development of a policy audit tool. J Phys Act Health. 2014;11(2):233–40. doi:10.1123/jpah.2012-0083.

    Article  PubMed  Google Scholar 

  10. Kahlmeier S, Wijnhoven TM, Alpiger P, Schweizer C, Breda J, Martin BW. National physical activity recommendations: systematic overview and analysis of the situation in European countries. BMC Public Health. 2015;15:133. doi:10.1186/s12889-015-1412-3.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Verschuuren M, Gissler M, Kilpelainen K, Tuomi-Nikula A, Sihvonen AP, Thelen J, et al. Public health indicators for the EU: the joint action for ECHIM (European Community Health Indicators & Monitoring). Arch Public Health. 2013;71(1):12. doi:10.1186/0778-7367-71-12.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Craig CL, Marshall AL, Sjostrom M, Bauman AE, Booth ML, Ainsworth BE, et al. International physical activity questionnaire: 12-country reliability and validity. Med Sci Sports Exerc. 2003;35(8):1381–95. doi:10.1249/01.MSS.0000078924.61453.FB.

    Article  PubMed  Google Scholar 

  13. Finger JD, Tafforeau J, Gisle L, Oja L, Ziese T, Thelen J, et al. Development of the European Health Interview Survey - Physical Activity Questionnaire (EHIS-PAQ) to monitor physical activity in the European Union. Arch Public Health. 2015;73:59. doi:10.1186/s13690-015-0110-z.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Kim Y, Park I, Kang M. Convergent validity of the international physical activity questionnaire (IPAQ): meta-analysis. Public Health Nutr. 2013;16(3):440–52. doi:10.1017/S1368980012002996.

    Article  PubMed  Google Scholar 

  15. Lee PH, Macfarlane DJ, Lam TH, Stewart SM. Validity of the International Physical Activity Questionnaire Short Form (IPAQ-SF): a systematic review. Int J Behav Nutr Phys Act. 2011;8:115. doi:10.1186/1479-5868-8-115.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Finger JD, Gisle L, Mimilidis H, Santos-Hoevener C, Kruusmaa EK, Matsi A, et al. How well do physical activity questions perform? A European cognitive testing study. Arch Public Health. 2015;73:57. doi:10.1186/s13690-015-0109-5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Walter SD, Eliasziw M, Donner A. Sample size and optimal designs for reliability studies. Stat Med. 1998;17(1):101–10.

    Article  CAS  PubMed  Google Scholar 

  18. Galea S, Tracy M. Participation rates in epidemiologic studies. Ann Epidemiol. 2007;17(9):643–53. doi:10.1016/j.annepidem.2007.03.013.

    Article  PubMed  Google Scholar 

  19. Schnell R. Survey-Interviews: Methoden standardisierter Befragungen. Berlin, Germany: VS, Verlag für Sozialwiss.; 2012.

  20. US Centers for Disease Control and Prevention PHSBP. Behavioral Risk Factor Surveillance System (BRFSS) Questionnaires. Centers for Disease Control and Prevention, Atlanta. 2015. Accessed 06/15/2015 2015.

  21. Kwak L, Proper KI, Hagstromer M, Sjostrom M. The repeatability and validity of questionnaires assessing occupational physical activity--a systematic review. Scand J Work Environ Health. 2011;37(1):6–29.

    Article  PubMed  Google Scholar 

  22. Yore MM, Bowles HR, Ainsworth BE, Macera CA, Kohl H. Single versus multiple item questions on occupational physical activity. J Phys Act Health. 2006;3(1):102.

    Google Scholar 

  23. Evenson KR, McGinn AP. Test-retest reliability of adult surveillance measures for physical activity and inactivity. Am J Prev Med. 2005;28(5):470–8. doi:10.1016/j.amepre.2005.02.005.

    Article  PubMed  Google Scholar 

  24. Bull FC, Maslin TS, Armstrong T. Global physical activity questionnaire (GPAQ): nine country reliability and validity study. J Phys Act Health. 2009;6(6):790–804.

    PubMed  Google Scholar 

  25. Haskell WL, Lee IM, Pate RR, Powell KE, Blair SN, Franklin BA, et al. Physical activity and public health: updated recommendation for adults from the American College of Sports Medicine and the American Heart Association. Circulation. 2007;116(9):1081–93. doi:10.1161/CIRCULATIONAHA.107.185649.

    Article  PubMed  Google Scholar 

  26. US Department of Health Human Services. Physical activity guidelines advisory committee report. Washington, DC: US Department of Health and Human Services; 2008.

    Google Scholar 

  27. World Health Organization. Global recommendations on physical activity for health. Geneva: WHO Press; 2010.

    Google Scholar 

  28. US Centers for Disease Control and Prevention NCfHS. Adult Physical Activity Questions on the National Health Interview Survey 1975–2012. Centers for Disease Control and Prevention, Atlanta. 2014. Accessed 04/13/2015.

  29. Ainsworth BE, Haskell WL, Herrmann SD, Meckes N, Bassett Jr DR, Tudor-Locke C, et al. 2011 Compendium of Physical Activities: a second update of codes and MET values. Med Sci Sports Exerc. 2011;43(8):1575–81. doi:10.1249/MSS.0b013e31821ece12.

    Article  PubMed  Google Scholar 

  30. IPAQ Research Committee. Guidelines for data processing and analysis of the International Physical Activity Questionnaire (IPAQ)–short and long forms. 2005. Accessed 18 May 2016.

  31. Dahl-Petersen IK, Hansen AW, Bjerregaard P, Jorgensen ME, Brage S. Validity of the international physical activity questionnaire in the arctic. Med Sci Sports Exerc. 2013;45(4):728–36. doi:10.1249/MSS.0b013e31827a6b40.

    Article  PubMed  Google Scholar 

  32. Koebnick C, Wagner K, Thielecke F, Moeseneder J, Hoehne A, Franke A, et al. Validation of a simplified physical activity record by doubly labeled water technique. Int J Obes (Lond). 2005;29(3):302–9. doi:10.1038/sj.ijo.0802882.

    Article  CAS  Google Scholar 

  33. Ainsworth B, Cahalin L, Buman M, Ross R. The current state of physical activity assessment tools. Prog Cardiovasc Dis. 2015;57(4):387–95. doi:10.1016/j.pcad.2014.10.005.

    Article  PubMed  Google Scholar 

  34. Semanik PA, Lee J, Song J, Chang RW, Sohn MW, Ehrlich-Jones LS, et al. Accelerometer-monitored sedentary behavior and observed physical function loss. Am J Public Health. 2015;105(3):560–6. doi:10.2105/AJPH.2014.302270.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Pedisic Z, Bauman A. Accelerometer-based measures in physical activity surveillance: current practices and issues. Br J Sports Med. 2015;49(4):219–23. doi:10.1136/bjsports-2013-093407.

    Article  PubMed  Google Scholar 

  36. Freedson PS, Melanson E, Sirard J. Calibration of the Computer Science and Applications, Inc. accelerometer. Med Sci Sports Exerc. 1998;30(5):777–81.

    Article  CAS  PubMed  Google Scholar 

  37. Gorman E, Hanson HM, Yang PH, Khan KM, Liu-Ambrose T, Ashe MC. Accelerometry analysis of physical activity and sedentary behavior in older adults: a systematic review and data analysis. Eur Rev Aging Phys Act. 2014;11:35–49. doi:10.1007/s11556-013-0132-x.

    Article  PubMed  PubMed Central  Google Scholar 

  38. DeFina LF, Haskell WL, Willis BL, Barlow CE, Finley CE, Levine BD, et al. Physical activity versus cardiorespiratory fitness: two (partly) distinct components of cardiovascular health? Prog Cardiovasc Dis. 2015;57(4):324–9. doi:10.1016/j.pcad.2014.09.008.

    Article  PubMed  Google Scholar 

  39. Myers J, McAuley P, Lavie CJ, Despres JP, Arena R, Kokkinos P. Physical activity and cardiorespiratory fitness as major markers of cardiovascular risk: their independent and interwoven importance to health status. Prog Cardiovasc Dis. 2015;57(4):306–14. doi:10.1016/j.pcad.2014.09.011.

    Article  PubMed  Google Scholar 

  40. Sadik J, Myers J, Froelicher V. A modified nomogram for ramp treadmill testing using the Veterans Specific Activity Questionnaire. Am J Cardiol. 2014;114(5):803–5. doi:10.1016/j.amjcard.2014.06.008.

    Article  PubMed  Google Scholar 

  41. Finger JD, Gosswald A, Hartel S, Muters S, Krug S, Holling H, et al. Measurement of cardiorespiratory fitness in the German Health Interview and Examination Survey for Adults (DEGS1). Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz. 2013;56(5-6):885–93. doi:10.1007/s00103-013-1694-5.

    Article  CAS  PubMed  Google Scholar 

  42. Tanaka H, Monahan KD, Seals DR. Age-predicted maximal heart rate revisited. J Am Coll Cardiol. 2001;37(1):153–6.

    Article  CAS  PubMed  Google Scholar 

  43. Beutner F, Ubrich R, Zachariae S, Engel C, Sandri M, Teren A, et al. Validation of a brief step-test protocol for estimation of peak oxygen uptake. Eur J Prev Cardiol. 2015;22(4):503–12.

    Article  PubMed  Google Scholar 

  44. Myers J, Arena R, Franklin B, Pina I, Kraus WE, McInnis K, et al. Recommendations for clinical exercise laboratories: a scientific statement from the american heart association. Circulation. 2009;119(24):3144–61. doi:10.1161/CIRCULATIONAHA.109.192520.

    Article  PubMed  Google Scholar 

  45. Trampisch US, Franke J, Jedamzik N, Hinrichs T, Platen P. Optimal Jamar dynamometer handle position to assess maximal isometric hand grip strength in epidemiological studies. J Hand Surg Am. 2012;37(11):2368–73. doi:10.1016/j.jhsa.2012.08.014.

    Article  PubMed  Google Scholar 

  46. Hankinson SE, Manson J, Spiegelman D, Willett WC, Longcope C, Speizer FE. Reproducibility of plasma hormone levels in postmenopausal women over a 2-3-year period. Cancer Epidemiol Biomarkers Prev. 1995;4(6):649–54.

    CAS  PubMed  Google Scholar 

  47. De Vet HC, Terwee CB, Mokkink LB, Knol DL. Measurement in medicine: a practical guide. Cambridge: Cambridge University Press; 2011.

    Book  Google Scholar 

  48. Cleland CL, Hunter RF, Kee F, Cupples ME, Sallis JF, Tully MA. Validity of the global physical activity questionnaire (GPAQ) in assessing levels and change in moderate-vigorous physical activity and sedentary behaviour. BMC Public Health. 2014;14:1255. doi:10.1186/1471-2458-14-1255.

    Article  PubMed  PubMed Central  Google Scholar 

  49. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33(1):159–74.

    Article  CAS  PubMed  Google Scholar 

  50. Bland JM, Altman DG. Measuring agreement in method comparison studies. Stat Methods Med Res. 1999;8(2):135–60.

    Article  CAS  PubMed  Google Scholar 

  51. Yore MM, Ham SA, Ainsworth BE, Kruger J, Reis JP, Kohl 3rd HW, et al. Reliability and validity of the instrument used in BRFSS to assess physical activity. Med Sci Sports Exerc. 2007;39(8):1267–74. doi:10.1249/mss.0b013e3180618bbe.

    Article  PubMed  Google Scholar 

  52. Matthews CE, Moore SC, George SM, Sampson J, Bowles HR. Improving self-reports of active and sedentary behaviors in large epidemiologic studies. Exerc Sport Sci Rev. 2012;40(3):118–26. doi:10.1097/JES.0b013e31825b34a0.

    PubMed  PubMed Central  Google Scholar 

  53. Helmerhorst HJ, Brage S, Warren J, Besson H, Ekelund U. A systematic review of reliability and objective criterion-related validity of physical activity questionnaires. Int J Behav Nutr Phys Act. 2012;9:103. doi:10.1186/1479-5868-9-103.

    Article  PubMed  PubMed Central  Google Scholar 

  54. van Poppel MN, Chinapaw MJ, Mokkink LB, van Mechelen W, Terwee CB. Physical activity questionnaires for adults: a systematic review of measurement properties. Sports Med. 2010;40(7):565–600. doi:10.2165/11531930-000000000-00000.

    Article  PubMed  Google Scholar 

  55. Terwee CB, Mokkink LB, van Poppel MN, Chinapaw MJ, van Mechelen W, de Vet HC. Qualitative attributes and measurement properties of physical activity questionnaires: a checklist. Sports Med. 2010;40(7):525–37. doi:10.2165/11531370-000000000-00000.

    Article  PubMed  Google Scholar 

  56. Lee IM, Shiroma EJ. Using accelerometers to measure physical activity in large-scale epidemiological studies: issues and challenges. Br J Sports Med. 2014;48(3):197–201. doi:10.1136/bjsports-2013-093154.

    Article  PubMed  PubMed Central  Google Scholar 

  57. Pivarnik JM, Reeves MJ, Rafferty AP. Seasonal variation in adult leisure-time physical activity. Med Sci Sports Exerc. 2003;35(6):1004–8. doi:10.1249/01.mss.0000069747.55950.b1.

    Article  PubMed  Google Scholar 

  58. Haskell WL, Troiano RP, Hammond JA, Phillips MJ, Strader LC, Marquez DX, et al. Physical activity and physical fitness: standardizing assessment with the PhenX Toolkit. Am J Prev Med. 2012;42(5):486–92. doi:10.1016/j.amepre.2011.11.017.

    Article  PubMed  PubMed Central  Google Scholar 

Download references


Not applicable.


This work was supported by grants of the Robert Koch Institute Berlin593 (Grant n° 1362-1204) and the European Commission/Eurostat (Grant n° 594 10501.2099.007.2009.890).

Availability of data and materials

Due to property rights data cannot be made available as a public use file.

Authors’ contributions

SEB, SK, BF, JDF and MFL designed the study. JDF developed the EHIS-PAQ. SK and MFL received funding. BF and CT led the data collection. CR performed the statistical analyses. SEB drafted the manuscript. All authors critically revised the manuscript and approved the final version.

Competing interests

All authors declare not having financial or non-financial competing interests.

Consent for publication

Not applicable because the manuscript does not include details, images, or videos relating to individual participants.

Ethics and consent

The investigation was carried out in accordance with the Declaration of Helsinki, including written informed consent of all participants. The Ethics Committee of the University of Regensburg (Germany) (reference # 13-101-0210) approved the study protocol.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Sebastian E. Baumeister.

Additional files

Additional file 1: Table S1.

30 day test-retest reliability of the European Health Interview Survey Interview Physical Activity Questionnaire (EHIS-PAQ) by gender, age, and body mass index (BMI). Table S2 Median difference in moderate to vigorous physical activity between the European Health Interview Survey Interview Physical Activity Questionnaire (EHIS-PAQ) and GT3X+ accelerometer in the total study population and stratified by gender, age, and body mass index (BMI). (DOCX 21 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Baumeister, S.E., Ricci, C., Kohler, S. et al. Physical activity surveillance in the European Union: reliability and validity of the European Health Interview Survey-Physical Activity Questionnaire (EHIS-PAQ). Int J Behav Nutr Phys Act 13, 61 (2016).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: