A comparison of the dietary patterns derived by principal component analysis and cluster analysis in older Australians
International Journal of Behavioral Nutrition and Physical Activity volume 13, Article number: 30 (2016)
Despite increased use of dietary pattern methods in nutritional epidemiology, there have been few direct comparisons of methods. Older adults are a particularly understudied population in the dietary pattern literature. This study aimed to compare dietary patterns derived by principal component analysis (PCA) and cluster analysis (CA) in older adults and to examine their associations with socio-demographic and health behaviours.
Men (n = 1888) and women (n = 2071) aged 55–65 years completed a 111-item food frequency questionnaire in 2010. Food items were collapsed into 52 food groups and dietary patterns were determined by PCA and CA. Associations between dietary patterns and participant characteristics were examined using Chi-square analysis. The standardised PCA-derived dietary patterns were compared across the clusters using one-way ANOVA.
PCA identified four dietary patterns in men and two dietary patterns in women. CA identified three dietary patterns in both men and women. Men in cluster 1 (fruit, vegetables, wholegrains, fish and poultry) scored higher on PCA factor 1 (vegetable dishes, fruit, fish and poultry) and factor 4 (vegetables) compared to factor 2 (spreads, biscuits, cakes and confectionery) and factor 3 (red meat, processed meat, white-bread and hot chips) (mean, 95 % CI; 0.92, 0.82–1.02 vs. 0.74, 0.63–0.84 vs. −0.43, −0.50– −0.35 vs. 0.60 0.46–0.74, respectively). Women in cluster 1 (fruit, vegetables and fish) scored highest on PCA factor 1 (fruit, vegetables and fish) compared to factor 2 (processed meat, hot chips cakes and confectionery) (1.05, 0.97–1.14 vs. −0.14, −0.21– −0.07, respectively). Cluster 3 (small eaters) in both men and women had negative factor scores for all the identified PCA dietary patterns. Those with dietary patterns characterised by higher consumption of red and processed meat and refined grains were more likely to be Australian-born, have a lower level of education, a higher BMI, smoke and did not meet physical activity recommendations (all P < 0.05).
PCA and CA identified comparable dietary patterns within older Australians. However, PCA may provide some advantages compared to CA with respect to interpretability of the resulting dietary patterns. Older adults with poor dietary patterns also displayed other negative lifestyle behaviours. Food-based dietary pattern methods may inform dietary advice that is understood by the community.
Exploring whole dietary patterns, rather than the individual components, has become increasingly important in examining diet and disease relations [1, 2]. With the complex interaction and correlation between nutrients and other food components, dietary pattern analysis has emerged as a more comprehensive assessment of diet . Furthermore, this multi-dimensional and food-based approach can help provide dietary advice that is understood by the community . Three categories of dietary pattern assessment methods exist; theoretical methods, empirical methods and hybrid methods. Theoretical methods, also known as a priori methods, assess diet based on prior knowledge and scientific evidence  for example, the dietary guideline index . Whereas empirical methods, also known as a posteriori methods, use statistical approaches to provide information about existing dietary patterns within the population . Further to these methods are hybrid methods, such as reduced rank regression and partial least squares regression that use a combination of theoretical knowledge and statistical approaches to determine dietary patterns .
Principal component analysis (PCA) and cluster analysis (CA) are two commonly applied empirical dietary pattern methods . PCA uses the correlation matrix of food intake variables to identify common patterns of food consumption within the data in order to account for the largest amount of variation in diet . CA groups individuals with similar dietary patterns into mutually exclusive categories according to the mean of the food intake variables . Several CA algorithms exist, with k-means being the most popular in nutrition research because it can handle a large number of input variables efficiently .
Both PCA and CA have been extensively used for examining dietary patterns [1, 6, 7], however, they take alternative approaches to addressing the issue. Few studies have directly compared the outcomes of PCA and CA within the same data set [8–16]. Those studies show that both PCA and CA are able to identify comparable key dietary patterns, often identifying a fruit and vegetable dominant pattern vs. a red and processed meat pattern. Comparison studies of dietary pattern methodologies can help us to understand the strengths and weaknesses of their application in nutrition research. However, little research has focused on dietary patterns of older adults with only one known comparison study of PCA and CA .
Focusing on health behaviours, such as dietary patterns, among older people has become increasingly important particularly with the ageing population . Prevention campaigns that target diet in older adults may help improve quality of life and reduce morbidity and premature mortality rates . To our knowledge, no studies have explored dietary patterns during the transition period nearing retirement. Peri-retirement, defined as the age of 55 to 65 years, is an important time where major life course transitions occur. Transitional events such as those related to employment, family or health-related circumstances have the potential to impact dietary patterns [19, 20]. Therefore an opportunity exists for public health strategies to be implemented within this population . The objective of the current analysis was to compare dietary patterns derived by PCA and CA and to examine their associations with socio-demographic and health behaviours of a sample of 55 to 65 year old adults.
This study used data collected as part of the Wellbeing Eating and Exercise for a Long Life (WELL) study. The methods of this study has been described in detail elsewhere . In brief, the WELL study is a longitudinal cohort study with data collected via a postal survey. A random sample of 11,256 Australian adults aged 55–65 years at the census date (31 October 2009) in Victoria were selected from the Australian Electoral Commission’s electoral roll, which is compulsory for Australian citizens to be registered on. A stratified random sampling process was used to select the sample according to sex and socio-economic position . A total of 475 could not be delivered or the participant did not meet the studies age criteria, resulting in 10,781 eligible participants. A total of 4082 volunteers returned surveys, (38 % response rate; 48 % male; 53 and 44 % of males and females respectively had obtained an education level of up to year 12, 20 and 28 % had obtained a trade or certificate qualification and 27 and 28 % had obtained a university degree). Those with complete surveys and sufficient dietary data (having completed at least 90 % of the food frequency questionnaire) were included in this study. Ethical approval to conduct the WELL study was approved by Deakin University Human Research Ethics Committee (2009–105).
Dietary intake was assessed using a 111-item Food Frequency Questionnaire (FFQ) adapted from the 1995 Australian National Nutrition Survey [23, 24], based on an existing validated FFQ and has been used in other cohorts in Australia [25–28]. The FFQ assessed participant’s dietary intake over the previous six months, with nine response categories for each item, ranging from ‘never or less than once a month’ to ‘6+ times per day’. No information was gathered on portion sizes. Participants with > 10 % of the FFQ data missing were considered invalid  and not included in this study while all other missing FFQ responses were considered not consumed. FFQ responses were converted to daily equivalents and the 111 items were categorised into 52 food groups according to their nutritional content, culinary usage and the 2013 Australian Dietary Guidelines food groups , in line with previous dietary pattern studies  (Additional file 1: Table S1). FFQ items consumed once per week or more by less than 10 % of the population were combined with other food items where possible or omitted. Only soy beverages were omitted from analyses since a large proportion of the sample (91 %) indicated they never consumed this item. The daily intake frequencies were used to determine dietary patterns as the FFQ did not include portion sizes so grams per day were not available. However servings per day (frequency), is routinely used to determine empirical dietary patterns .
Participant characteristics and health behaviours
Self-reported socio-demographic and health behaviours, including height and weight, were collected in the postal survey. Body mass index (BMI) was calculated and categorized according to the World Health Organization criteria (Underweight: BMI <18.5; Healthy: BMI ≥ 18.5 to < 25 kg/m2; Overweight: BMI ≥ 25 to < 30 kg/m2; Obese: BMI ≥ 30 kg/m2) . Several studies have examined the validity of self-reported height and weight among adults, finding high correlations between self-reported and objectively-assessed weight, including among older adults [32–35].
Self-reported physical activity in the seven days prior to the survey was assessed using the long version of the International Physical Activity Questionnaire (IPAQ) . IPAQ records the frequency, intensity, and duration of leisure time physical activity during the previous week. Minutes of activity per week were calculated by summing the number minutes of moderate intensity physical activity per week and twice the number of minutes per week spent participating in vigorous intensity physical activity per week . Participants were classified as to whether they met the physical activity recommendations of at least 150 min of activity per week .
Principal component analysis
The food groups were entered into the PCA procedure using the software Stata (StataCorp, Version 12.0). Since the PCA output results in a large number of factor solutions (as many as there are food groups), it is important to identify the key dietary patterns. Firstly, factors with eigenvalues >1.0 were considered, then the break in the scree plot was examined to determine the number of key identified dietary patterns and then the interpretability of the identified patterns was assessed . The identified factors were orthogonally rotated to simplify the factor structure and to enhance their interpretability . For each factor, foods with factor loadings of | ≥ 0.2| were considered to contribute significantly to the pattern and used to calculate factors scores . Factor scores were calculated for each of the derived patterns by summing the products of the observed consumption frequency and the factor loading for each of the significant food groups . Factors were numbered and given provisional labels according to the food groups that loaded highly on the pattern.
PCA was initially conducted separately for men and women and Tucker’s coefficient of congruence was used to assess agreement between sexes  to determine if analyses should be stratified by sex. The coefficient of congruence indicated that the dietary factors of men and woman were not similar (data not shown). Therefore all dietary pattern analyses and subsequent tests were stratified by sex.
K-means CA was employed to determine dietary clusters. Frequency of food intake of the 52 food groups were converted to z-scores (standardised), and entered into the cluster algorithm using the software Stata (StataCorp, Version 12). Standardised food intakes were used to ensure all foods have equal influence on the cluster procedure  as cluster analysis is sensitive to outliers . A number of steps were taken to determine the number of identified clusters. Firstly, the Ward’s hierarchical clustering method and the Duda-Hart stopping rule  was considered. Secondly, k-means cluster solutions of 2–8 clusters (the range of clusters found previously in the literature ) were run using the Calinski-Harabasz stopping rule. These stopping rules examine the between- and within-cluster variance to ensure the most distinct clustering k-means cluster solution is obtained [42, 43]. If a cluster contained <10 % of the total sample it was considered too small for adequate statistical power . Finally, the interpretability of clusters was examined to confirm the final solution. The resulting clusters were numbered and given provisional labels according to the food groups that had a significantly higher mean frequency. Since CA is sensitive to small changes  the stability of the final cluster solution was tested. The sample was randomly split in half and the analysis was re-run. Agreement between the clusters of the total sample against a random half was tested with Kappa statistic using standard cut-offs (<0 poor; 0.00–0.20 slight; 0.21–0.40 fair; 0.41–0.60 moderate; 0.61–0.80 substantial; and 0.81–1.00 almost perfect) .
Participant characteristics across tertiles of PCA dietary patterns and dietary clusters were explored using Chi-square analysis. The mean PCA factor scores by clusters were compared using ANOVA and a bonferroni post-hoc test. For ease of interpretation, the factor scores were standardised so that the patterns could be compared on the same scale. Data presented in the text are mean and 95 % confidence intervals (CI) unless otherwise specified.
A total of 3959 (1888 men and 2071 women) participants with complete data were included in this study (Table 1). Compared to men, women were more likely to have a lower BMI, be separated, divorced, widowed or retired, have a lower level of education, were less likely to be smokers and were more likely to be meeting physical activity recommendations (all P < 0.001).
Principal component analysis
PCA identified four dietary patterns in men and two in women (Table 2). For men, factor 1 was characterised by high factor loadings for vegetable dishes, fruit, fish and poultry with a negative loading for potato. Factor 2 was characterised by high loadings for spreads, biscuits, cakes and confectionery. Factor 3 was characterised by high loadings for red and processed meat, white bread, fried fish and hot chips while having negative loadings for muesli or porridge and reduced fat milk. Factor 4 was characterised by high loadings for a range of vegetables (orange, dark green and cruciferous, potato and other vegetables). These patterns explained 5.8, 5.7, 5.6 and 5.6 % of the variation in food intakes, respectively. In women, factor 1 was characterised by vegetables, fruit and fish and factor 2 was characterised by high loadings for cakes, processed meat, hot chips and confectionery. These patterns explained 7.8 and 6.5 % of the variation in food intakes in women.
The three cluster solution produced the best cluster outcome for both men and women as it formed reasonably sized (>10 % of sample size) and well-separated clusters (determined by a high Calinski-Harabasz pseudo F statistic ). The means and standard deviations of the daily food consumption frequency across clusters demonstrated that the identified clustered had varied consumption frequency of key food groups (Table 3). The reliability of the chosen cluster solutions was confirmed by running the analysis on a random 50 % sample, in which the kappa statistic indicated that the random half had good agreement for men (kappa coefficient = 0.72) and very good agreement for women (kappa coefficient = 0.83) in comparison to the total sample (data not shown). Therefore these solutions were considered reliable representations of the dietary clusters in this sample.
In men, cluster 1 (n = 474) was characterised by higher intake of fruit, vegetables, wholegrain bread, fish and poultry. The men within cluster 2 (n = 343) had higher intakes of red and processed meat, white bread, flavoured drinks, cakes, pastries and confectionery. Cluster 3 (n = 1071) was characterised by a lower mean frequency for most food items compared to the other clusters and was called ‘small eaters’. In women, cluster 1 (n = 525) was characterised by higher mean frequency of fruit, vegetables, nuts, legumes and fish. Cluster 2 (n = 409) was characterised by a higher frequency of red and processed meat, white bread, flavoured drinks, cakes, pastries and confectionery. Similar to men, cluster 3 (n = 1137) was labelled ‘small eaters’ and was characterised by a consistently lower mean daily intake frequency for the majority of the food items.
Principal component analysis and participant characteristics
Men in the highest tertile of factor 1 (vegetable dishes, fruit, fish and poultry) were more likely to have been born outside of Australia, obtained a higher level of education, be non-smokers and meet the physical activity recommendations (Table 4). Men in the highest tertile of factor 2 (spreads, biscuits, cakes and confectionery) were more likely to have been born within Australia. Factor 2 also had a weak u-shaped association with meeting physical activity recommendations (P = 0.03). A higher score on factor 3 (red meat, processed meat, white-bread and hot chips) was associated with men who were younger, had a high BMI, a lower level of education and were more likely to be smokers and not meeting physical activity recommendations. Factor 4 (vegetables) was the only PCA pattern associated with relationship status in men, with men living as married more likely to score high on this pattern compared to those separated or never married. Men who also scored high on factor 4 (vegetables) were more likely to have been born in Australia, have a lower level of education and had a weak U-shaped association with meeting physical activity recommendations (P = 0.03). None of the male PCA dietary patterns were associated with retirement status.
For women, a high score on factor 1 (fruit, vegetables and fish) was associated with a lower BMI, a higher level of education, being a non-smoker and meeting the physical activity recommendations (Table 4). Women within the lowest third of factor 2 (processed meat, hot chips cakes and confectionery) tended to be more likely to be born outside of Australia, separated, divorced or widowed, not retired, have a higher education and meeting PA recommendations compared to the middle and highest thirds. No significant associations were shown between PCA dietary patterns and age in women.
Cluster analysis and participant characteristics
Men in cluster 2 (red meat, processed meat, refined grains and high-energy drinks) were more likely to be younger and born in Australia compared to the other clusters (Table 5). A higher proportion of men in cluster 2 were classified as obese (BMI ≥ 30 kg/m2) and cluster 1 (fruit, vegetables, wholegrains, fish and poultry) had a high proportion of men within the healthy range (BMI ≥18.5 < 25 kg/m2). Cluster 1 (fruit, vegetables, wholegrains, fish and poultry) contained men with a higher level of education. Cluster 1 and 3 (small eaters) were more likely to display positive health behaviours (non-smokers and meeting physical activity recommendations) compared to those in cluster 2 (red meat, processed meat, refined grains and high-energy drinks). Relationship status or retirement status of men did not differ between clusters.
The women within cluster 2 (red and processed meat, white bread, flavoured drinks, cakes, pastries and confectionery) were more likely to be overweight (BMI ≥25 < 30 kg/m2) or obese (BMI ≥ 30 kg/m2) compared to the other clusters. Cluster 1 (fruit, vegetables and fish) contained a high proportion of women within healthy weight range (BMI ≥18.5 < 25 kg/m2) (Table 5). Women classified into cluster 2 were more likely to be retired and had achieved a lower level of education compared to the other clusters. Women within cluster 1 (fruit, vegetables and fish) were more likely to be non-smokers and meet physical activity recommendations. No significant differences were found between age, country of birth, and relationship status and clusters in women.
Comparison of principal component analysis and cluster analysis
Men who were grouped into cluster 1 (fruit, vegetables, wholegrain bread, fish and poultry) scored higher on PCA factor 1 (vegetable dishes, fruit, fish and poultry) (mean: 0.92, 95 % CI: 0.82–1.02) and factor 4 (vegetables) (0.74, 0.63–0.84) compared to factor 3 (red meat, processed meat, white-bread and hot chips) (−0.43, −0.50– −0.35) (Fig. 1). Correspondingly, men within cluster 2 (red meat, processed meat, refined grains and high-energy drinks) scored high on factor 3 (red meat, processed meat, white-bread and hot chips) (1.24, 1.12–1.36), followed by factor 2 (spreads, biscuits, cakes and confectionery) (0.60, 0.46–0.74) and scored low on factor 1 (vegetable dishes, fruit, fish and poultry) (−0.12, −0.21– −0.03) and factor 4 (vegetable) (0.14, 0.03–0.25). Men within cluster 3 (small eaters) had negative scores for all four of the PCA patterns. All PCA mean standardized factor scores were significantly different across clusters, except for factor 2 (spreads, biscuits, cakes and confectionery), which did not differ between cluster 1 and cluster 2 (P = 0.16).
Women that were classified into cluster 1 (fruit, vegetables and fish) scored the highest on factor 1 (fruit, vegetables and fish) (1.05, 0.97–1.14) and scored lowest on factor 2 (processed meat, hot chips cakes and confectionery) (−0.14, −0.21– −0.07) (Fig. 2). Women classified into cluster 2 (red meat, processed meat, cereals and confectionery) scored highly on factor 2 (processed meat, hot chips cakes and confectionery) (0.87, 0.73–1.01) compared to factor 1 (fruit, vegetables and fish) (−0.10, −0.18– −0.03). The women in cluster 3 (small eaters) scored low on both the PCA factor 1 and factor 2.
This study demonstrates the comparability between PCA and CA dietary pattern methods with two key dietary patterns (characterised by fruit, vegetables and fish vs. red meat, processed meat and refined grains) identified in both men and women. These dietary patterns are consistent with those previously described in the literature , and showed associations with key socio-demographic variables and health behaviours. These results are consistent with previous comparison studies among adults [8–16] and older adults , however this study is the first to explore dietary patterns in this specific transitional life stage among Australian adults.
Although this study demonstrated consistencies in the identified dietary patterns, some differences were also acknowledged in the outcome from each method. Using PCA, more dietary patterns were identified in men (4) than women (2), perhaps indicating greater variation in dietary intake in men than women of this age group. The PCA-derived factor 2, characterised by spreads, biscuits, cakes and confectionery identified in men, however a corresponding pattern was not evident in CA. There was no difference in the mean scores for this factor across clusters in men, suggesting that all men shared these snacking-type dietary characteristics of factor 2.
In both men and women, the largest cluster identified (small eaters) was characterised by low consumption frequencies for most food groups relative to the other clusters and it contained no dominating food groups. No equivalent pattern was identified in PCA analysis, perhaps as PCA is driven by correlations between input variables (food frequency) rather than the absolute input values. A similar dominating smaller eaters pattern has been described in other studies among older adults aged 65 years and over [46–48] and in adults aged 18 to 64 years . While it has been suggested in other studies that those in the small eaters cluster might be at risk of malnutrition , there is no evidence that the small eaters in the current study are at risk of malnutrition. The small eaters cluster’s mean BMI was 27.7 kg/m2 in men and 26.9 kg/m2 in women and only 0.3 % were considered underweight (BMI < 18 kg/m2). It is possible that this cluster reflects under-reporting, although poorly completed questionnaires were excluded prior to analysis. Furthermore, with respect to under-reporting we would have expect to observe low consumption frequencies for ‘unhealthy’ food groups (e.g., red meat, processed red meat, refined grain) and relatively high consumption frequencies for ‘healthy’ food groups (e.g., fruits, vegetables, fish), which may result particularly from social desirability bias. However, this was not the case in our study. It is also possible that those within the small eater cluster were consuming larger portions but still less frequently than the other clusters. Unfortunately, portion size was not measured in this study, so we cannot investigate the plausibility of this hypothesis. Another possibility is that those in the small eaters cluster have an increased diet variety, consuming low frequencies of many food groups. The authors compared these clusters with adherence to the Australian Dietary Guidelines  in further analyses and found that the small eaters cluster did not demonstrated a higher diet variety, however they did demonstrate higher compliance with the guidelines overall compared to those in cluster 2 (red meat, processed meat, refined grains and high-joule drinks), indicating better diet quality (unpublished results).
Each dietary pattern method has individual strengths and weaknesses. Although CA is good at identifying sub-populations with similar characteristics, it may not always be optimal for looking at the relationship between dietary patterns and health outcomes. The statistical power is limited by the need to use a reference category  and the uneven cluster sizes of the clusters identified in this study limit the power for future analyses. Furthermore, the limited interpretability of these clusters makes it difficult to translate results into practice. In addition, the continuous nature of the PCA factors is advantageous, since they can be assessed as a continuous variable within a regression model and appear more useful in future analyses in the sample.
Our results demonstrate associations with participant characteristics consistent with the current literature. Previous studies in older adult populations (55y+) have found that vegetable-based diets and those consistent with dietary guidelines are associated with being female, a younger age, a higher level of education, physical activity, a higher BMI and not smoking compared to a meat and processed food-type diet less consistent with dietary recommendations [10, 50]. Dietary patterns of this nature have also been associated with increased nutritional status, quality of life and decreased mortality in older adults [51–55].
Due to the small age range (55–65y) we did not expect to find significant relationships between age and dietary patterns. However, we did show that the younger men were more likely to have dietary patterns characterised by red meat, processed meat, white-bread and refined grains. There are mixed results regarding age and dietary patterns [9, 10], and confounding factors such as cultural and social factors may influence the differences in dietary patterns across age between studies.
We showed that men and women with dietary patterns characterised by red meat, processed meat, white-bread and refined grains were more likely to be overweight and obese compared to those whose dietary patterns consisted of high fruit and vegetables consistent with previous results . However, associations between BMI and dietary pattern have been inconsistent across studies [57, 58]. The disparities in results may be a result of the heterogeneous samples characteristics, limitations in dietary pattern measures and the limited ability to determine causality in observational studies.
Our results show a significant association between PCA dietary patterns and relationship status. Women who were married were more likely to have a dietary pattern characterised by processed meat, hot chips cakes and confectionery compared to those who were separated, while married men were more likely to score high on the vegetable pattern compared to those separated. However, relationship status was not associated with clusters. There is limited and inconclusive research available around marital status and dietary patterns . Previous evidence suggests that those living solitary are more likely to have poorer dietary patterns [60–62]. In a longitudinal study improvements in dietary behaviours over 21 years in women were demonstrated whether they remained married or became single . Another longitudinal study demonstrated remarriage or cohabitation had a positive effect on diet while marital break-up had adverse effects on diet and other health behaviours . The barriers to healthy eating may differ by sex, which is important to acknowledge in public health initiatives. Further research in this area is required as living alone may negatively effect diet contributing to poor health outcomes in these individuals .
Retirement status was a significant covariate of dietary patterns for women, but was not important in men. Women who were retired were more likely to have dietary patterns characterised by red and processed meat and refined grains, compared to their non-retired counterparts whose dietary patterns were likely to be characterised by fruit, vegetables, fish and poultry. However, our results are at odds with a previous longitudinal study that found retired women tended to improve dietary patterns post retirement . A review of the evidence on changes in lifestyle behaviours during the transition to retirement concluded that both positive and negative changes occur dependent on the personal circumstances of the retiree , but there is not enough evidence to draw any conclusions on changes in dietary habits . A prospective study compared nutritional patterns 6 months before retirement and 18 months after retirement and found that nutrient consumption did not change after retirement however, there were changes in food-related behaviours such a taking more time for breakfast and lunch, eating out more and having guests for meals more frequently . These social changes among other factors such as presence or absence of illness may play a role in influencing dietary pattern among retirees.
Lower levels of education, a measure of socio-economic position, were associated with poorer dietary patterns in this study, consistent with previous research in adults [56, 66–68]. Unfortunately, in the current study, substantial missing data on income (16 %) restricted further investigation of socio-economic position. The relationship between socio-economic position and diet is complex and the drivers of this relationship are not fully understood. A review on socio-economic position and diet quality highlights that most studies focus on lack of knowledge, cooking skills and motivation, in those with lower levels of education accounting for poorer dietary intakes . However, there is little evidence to confirm these theories since the relationship between socio-economic position and dietary intake is multifactorial . Missing data is a common occurrence with relation to sensitive information such as income .
Poor diet, smoking and low physical activity are key independent risk factor for chronic diseases [1, 18, 71] Consistent with previous studies, those with poorer dietary patterns (charaterised by red and processed meat and refined grains as opposed to fruit and vegetables) were more likely to be smokers and have lower levels of physical activity [4, 16, 68]. This may identify a group of at risk older adults who demonstrate a cluster of poor health behaviours.
Possible limitations of this study should be considered. No causal relationships could be determined due to the cross-sectional design of this study and the study relied on self-reported measures, which may result in measurement error, for example height, weight and BMI. However, self-reported height and weight has previously been shown to be a valid estimate of BMI in large epidemiological studies [32, 33, 72].
Empirically-based dietary pattern techniques have inherent limitations for dietary pattern analysis. Several researcher-determined decisions are required such as the collapsing and format of input variable, the number of derived patterns and assigning labels for example [6, 7]. In the current study, steps were taken to reduce such subjectivity. For example, the FFQ foods were grouped based on approaches used in previous literature and consistent with the Australian Dietary Guidelines . Established criteria and best practice were used to determine the dietary patterns and objective criteria were used to compare the dietary patterns between men and women in PCA. The use of FFQs are known to be susceptible to measurement error of dietary intake, however other methods such as food records or 24-h recalls would have substantially increased subject burden. This FFQ used in this study has previously been used to assess dietary patterns and behaviours, demonstrating that it is a valid predictor of health outcomes and suggesting it has predictive validity [28, 52, 4, 73].
A limitation of the FFQ used is that it did not measure portion sizes and therefore energy intake could not be estimated  and input variables for dietary pattern analysis could not be adjusted for energy intake. However, non energy-adjusted frequency is more sensitive to the intake of important low-energy foods such as fruit and vegetables [5, 8, 75] and previous research has questioned the need for energy adjustment [75–77]. There is conflicting evidence regarding best practice and adjusting for energy may have different implications for different dietary pattern assessment techniques .
Strengths of this study include the population-based design of the WELL study, the focus on older adults and the comparison of different methods. Although the response rate was modest (38 %), the sampling technique resulted in a large sample with characteristics consistent with both state  and national data [79, 80]. For example, at baseline the WELL study participants had similar levels of employment in comparison to national figures (60 vs. 61 % in full time or part time employment) and they were more highly educated (28 vs. 19 % had completed a university degree or higher). Participants were less likely to be overweight or obese in comparison to national data (64 vs. 74 %) and were less likely to be current smokers (12 vs. 15 %) [80–82]. A similar proportions of the WELL sample were meeting fruit (10 vs. 11 %) and vegetable (61 vs. 56 %) recommendations compared to the national population of the same age . Furthermore, the specific age range of 55–65 years captures an understudied population during a transitional life stage and the comparative nature of this study adds to the limited research in this area. Of the studies that have compared PCA and CA, they have concluded that although the dietary assessment methods are different, the dietary patterns identified often have similar qualities including a fruit and vegetable dominant pattern vs. a red and processed meat pattern . In order to enhance the understanding of the dietary patterns identified in this study population, validation against health outcomes or clinical markers of disease would be advantageous .
Both PCA and CA identified two key dietary patterns in peri-retirement aged men and women. These results add to the limited literature on dietary patterns in older adults. Overall, PCA identified dietary patterns that were more interpretable than CA. This study showed that those with poor diets tend to also display negative health behaviours including smoking and not meeting physical activity recommendations, initiatives targeting these collective health behaviours, which are risk factors for chronic disease, may help to improve the health of older adults.
analysis of variance
body mass index
Food Frequency Questionnaire
International Physical Activity Questionnaire
principal component analysis
Wellbeing Eating and Exercise for a Long Life study
Kant AK. Dietary patterns and health outcomes. J Am Diet Assoc. 2004;104(4):615–35.
Slattery ML. Analysis of dietary patterns in epidemiological research. Appl Physiol Nutr Metab. 2010;35(2):207–10.
Waijers PM, Feskens EJ, Ocke MC. A critical review of predefined diet quality scores. Br J Nutr. 2007;97(2):219–31.
McNaughton SA, Ball K, Crawford D, Mishra GD. An index of diet and eating patterns is a valid measure of diet quality in an Australian population. J Nutr. 2008;138(1):86–93.
Ocke MC. Evaluation of methodologies for assessing the overall diet: dietary quality scores and dietary pattern analysis. Proc Nutr Soc. 2013;72(2):191–9.
Newby PK, Tucker KL. Empirically derived eating patterns using factor or cluster analysis: A review. Nutr Rev. 2004;62(5):177–203.
Devlin UM, McNulty BA, Nugent AP, Gibney MJ. The use of cluster analysis to derive dietary patterns: methodological considerations, reproducibility, validity and the effect of energy mis-reporting. Proc Nutr Soc. 2012;71(4):599–609.
Newby PK, Muller D, Tucker KL. Associations of empirically derived eating patterns with plasma lipid biomarkers: a comparison of factor and cluster analysis methods. Am J Clin Nutr. 2004;80(3):759–67.
Costacou T, Bamia C, Ferrari P, Riboli E, Trichopoulos D, Trichopoulou A. Tracing the Mediterranean diet through principal components and cluster analyses in the Greek population. Eur J Clin Nutr. 2003;57(11):1378–85.
Bamia C, Orfanos P, Ferrari P, Overvad K, Hundborg HH, Tjonneland A, et al. Dietary patterns among older Europeans: the EPIC-Elderly study. Br J Nutr. 2005;94(1):100–13.
Crozier SR, Robinson SM, Borland SE, Inskip HM, SWS Study Group. Dietary patterns in the Southampton Women’s Survey. Eur J Clin Nutr. 2006;60(12):1391–9.
Reedy J, Wirfalt E, Flood A, Mitrou PN, Krebs-Smith SM, Kipnis V, et al. Comparing 3 dietary pattern methods-cluster analysis, factor analysis, and index analysis-with colorectal cancer risk. Am J Epidemiol. 2010;171(4):479–87.
Hearty AP, Gibney MJ. Comparison of cluster and principal component analysis techniques to derive dietary patterns in Irish adults. Br J Nutr. 2009;101(4):598–608.
Smith AD, Emmett PM, Newby PK, Northstone K. A comparison of dietary patterns derived by cluster and principal components analysis in a UK cohort of children. Eur J Clin Nutr. 2011;65(10):1102–9.
Cunha DB, Almeida RM, Pereira RA. A comparison of three statistical methods applied in the identification of eating patterns. Cad Saude Publica. 2010;26(11):2138–48.
Stricker MD, Onland-Moret NC, Boer JM, van der Schouw YT, Verschuren WM, May AM, et al. Dietary patterns derived from principal component- and k-means cluster analysis: long-term association with coronary heart disease and stroke. Nutr Metab Cardiovas. 2013;23(3):250–6.
The Lancet. How to cope with an ageing population. Lancet. 2013;382:1225.
Australian Institute of Health and Welfare. Australia’s health 2014. Canberra: AIHW; 2014.
Eng PM, Kawachi I, Fitzmaurice G, Rimm EB. Effects of marital transitions on changes in dietary and other health behaviours in US male health professionals. J Epidemiol Community Health. 2005;59(1):56–62.
Lee S, Cho E, Grodstein F, Kawachi I, Hu FB, Colditz GA. Effects of marital transitions on changes in dietary and other health behaviours in US women. Int J Epidemiol. 2005;34(1):69–78.
Zantinge EM, van den Berg M, Smit HA, Picavet HS. Retirement and a healthy lifestyle: opportunity or pitfall? A narrative review of the literature. Eur J Pub Health. 2014;24(3):433–9.
McNaughton SA, Crawford D, Ball K, Salmon J. Understanding determinants of nutrition, physical activity and quality of life among older adults: the Wellbeing, Eating and Exercise for a Long Life (WELL) study. Health Qual Life Outcomes. 2012;10(1):109.
Ireland P, Jolley D, Giles G, O’Dea K, Powles J, Rutishauser I, et al. Development of the Melbourne FFQ: a food frequency questionnaire for use in an Australian prospective study involving and ethnically diverse cohort. Asia Pac J Clin Nutr. 1994;3:19–31.
Hodge A, Patterson AJ, Brown WJ, Ireland P, Giles G. The Anti Cancer Council of Victoria FFQ: relative validity of nutrient intakes compared with weighed food records in young to middle-aged women in a study of iron supplementation. Aust NZ J Public Health. 2000;24(6):576–83.
Smith KJ, McNaughton SA, Gall SL, Blizzard L, Dwyer T, Venn AJ. Involvement of young Australian adults in meal preparation: cross-sectional associations with sociodemographic factors and diet quality. J Am Diet Assoc. 2010;110(9):1363–7.
Smith KJ, Blizzard L, McNaughton SA, Gall SL, Dwyer T, Venn AJ. Daily eating frequency and cardiometabolic risk factors in young Australian adults: cross-sectional analyses. Br J Nutr. 2012;108(6):1086–94.
Thorpe MG, Kestin M, Riddell LJ, Keast RS, McNaughton SA. Diet quality in young adults and its association with food-related behaviours. Public Health Nutr. 2014;17(8):1767–75.
Smith KJ, Gall SL, McNaughton SA, Blizzard L, Dwyer T, Venn AJ. Skipping breakfast: longitudinal associations with cardiometabolic risk factors in the Childhood Determinants of Adult Health Study. Am J Clin Nutr. 2010;92(6):1316–25.
McLennan W, Podger A. National nutrition survey users’ guide, 1995 Australian Bureau of Statistics Catalogue No. 4801.0. Canberra: Australian Government Publishing Service; 1998.
National Health and Medical Research Council. Eat for health: Australian Dietary Guidelines. Canberra: National Health and Medical Research Council; 2013.
World Health Organization. Global database on Body Mass Index- BMI classification. 2012. http://apps.who.int/bmi/index.jsp?introPage=intro_3.html. Accessed 30 June 2015.
McAdams MA, Van Dam RM, Hu FB. Comparison of self-reported and measured BMI as correlates of disease markers in US adults. Obesity. 2007;15(1):188–96.
Burton NW, Brown W, Dobson A. Accuracy of body mass index estimated from self-reported height and weight in mid-aged Australian women. Aust NZ J Public Health. 2010;34(6):620–3.
Stommel M, Schoenborn CA. Accuracy and usefulness of BMI measures based on self-reported weight and height: findings from the NHANES & NHIS 2001–2006. BMC Public Health. 2009;9:421.
Lin CJ, DeRoo LA, Jacobs SR, Sandler DP. Accuracy and reliability of self-reported weight and height in the Sister Study. Public Health Nutr. 2012;15(6):989–99.
Craig CL, Marshall AL, Sjostrom M, Bauman AE, Booth ML, Ainsworth BE, et al. International physical activity questionnaire: 12-country reliability and validity. Med Sci Sports Exerc. 2003;35(8):1381–95.
Fransen HP, May AM, Stricker MD, Boer JM, Hennig C, Rosseel Y, et al. A posteriori dietary patterns: how many patterns to retain? J Nutr. 2014;144(8):1274–82.
Abdi H. Factor rotations. In: Lewis-Beck M, Bryman A, Futing T, editors. Encyclopedia of social sciences research methods. Thousand Oaks: Sage Publications; 2003. p. 978–82.
Schulze MB, Hoffmann K, Kroke A, Boeing H. An approach to construct simplified measures of dietary patterns from exploratory factor analysis. Br J Nutr. 2003;89(3):409–19.
Kim JO, Mueller CW. Factor analysis: statistical methods and practical issues. Thousand Oaks: Sage Publications; 1978.
Lorenzo-Seva U, ten Berge JMF. Tucker’s congruence coefficient as a meaningful index of factor similarity. Methodology. 2006;2(2):57–64.
Duda RO, Hart PE. Pattern classification and scene analysis. New York: Wiley; 1973.
Calinski T. A dendrite method for cluster analysis. Commun Stat. 1968;3(1):1–27.
Everitt B. Cluster analysis. In: Wiley series in probability and statistics. 5th ed. Chichester: Wiley; 2011.
Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33(1):159–74.
Correa Leite ML, Nicolosi A, Cristina S, Hauser WA, Pugliese P, Nappi G. Dietary and nutritional patterns in an elderly rural population in Northern and Southern Italy: (I). A cluster analysis of food consumption. Eur J Clin Nutr. 2003;57(12):1514–21.
Correa Leite ML, Nicolosi A, Cristina S, Hauser WA, Pugliese P, Nappi G. Dietary and nutritional patterns in an elderly rural population in Northern and Southern Italy: (II). Nutritional profiles associated with food behaviours. Eur J Clin Nutr. 2003;57(12):1522–9.
Samieri C, Jutand MA, Feart C, Capuron L, Letenneur L, Barberger-Gateau P. Dietary patterns derived by hybrid clustering method in older people: association with cognition, mood, and self-rated health. J Am Diet Assoc. 2008;108(9):1461–71.
Wirfalt E, Drake I, Wallstrom P. What do review papers conclude about food and dietary patterns? Food Nutr Res. 2013;57.
Hsiao PY, Mitchell DC, Coffman DL, Allman RM, Locher JL, Sawyer P, et al. Dietary patterns and diet quality among diverse older adults: the University of Alabama at Birmingham Study of Aging. J Nutr Health Aging. 2013;17(1):19–25.
Anderson AL, Harris TB, Tylavsky FA, Perry SE, Houston DK, Hue TF, et al. Dietary patterns and survival of older adults. J Am Diet Assoc. 2011;111(1):84–91.
Milte CM, Thorpe MG, Crawford D, Ball K, McNaughton SA. Associations of diet quality with health-related quality of life in older Australian men and women. Exp Gerontol. 2015;64:8–16.
Harriss LR, English DR, Powles J, Giles GG, Tonkin AM, Hodge AM, et al. Dietary patterns and cardiovascular mortality in the melbourne collaborative cohort study. Am J Clin Nutr. 2007;86(1):221–9.
Kant AK, Graubard BI, Schatzkin A. Dietary patterns predict mortality in a national cohort: the National Health Interview Surveys, 1987 and 1992. J Nutr. 2004;134(7):1793–9.
Russell J, Flood V, Rochtchina E, Gopinath B, Allman-Farinelli M, Bauman A, et al. Adherence to dietary guidelines and 15-year risk of all-cause mortality. Br J Nutr. 2013;109(3):547–55.
Arabshahi S, van der Pols JC, Williams GM, Marks GC, Lahmann PH. Diet quality and change in anthropometric measures: 15-year longitudinal study in Australian adults. Br J Nutr. 2012;107(9):1376–85.
Togo P, Osler M, Sorensen TI, Heitmann BL. Food intake patterns and body mass index in observational studies. Int J Obes Relat Metab Disord. 2001;25(12):1741–51.
Hsiao PY, Jensen GL, Hartman TJ, Mitchell DC, Nickols-Richardson SM, Coffman DL. Food intake patterns and body mass index in older adults: a review of the epidemiological evidence. J Nutr Gerontol Geriatr. 2011;30(3):204–24.
Elisabet WirfÄLt AK, Jeffery RW. Using cluster analysis to examine dietary patterns: nutrient intakes, gender, and weight status differ across food pattern clusters. J Am Diet Assoc. 1997;97(3):272–9.
Billson H, Pryer JA, Nichols R. Variation in fruit and vegetable consumption among adults in Britain. An analysis from the dietary and nutritional survey of British adults. Eur J Clin Nutr. 1999;53(12):946–52.
Davis MA, Randall E, Forthofer RN, Lee ES, Margen S. Living arrangements and dietary patterns of older adults in the United States. J Gerontol. 1985;40(4):434–42.
Hanna KL, Collins PF. Relationship between living alone and food and nutrient intake. Nutr Rev. 2015;73(9):594–611.
Haapala I, Prattala R, Patja K, Mannikko R, Hassinen M, Komulainen P, et al. Age, marital status and changes in dietary habits in later life: a 21-year follow-up among Finnish women. Public Health Nutr. 2012;15(7):1174–81.
Helldan A, Lallukka T, Rahkonen O, Lahelma E. Changes in healthy food habits after transition to old age retirement. Eur J Pub Health. 2012;22(4):582–6.
Lauque S, Nourashemi F, Soleilhavoup C, Guyonnet S, Bertiere MC, Sachet P, et al. A prospective study of changes on nutritional patterns 6 months before and 18 months after retirement. J Nutr Health Aging. 1998;2(2):88–91.
Mishra G, Ball K, Arbuckle J, Crawford D. Dietary patterns of Australian adults and their association with socioeconomic status: results from the 1995 National Nutrition Survey. Eur J Clin Nutr. 2002;56(7):687–93.
Thornton LE, Pearce JR, Ball K. Sociodemographic factors associated with healthy eating and food security in socio-economically disadvantaged groups in the UK and Victoria, Australia. Public Health Nutr. 2014;17(1):20–30.
Robinson SM, Crozier SR, Borland SE, Hammond J, Barker DJ, Inskip HM. Impact of educational attainment on the quality of young women’s diets. Eur J Clin Nutr. 2004;58(8):1174–80.
Darmon N, Drewnowski A. Does social class predict diet quality? Am J Clin Nutr. 2008;87(5):1107–17.
Davern M, Rodin H, Beebe TJ, Call KT. The effect of income question design in health surveys on family income, poverty and eligibility estimates. Health Serv Res. 2005;40(5 Pt 1):1534–52.
Ezzati M, Riboli E. Behavioral and dietary risk factors for noncommunicable diseases. N Engl J Med. 2013;369(10):954–64.
Rowland ML. Self-reported weight and height. Am J Clin Nutr. 1990;52(6):1125–33.
Smith KJ, Sanderson K, McNaughton SA, Gall SL, Dwyer T, Venn AJ. Longitudinal associations between fish consumption and depression in young adults. Am J Epidemiol. 2014;179(10):1228–35.
Jakes RW, Day NE, Luben R, Welch A, Bingham S, Mitchell J, et al. Adjusting for energy intake--what measure to use in nutritional epidemiological studies? Int J Epidemiol. 2004;33(6):1382–6.
Bailey RL, Gutschall MD, Mitchell DC, Miller CK, Lawrence FR, Smiciklas-Wright H. Comparative strategies for using cluster analysis to assess dietary patterns. J Am Diet Assoc. 2006;106(8):1194–200.
Northstone K, Ness AR, Emmett PM, Rogers IS. Adjusting for energy intake in dietary pattern investigations using principal components analysis. Eur J Clin Nutr. 2008;62(7):931–8.
Balder HF, Virtanen M, Brants HA, Krogh V, Dixon LB, Tan F, et al. Common and country-specific dietary patterns in four European cohort studies. J Nutr. 2003;133(12):4246–51.
Department of Health. Victorian Health Monitor Food and Nutrition report. Melbourne, State Government of Victoria. 2012. http://www.health.vic.gov.au/healthstatus/survey/vhm.htm.
Australian Bureau of Statistics. Australian Health Survey: Nutrition First Results - Foods and Nutrients, 2011–12, cat. no. 4364.0.55.007. 2014.
Australian Bureau of Statistics. Retirement and Retirement Intentions, Australia, July 2010 to June 2011, cat. no. 6238.0. 2011. http://www.abs.gov.au/ausstats/abs@.nsf/mf/6238.0.
Australian Bureau of Statistics. General Social Survey: Summary Results, Australia, 2010, cat. no. 4159.0. 2011. http://www.abs.gov.au/ausstats/abs@.nsf/mf/4159.0/.
Australian Bureau of Statistics. Australian Health Survey: Updated Results, 2011–12 (Cat. no. 4364.0.55.003). Canberra: ABS; 2013.
Kylie Ball and Jo Salmon contributed to the study design and the development and implementation of the WELL study. Australia The WELL study is funded by The Australian Research Council and Diabetes Australia Research Trust. MGT is funded by an Australian Postgraduate Award scholarship. CMM is funded by a Deakin University Alfred Deakin Postdoctoral Research Fellowship. SAM is funded by an Australian Research Council Future Fellowship (FT100100581). The WELL study is funded by the Diabetes Australia Research Trust and Australian Research Council.
The authors declare that they have no competing interests.
SAM and DC (along with those acknowledged) designed the study and contributed to the development and implementation of the study. MGT conducted the analysis and interpretation of the data with the assistance of CMM, DC and SAM. MGT drafted the manuscript and all authors contributed to editing and reviewing of the final manuscript. All authors read and approved the final manuscript.
About this article
Cite this article
Thorpe, M.G., Milte, C.M., Crawford, D. et al. A comparison of the dietary patterns derived by principal component analysis and cluster analysis in older Australians. Int J Behav Nutr Phys Act 13, 30 (2016). https://doi.org/10.1186/s12966-016-0353-2