Introduction

Aging is the single greatest risk factor for nearly every major cause of morbidity and mortality in living organisms [1,2,3,4], including companion dogs [5]. Even in the absence of clinical disease, loss of function at the cellular and tissue levels underlies progressive impairments in physiological and homeostatic function of organ systems throughout life [6] that result in declining daily function and performance and increasing multimorbidity. Several studies in model organisms have demonstrated that the aging process is malleable and can be slowed with medical interventions and lifestyle alterations to delay mortality and promote a healthy lifespan or “healthspan” [7,8,9,10,11,12,13,14]. However, in contrast to traditional laboratory models, the domestic dog offers a novel translational opportunity to investigate aging biology and potential gerotherapeutics. Companion dogs live in shared environments with humans and have similar age-associated diseases and access to advanced healthcare. Simultaneously, dogs have the greatest intraspecies phenotypic diversity of any mammal, including a 2–2.5-fold variation in lifespan, which enhances the study of risk factors and biological pathways contributing to longevity. Dogs live shorter lives than humans, making evaluation of gerotherapeutics feasible on a meaningful timescale. Most importantly, companion dogs are seen as valued family members, and there is a strong societal drive to increase their healthspan and lifespan. For all these reasons, comparative and translational research in the dog holds the promise of identifying novel strategies that mitigate aging and improving health outcomes in both canines and humans [15, 16].

Quantitative outcome measures to assess the multidimensional process of aging and multifactorial components of healthy lifespan in the companion dog are necessary for advancing mechanistic understanding of aging biology and selecting clinical trial endpoints for candidate gerotherapeutics. An important approach for measuring the impact of aging is to assess healthspan, which we define here as “the period of life spent in good health, free from the chronic diseases and disabilities of aging.” In order to measure healthspan (i.e., quantification of the period of time the organism is free from decline), one must be able to quantify the health-related features that change in an organism due to aging. Two important features that can be measured to quantify healthspan include assessments of frailty, a state of vulnerability and diminished physiological reserve, and health-related quality of life (HRQL), the perceived impact of one’s health on overall well-being. Because there is currently no clear consensus on gold standard methods for assessing healthspan in dogs (Kaeberlein, 2018), operationalizing tools that do so are critical to capturing the negative impact of aging in dogs and advancing translational geroscience. Gerontology research in humans has led to the development of several tools that quantify components of healthspan— e.g., through patient-reported or clinical outcome measures of frailty, quality of life, physical function/capacity, and activity [17,18,19,20]. Only recently has there been an interest to develop analogous outcome measures in veterinary gerontology. Here, we selected two existing tools (Canine Frailty Index (CFI) and the VetMetrica health-related quality of life (HRQL) instrument) to investigate their utility in evaluating two important readouts of healthspan in the companion dog: frailty and health-related quality of life.

The CFI was developed to measure frailty in dogs by scoring age-associated multimorbidity and functional deficits and determining the relationship of that score to short-term mortality risk [21]. The CFI applied in dogs is analogous to the frailty index used in humans or rodents where frailty is represented as a general state of poor health and higher risk of mortality proportional to the number of age-associated deficits accumulated [22, 23]. The CFI consists of 33 discrete clinical health parameters from difficulty climbing stairs to oral disease assessed by a veterinarian, which are added and divided by the total number of items to create a composite score. Higher CFI scores indicate a higher number of age-associated diseases, deficits, and overall greater frailty, which is associated with increased short-term (6 months) mortality risk [21]. In a prospective study in 401 dogs, CFI score was moderately correlated with age, and the distribution of scores also increased with age [21].

In human clinical trials, patient-reported outcomes for health-related quality of life (HRQL) are critical to a patient-centric approach for assessing benefit of an intervention. Analogously, in veterinary clinical trials, valid owner-assessed outcome measures ensure meaningful interpretation when evaluating efficacy of an intervention. These instruments must undergo proper psychometric testing and validation. The VetMetrica™ health-related quality of life (HRQL) instrument was developed by NewMetrica Ltd. to assess the general health-related quality of life of companion dogs as perceived by the pet owner. The VetMetrica instrument is an online questionnaire that covers four domains of health-related quality of life in companion dogs: energetic/enthusiastic (E/E), happy/content (H/C), active/comfortable (A/C), and calm/relaxed (C/R) and has previously been psychometrically validated [24, 25]. VetMetrica instantaneously computes and reports a profile of raw scores across the 4 domains [26]. Importantly, VetMetrica HRQL has previously shown lower quality of life scores in older dogs [27] as well as in dogs with illness [24, 26].

In addition to potentially serving as outcome measures for clinical trial endpoints, assessment tools for healthspan can be used to identify risk factors and explore hypotheses on biological pathways that contribute to healthspan and lifespan differences. A number of studies have now established a consistent inverse correlation found between median lifespan and adult body mass in dogs [28,29,30]. However, it is currently unclear why large dogs have shorter lifespans compared to small dogs. A possible explanation is that large and giant breed dogs have higher mortality risk with increasing age [31] compared to small dogs, but not necessarily differences in multimorbidity [21, 31, 32]. This theoretical model of “truncation” (shortened lifespan without accelerated aging) or “compression” (accelerated aging leading to shortened lifespan) in large-sized dogs has been previously discussed and is an ongoing area of investigation [16, 33]. Tools that are sensitive enough to detect biologically meaningful differences between subpopulations of dogs (such as small and large) but are still applicable to all dogs could be used as clinical trial endpoints or in observational, investigative research to determine if and how biological pathways underlying body size contribute to aging in dogs.

While both the CFI and the VetMetrica HRQL instruments have undergone initial development and validation, further evaluation and refinement in large-scale companion dog populations and multi-center studies are necessary to determine their potential utility in measuring aging-related decline and assessing potential gerotherapeutics in companion dogs. Here, we report a cross-sectional, multi-site veterinary clinical study that aimed to assess the utility of the clinically assessed CFI and the owner-reported VetMetrica HRQL instrument in detecting differences in health-related quality of life and frailty scores between young (≥ 2 and < 6 years of age) and old (≥ 7 years of age) dog groups. Specifically, we hypothesized that the CFI and VetMetrica HRQL tools would show increased frailty score and decreased HRQL scores in older dogs. We also hypothesized that owner-assessed HRQL scores would show an inverse relationship with clinician-assessed CFI scores, demonstrating that owner-assessed HRQL relates to the clinical status of the dog as assessed by a veterinarian (CFI). Finally, given the evidence that dogs of large body size have shorter expected lifespans [28,29,30], we also assessed potential differences in CFI and HRQL between old and young dogs of small (< 11.3 kg; < 25 lbs) and large (> 22.7 kg; 50 lbs) body size to evaluate whether larger dogs show increased frailty score or worse health-related quality of life scores with age compared to smaller dogs.

Materials and methods

IACUC approval, study standards, and animal welfare

The study was performed according to VICH-GL9 Good Clinical Practice (GCP) standards. Approval by an independent IACUC was obtained on 13 Dec 2020 through Veterinary and Biomedical Research Center, Inc. (VBRC) (Approval no.: VACLOY001CLDEFF1PILC). All investigators and study veterinarians were qualified, licensed veterinarians. Owner consent was obtained prior to any study procedures.

Study design and experimental subjects (Fig. 1)

This study was a multi-center, cross-sectional, observational study. Dogs were recruited from 11 different veterinary clinical sites across the continental USA from January 2021 to August 2021 via word-of-mouth, email communication to existing clientele, or social media advertising. Dogs were included if they met the following age and size criteria: ≥ 2 and < 6 years of age (young dogs) or ≥ 7 years of age (old dogs); < 11.3 kg/< 25 lbs (small dogs) or large > 22.7 kg/> 50 lbs (large dogs). Age cutoffs were determined based on previous research norms to facilitate comparison between studies [34,35,36]. Similarly, consistent with previous research studies [37, 38], weight cutoffs were selected based on AKC dog breed classification for small- and large-sized dogs [39]. All dogs had to be cooperative for study procedures without the use of injectable sedation. Accurate age was verified with a documented birth year on a breeding or a medical record from < 6 months of age.

Fig. 1
figure 1

Five hundred and eighty-six dogs were screened for eligibility. Dogs without life-threatening acute disease, hypoadrenocorticism, or diabetes were included if age was verified to be ≥ 2 and < 6 years (young) or ≥ 7 years (old) and if body weight was < 11.3 kg (25 lbs) (small) or > 22.7 kg (50 lbs) (large). Of 586 dogs, 90 dogs were excluded primarily because age could not be verified or when age/weight criteria were not met (n = 87). Two dogs received a prohibited medication, and 1 dog was excluded due to a history of diabetes mellitus. Four hundred and ninety-six met the inclusion criteria for enrollment of which 45 dogs were subsequently excluded due to owner non-compliance (n = 43), subject non-compliance (n = 1), or disease identified on evaluation of physical exam or blood work (n = 1). Four hundred and fifty-one dogs were analyzed of which 55 were young and small, 117 old and small, 127 young and large, and 152 old and large

Dogs were excluded from the study if they had a history or physical examination suggestive of severe illness such as recent dramatic weight loss (≥ 10% of baseline) within the previous 2 weeks, severe dehydration, pale or white mucous membranes, labored breathing, or evidence of fever or hypothermia (rectal temperature > 39.7 °C (103.5 °F) or < 37.2 °C (99 °F)). Dogs were excluded if they were already enrolled in an ongoing clinical trial evaluating a pharmaceutical medication, nutraceutical, or supplement or if the dog had undergone a surgical or anesthetic procedure within 14 days of day 0. In order to prevent confounding with future analysis of blood-based biomarkers (not reported here), dogs were also excluded if they had previously been diagnosed with diabetes mellitus or hypoadrenocorticism or if they received corticosteroids within 14 days of the day 0 study visit or if they had ever received metformin.

Study procedures

Owner informed consent (OIC) was obtained electronically before or on the day of the study visit (day 0). The VetMetrica HRQL survey was completed on day 0 before the dog was examined by a study veterinarian or up to 3 days prior. Owners, investigators, study veterinarians, and all other study personnel were blinded to HRQL scores. On day 0, the investigator or study veterinarian obtained medical history, performed a physical exam, and completed the CFI and evaluated final eligibility based on inclusion/exclusion criteria. Investigators were instructed to collect blood and urine samples from dogs fasted overnight or for at least 6 h for comprehensive CBC, serum biochemical profile, urinalysis, and total thyroid hormone (T4). Duration of fasting was not verified. Investigators and study veterinarians reviewed all clinical laboratory results to finalize the CFI and sign off on final inclusion/exclusion criteria after day 0. The schedule of events is detailed in Table 1.

Table 1 Schedule of events

VetMetrica HRQL

Dog owners were registered in the NewMetrica system and instructed to complete the VetMetrica HRQL survey prior to day 0. If the survey was not completed before day 0, owners were required to complete the HRQL survey on day 0 at the clinical site before the veterinary examination. The electronic survey comprised 22 psychometrically validated questions to capture four domains of quality of life — energetic/enthusiastic (E/E), happy/content (H/C), active/comfortable (A/C), and calm/relaxed (C/R). Each question consisted of a descriptor and was rated on a 7-point Likert scale, with a minimum of 0 (“not at all”) and a maximum of 6 (“could not be more”). The results were calculated and scored using proprietary algorithms to provide raw scores for each of the four quality of life domains. Higher numbers indicate better owner-perceived health-related quality of life.

Canine Frailty Index

The investigator or study veterinarian at each clinical site completed the Canine Frailty Index. Higher CFI scores indicated a greater number of health deficits (higher frailty) as recorded on the 33-item clinical questionnaire. Each item was scored as 0 for “absent”, 0.5 for “mild”, and 1 for “present” or “severe.” CFI scores were calculated by dividing the total sum of item scores by the total number of items scored (33), with a minimum possible score of 0 and maximum of 1. The investigator or study veterinarian completed the initial part of the assessment on day 0 based on the dog’s medical history and PE. The remainder of the assessment was completed after the investigator or study veterinarian reviewed the clinical pathology results from the samples obtained on day 0. The items on the CFI are included in Supplementary Fig. 1.

Statistical methods and analysis

Each individual dog served as the experimental unit. Missing data was not imputed, and only observed data was included in statistical analyses.

Descriptive statistics (mean, standard deviation, median, interquartile range (IQR), and minimum and maximum) of demographic features (age, sex, and percentages of pure or mixed breed dogs) and key outcome variables (CFI score, total HRQL, and individual HRQL domain scores) were summarized for the entire dataset as well as by age-size group stratifications (young, small; old, small; young, large; old, large). CFI scores range from 0 to 1, with 1 indicating most frail. Individual HRQL domain raw scores range from 0 to 6, with larger scores indicating higher quality of life on that domain. The distribution of CFI scores is presented in Supplementary Fig. 2.

To ease interpretation, raw scores from the four domains (energetic/enthusiastic (E/E), happy/content (H/C), active/comfortable (A/C), and calm/relaxed (C/R)) were aggregated into a single score reflecting the total sum of scores across these domains (referred to as “total HRQL”). Following Davies [40], the four domains were first summed together, normalized by applying a logit transformation, standardized, and finally rescaled so that the distribution of the total scores ranges from 0 to 100 with mean 50 and standard deviation of 10. This total HRQL score reflects the overall quality of life for a given dog, where higher scores indicate better quality of life.

Differences in CFI and total HRQL and domain scores across age-size groups were assessed using one-way ANOVA with pairwise two-sample t-tests, adjusting for multiple comparisons using the Holm [41] method. If assumptions of ANOVA were violated, then the nonparametric Kruskal-Wallis tests followed by pairwise comparisons with the Wilcoxon rank sum tests were used.

Multiple linear regression modeling was used to explore the effects of age and body size on CFI, total HRQL, and HRQL domain scores. Due to study recruitment procedures, systematic gaps across continuous variables age and weight were present, precluding the ability to fit standard linear models testing interactions of weight and age. Therefore, to assess age-related rates of change in CFI and HRQL domain scores across age-size groups, we fit the following model:

$${y}_{ik}=\alpha +\beta Ag{e}_i+{\sum}_1^{K-1}{\gamma}_k Age\& Size\ grou{p}_k+{\sum}_1^{K-1}{\delta}_k\Big( Ag{e}_i\ast Age\& Size\ grou{p}_k+{\varepsilon}_i$$
(1)

where i = 1,..., N dogs, k = 1, …, 4 age-size groups indicator variables with small, young dogs as the referent group. Estimated marginal means of linear trends (i.e., slopes for each age-size group) were estimated from this model, and post hoc linear contrast statements were then used to compare rates of change in CFI, total HRQL, and domain scores across the different age-size groups. For any outcomes showing potential interaction effects between age and size, further exploratory data analysis techniques were used to further understand the role of weight as a continuous variable (Supplementary Fig. 3 and Supplementary Fig. 4).

Simple and multivariate linear regression was used to assess the relationship between CFI score and total HRQL score and individual HRQL domain scores. Simple linear regression treated CFI score as the explanatory variable and HRQL total and domain scores as the outcomes. To explore if CFI score was associated with total HRQL and HRQL domain scores above and beyond effects of age and size, age and size-age group variables were also added into the model. Furthermore, Spearman’s correlation coefficients were estimated between Canine Frailty Index (CFI) and health-related quality of life (HRQL) scores (Supplementary Table 1).

An exploratory mediation analysis was also performed with the most prevalent CFI items to identify the potential role of individual CFI deficits as mediators in any interaction effects of age and size group on CFI and HRQL domain scores. CFI items were first dichotomized as absence (“no”) = 0 and presence (“yes”, “mild”, or “severe”) = 1. Relative prevalence for each item was reported across age-size groups. Multivariate logistic regression was then used to identify if age and size groups significantly interacted to predict individual CFI deficits. Deficits for which age and size group significantly interacted were deemed candidate mediators. Finally, candidate mediators were added into the interaction model (Eq. 1) to evaluate if the presence of each deficit mediated any interactions between age and size (Fig. 2).

Fig. 2
figure 2

DAG (directed acyclic graph) represents the mediation analysis performed to evaluate whether individual CFI (Canine Frailty Index) deficits mediate the effect of age-size group on HRQL (health related quality of life) score

All regression models were tested for normality assumptions. All hypothesis tests of significance were performed at alpha = 0.05, two-sided, unless otherwise stated. All analyses and data presentations were performed using R version 4.12 [42].

Results

Subject demographics

Following screening, 451 eligible adult dogs were enrolled of which 46.3% were female and 53.7% male. The mean age of dogs was 7.72 years (range = 2.0 to 17.7, SD = 3.8). Mean weight was 24.0 kg (range 1.3–80, SD 15). The majority (91.1%) of dogs were neutered. The evaluable population consisted of 43.6% mixed breeds and 56.4% purebred dogs. The demographics of the evaluated subjects in each of the age and size groups are presented in Table 2. A table of enrolled breeds < 11.3 kg/< 25 lbs (small dogs) or > 22.7 kg/> 50 lbs (large dogs) is presented in Supplementary Table 2 and Table 3, respectively. Pairwise Wilcoxon’s rank sum comparisons of ages, weights, and outcomes measures (CFI scores and HRQL scores) were tested to investigate potential side effects. There were no significant differences in the distribution of these variables for dogs between sites. Age and weight information of dogs in each site can be found in Supplementary Table 4.

Table 2 Demographics of study population with age, sex, and breed information in 4 age-size groups (young and small, old and small, young and large, old and large)

CFI scores increase with age

Wider distribution of scores in old versus young dogs

The distribution of CFI scores was wider in old dogs compared to young dogs. The distribution of the CFI scores in each age-size group was non-normal with positive skew. CFI scores in the evaluable population of dogs (n = 450) ranged from 0.0 to 0.59, with a median of 0.03, and IQR of 0.09. IQRs of CFI scores in old dog groups were larger than that of young dog groups (Table 3).

Table 3 Canine Frailty Index (CFI) and health-related quality of life (HRQL) scores across age-size groups with mean, standard deviation, interquartile range, range (minimum, maximum), and p values for Kruskal-Wallis’s tests showing significant differences across age-size groups

Higher CFI scores in old versus young dogs

Old dogs had significantly higher CFI scores compared to young dogs. A Kruskal-Wallis test showed a significant difference in CFI scores across age-size groups (H(3) = 182.87, p < 0.001) (Table 3). Pairwise Wilcoxon-rank sum tests demonstrated that CFI scores were significantly higher in old dogs compared to young dogs, in both large (p-adjusted < 0.001) and small (p-adjusted < 0.001) dogs (Fig. 3).

Fig. 3
figure 3

Canine Frailty Index scores in the different age-size groups (young and small, old and small, young and large, old and large) are shown with box and whisker plots. Young dogs were ≥ 2 and < 6 years, old dogs were ≥ 7 years, small dogs were < 11.3 kg (25 lbs), and large dogs were > 22.7 kg (50 lbs). The median is the line in the center of the box, and the X represents the mean. The vertical line at the bottom of the box denotes the first quartile, and the vertical line extending from the top of the box denotes the third quartile. Pairwise differences between young and old groups of corresponding size were compared with Wilcoxon rank sum tests. ****p value < 0.0001

CFI scores increased with age in older dogs but not younger dogs

Scatter plots showed trends of increasing CFI score with age (Fig. 4). Multiple linear regression analysis confirmed that CFI scores significantly increased with age in old dogs (old and small: 𝛽 = 0.02, p < 0.001; old and large: 𝛽 = 0.038, p < 0.001), but not in young dogs (young and small: 𝛽 = 0.00, p = 0.503; young and large: 𝛽 = 0.00, p = 0.383) (Supplementary Table 5). Within the old and young age groups, old dogs had faster age-related increases in CFI score compared to young dogs (𝛽old-𝛽young = 0.04, p < 0.001). While an overall interaction effect between age and age-size group was observed (F(3) = 9.49, p < 0.001), there were no size-based differences in age-related increases in CFI scores within old (𝛽old, large-𝛽old, small = 0.00, p = 0.278) or young dogs (𝛽young, large-𝛽young, small = 0.00, p = 0.944) (Table 4). Pairwise comparisons of slopes were also calculated across age-size groups and can be found in Supplementary Table 6.

Fig. 4
figure 4

Relationship between Canine Frailty Index scores and age. Canine Frailty Index (CFI) scores in the different age-size groups are depicted in scatter plots with a fitted line. Red dots denote the small dogs < 11.3 kg (25 lbs) and blue dots denote the large dogs > 22.7 kg (50 lbs). Red and blue shading show the confidence intervals. Text in figure reports the estimated age effects (slope) for each age-size group along with their 95% confidence intervals and p values

Table 4 Linear contrast estimates comparing differences in age trends of Canine Frailty Index (CFI) and health-related quality of life (HRQL) between age and age-size groups

HRQL total and domain scores decrease with age

Wider distribution in scores in old versus young dogs

Older dogs had a wider distribution (higher IQR) of total and HRQL domain-specific scores than younger dogs, indicating more observed variability within the older population. Ranges, means, SDs, medians, and IQRs for total and all individual HRQL domain scores are presented in Table 3. For individual HRQL scores, the same pattern across age bins was observed in all domains except C/R. Older dogs showed lower median HRQL scores than younger dogs across all domains, except C/R, in which older dogs scored higher (Table 3).

Lower HRQL scores in old versus young dogs

Older dogs had significantly lower total HRQL scores compared to younger dogs. A Kruskal-Wallis test showed a significant difference in total HRQL scores between age-size groups (H(3) = 48.97, p < 0.001)) (Table 3). Pairwise Wilcoxon rank sum tests demonstrated that older dogs had significantly lower total HRQL scores, compared to younger dogs, in both large and small dogs (Fig. 5a). The same pattern was observed for each individual HRQL domain score, where a Kruskal-Wallis test showed a significant difference between age-size groups (E/E: H(3) = 91.44, p < 0.001, H/C: H(3) = 54.00, p < 0.001, C/R: H(3) = 10.75, p = 0.013, A/C: H(3) = 94.56, p < 0.001). Pairwise Wilcoxon rank sum tests demonstrated that younger dogs had significantly higher E/E, H/C, and A/C scores compared to older dogs, regardless of size. In contrast, a size effect was only seen within the C/R (calm/relaxed) domain where only old, large dogs showed significantly higher C/R scores than young, large dogs (p-adjusted = 0.007, Fig. 5b), whereas there was no difference between old, small and young, small dogs.

Fig. 5
figure 5

Total and individual HRQL scores amongst age-size groups. a Total health related quality of life (HRQL) scores are shown with box and whisker plots across age-size groups (young and small, old and small, young and large, old and large). Young dogs were ≥ 2 and < 6 years, old dogs were ≥ 7 years, small dogs were < 25 lbs, and large dogs were > 50 lbs. The median is the line in the center of the box, and the X represents the mean. The vertical line at the bottom of the box denotes the first quartile, and the vertical line extending from the top of the box denotes the third quartile. Pairwise differences between young and old groups of corresponding size were compared with Wilcoxon rank sum tests. **p < 0.01; ****p < 0.0001. b Individual domain HRQL scores (energetic/enthusiastic, happy/content, active/comfortable, and calm/relaxed) are shown with box and whisker plots across age-size groups (young and small, old and small, young and large, old and large). Young dogs were ≥ 2 and < 6 years, old dogs were ≥ 7 years, small dogs were < 11.3 kg (25 lbs), and large dogs were > 22.7 kg (50 lbs). The median is the line in the center of the box, and the X represents the mean. The vertical line at the bottom of the box denotes the first quartile, and the vertical line extending from the top of the box denotes the third quartile. Pairwise differences between young and old groups of corresponding size were compared with Wilcoxon rank sum tests. **p < 0.01; ****p < 0.0001

HRQL scores declined with age in old dogs but not in young dogs

Scatter plots showed trends of decreasing total and individual HRQL scores E/E, H/C, and A/C in old dogs, but not in young dogs (Fig. 6). Multiple linear regression model results confirmed that old dogs showed significant age-related declines in total HRQL scores (old and small: 𝛽 = − 1.5, p < 0.001, old and large: 𝛽 = − 1.3, p < 0.001) that were not observed in the young dogs. Individual HRQL domain scores, except for C/R, also showed age-related decline in old dogs but not in young dogs: E/E (old and small: 𝛽 = − 0.14, p < 0.001, old and large: 𝛽 = − 0.12, p < 0.001), H/C (old and small: 𝛽 = − 0.06, p < 0.001, old and large: 𝛽 = − 0.07, p < 0.001), and A/C (old and small: 𝛽 = − 0.13, p < 0.001, old and large: 𝛽 = − 0.20, p < 0.001) (Supplementary Table 5). Pairwise comparisons of slopes were also calculated across age-size groups and can be found in Supplementary Table 6.

Fig. 6
figure 6

Total and individual HRQL scores relationships with age. a Total health related quality of life (HRQL) scores in the different age and size groups are depicted in scatter plots with a fitted line for each domain. Red dots denote the small dogs < 11.3 kg (25 lbs), and blue dots denote the large dogs > 22.7 kg (50 lbs). Red and blue shading show 95% confidence intervals. Text in figure reports the estimated age effects (slope) for each age-size group along with their 95% confidence intervals and p values. b Individual domain health related quality of life (HRQL) scores in the different age and size groups are depicted in scatter plots with a fitted line. Red dots denote the small dogs < 11.3 kg (25 lbs), and blue dots denote the large dogs > 22.7 kg (50 lbs). Red and blue shading show 95% confidence intervals

Faster age-related decline in A/C scores in older, larger dogs

Likelihood ratio tests of multiple linear regression models indicated a significant interaction effect between age and age-size group for individual A/C HRQL scores only (p < 0.001), but not for other domains or total HRQL score (Supplementary Table 5). Linear contrast statements comparing age-related changes in HRQL between age-size groups (Table 4) revealed that old dogs had significantly faster age-related declines in A/C scores compared to that of young dogs (𝛽old-𝛽young = − 0.27, p = 0.006). Within old dogs, large dogs showed more rapid age-related decline in A/C scores compared to small dogs (𝛽old, large-𝛽old, small = − 0.07, p = 0.022). We further explored the effects of weight on age-related declines in A/C, by performing exploratory data analysis and visualization in this subdomain (Supplementary Figs. 3 and 4).

Clinically assessed CFI is inversely related to owner-assessed total HRQL and HRQL domain scores after adjusting for age

CFI score and total HRQL score were inversely related. Simple linear regression analyses demonstrated inverse linear relationships between CFI score and total HRQL score and all HRQL individual domains except for C/R scores (R2 < 0.01, 𝛽 = − 0.38, p = 0.344) (Fig. 7).

Fig. 7
figure 7

Relationships between total and individual HRQL scores and age. a Scatter plots of total health related quality of life (HRQL) scores on the y axis against Canine Frailty Index (CFI) scores on the x axis with a fitted line. R (correlation coefficient), R2 (proportion of variance explained), and p value indicate the magnitude and significance of associations. b Scatter plots of individual domain HRQL scores on the y axis plotted against CFI scores on the x axis with a fitted line. R (correlation coefficient), R2 (proportion of variance explained), and p value indicate the magnitude and significance of associations

Multivariate regression models accounting for age and age-size group and their interaction showed that CFI score was significantly inversely associated with all HRQL scores (total and individual domains, even after adjusting for the effect of age: total HRQL score (𝛽 = − 28.3, p < 0.001); E/E (𝛽 = − 2.64, p < 0.001); H/C (𝛽 = − 1.995 , p < 0.001); A/C (𝛽 = − 3.31, p < 0.001); C/R (𝛽 = − 1.22, p = 0.042) (Supplementary Table 7). Additionally, the previously observed finding of large dogs showing more rapid decline in A/C scores with age was still observed, even after adjusting for CFI score (𝛽old,large-𝛽old, small = − 0.06, p = 0.042). Thus, we aimed to further explore the reason for this size difference in declining A/C score with age.

Age-associated disease may mediate size differences in decline of A/C HRQL domain scores in old dogs

We evaluated clinical information collected from the CFI to see whether specific CFI clinical diseases or deficits might mediate the faster age-related decline in A/C scores in old, large dogs compared to old, small dogs. To do so, we calculated the relative prevalence across age-size groups, which yielded 12 CFI items with a total prevalence of at least 0.25. We found differences in relative prevalences in diseases between old, large and old, small dogs: oral disease was more common in small dogs, whereas osteoarthrosis (osteoarthritis), hair whitening, and decreased activity were more common in large dogs (Fig. 8). As outlined in the “Materials and methods” section, these were then individually tested as candidate mediators by examining the interaction effect of age and size group on these clinical deficits. Results of these logistic regression analyses indicated that size group (large or small) significantly interacted with age in explaining osteoarthritis (OA) prevalence, making it a candidate mediator (F(1) = 4.52, p = 0.034) (Supplementary Table 8). To complete our exploratory mediation analysis, OA was added into the full interaction model of A/C and was found to be significantly associated with reduced A/C scores (𝛽 = − 0.50, p < 0.001). Its inclusion did not eliminate the significant interaction effect of age and age-size group on A/C (F(3) = 3.52 p = 0.015). OA, as opposed to other CFI items, may be a partial mediator for the faster A/C decline in old, large dogs compared to old, small dogs (Supplementary Table 8).

Fig. 8
figure 8

Relative prevalence of Canine Frailty Index items. The most common Canine Frailty Index (CFI) items in each of the age-size groups are listed with the heat map indicating darker colors of higher relative prevalence in this study population

Discussion

As the companion dog emerges as a translational model for human aging and age-related disease, the development and validation of healthspan assessment tools in dogs is important for evaluating gerotherapeutics and advancing biogerontology research. This study aimed to assess two previously published instruments (Canine Frailty Index and NewMetrica’s HRQL) for measuring and relating two key components of healthspan in companion dogs: frailty and health-related quality of life. To assess the validity of the CFI and VetMetrica HRQL, we hypothesized that the CFI and VetMetrica HRQL instrument would show age group differences with increased frailty scores and decreased HRQL scores in old dogs compared to young dogs. We also hypothesized that owner-assessed HRQL scores would show an inverse relationship with clinician-assessed CFI scores. Finally, we used both the overall scores as well as more granular information collected from these two instruments to investigate relationships between frailty score, HRQL scores, age as a continuous measure, and body size. We expected that frailty scores would increase with age while HRQL scores would decline with age. We also expected that size may have an effect on CFI or HRQL scores, where larger body size would be associated with higher frailty score or lower HRQL scores.

As expected, results showed that CFI scores, a potential measure of clinically assessed frailty, were significantly higher in old (≥ 7 years) compared to young dogs (≤ 2 to < 6 years). Total HRQL domain scores as well as individual HRQL scores related to energy and enthusiasm (E/E), happiness and contentment (H/C), and physical activity and comfort (A/C) were significantly lower in old compared to young dogs. CFI and HRQL scores were inversely associated, even after adjusting for the effect of age. These results, demonstrating expected relationships between frailty score and HRQL scores in old and young dogs, show that the CFI and VetMetrica HRQL can be used to assess age-related changes in frailty score and health-related quality of life in companion dogs. Additionally, these tools allowed for the exploration of relationships between age, body size, owner-assessed HRQL scores, clinically assessed frailty score, and multimorbidity. Frailty score increased with age, and HRQL scores decreased with age in old dogs but not young dogs. While body size did not affect the age-associated increase in CFI score or decline in total HRQL score, our results demonstrated a more rapid owner-observed decline in A/C score in large, old dogs compared to small, old dogs. Because the CFI provided standardized information on clinically assessed disease and deficits, we were able to explore potential differences in age-associated morbidities, such as OA, in mediating this effect. In this way, the CFI and the VetMetrica HRQL can provide complementary readouts of age-associated declines and healthspan in companion dogs.

Relationships between CFI scores, age, and size

Consistent with previous findings [21], a significant positive relationship was found between CFI score and age. The present study also found that increases in CFI scores were significantly greater with every additional year of age in dogs ≥ 7 years aged compared to dogs between 2 and 6 years. This implies that frailty increases as dogs age and accumulates more rapidly in dogs ≥ 7 years age. The study population in Banzato et al., which evaluated CFI scores in association with age, had an overall higher disease burden, and dogs were enrolled from a single tertiary referral academic hospital in Italy. We found a similar rate of increase in CFI score per year (~ 0.02/year) in our study population of dogs presenting to predominantly primary care practices throughout the USA. This illustrates that the CFI may be a broadly applicable tool that effectively measures increases in frailty in dogs with varying diseases across a diverse set of veterinary investigators and clinical practices. As reported by Banzato and colleagues, CFI score may also be moderately predictive of short-term mortality based on previous findings [21], but survival was not assessed in this study. Assessing the relationship of CFI score with survival of dogs in this study through longitudinal follow up will be required to evaluate whether the CFI is a frailty tool predictive of mortality in dogs. While our analyses did not find site effects associated with CFI score, further studies demonstrating intra-rater and interrater reliability would be required for formal validation of this instrument.

When accounting for age, CFI scores were not significantly higher in large dogs compared to small dogs in this study, which suggests that overall multimorbidity may not be higher in large dogs despite consistent evidence of their shortened lifespans [28, 29, 43, 44]. Additionally, the slopes of increasing CFI score with age did not significantly differ between large and small dogs, suggesting that large dogs do not accumulate age-associated disease and functional deficits more rapidly. These results are consistent with findings from previous cross-sectional studies on size-related differences in canine multimorbidity, which showed that neither body weight nor breed had an observed effect on the number of comorbid diseases [32] or on cognitive aging, where the trajectory of cognitive decline did not differ due to size or breed-average lifespan [33]. Of course, it is possible that the cross-sectional differences between large and small dogs found here do not reflect longitudinal changes in individual dogs over time. Ongoing large-scale longitudinal cohort studies [15, 45] characterizing the natural course of canine aging will be invaluable for providing additional datasets to address whether size mediates any relationships between healthspan and lifespan in companion dogs.

Relationships between HRQL scores with CFI, age, and size

Measuring quality of life to integrate the human patient (or pet-owner) perspective in response to an intervention is an important efficacy readout implemented in human clinical practice and clinical trials [46,47,48]. Quality of life is a central component of the concept of healthspan, and measuring the number of “healthy” or “good quality” years lived could be operationalized by using these psychometrically validated HRQL instruments. While the VetMetrica HRQL instrument has been psychometrically validated [24, 25] and differentiates between healthy and ill dogs [40], this study is the first to assess age and size differences in HRQL in conjunction with a standardized assessment of frailty score, which we believe to be critical components of healthspan.

In the current study, total HRQL score and all individual HRQL domain scores (E/E, H/C, A/C, C/R) showed significant inverse correlations with clinician-assessed CFI scores after accounting for the effect of age. This provides evidence that frailty contributes to age-related decline in HRQL scores and that an owner assessment can capture the adverse impact of frailty on health-related quality of life. The association between CFI and HRQL score was weakest for the C/R domain perhaps because C/R domain scores are a better reflection of differences or changes in temperament rather than declining health status or frailty [25, 40]. In this study, we aggregated raw individual domain scores to a normalized total HRQL score, which could simplify interpretation and improve usability for clinicians and owners. Total HRQL score did not show differences between size groups nor interaction effects between age and age-size group, lending support to use total HRQL score as a global measure of HRQL in dogs regardless of body size. However, multivariate regression analyses demonstrated significant interaction effects between age and age-size group for A/C domain scores, specifically in older dogs. This suggests that individual domains such as A/C may be more useful for research purposes by providing more granular information on size/breed-dependent differences in age-related declines of physical activity and comfort.

Size-related differences in CFI deficit prevalence and the individual HRQL domain score A/C

The CFI not only demonstrated utility in standardizing and quantifying frailty, but also allowed us to explore qualitative differences in age-associated diseases between large and small dogs. Although this study did not observe any size-related differences in overall CFI scores, size-related differences in specific CFI diseases and deficits were observed between small and large dogs. The most prevalent CFI items overall were more common in old compared to young dogs. Oral disease was very common in all four age-size groups but was the most common in old, small dogs. Osteoarthrosis (osteoarthritis), hair whitening, decreased activity in the last year, weakness with exercise, and reduced vitality in the past year were more common in large compared to small dogs. These differences were consistent with studies showing that small dogs have increased periodontal disease [49] and that large size is a risk factor for osteoarthritis [50, 51]. Our exploratory mediation analysis allowed us to explore how size-related differences in age-associated disease may modulate the decline in A/C scores, introducing future hypotheses to formally test in future studies. As a clinically assessed frailty index, the CFI could be useful for global quantification of frailty applicable to any sized dog, while also uncovering breed and size differences in age-associated diseases.

Limitations and future directions

There are several limitations to consider in this study. Firstly, due to its cross-sectional design, this study does not provide information on the longitudinal performance of the HRQL and CFI.

In the above analyses, we did not control for sex, neuter status, or breed effects. Purebred dogs are documented to have shorter lifespans compared to mixed breeds and likely age differently from mixed breed dogs due to inbreeding depression [28, 29, 52]. Environmental and owner-influenced factors, such as exercise and diet, were not considered in this study, which are potential confounding factors.

This study did not assess dogs between 6 and 7 years and dogs between 25 and 50 lbs/11.3 and 22.7 kg. These age and weight cutoffs were chosen to differentiate between and compare groups but preclude full characterization of the relationship between size and age on HRQL and CFI.

Additional selection bias may have occurred due to challenges in recruiting old and frail dogs. For instance, owners of dogs with higher multimorbidity and frailty are less likely to enroll in an observational study. This could have biased the study population towards old dogs with better health status than the population at large. In general, geriatric dogs (> 12 years old) were not highly represented in this study population, and small-sized dogs are more highly represented at older ages.

Both HRQL and CFI are subject to measurement bias. Despite undergoing psychometric validation, the VetMetrica HRQL is ultimately a subjective measurement that may be sensitive to response style effects such as the owner’s tendency to select extremes on the rating scale. Clear ceiling effects are observed in the HRQL data, suggesting a limited range of sensitivity to detect differences in HRQL at young ages. Ideally, HRQL instruments should be complemented by objective measures of the functional impact of aging. Similarly, floor effects are observed in the CFI data, suggesting limited sensitivity and interpretability at young ages when health deficits and diseases are uncommon.

While the CFI is veterinarian assessed, there are items of the CFI that may be vulnerable to noise and interobserver variability if investigators are not trained on how to assess and score items. In the present study, interobserver reliability was not formally assessed. However, the CFI was robust enough to show higher frailty scores with increasing age, at a similar rate found in Banzato et al., in 2019, even though CFI was scored by a dozen study veterinarians and investigators across 11 different sites. Evaluating interobserver reliability in future studies will further strengthen the validity and utility of this tool.

Conclusion

This 451-dog cross-sectional pilot study demonstrated the ability of two different tools (the Canine Frailty Index and the VetMetrica HRQL instrument) to identify age-related differences across a US companion dog population. CFI scores increased with older age while total HRQL score, as well as individual domain scores for E/E, H/C, and A/C, decreased, confirming the expectation that frailty score increases and health-related quality of life declines as dogs age. These trends were largely consistent across size groups, suggesting that dogs experience similar age-related changes in frailty score regardless of body size. However, A/C scores decreased not only with age, but also with larger body size. Altogether, the results of this study lend support for the use of the Canine Frailty Index and VetMetrica HRQL in capturing two key components of healthspan: the increase in frailty score and deterioration in quality of life with age. Characterizing the ability of the CFI and VetMetrica HRQL tools to effectively detect age and size-related differences in frailty and HRQL scores amongst companion dogs paves the way to utilize these tools in clinical and research settings to operationalize healthspan assessment in dogs and ultimately evaluate novel gerotherapeutics that benefit both human and veterinary medicine.