Validity of utility measures for women with urge, stress, and mixed urinary incontinence




Objective


We sought to evaluate the construct validity of 3 health status classification system instruments–Health Utilities Index Mark 3 (HUI-3), EuroQol (EQ-5D), and Short Form 6D (SF-6D)–and a visual analog scale (VAS) for measuring utility scores in women with urge, stress, and mixed urinary incontinence.


Study Design


Utility scores were measured in 202 women with urinary incontinence. Pelvic floor symptom severity and quality of life were measured using the Pelvic Floor Distress Inventory and Pelvic Floor Impact Questionnaire, respectively. Construct, discriminant, and concurrent validity were evaluated.


Results


Significant correlations were noted between utility scores and the Pelvic Floor Distress Inventory (r = –0.22 to –0.42, P < .05) and the Pelvic Floor Impact Questionnaire (r = –0.32 to –0.50, P < .05). Mean utility scores were significantly lower for women with urge or mixed incontinence compared to stress incontinence for the EQ-5D (0.71 ± 0.23, 0.73 ± 0.26, and 0.81 ± 0.16, respectively, P = .02) and the SF-6D (0.76 ± 0.12, 0.74 ± 0.12, and 0.81 ± 0.11, respectively, P = .02) but not the HUI-3 or the VAS. There was a clinically important difference in utility scores (>0.03) between women with urge or mixed incontinence as compared to stress incontinence for the HUI-3, EQ-5D, and SF-6D but not the VAS. Utility preference scores were significantly lower for women with combined urinary and fecal incontinence (0.69-0.73) than urinary incontinence alone (0.77-0.84, P < .01).


Conclusion


The HUI-3, EQ-5D, and SF-6D, but not the VAS, provide valid measurements for utility scores in women with stress, urge, and mixed urinary incontinence.


Urinary incontinence is a common and costly condition. Over 10 million women in the United States suffer from urinary incontinence. Urinary incontinence severely affects health-related quality of life (HRQOL) and contributes to social isolation, depression, decreased activity, falls, fractures, and nursing home admission. The direct medical cost of urinary incontinence in the United States has been estimated at >$16 billion each year.


Utility preference scores are key elements in cost-effectiveness research. A utility preference score (or utility score) is a measure of patient preference for a given health state, a standardized generic HRQOL measure that summarizes morbidity on a scale from 0 (death) to 1 (optimum health). Utility scores are used to quantify the severity of a patient’s condition and burden of illness and allow comparison across a wide range of disease states, populations, and treatment modalities. Utility scores are also combined with life expectancy estimates to calculate quality adjusted life-years, a unit of measure for quantifying the benefits of an intervention.


Several condition-specific instruments to measure quality of life in women with urinary incontinence exist such as the Pelvic Floor Distress Inventory (PFDI) and the Pelvic Floor Impact Questionnaire (PFIQ). However, these instruments do not allow calculation of utility scores or comparison across different disease states. Several general scales have been developed to measure utility scores for a wide variety of disease conditions and populations. These include the single-item generic visual analog scales (VAS) and the widely used multiitem multiattribute health status classification system instruments, Health Utilities Index Mark 3 (HUI-3) ( http://www.healthutilities.com ; Health Utilities Inc, Hamilton, Ontario, Canada), EuroQol (EQ-5D) ( http://www.euroqol.org ; EuroQol Group, Rotterdam, the Netherlands), and Short Form 6D (SF-6D) ( http://www.qualitymetric.com ; QualityMetric Inc, Lincoln, RI). For the HUI-3, EQ-5D, and SF-6D, population norm data are available and widespread use facilitates the interpretation of results and permits comparisons of disease and treatment outcomes at the local, national, and international levels. Although the HUI-3, EQ-5D, and SF-6D have been used in women with urinary incontinence, the validity of scores on these instruments for measuring utilities in women with urinary incontinence has not been established. For example, a 2013 review of the psychometric performance of EQ-5D in urinary incontinence stated that the EQ-5D generally performed well on tests of construct validity, however, studies specifically designed to assess the psychometric properties of the EQ-5D are lacking. Additionally, the difference in utility scores for women with stress or urge or mixed urinary incontinence is not known.


The aim of the present study is to evaluate the construct validity of 3 multiattribute health status classification system instruments and the VAS for measuring utility preference scores in women with urge, stress, and mixed urinary incontinence within a population of women with pelvic floor disorders.


Materials and Methods


This is a prospective observational study of 286 consecutive new women presenting to the University of Pennsylvania urogynecology practice in the 24-month period from March 2008 through December 2010 with chief complaint of urinary incontinence or pelvic organ prolapse. Institutional review board approval was obtained from the University of Pennsylvania.


All women presenting for new visits were evaluated for eligibility. Women with urinary incontinence or pelvic organ prolapse stage ≥2 were invited to participate in the study. Additional inclusion criteria included ability to give consent and complete questionnaires in English. Exclusion criteria included age <18 years, pregnancy, chronic pain conditions, neurologic diseases, current or recurrent urinary tract infections, and pelvic surgery within the last 6 months.


After obtaining written informed consent, all women were asked to complete: (1) 4 general HRQOL questionnaires: the 3 multiattribute health status classification system instruments as well as a VAS; and (2) 2 condition-specific symptom and HRQOL questionnaires: the PFDI short form (PFDI-20) and the PFIQ short form (PFIQ-7) (details given below). Elements of their physical examination and medical history were obtained from the medical chart. Prolapse was staged using the pelvic organ prolapse quantification system.


Three common preference-based multiattribute health status classification system instruments were used to estimate utility preference scores: HUI-3, EQ-5D, and SF-6D. The HUI-3 classifies health status across 8 attributes (vision, hearing, speech, ambulation, dexterity, emotion, cognition, and pain) with 5-6 levels each for a possible 972,000 unique health states. The EQ-5D has 5 attributes (mobility, self-care, usual activities, pain/discomfort, and anxiety/depression) with 3 levels each for a possible 243 unique health states. The SF-6D is derived from 8 items of the Short Form 12 (SF-12) and has 6 attributes (physical functioning, role limitation, social functioning, pain, mental health, vitality) with 5-6 levels each for a possible 7500 unique health states. Women also completed a 100-point vertically oriented VAS with anchors of “best imaginable health state” and “worst imaginable health state.” VAS scores were divided by 100 prior to analysis to make comparable to the utility score 0-1 scale. Higher scores on the health status instruments and VAS indicate better quality of life.


All women also completed the PFDI-20, a validated, condition-specific questionnaire with 3 subscales, designed to evaluate distress caused by specific pelvic floor symptoms including bowel, urinary, and pelvic organ prolapse complaints. Items on the PFDI-20 first ask whether each symptom is experienced (yes or no response) and if “yes,” the degree of bother is assessed on a scale from 1 (not at all) to 4 (quite a bit). Pelvic floor–related quality of life was measured by the PFIQ-7, a validated condition-specific HRQOL questionnaire also with bladder, bowel, and pelvic organ prolapse subscales. Items on the PFIQ-7 assess the impact of symptoms on ability to do household chores, physical activities, entertainment activities, travel, social activities, emotional health, and feeling frustrated on a scale from 0 (not at all) to 3 (quite a bit). Scores on the PFDI-20 and PFIQ-7 range from 0–300, with higher scores indicating worse symptoms and worse quality of life. The diagnosis of urinary incontinence was based on the Questionnaire for Urinary Incontinence Diagnosis, a questionnaire validated for the diagnosis of urinary incontinence. Comorbid medical conditions were measured by the Charlson Comorbidity Index. All questionnaires were self-administered on the same day during the baseline evaluation. Order of questionnaire administration was varied each day to minimize order effect.


Based on the validated Questionnaire for Urinary Incontinence Diagnosis, women were divided into 3 groups: predominant urge incontinence (urge score ≥6 and stress score <4), predominant stress incontinence (urge score <6 and stress score ≥4), and mixed incontinence (urge score ≥6 and stress score ≥4). Fecal incontinence was defined as leakage of solid or liquid stool with at least “somewhat” bother on the PFDI-20.


For construct validity, we compared the utility scores among women with urge incontinence, stress incontinence, and mixed incontinence. Our hypothesis was that women with mixed incontinence would have worse (lower) utility scores than women with urge or stress incontinence and women with urge incontinence would have worse (lower) utility scores than women with stress incontinence, similar to the relationship of clinical symptoms and the impact on quality of life previously reported in other studies. Utility scores between groups were compared using Kruskal-Wallis test and linear regression was used to adjust for confounding risk factors such as age, comorbidities, presence of coexistent pelvic organ prolapse, and fecal incontinence. For discriminant validity, we examined the relationship of individual subscales of the utility instruments expected to be related to the severity of urinary incontinence (eg, depression, mobility) and also the relationship of individual subscales of the utility instruments not expected to be related to the severity of urinary incontinence (eg, speech, hearing). For concurrent validity, the relationship of the instrument to other similar evaluations, we measured the correlations of the utility scores of the different health status instruments and the VAS with each other. We also measured the correlations of the scores of the health status instruments and the VAS with condition-specific instruments, the bladder subscales of the PFDI-20 and PFIQ-7. Known group differences were explored by evaluating the effect of urinary incontinence with or without concomitant fecal incontinence.


We compared data among 3 groups: women with urge incontinence, stress incontinence, and mixed incontinence. Demographic data are presented as percentages, medians, or means ± SD. Categorical data were compared between groups using Pearson χ 2 and Fisher exact tests as appropriate and continuous variables were compared using analysis of variance or Kruskal-Wallis, as appropriate. Correlations were Spearman or Pearson, as appropriate. Observed significant correlations <0.3 were considered low, between 0.3-0.5 moderate, and ≥0.5 high. Sample size was based on achieving a >5:1 subject-to-item ratio for the longest utility instrument (SF-12 with 8 items contributing to the utility scare) resulting in a minimum of 40 women per group. All reported P values were 2-sided and P values < .05 were considered statistically significant. All statistical analysis was done using software (Stata, version 10.0; StataCorp, College Station, TX).




Results


Of the 286 women enrolled in this study, 202 (71%) had urinary incontinence, whereas 84 (29%) did not. Among women with urinary incontinence, 40 (20%) had urge incontinence, 54 (27%) had stress incontinence, and 108 (53%) had mixed urinary incontinence. Demographics and baseline characteristics of the 3 groups are presented in Table 1 . Women with urge incontinence were significantly more likely to be African American. There were no significant differences in age, parity, body mass index, comorbidities, stage ≥2 prolapse, or fecal incontinence between groups.



Table 1

Demographics and baseline characteristics




























































































































Demographic UUI (n = 40) MUI (n = 108) SUI (n = 54) P value
Age, y 56.5 ± 16.9 56.3 ± 11.1 54.5 ± 10.9 .62 a
Parity (median) 2 (0-9) 2 (0-9) 2 (0-9) .52 b
Body mass index, kg/m 2 28.7 ± 9.8 29.0 ± 6.3 28.4 ± 6.3 .83 a
Race
Caucasian 22 (55%) 79 (74%) 47 (89%) .001 d
African American 17 (43%) 20 (19%) 6 (11%)
Other 1 (3%) 8 (7%) 0
Postmenopausal 24 (60%) 75 (69%) 34 (63%) .49 c
Prior surgery
Hysterectomy 7 (18%) 24 (22%) 8 (15%) .51 c
Pelvic surgery 2 (5%) 11 (10%) 1 (2%) .29 d
Pelvic organ prolapse
Stage ≥2 28 (70%) 63 (58%) 30 (56%) .33 c
Fecal incontinence 8 (20%) 36 (33%) 14 (26%) .25 c
Charlson Comorbidity Index score 0.35 ± 0.69 0.44 ± 0.98 0.17 ± 0.47 .15 a
PFDI-20 score e 110.4 ± 49.4 113.9 ± 51.9 83.5 ± 48.5 .0017 a, g
Urinary subscore f 46.2 ± 24.1 57.3 ± 22.1 42.2 ± 22.1 .0002 a, g
PFIQ-7 score e 62.5 ± 57.3 74.6 ± 64.7 33.5 ± 37.0 .0003 a, g
Urinary subscore f 34.9 ± 28.8 40.6 ± 27.8 21.8 ± 21.0 .0002 a, g

MUI , mixed stress and urge incontinence; PFDI-20 , Pelvic Floor Distress Inventory short form; PFIQ-7 , Pelvic Floor Impact Questionnaire short form; SUI , stress incontinence; UUI , urge incontinence.

Harvie. Utility scores for urinary incontinence. Am J Obstet Gynecol 2014.

a Kruskal-Wallis comparing UUI, SUI, and MUI groups


b Equality of medians test


c Pearson χ 2


d Fisher exact test


e Score range from 0 (best) to 300 (worst)


f Score range from 0 (best) to 100 (worst)


g Remains P < .05 after adjusting for age, race, Charlson Comorbidity Index score, fecal incontinence, and pelvic organ prolapse.



We noted significant differences in the total and bladder subscale scores of the PFDI-20 and PFIQ-7 between groups; highest (worst) scores for mixed urinary incontinence followed by urge urinary incontinence and lowest (best) scores for stress urinary incontinence ( Table 1 ). These relationships persisted after adjusting for age, race, comorbidity index, fecal incontinence, and pelvic organ prolapse.


Construct validity


Mean utility preference scores for women with urge incontinence, stress incontinence, and mixed incontinence are summarized in Table 2 . The pattern of higher (better) utility scores for stress incontinence than urge or mixed incontinence was seen for all instruments. The utility scores of the urge and mixed groups were similar. We noted significant differences in utility scores among the 3 groups for the EQ-5D and SF-6D but not for HUI-3 or the VAS. For the EQ-5D and SF-6D, the differences in scores among the 3 groups persisted after adjusting for age, race, comorbidity index, fecal incontinence, and pelvic organ prolapse ( Table 2 ).



Table 2

Mean utility preference scores for women with urge incontinence, stress incontinence, and mixed incontinence


































Variable UUI (n = 40) MUI (n = 108) SUI (n = 54) P value a
HUI-3 0.78 ± 0.23 0.79 ± 0.24 0.86 ± 0.15 .29
EQ-5D 0.71 ± 0.23 0.73 ± 0.26 0.81 ± 0.16 .02 c
SF-6D 0.76 ± 0.12 0.74 ± 0.12 0.81 ± 0.11 .02 c
VAS b 0.78 ± 0.15 0.78 ± 0.16 0.80 ± 0.14 .63

EQ-5D , EuroQol; HUI-3 , Health Utilities Index Mark 3; MUI , mixed stress and urge incontinence; SF-6D , Short Form 6D; SUI , stress incontinence; UUI , urge incontinence; VAS , visual analog scales.

Harvie. Utility scores for urinary incontinence. Am J Obstet Gynecol 2014.

a Kruskal-Wallis comparing UUI, SUI, and MUI groups


b Scores were divided by 100 to enhance comparability


c Remains significant after controlling for age, race, Charlson Comorbidity Index score, fecal incontinence, and pelvic organ prolapse.

Only gold members can continue reading. Log In or Register to continue

Stay updated, free articles. Join our Telegram channel

May 11, 2017 | Posted by in GYNECOLOGY | Comments Off on Validity of utility measures for women with urge, stress, and mixed urinary incontinence

Full access? Get Clinical Tree

Get Clinical Tree app for offline access