Journal of Pediatric Psychology Advance Access published January 6, 2016

Journal of Pediatric Psychology, 2016, 1–20 doi: 10.1093/jpepsy/jsv161 Systematic Review

Systematic Review: Issues in Measuring Clinically Meaningful Change in Self-Reported Chronic Pediatric Pain Intensity John V. Lavigne,1,2,3 PHD, ABPP Department of Child and Adolescent Psychiatry, Ann & Robert H. Lurie Children’s Hospital of Chicago, 2Feinberg School of Medicine, Northwestern University, and 3Mary Ann and J. Milburn Smith Child Health Research Program, Children’s Hospital of Chicago Research Center All correspondence concerning this article should be addressed to John V. Lavigne, PHD, Ann & Robert H. Lurie Children’s Hospital of Chicago, 225 East Chicago Avenue, Chicago, IL 60611, USA. E-mail: [email protected] Received July 25, 2015; revisions received November 28, 2015; accepted December 1, 2015

Abstract Objective This report examined limitations in our ability to assess clinically significant change (CSC) in randomized controlled trials of treatments of self-reported pediatric chronic pain intensity. Methods The following were reviewed: (a) approaches to assessing CSC; (b) approaches to assessing CSC used in psychological treatment studies of self-reported pediatric chronic pain intensity included in a recent systematic review; (c) the role of test–retest reliability in distribution-based CSC measures; (d) the test–retest reliability of recommended chronic pain measures. Results and Conclusions Existing studies do not assess whether a CSC occurred or use procedures that did not account for measurement error and true score fluctuations unrelated to treatment, possibly resulting in overestimating CSCs. Distribution-based approaches to assessing CSCs that address these problems require knowing test–retest reliability of the chronic pain measure at appropriate intervals. Available information raises concern about our ability to estimate CSC reliably. Recommendations are made for future research.

Key words: clinical significance; chronic and recurrent pain; pain.

Pain is a complex phenomenon with distinct domains that include pain intensity, physical functioning, symptoms, and emotional functioning, among others (McGrath et al., 2008; Norman, Sloan, & Wyrwich, 2003; Palermo, 2009). Clinical trials of the treatment of chronic pediatric pain rely on selfreports of pain as a key outcome measure. Few areas of medicine rely as heavily on self-report for clinical diagnosis (Miro, 2009), and self-reported chronic pain intensity remains the single most common measure of intervention response in pediatric pain studies (Palermo, 2009; von Baeyer, 2009). Singleitem self-report measures of chronic pediatric pain intensity are commonly used (Stinson, Kavanaugh, Yamada, Gill, & Stevens, 2006), and are

recommended for use in measuring that construct (McGrath et al., 2008). While psychologists have long been using selfreports to assessing change in patient’s affect and behavior, in some areas of medicine, the emphasis on measuring treatment response using Patient Reported Outcomes (PROs) when biomarkers are not available has increased. This is true for measuring outcomes for all pediatric chronic illnesses, for example, quality of life (Bevans, Riley, Moon, & Forest, 2010), and certainly includes the measurement of pain in response to treatment (Food and Drug Administration, 2012). Accordingly, there has been increased interest in ways of assessing change in randomized controlled trials (RCTs) in which PROs are used.

C The Author 2016. Published by Oxford University Press on behalf of the Society of Pediatric Psychology. V

All rights reserved. For permissions, please e-mail: [email protected]

1

Downloaded from http://jpepsy.oxfordjournals.org/ at The University of British Colombia Library on March 14, 2016

1

2

Approaches to Measuring CSC While estimating the statistical significance and ES of an intervention occurs at the group level (treatment vs.

comparison group), determining whether a CSC has occurred involves patient-level analysis, that is, determining whether each study participant has achieved a CSC. Approaches to assessing CSC include anchorbased and distribution-based (Cella, Eton, Lai, Peterman, & Merkel, 2002; Copay, Subach, Glassman, Polly, & Schuler, 2007), percent-change (Dworkin et al., 2008), statistical (Jacobson & Truax, 1991), and combined approaches (Crosby, Kolotkin, & Williams, 2003). See Table I for a summary of these approaches.

Anchor-Based Approaches Anchor-based methods map a difference score on a continuous measure to a patient’s verbal “anchor” of “improved,” “better,” etc. (Cella et al., 2002). That difference score becomes the minimum clinically important difference (MCID), that is, the smallest change important to the patients (Copay et al., 2007). For example, if the mean score for pre- to posttreatment change on a visual analogue scale (VAS) associated with children’s ratings of “better” is 23.00 mm, that value is the MCID, and scores 23.00 mm are considered clinically significant. Presumably, that MCID value can then be used in similar situations to determine who has achieved a clinically meaningful change, but searching for a single, generalizable number may be unrealistic (Cella et al., 2002), especially when developmental factors, demographic differences, pain conditions, level of pain severity, etc. could influence the MCID. Pediatric studies of anchor-based MCIDs have been conducted (Bulloch & Tenenbein, 2002; Myrvik et al., 2013; Tsze et al., 2015). The anchor-based approach, relying on a global improvement scale (GIS) such as a self-report of better, appears face valid and is easy to interpret, but has considerable limitations. Norman, Stratford, and Regehr (1997) note that judging improvement requires that the individuals be able to quantify both their initial and present state, but adults have difficulty recalling their initial pain state. Similarly, children’s recollections of prior pain levels have been found to be poor (Chogle et al., 2012; van den Brink, BandellHoekstra, & Abu-Saad, 2001), raising questions about the child’s ability to accurately use such a GIS as a “better” rating. In addition, the reliability of rating “better” or “improved” is unknown and, in general, there is little evidence supporting strong test– retest reliability of global scales to measure change (Norman et al., 1997). If the anchor used to quantify the MCID may not be reliable, the MCID value for the continuous measure may not be reliable. Furthermore, if the GIS itself were reliable, quantifying the MCID would be unnecessary (McCarthy,

Downloaded from http://jpepsy.oxfordjournals.org/ at The University of British Colombia Library on March 14, 2016

In addition, while statistical significance was once the only criteria applied to determining whether research findings were meaningful, more emphasis is now being placed on the importance of assessing the magnitude of the observed effect (American Psychological Association, 2010). This can take the form of reporting an effect size (ES), but could also take the form of reporting a measure of clinical significance. Different approaches have been taken to assessing the clinical significance of treatment, and the psychometric properties of the measures used in assessing change can influence how clinically significant treatment appears to be. Along with the potential clinical implications, there are also policy implications associated with assessing clinically significant change (CSC). For clinical trials of treatment for the functional disorder of irritable bowel syndrome (IBS), for example, the FDA (Food and Drug Administration, 2012) has recommended using a measure of self-reported clinical significance based on the percent change in pain reported by the individual. RCTs of medications to treat this condition are expected to use a standard of 30% in pain symptoms as one outcome criteria for success; as we shall see, the reliability of findings based on that approach to measuring symptom change may be less reliable than alternatives. Presently, it is important to assess what we know about different approaches to assessing clinically meaningful treatment outcomes, how psychometric properties of chronic pain intensity measures influence the measurement of clinical significance, and which approaches are likely to lead to studies that identify reliable, clinically significant psychological and pharmacological approaches to treating pediatric pain. The purpose of this review is to examine whether existing measure of chronic pain intensity that have been recommended for use in systematic reviews (Cohen et al., 2008; Stinson et al., 2006) or by expert panels (McGrath et al., 2008), or used in RCTs of psychological treatments of chronic pain in children and adolescents (Fisher et al., 2014), are suitable for assessing CSC in future RCTs. This report will discuss the following in the given order: (a) the strengths and weaknesses of approaches to measuring CSC in RCTs in which chronic pediatric pain intensity is an outcome measure; (b) how CSC has been measured in RCTs of psychological interventions for chronic pediatric pain treatment; (c) based on a systematic review, what we know about the test–retest reliability of chronic pediatric pain intensity measures recommended for use in prior reports as well as those used in clinical trials; (d) future directions in measuring CSC in studies of pediatric pain.

Lavigne

Pain Report Clinical Significance

3

Table I. Characteristics of Approaches to Measuring Clinically Significant Change (CSC) Approach

Core element

Anchor based

Global rating of “better,” Appealing face validity; easy to “improved,” etc.; may include obtain; multiple approaches to quantification of rating on a calculating MCID value continuous scale to calculate the minimal clinically significant difference (MCID)

Percent change

Cutoff score to classify as improved versus not improved based on a percent change from individual’s baseline score

b. Standard error of mean (SEM) approach Statistical

Combined

Number needed to treat (NNT)

Unlikely to be a single anchor-based MCID across ages, disorders, etc.; poor retrospective recall of initial state raises concerns about meaningfulness of rating; reliability of global rating of “better,” etc. is not known Appealing face validity as a Magnitude of change varies to meet marker for a minimum change; cutoff based on initial pain level; Food and Drug Administration may identify some individuals as recommended for some condiimproved when there reports are tions; often used for some disornot reliable ders, allowing comparisons across studies

Uses Cohen’s ES to specify cutoff for improved

ES familiar to scientific community

CSC determined based on the SEM in the sample/population

Using 1.96 SEM, 95% confidence that rating of improved vs. not improved is reliable Cutoff score likely to be meaningful

Defines the CSC based on change to a target level of the normal population (e.g., 2 weeks. Thus, we know little about the test–retest reliability of the VAS for the intervals that are common for pre–post or pre-followup assessments in RCTs in studies of psychological interventions. Drug trials often cover the same intervals, so relevant information for use with such studies is also lacking. Although the focus of this report has been on the problems of ascertaining test–retest reliability estimates that can be used to examine CSC in treatment studies of chronic pediatric pain, the effects of our limited knowledge of test–retest reliability for pain intensity measures applies to other types of conditions and other types of PROs. For example, acute pain resulting from an injury or surgical procedure could last for days or weeks. Variability in patient-reported pain intensity could result from variability in the underlying latent “true” level of pain and not just measurement error. A study designed to determine whether, say, a psychological intervention reduced acute pain a week after a particular surgical procedure was conducted would need to know what the test–retest reliability for that condition was for the tested interval to determine in CSC resulted from treatment. It is less clear how problems with variability in latent “true” levels of pain might affect studies of treatments for procedural pain. The true level of pain associated with, say venipuncture, is likely to diminish over time, but the diminution in latent true pain intensity might be rapid enough that it adds little to variability above measurement error. As the studies in Table III indicate, we know no more about the test–retest reliability of these measures when they are used to assess acute or procedural pain that we do about chronic pain. How changes in latent true pain levels vary is, of course, a question that requires empirical verification. There is also an important pragmatic aspect to the use of test–retest reliabilities. Requiring that someone be considered a responder based on a combined statistically reliable change and CSC sets a high bar for deciding that a treatment is successful. Studies using that standard to determine treatment success may require larger sample sizes than studies assess whether treatment and control groups differ on a continuous measure in the manner typically used in RCTs. While such studies may be more difficult to conduct, their findings may be more meaningful and

Downloaded from http://jpepsy.oxfordjournals.org/ at The University of British Colombia Library on March 14, 2016

reliability can be used to establish the criterion for a reliable change with only a 5% chance that classification of responders is not statistically reliable. Methods combining the need to achieve a reliable change with a criterion for clinical significance (“better” using an anchor-based approach, a meaningful percent change criterion, or statistical criteria for clinical significance) provide the best way to ensure accurate classification as both reliable and clinically significant. The limitations of our understanding of the test– retest reliability of recommended measures of chronic pediatric pain intensity, however, make it difficult for researchers to assess reliable CSC in studies of pediatric pain intensity at this time. The limitations are both conceptual and empirical. Conceptually, it is known that the magnitude of test–retest reliability coefficients is reduced not only by measurement error but also by true score fluctuations that vary over time. It is clear from reports on the use of distribution-based measures of clinical significance that they were intended to be used with traits or symptoms showing such fluctuations in true scores, such as quality of life (Cella et al., 2002) and marital satisfaction (Jacobson & Truax, 1991). It is not uncommon for the magnitude of test–retest reliability coefficients to decline as the interval between measurement points increases. As a result, to estimate true score fluctuations, it seems appropriate that the test–retest reliability used to calculate a CSC should approximate the interval between test administrations in the RCT, that is, the interval between pre- and posttreatment assessments, and between pretreatment and follow-up. Using the reliability coefficient for a short interval will inflate the clinical significance of a treatment that takes 4–12 weeks to administer, while a coefficient from a longer time interval may lead to underestimating the intervention’s clinical significance. This issue has not been well-discussed and warrants further attention. Empirically, this review indicates that we lack sufficient information about the test–retest reliability of existing pediatric pain intensity measures to be able to calculate statistically reliable, clinically significant estimates of treatment effectiveness for psychological interventions for chronic pain as this time. In the studies reviewed by Fisher et al. (2014), the pre–post treatment interval varied widely, with most treatments occurring over 3–12 weeks, and the posttreatment to follow-up period from 3 to 12 months. Presently, we have no studies of the test–retest reliability that could be used in determining CSCs for any recommended measure of pediatric pain intensity for any intervals that are similar to those used in RCTs for psychological interventions for chronic pediatric pain. For the VAS, some data are available on short intervals that may be useful in studying pre- to posttreatment CSCs. Test–retest reliability for 1-week interval has been

Lavigne

Pain Report Clinical Significance

trial. Studies would be needed to determine whether test–retest reliabilities differ for treated versus untreated control groups. For disorders that occur with some regularity in the general population (e.g., abdominal pain, headache), estimates of test–retest reliability in general populations may be useful, but it is not known if they differ for a community, nonclinical sample from those with chronic conditions. There has been considerable interest in the use of electronic diaries for measure pain. Presently, attention has focused on the feasibility and acceptability of this technique (Palermo, Valenzuela, & Stork, 2004; Peters et al., 2000; Stinson, 2009), but attention to the reliability of such measures needs to be addressed, as well, if they are to play an important role in assessing reliable, clinically significant outcomes in future studies. Both paper and electronic diaries are often used for a week or so at each pretreatment, posttreatment, and follow-up assessment period in the course of the RCT. This may reduce the likelihood of recall error for that period compared with asking the child or adolescent what his or her usual or worst pain was during that interval. For the purpose of statistical analysis, those diaries are typically summarized into a single score for each assessment interval, and as the review shows, the reliability of those weekly summary scores over longer intervals is not established, and weekly fluctuations in the pain experience may be considerable. It is possible that the test–retest reliability coefficient for the control or comparison group in the RCT being conducted is appropriate for calculating the RCI to be used in calculating the CSC for that particular study. The interval between test administrations would be the same as that for the treatment group and would control for any effect of the repeated test administration. It would be of interest to examine how the test–retest reliability of the control/comparison group might differ from that for a group that only took the pre- and posttest in the manner of a wait-list control group. It is now well recognized that there are different domains to pain than just pain intensity, and changes in these other domains may be important to measure in studying treatment outcomes. Further review will be needed to determine whether adequate information is available on test–retest reliability to assess CSCs on measures in these other domains, and additional research on this psychometric property for measures in these other domains is likely to be needed. The absence of information about test–retest reliability has clinical and policy consequences. Based on the studies of psychological treatments of chronic pediatric pain included in the Fisher et al. (2014) systematic review, we have reason to believe that those

Downloaded from http://jpepsy.oxfordjournals.org/ at The University of British Colombia Library on March 14, 2016

more reliable. Presently, there is general concern over the difficulty in medicine and social science in the frequency with which study results cannot be replicated (Ioannidis, 2005, 2008), and the contributions of measurement error to this problem may be substantial (Ioannidis, 2008). Possibly, replication would be easier to achieve if outcome measures were more rigorous, with greater assurance that outcomes were reliable. This would occur using distribution or combined approaches compared with approaches not requiring reliable estimates of treatment response. Further research is needed to examine the test– retest reliability of measures of chronic pediatric pain intensity. At the practical level, more information about test–retest reliability of pain intensity is needed to be able to assess the clinical significance of treatments. It is important for future studies to assess the differences that occur in test–retest reliability at different time intervals for measures of pain intensity. Without a clearer understanding of the differences in that statistic that occur, there can be no empirical basis for deciding the magnitude of changes that occur and, say, whether an estimate of test–retest reliability for 8 weeks may be applicable to a treatment taking 10 weeks to complete. In addition, different approaches to self-reporting chronic pain intensity can be taken. At different intervals (e.g., pretreatment, posttreatment, follow-up), a study participant might be asked to report how intense her pain is now, what was the worst pain she experienced during a particular time interval, or what the usual pain level was for that time interval. Such ratings could be supplied for 1 day or 1 week, or daily for multiple days, with the average for that interval used as the measure of pain intensity. Given how few studies assessed test–retest reliability for chronic pain using the measures that were reviewed, the differences in test–retest reliability and their impact on assessing CSC could not be examined in this report. Nonetheless, the differences in these approaches to measuring pain intensity are potentially important, and attention to the effects of these different approaches on the test–retest reliability of chronic pain intensity reports need to be examined in future studies. Information about test–retest reliability of chronic pain intensity measures is also needed for different disorders and developmental levels. Given the paucity of information currently available, we do not know how different the test–retest reliabilities are for this difference disease  developmental level groups. If such information were available, researches would be able to draw on such information in estimating sample sizes for designing RCTs with clinical significance as an end point. Lacking such information, clinicians may need to gather test–retest data during the clinical

15

16

Lavigne

Recommendations These observations recommendations:

lead

to

the

following

1. To increase the likelihood that calculating CSC will result in reliable and clinically meaningful results, combined approaches to calculating CSC are preferable, particularly those that combine a distribution-based and statistical approach. 2. A nomological network of reports of test–retest reliability across different test administration intervals and with different ages and painful conditions needs to be developed. It will also be important to examine differences as a function of variations in test forms (e.g., different end point descriptors). This information is needed so that researchers can plan sample sizes for RCTs. 3. Until that network is developed, the best estimates of test–retest reliability that can be used in a particular study to calculate the CSCs will need to be based on those obtained for that particular study, using the test– retest reliabilities between administrations for the untreated control group for the intervals included in the study between assessment periods.

It is important to keep in mind that the empirical basis of support for the use of psychological interventions to treatment pediatric chronic pain has grown and is likely to continue to grow. As the field moves forward in its study of such interventions, better estimates of the clinical significance of such treatments will be important for the clinician and the consumer to know. Finding that the clinical significance of such treatments is good will be encouraging and, if the results are less reassuring, they may stimulate increased research leading to better treatments.

Supplementary Data Supplementary data can be found at: http://www.jpepsy. oxfordjournals.org/. Conflicts of interest: None declared.

References *Indicates studies in the systematic review of the test-retest reliability. **Indicates studies from the systematic review of treatments by Fisher et al. (2014). Abram, H. S., Buckloh, L. M., & Schilling, L. M. (2007). A randomized, controlled trial of a neurological and psychoeducational group appointemtn model for pediatric headaches. Children’s Healthcare, 36, 249–265. **Alfven, G., & Lindstrom, A. (2007). A new mehod for the treatment of recurrent abdominal pain of prolonged negative stress origin. Acta Paediatrica, 96, 76–81. doi:10.1111/j.1651-2227.2006.00028.x American Education Research Association, American Psychological Association, & National Council on Measurement in Education. (2014). Standards for educational and psychological testing. Washington, DC: American Psychological Association. American Psychological Association. (2010). Publication manual of the American Psychological Association (6th edn). Washington, DC: American Psychological Association. Andrasik, F., Burke, E. J., Attanasio, V., & Rosenblum, E. L. (1985). Child, parent, and physician reports of a child’s headache pain: Relationships prior to and following treatment. Headache, 25, 421–425. *Bailey, B., Bergeron, S., Gravel, J., & Daoust, R. (2007). Comparison of four pain scales in children with acute abdominal pain in a pediatric emergency department. Annals of Emergency Medicine, 50, 379–383. **Bailey, B., Daoust, R., Doyon-Trottier, E., DauphinPierre, S., & Gravel, J. (2010). Validation and properties of the verbal numeric scale in children with acute pain. Pain, 149, 216–221. doi:10.1016/j.pain.009.12.008 *Bailey, B., Gravel, J., & Daoust, R. (2012). Reliability of the visual analog scale in children with acute pain in the emergency department. Pain, 153, 839–842. doi:10.1016/ j.pain.2012.01.006 **Barry, J., & von Baeyer, C. L. (1997). Brief cognitivebehavioral group treatment for children’s headache. Clinical Journal of Pain, 13, 215–220. *Belter, R. W., McIntosh, J. A., Finch, A. J., & Saylor, C. F. (1988). Preschoolers’ ability to differentiate levels of pain: Relative efficacy of three self-report measures. Journal of Clinical Child Psychology, 17, 329–335. Berntson, L., & Svensson, E. (2001). Pain assessment in children with juvenile chronic arthritis: A matter of scaling and rater. Acta Paediatrica, 90, 1131–1136. Bevans, K. B., Riley, A. W., Moon, J. H., & Forest, C. B. (2010). Conceptual and methodological advances in childreported outcomes measurement. Expert Review of Pharmacoeconomics and Outcomes Research, 10, 385– 396. doi:10.1586/ER.10.52 *Bieri, D., Reeve, R. A., Champion, G. D., Addicoat, L., & Ziegler, J. B. (1990). The faces pain scale for the self-

Downloaded from http://jpepsy.oxfordjournals.org/ at The University of British Colombia Library on March 14, 2016

treatments can be of benefit and the magnitude of the ES is small/moderate. Because none of those studies calculated statistically reliable, clinically significant effects, however, we do not know just how clinically significant these treatments are (i.e., the percent of participants who achieved a clinically significant, statistically reliable effect in the treatment group), nor whether the treatments were more effective than the comparison groups in producing CSCs. Clearly, more attention is needed to determining the clinical significance of such treatments. At the policy level, the FDA and EMA recommended percent-change criteria for evaluating certain kinds of treatments, 30% and 50%, respectively. Consideration to change those standards to include a combined approach advocating statistically reliable and clinically significant standards has been suggested (Saps & Lavigne, 2015). Finally, while we have confined this discussion to pain intensity outcome measures, the same concerns apply to all PROs measures, with most such measures lacking information concerning test–retest reliability (Bevans et al., 2010) that limit the ability to reliably assess CSCs.

Pain Report Clinical Significance

Cognitive-behavioral family intervention. Journal of Pediatric Gastrenterology, 43, 59–64. Dworkin, R. H., Turk, D. C., Wyrwich, K. W., Beaton, D., Cleeland, C. S., Farrar, J. T., . . . Zavisic, S. (2008). Interpreting the clinical importance of treatment outcomes in chronic pain clinical trials: IMMPACT recommenations. Journal of Pain, 9, 105–121. doi:10.1016/ j.jpain.2007.09.005 Eccleston, C., Palermo, T. M., Williams, A. C., Lewandowski, A., Morley, S., Fisher, E., & Law, E. (2013). Psychological therapies for the management of chronic and recurrent pain in children and adolescents (Review). Cochrane Database of Systematic Reviews, 12, 1–83. doi:10.1002/14651858.CD003968.pub3 **Fichtel, A., & Larsson, B. (2001). Does relaxation treatment have differential effects on migraine and tension-type headache in adolescents? Headache, 41290–296. Fisher, E., Heathcote, L., Palermo, T., de C. Williams, A. C., Lau, J., & Eccleston, C. (2014). Systematic review and meta-analysis of psychological therapies for children with chronic pain. Journal of Pediatric Psychology, 39, 763– 782. doi:10.1093/jpepsy/jsu008 Food and Drug Administration. (2012). Guidance for industry: Irritable bowel syndrome-clinical evaluation of drugs for treatment. Retrieved from http://www.fda.gov/down loads/Drugs/GuidanceComplianceRegulatoryinformation/ Guidances/UCM205269.pdf Goodenough, B., Addicoat, L., Champion, G. D., McInerney, M., Young, B., Juniper, K., & Ziegler, J. B. (1997). Pain in 4- to 6-year-old children receiving intramuscular injections: A comparison of the Faces Pain Scale and other self-report and behavioral measures. The Clinical Journal of Pain, 13, 60–73. *Gragg, R. A., Rapoff, M. A., Danowsky, M. B., Lindsley, C. B., Varni, J. W., Waldron, S. A., & Bernstein, B. H. (1996). Assessing chronic mesculoskeleetal pain associated with rheumatic disease: Further validation of the pediatric pain questionnaire. Journal of Pediatric Psychology, 21, 237–250. **Griffiths, J. D., & Martin, P. R. (1996). Clinical-versus home-based treatment formats for children with chronic headaches. British Journal of Health Psychology, 1, 151– 166. Heise, D. R. (1969). Separating reliability and stability in test-retest correlation. American Sociological Review, 34, 93–101. Hester, N. K. (1979). The preoperation child’s reaction to immunization. Nursing Research, 28, 250–255. **Hicks, C. L., Von Baeyer, C. L., & McGrath, P. J. (2006). Online psychological treatment for pediatric recurrent pain: A randomized evaluation. Journal of Pediatric Psychology, 31, 724–736. doi:10.1093/jpepsy/jsj065 Hicks, C. L., Von Baeyer, C. L., Spafford, P. A., van Korlaar, I., & Goodenough, B. (2001). The faces pain scale-revised: Toward a common metric in pediatric pain measurement. Pain, 2001, 173–183. Hirschfeld, G., Wager, J., Schmidt, P., & Zernikow, B. (2014). Minimally clinically significant differences for adolescenets with chronic pain-variability of ROC-based cut points. Journal of Pain, 15, 32–39. doi:10.1016/ j.pain.2013.09.006

Downloaded from http://jpepsy.oxfordjournals.org/ at The University of British Colombia Library on March 14, 2016

assessment of the severity of pain experienced by children: Development, initial validation, and preliminary investigation for ratio scale properties. Pain and Headache, 41, 139–150. Birnie, K. A., McGrath, P. J., & Chambers, C. T. (2012). When does pain matter? Acknowledging the subjectivity of clinical significance. Pain, 153, 2311–2314. doi: 10.1016/j.pain.2012.07.033 Blanchard, E. B., & Andrasik, F. (1985). Management of chronic headaches: A psychological approach. New York, NY: Pergamon. Bulloch, B., & Tenenbein, M. (2002). Assessment of clinically significant changes in acute pain in children. Academic Emergency Medicine, 9, 199–202. **Bussone, G., Grazzi, L., D’Amico, D., Leone, M., & Andrasik, F. (1998). Biofeedback-assisted relaxation training for young adolescents with tension-type headache: A controlled study. Cephalalgia, 18, 463–467. >Cella, D., Eton, D. T., Lai, J. S., Peterman, A. H., & Merkel, D. E. (2002). Combining anchor and distribution-based methods to derive minimal clinically important differences on the Functional Assessment of Cancer Therapy (FACT) anemia and fatigue scales. Journal of Pain and Symptom Management, 24, 547– 561. Charry, C. L. E., dos Santos Piola, J., Linhares, M. B. M., & da Silva, J. A. (2014). Validity and reliability assessment of the Brazilian version of the faces pain scale-revised. Psychology and Neuroscience, 7, 55–59. doi:10.3922/ j.psns.2014.1.08 Chogle, A., Sztainberg, M., Bass, L., Youssef, N. N., Miranda, A., Nurko, S., . . . Saps, M. (2012). Accuracy of pain recall in children. Journal of Pediatric Gastroenterology and Nutrition, 55, 288–291. doi:10.1097/MPG.0b013e31824cf08a Cohen, L. L., Lemanek, K., Blount, R. L., Dahlquist, L. M., Lim, C. S., Palermo, T. M., . . . Weiss, K. E. (2008). Evidence-based assessment of pediatric pain. Journal of Pediatric Psychology, 33, 939–955. doi:10.1093/jpepsy/ jsm103 **Connelly, M. (2010). The verbal numeric rating scale in the pediatric emergency deparatment: What do the numbers really mean?. Pain, 149, 167–168. doi:10.1016/ j.pain.2010.02.031 Connelly, M., Rapoff, M. A., Thompson, N., & Connelly, W. (2006). Headstrong: A pilot study of a CD-ROM intervention for recurrent pediatric headache. Journal of Pediatric Psychology, 31, 737–747. doi:10.1093/jpepsy/ jsj003 Copay, A. G., Subach, B. R., Glassman, S. D., Polly, D. W., & Schuler, T. C. (2007). Understanding the minimum clincally important difference: A review of concepts and methods. The Spine Journal, 7, 541–546. doi:10.1016/ j.spine.2007.01.008 Crosby, R. D., Kolotkin, R. L., & Williams, R. (2003). Defining clinically meaningful change in health-related quality of life. Journal of Clinical Epidemiology, 56, 395– 407. **Duarte, M. A., Penna, F. J., Andrade, E. M. G., Cancela, C. S. P., Neto, J. C. A., & Barbosa, T. F. (2006). Treatment of nonorganic recurrent abdominal pain:

17

18

**Levy, R. L., Langer, S. L., Walker, L. S., Romano, J. M., Christie, D. L., Youssef, N., . . . Whitehead, W. E. (2010). Cognitive-behavioral therapy for children with functional abdominal pain and their parents decreases pain and other symptoms. American Journal of Gastroenterology, 105, 946–956. doi:10.1038/ajg.2010.106 **Levy, R. L., Langer, S. L., Walker, L. S., Romano, J. M., Christie, D. L., Youssef, N., . . . Whitehead, W. E. (2013). Twelve-month follow-up of cognitive behavioral therapy for children with functional abdominal pain. JAMA Pediatrics, 167, 178–184. doi:10.1001/2013.jamap ediatrics.282 *Luffy, R., & Grove, S. K. (2003). Examining the validty, reliabity, and preference of three pediatric pain measurement tools in African-American children. Pediatric Nursing, 29, 54–59. McCarthy, I. (2012). The minimum clinically important difference and its use in medical research. Retrieved from www.thisiscapstone.com/details/article/2041047/The_ Minimum_Clinically_Important_Difference_And_Its_ Use_In_Medical_Research.html McGrath, P. J., de Veber, L., & Hearn, M. (1985). Multidimensional pain assessment in children. Paper presented at the advances in pain research and therapy: Proceedings from the 4th World Congress on Pain, Seattle, Washington. **McGrath, P. J., Humphreys, P., Goodman, J. T., Keene, D., Firestone, P., Jacob, P., & Cunningham, S. J. (1988). Relaxation prophylaxis for childhood migraine: A randomized placebo-controlled trial. Developmental Medicine and Child Neurology, 30, 626–631. **McGrath, P. J., Humphreys, P., Keene, D., Goodman, J. T., Lascelles, M. A., Cunningham, S. J., & Firestone, P. (1992). The efficacy and efficiency of a self-administered treatment for adolescent migraine. Pain and Headache, 49, 312–324. McGrath, P. J., Walco, G. A., Turk, D. C., Dworkin, R. H., Brown, M. T., Davidson, K., . . . Zeltzer, L. (2008). Core outcome domains and measures for pediatric acute and chronic/recurrent pain clinical trials: PedIMMPACT recommendations. Journal of Pain, 9, 771–783. doi:10.1016/ j.pain.2008.04.007 Miro, J. (2009). Translational pain research: Achievements and challenges. Journal of Pain, 10, 1001–1011. doi:10.1016/j.jpain.2009.06.002 Miro, J., Castarlenas, E., & Huguet, A. (2009). Evidence for the use of a numerical rating scale to assess the intensity of pediatric pain. European Journal of Pain, 13, 1089–1095. doi:10.1016/j.ejpain.2009.07.002 *Miro, J., & Huguet, A. (2004). Evaluation of reliability, validity, and preference for a pediatric pain intensity scale: The Catalan version of the faces pain scale-revised. Pain, 111, 59–64. doi:10.1016/j.pain.2004.05.023 Moher, D., Liberati, A., Tetzlaff, J., Altman, D. G., & The PRISMA Group. (2009). Preferred reporting items for systematic reviws and meta-analyses: The PRISMA statement. PLoS Med, 6, e1000097. doi:10.1371/ jurnal.pmed1000097 Myrvik, M. P., Brandow, A. M., Drendel, A. L., Yan, K., Hoffman, R. G., & Panepinto, J. A. (2013). Clinically meaningful measurement of pain in children with sickle

Downloaded from http://jpepsy.oxfordjournals.org/ at The University of British Colombia Library on March 14, 2016

Humphreys, P. A., & Gevirtz, R. N. (2000). Treatment of recurrent abdominal pain: Components analysis of four treatment protocols. Journal of Pediatric Gastrenterology and Nutrition, 31, 47–51. *Hunter, M., McDowell, L., Hennessy, R., & Cassey, J. (2000). An evaluation of the faces pain scale with young children. Journal of Pain and Symptom Management, 20, 122–129. Ioannidis, J. P. A. (2005). Contradicted and initially stronger effects in highly cited clinical research. JAMA, 294, 218– 228. Ioannidis, J. P. A. (2008). Why most discovered true associations are inflated. Epidemiology, 19, 640–648. doi:10.1097/EDE.0b013e31818131e7 Jacobson, N., & Truax, P. (1991). Clinical significance: A statistical approach to defining meaningful change in psychotherapy research. Journal of Consulting and Clinical Psychology, 59, 12–19. Kashikar-Zuck, S., Sil, S., Lynch-Jordan, A. M., Tink, T. V., Peugh, J., Schikler, K. N., . . . Lovell, D. J. (2013). Changes in pain coping, catastrophizing, and coping efficacy after cognitive-behavioral therapy in children and adolescents with juveile fibromyalgia. Journal of Pain, 14, 492–501. **Kashikar-Zuck, S., Swain, N. F., Jones, B. A., & Graham, T. B. (2005). Efficacy of cognitive-behavioral intervention for juvenile primary fibromyalgia syndrome. Journal of Rheumatology, 32, 1594–1602. **Kashikar-Zuck, S., Ting, T. V., Arnold, L. M., Bean, J., Powers, S. W., Graham, T. B., . . . Lovell, D. J. (2012). Cognitive behavioral therapy for the treatment of juvenile fibromyalgia. Arthritis and Rheumatism, 64, 297–305. doi:10.1002/art.30644 **Kroener-Herwig, B., & Denecke, H. (2002). Cognitive-behavioral therapy of pediatric headache: Are there differences in efficacy between a therapist-administered group training and a self-help format? Journal of Psychosomatic Research, 531107–1114. >**Labbe, E. E. (1995). Treatment of childhood migraine with autogenic training and skin temperature biofeedback: A component analysis. Headache, 353, 10–13. **Labbe, E. E., & Williamson, D. A. (1984). Treatment of childhood migraine using autogenic feedback training. Journal of Consulting and Clinical Psychology, 52, 968– 976. **Larsson, B., & Carlsson, J. (1996). A school-based, nurseadministered relaxation training for children with chronic tension-type headache. Journal of Pediatric Psychology, 31, 603–614. **Larsson, B., Daleflod, B., Hakansson, L., & Melin, L. (1987). Therapist-assisted versus self-help relaxation treatment of chronic headaches in adolescents: A school-based intervention. Journal of Child Psychology and Psychiatry, 28, 127–136. **Larsson, B., & Doberl, A. (1990). Recurrent tension headache in adolescents treated with self-help relaxation and a muscle relaxant drug. Headache, 30, 667–671. **Larsson, B., Melin, L., Lamminen, M., & Ullstedt, F. (1987). A school-based treatment of chronic headaches in adolescents. Journal of Pediatric Psychology, 12, 553–566.

Lavigne

Pain Report Clinical Significance

*Sanchez-Rodriguez, E., Miro, J., & Castarlenas, E. (2012). A comparison of four self-report scales of pain intensity in 6- to 8- year-old children. Pain, 153, 1715–1719. doi:10.1016/j.pain.2012.05.007 **Sanders, M. R., Shepherd, R. W., Cleghorn, G., & Woolford, H. (1994). The treatment of recurrent abdominal pain in children: A controlled comparison of cognitivebehavioral family intervention and standard pediatric care. Journal of Consulting and Clinical Psychology, 63, 306–314. *Saps, M., & Lavigne, J. V. (2015). Abdominal pain endpoints currently recommended by the FDA and EMA for adult patients with irritable bowel syndrome may not be reliabile in children. Neurogastroenterology and Motility, 27, 849–855. doi:10.1111/nmo/12559 **Sartory, G., Muller, B., Metsch, J., & Pothmann, R. (1998). A comparison of psychological and pharmacological treatment of pediatric migraine. Behaviour Research and Therapy, 36, 1155–1170. Scharff, L., Marcus, D. A., & Masek, B. J. (2002). A controlled study of minimal-contact thermal biofeedback treatment in children with migraine. Journal of Pediatric Psychology, 27, 109–119. Stinson, J. N. (2009). Improving the assessment of pediatric chronic pain: Harnessing the potential of electronic diaries. Pain Research and Management, 14, 59–64. Stinson, J. N., Kavanaugh, T., Yamada, J., Gill, N., & Stevens, B. (2006). Systematic review of the psychometric properties, interpretability and feasibility of self-report pain intensity measures for use in clinical trials in children and adolescents. Pain, 125, 143–157. doi:10.1061/ j.pain.2006.05.006 **Stinson, J. N., McGrath, P. J., Hodnett, E. D., Feldman, B. M., Duffy, C. M., Huber, A. M., . . . White, M. E. (2010). An internet-based self-management program with telephone support for adolescents with arthritis: A pilot randomized controlled trial. Journal of Rheumatology, 37, 1944–1952. doi:10.3899/jrheum.091327 Thompson, K. L., Varni, J. W., & Hanson, V. (1987). Comprehensive assessment of pain in juvenile rheumatoid arthritis: An empirical model. Journal of Pediatric Psychology, 12, 241–255. Tomlinson, D., Von Baeyer, C. L., Stinson, J., & Sung, L. (2010). A systematic review of Faces Scales for the self-report of pain intensity in children. Pediatrics, 126, e1–e31. doi:10.1542/peds.2010-1609 **Trautman, E., & Kroner-Herwig, B. (2010). A randomized controlled trial of internet-based self-help training for recurrent headache in childhood and adolescence. Behaviour Research and Therapy, 48, 28–37. doi:19.1016/ j.brat.2009.09.004 *Tsze, D. S., Hirschfeld, G., Von Baeyer, C. L., Bulloch, B., Dayan, P. S., & Walthall, J. (2015). Clinically significant differences in acute pain measured on self-report pain scales in children. Academic Emergency Medicine, 22, 415–422. doi:10.1111/acem.12620 Tsze, D. S., Von Baeyer, C. L., Bulloch, B., & Dayan, P. S. (2013). Validation of self-report pain scales in children. Pediatrics, 132, e971–e979. doi:10.1542/peds.2013-1509 van den Brink, M., Bandell-Hoekstra, E. N. G., & Abu-Saad, H. H. (2001). The occurrence of recall bias in pediatric

Downloaded from http://jpepsy.oxfordjournals.org/ at The University of British Colombia Library on March 14, 2016

cell disease. Pediatric Blood Cancer, 6, 1689–1695. doi:10.1002/pbc.24624 Norman, G. R., Sloan, J. A., & Wyrwich, K. W. (2003). Interpretation of changes in health-related quality of life: The remarkable universality of half a standard deviation. Medical Care, 41, 582–592. Norman, G. R., Stratford, P., & Regehr, G. (1997). Methodological problems in the retrospective computation of responsiveness to change: The lesson of Cronbach. Journal of Clinical Epidemiology, 50, 869–879. *Osterhaus, S. O. L., Lange, A., Linssen, W. H. J. P., & Passchier, J. (1997). A behavioral treatment of young migrainous and nonmigrainous headache patients: Prediction of treatment success. International Journal of Behavioral Medicine, 4, 378–396. *Page, M. G., Katz, J., Stinson, J., Isaac, L., Martin-Pichora, A. L., & Campbell, F. (2012). Validation of the numerical rating scale for pain intensity and unpleasantness in pediatric acute postoperative pain: Sensitivity to change over time. Journal of Pain, 13, 359–369. doi:10.1016/ j.pain.2011.12.010 **Palermo, T. M. (2009). Assessment of chronic pain in children: Current status and emerging topics. Pain Research and Management: The Journal of the Canadian Pain Society, 141, 21–26. Palermo, T. M., Valenzuela, D., & Stork, P. P. (2004). A randomized trial of electronic versus paper pain diaries in children: Impact on compliance, accuracy, and acceptability. Pain and Headache, 107, 213–219. doi:10.1016/ j.pain.2003.10.005 Palermo, T. M., Wilson, A. C., Peters, M., Lewadowski, A., & Somhegyi, H. (2009). Randomized controlled trial of an internet-delivered family cognitive-behavioral therapy intervention for children and adolescents with chronic pain. Pain, 146, 205–213. doi:10.1016/j.pain.20009.07.034 **Passchier, J., van den Bree, M. B. M., Emmen, H. H., Osterhaus, S. O. L., Orlebeke, J. F., & Verhage, F. (1990). Relaxation training in school classes does not reduce headache complaints. Headache, 30, 660–664. Perrott, D. A., Goodenough, B., & Champion, G. D. (2004). Children’s ratings of the intensity and unpleasantness of post-operative pain using facial expression scales. European Journal of Pain, 8, 119–127. doi:10.1016/ S1090-3801(03)00087-9 Peters, M. L., Sorbi, M. J., Kruise, D. A., Kerssens, J. J., Verhaak, P. F. M., & Bensing, J. M. (2000). Electronic diary assessment of pain, disability and psychological adaptationi patients differing in duration of pain. Pain, 84, 181–192. Richardson, G. M., McGrath, P. J., Cunningham, S. J., & Humphreys, P. (1983). Validity of the headache diary for children. Headache, 23, 184–187. **Richter, I. L., McGrath, P. J., Humphreys, P. J., Goodman, J. T., Firestone, P., & Keene, D. (1986). Cognitive and relaxation treatment of pediatric migraine. Pain, 25, 195–203. **Robins, P. M., Smith, S. M., Glutting, J. J., & Bishop, C. T. (2005). A randomized controlled trial of a cognitive-behavioral family intervention of pediatric recurrent abdominal pain. Journal of Pediatric Psychology, 30, 397–408. doi:10.1093/jpepsy.jsi063

19

20

von Baeyer, C. L., Spagrud, L. J., McCormick, J. C., Choo, E., Neville, K., & Connelly, M. A. (2009). Three new datasets supporting use of the Numerical Rating Scale (NRS11) for children’s self-reports of pain intensity. Pain, 143, 223–227. doi:10.1016/j.pain.2009.03.002 Walco, G. A., & Dampier, C. D. (1990). Pain in children and adolescents with sickle cell disease: A descriptive study. Journal of Pediatric Psychology, 15, 643–658. **Wicksell, R. K., Melin, L., Lekander, M., & Olsson, G. L. (2009). Evaluating the effectiveness of expousre and acceptance strategies to improve functioning and quality of life in longstanding pediatric pain- a randomized controlled trial. Pain, 141, 248–257. doi:10.1016/j.pain.2008.11.006 Wyrwich, K. W., Nienaber, N. A., Tierneyh, W. M., & Wolinksy, F. D. (1999). Linking clinical relevance and statistical significance in evaluating intra-individual changes in health-related quality of life. Medical Care, 37, 469– 478. Wyrwich, K. W., Tierney, W. M., & Wolinsky, F. D. (1999). Further evidence supporting an SEM-based criterion of identifying meaningful intra-individual changes in healthrelate quality of life. Journal of Clinical Epidemiology, 52, 861–873.

Downloaded from http://jpepsy.oxfordjournals.org/ at The University of British Colombia Library on March 14, 2016

headache: A comparison of questionnaire and diary data. Headache, 41, 11–20. **van Tilburg, M. A. L., Chitkara, D. K., Palsson, O. S., Turner, M., Blois-Martin, N., Ulshen, M., & Whitehead, W. E. (2009). Audio-recorded guided imagery treatment reduces functional abdominal pain in children: A pilot study. Pediatrics, 124, e890–e897. doi:10.1542/peds. 2009-0028 **Vlieger, A. M., Menko-Frankenuis, C., Wolfkamp, S. C. S., Tromp, E., & Beninga, M. A. (2007). Hypnotherapy for children with functional abdominal pain or irritable bowel syndrome: A randomized controlled trial. Gastroenterology, 133, 1430–1436. doi:10.1053/ j.gastro.2007.08.072 Voepel-Lewis, T., Burke, C. N., Jeffreys, N., Malviya, S., & Tait, A. R. (2011). Do 0–10 numeric rating scores translate into clinically meaningful pain measures for children? Anesthesia and Analgesia, 112, 415–421. doi:10.1023/ ANA.0b013e318203f495 von Baeyer, C. L. (2009). Children’s self-report of pain intensity: What we know, where we are headed. Pain Research and Management: The Journal of the Canadian Pain Society, 14, 39–45.

Lavigne

Systematic Review: Issues in Measuring Clinically Meaningful Change in Self-Reported Chronic Pediatric Pain Intensity.

This report examined limitations in our ability to assess clinically significant change (CSC) in randomized controlled trials of treatments of self-re...
564B Sizes 0 Downloads 6 Views