Inter-rater reliability refers to the degree of agreement between two raters who are rating the same youth. Estimates of inter-rater reliability help todescribe levels of consistency between raters, typically indexed by a correlation coefficient (in this case, Pearson’s r; LeBreton & Senter, 2008). Twointer-rater studies were conducted with the Conners 4: (1) two raters of the same rater type rated the same youth (i.e., two parents or two teachers), and(2) two raters of different rater types rated the same youth (i.e., parent/teacher, parent/self-report, or teacher/self-report).
Inter-Rater Reliability Study 1. In the first inter-rater study, dyads of parent raters (N = 68) completed the Conners 4 Parent,and teacher dyads (N = 34) completed the Conners 4 Teacher. Parent dyads were comprised of either a mother and father rating the same child (88.2%),or a mother and a non-biological father (11.8%) rating the same child. For teacher dyads, the teachers in each pair were typically providing instruction indifferent class content (e.g., math and literacy, or science and writing). Ratings were obtained from the parent and teacher dyads on two separateadministrations over a 2- to 4-week interval (14 to 30 days). In appendix F, please see Table F.3 for demographic characteristics of the youth being rated and Table F.4 for demographic characteristics of the parent and teacher raters.
Scores on the Conners 4 scales for the paired raters were correlated. The reliability coefficients are Pearson’s correlations, ranging from -1 to 1,with higher values indicating greater consistency or agreement between raters. Correlation coefficients provide us with a statistical measure of the degreeof association between two variables. Although there are several approaches to interpretation, the correlation coefficients (r) are categorized herein asfollows: absolute values lower than .20 are classified as very weak; values of .20 to .39 are considered weak; values of .40 to .59 are moderate; values of.60 to .79 are strong; and absolute values greater than or equal to .80 are very strong (Evans, 1996).
The obtained inter-rater reliability coefficients and those corrected for range variation (Bryant & Gokhale, 1972), as well as the means, medians, and SDs for each dyad are provided in Table 8.15 for the Parent inter-rater study, and Table 8.16 for the Teacher inter-rater study. Results ofthe Parent inter-rater study were indicative of high levels of consistency between raters across all scales, with slightly lower corrected correlations forAnxious Thoughts and Family Life scales (evidence that parents of the same child can still have differing perspectives, i.e., corrected r rangedfrom .58 to .93, all p< .001). In Table 8.16, the inter-rater study results for teacher dyads showed more moderate to strong associations (i.e., corrected r rangedfrom .38 to .69); this pattern of results is a reminder of the value of obtaining ratings from multiple teachers and evaluating multiple sources ofinformation when conducting a comprehensive assessment.
The consistency between raters was further evaluated by calculating the difference between scores for parent or teacher dyads in the inter-ratersamples. Table 8.17 and 8.18 present the percentage of the sample displaying differences in scores of greater than, or equal to, 10 points (i.e., 1 SD or greater), as well as the absolute mean differences (i.e., the difference between ratings from pairs of parents or teachers averaged across thesample). The results suggest high consistency between parent ratings (scores on the Conners 4 scales were less than 1 SD apart for 79.4% to 100.0%of parent dyads) and moderate to high consistency between teacher ratings (scores on the Conners 4 scales were less than 1 SD apart for 61.8% to82.4% of teacher dyads). The absolute mean and SD differences were also close to zero (i.e., on average, there was close to no difference in scoresbetween the raters, and there was similar dispersion in scores), providing further evidence of inter-rater consistency, given the high degree of agreementbetween the same type of raters.
Table 8.15. Inter-Rater Reliability Study 1: Conners 4 Parent
Scale | Obtained | Corrected | p | Parent 1 | Parent 2 | |||||
M | Mdn | SD | M | Mdn | SD | |||||
Content Scales | Inattention/Executive Dysfunction | .82 | .84 | < .001 | 53.3 | 51 | 10.8 | 53.5 | 50 | 11.9 |
Hyperactivity | .84 | .86 | < .001 | 51.9 | 49 | 10.7 | 50.8 | 48 | 10.0 | |
Impulsivity | .75 | .85 | < .001 | 51.3 | 49 | 9.0 | 51.9 | 49 | 11.3 | |
Emotional Dysregulation | .76 | .78 | < .001 | 51.4 | 49 | 9.9 | 52.8 | 49 | 12.0 | |
Depressed Mood | .68 | .79 | < .001 | 50.4 | 47 | 9.1 | 51.3 | 48 | 8.8 | |
Anxious Thoughts | .78 | .58 | < .001 | 52.0 | 47 | 11.5 | 51.6 | 48 | 10.9 | |
Impairment & Functional | Schoolwork | .88 | .81 | < .001 | 51.5 | 50 | 10.8 | 51.7 | 47 | 11.7 |
Peer Interactions | .91 | .93 | < .001 | 50.8 | 48 | 9.7 | 51.4 | 48 | 9.9 | |
Family Life | .86 | .69 | < .001 | 52.4 | 48 | 11.3 | 52.5 | 48 | 12.0 | |
DSM | ADHD Inattentive Symptoms | .82 | .84 | < .001 | 53.3 | 51 | 10.3 | 53.2 | 49 | 11.7 |
ADHD Hyperactive/Impulsive Symptoms | .83 | .91 | < .001 | 51.8 | 50 | 9.9 | 51.4 | 48 | 10.2 | |
Total ADHD Symptoms | .84 | .89 | < .001 | 52.7 | 51 | 10.3 | 52.5 | 49 | 10.9 | |
Oppositional Defiant Disorder Symptoms | .80 | .82 | < .001 | 51.4 | 50 | 9.5 | 51.9 | 49 | 10.8 | |
Conduct Disorder Symptoms | .68 | .93 | < .001 | 49.4 | 47 | 7.9 | 48.7 | 45 | 6.2 |
Note. N = 68. Reported p-values are for corrected r. Guidelines for interpreting |r|: very weak < .20, weak = .20 to .39,moderate = .40 to .59, strong = .60 to .79, very strong ≥ .80.
Table 8.16. Inter-Rater Reliability Study1: Conners 4 Teacher
Scale | Obtained r | Corrected r | p | Teacher 1 | Teacher 2 | |||||
M | Mdn | SD | M | Mdn | SD | |||||
Content Scales | Inattention/Executive Dysfunction | .54 | .54 | .001 | 61.1 | 63 | 10.1 | 59.7 | 61 | 9.9 |
Hyperactivity | .65 | .52 | .001 | 56.2 | 56 | 12.6 | 55.5 | 53 | 11.0 | |
Impulsivity | .57 | .48 | .004 | 56.7 | 56 | 11.8 | 55.8 | 54 | 10.9 | |
Emotional Dysregulation | .79 | .50 | .003 | 57.3 | 50 | 14.7 | 57.3 | 50 | 15.2 | |
Depressed Mood | .64 | .58 | < .001 | 53.1 | 50 | 10.8 | 52.0 | 48 | 10.6 | |
Anxious Thoughts | .68 | .49 | .004 | 55.0 | 52 | 13.0 | 52.1 | 47 | 12.9 | |
Impairment & Functional Outcome Scales | Schoolwork | .31 | .38 | .028 | 61.0 | 59 | 9.5 | 57.7 | 57 | 8.4 |
Peer Interactions | .66 | .59 | <.001 | 62.2 | 62 | 11.2 | 60.9 | 56 | 10.7 | |
DSM | ADHD Inattentive Symptoms | .51 | .52 | .002 | 61.7 | 64 | 10.1 | 60.1 | 61 | 9.7 |
ADHD Hyperactive/Impulsive Symptoms | .65 | .55 | .001 | 56.5 | 55 | 12.1 | 55.9 | 54 | 10.7 | |
Total ADHD Symptoms | .60 | .57 | < .001 | 59.7 | 58 | 11.0 | 58.5 | 58 | 9.7 | |
Oppositional Defiant Disorder Symptoms | .69 | .48 | .004 | 56.7 | 51 | 13.7 | 54.8 | 50 | 12.7 | |
Conduct Disorder Symptoms | .76 | .69 | < .001 | 53.4 | 48 | 11.3 | 53.2 | 49 | 11.1 |
Note. N = 34. Reported p-values are for corrected r. Guidelines for interpreting |r|: very weak < .20, weak = .20 to .39,moderate = .40 to .59, strong = .60 to .79, very strong ≥ .80.
Table 8.17. Difference Between Rater T-scores Study 1: Conners 4 Parent
Scale | Percentage of Inter-Rater Sample | Absolute M Differences | Absolute SD Differences | |||
Parent 1 ≥ 1 | Scores | Parent 2 ≥ 1 | ||||
Content | Inattention/Executive Dysfunction | 2.9 | 91.2 | 5.9 | 0.3 | 1.1 |
Hyperactivity | 1.5 | 92.6 | 5.9 | 1.0 | 0.7 | |
Impulsivity | 8.8 | 85.3 | 5.9 | 0.6 | 2.3 | |
Emotional Dysregulation | 14.7 | 79.4 | 5.9 | 1.4 | 2.1 | |
Depressed Mood | 10.3 | 86.8 | 2.9 | 1.0 | 0.3 | |
Anxious Thoughts | 8.8 | 80.9 | 10.3 | 0.3 | 0.6 | |
Impairment & Functional Outcome Scales | Schoolwork | 2.9 | 89.7 | 7.4 | 0.2 | 0.9 |
Peer Interactions | 0.0 | 100.0 | 0.0 | 0.6 | 0.3 | |
Family Life | 2.9 | 89.7 | 7.4 | 0.1 | 0.8 | |
DSM | ADHD Inattentive Symptoms | 7.4 | 88.2 | 4.4 | 0.1 | 1.4 |
ADHD Hyperactive/Impulsive Symptoms | 4.4 | 91.2 | 4.4 | 0.4 | 0.3 | |
Total ADHD Symptoms | 4.4 | 91.2 | 4.4 | 0.2 | 0.6 | |
Oppositional Defiant Disorder Symptoms | 10.3 | 85.3 | 4.4 | 0.6 | 1.3 | |
Conduct Disorder Symptoms | 2.9 | 91.2 | 5.9 | 0.7 | 1.7 |
Note. N = 68. 1 SD is equivalent to 10 T-score points.
Table 8.18. Difference Between Rater T-scores Study 1: Conners 4 Teacher
Scale | Percentage of Inter-Rater Sample | Absolute | Absolute | |||
Teacher 1 ≥ 1 SD higher than Teacher 2 | Scores | Teacher 2 ≥ 1 SD higher than Teacher 1 | ||||
Content Scales | Inattention/Executive Dysfunction | 5.9 | 79.4 | 14.7 | 1.1 | 0.1 |
Hyperactivity | 11.8 | 76.5 | 11.8 | 1.6 | 0.3 | |
Impulsivity | 14.7 | 61.8 | 23.5 | 0.5 | 1.4 | |
Emotional Dysregulation | 14.7 | 64.7 | 20.6 | 1.6 | 1.0 | |
Depressed Mood | 5.9 | 79.4 | 14.7 | 0.2 | 0.2 | |
Anxious Thoughts | 8.8 | 76.5 | 14.7 | 0.3 | 1.1 | |
Impairment & Functional | Schoolwork | 8.8 | 70.6 | 20.6 | 0.2 | 1.0 |
Peer Interactions | 14.7 | 70.6 | 14.7 | 0.2 | 0.0 | |
DSM Symptom Scales | ADHD Inattentive Symptoms | 11.8 | 61.8 | 26.5 | 0.8 | 0.2 |
ADHD Hyperactive/Impulsive Symptoms | 14.7 | 64.7 | 20.6 | 1.2 | 1.0 | |
Total ADHD Symptoms | 17.6 | 64.7 | 17.6 | 1.1 | 0.8 | |
Oppositional Defiant Disorder Symptoms | 5.9 | 82.4 | 11.8 | 1.3 | 0.7 | |
Conduct Disorder Symptoms | 11.8 | 64.7 | 23.5 | 0.4 | 0.6 |
Note. N = 34. 1 SD is equivalent to 10 T-score points.
Inter-Rater Reliability Study 2. In the second inter-rater study, comparisons were made across different types of raters. As theConners 4 Parent, Teacher, and Self-Report all measure similar constructs, similarity in scores across the different types of raters would provide evidenceof the reliability of the test scores. Although some degree of similarity in ratings is expected between informants, given that they are rating the sameyouth on the same constructs, it is nonetheless expected that there be a certain degree of incongruence between their ratings, because different informantssee the youth in different contexts, and may have dissimilar perceptions of, or experiences with, the youth’s behavior.
For the Conners 4, correlation coefficients (Pearson’s r, LeBreton & Senter, 2008) were calculated between scores for the following pairs ofraters: (1) parent and teacher, (2) parent and self-report, and (3) teacher and self-report. To examine these relationships, parents, teachers, and youthprovided ratings of the same youth (N = 62; all ratings completed within a 30-day period). Refer to appendix F, Table F.4 for the demographic characteristics of the youth being rated (recall that in this studythe same youth is being rated by themselves, a parent, and a teacher), and of the parent and teacher raters.
The obtained correlation coefficients between different rater types, as well as those corrected for range restrictions (Bryant & Gokhale, 1972), areprovided in Table 8.19 (Parent/Teacher), Table 8.20, (Parent/Self-Report), and Table 8.21 (Teacher/Self-Report).
For the Parent/Teacher dyads, the majority of the corrected correlations were moderate in size (median r = .46; ranging from r = .28 to.79), with two (Depressed Mood r = .36, Anxious Thoughts r = .28) being weakly correlated, and one (DSM Conduct Disorder Symptoms) beingstrongly correlated (r = .79). This strong correspondence for the DSM Conduct Disorder Symptoms scale is likely due to infrequent occurrence, asmost raters do not report observing Conduct Disorder symptoms as they are severe behaviors. This similarity between raters in the infrequent endorsement ofbehaviors associated with Conduct Disorder is likely what accounted for the strong correlation between ratings on this scale.
The majority of corrected correlations were weak in the Parent/Self-Report dyads (median r = .30; ranging from r = .19 to .72; DSMConduct Disorder Symptoms r = .72), as well in the Teacher/Self-Report dyads (median r = .23; ranging from r = .16 to .61; DSM Conduct Disorder Symptoms r = .61). Notably, for both the Parent/Self-Report and Teacher/Self-Report dyads,Impulsivity scores were very weakly correlated (r = .16 and r = .11, respectively), as were scores for Peer Interactions (r = .12and r = .16, respectively). This low agreement between raters may be due to a variety of reasons, one of which is setting differences. Youth canreport on their own behaviors across multiple settings, whereas parents and teachers can only report on those behaviors observed in specificenvironments, such as home and school. Another reason for low agreement for Impulsivity may be the youth’s unique insight into their own impulsivebehaviors, especially in adolescence when these behaviors may be more inwardly felt than outwardly expressed, while parents and teachers might be lessaware of the effort needed to restrain such behaviors. Similarly, for Peer Interactions, youth may have different insights into their friendships andexperiences with their peers than what their parents or teachers would have access to observe.
Table 8.19. Inter-RaterReliability Study 2: Conners 4 Parent/Teacher
Scale | Obtained | Corrected | p | Parent | Teacher | |||||
M | Mdn | SD | M | Mdn | SD | |||||
Content Scales | Inattention/Executive Dysfunction | .54 | .51 | < .001 | 63.5 | 64 | 10.5 | 59.3 | 60 | 10.2 |
Hyperactivity | .47 | .40 | .002 | 59.0 | 57 | 12.4 | 53.8 | 52 | 10.0 | |
Impulsivity | .62 | .54 | < .001 | 59.8 | 60 | 12.6 | 54.1 | 53 | 9.8 | |
Emotional Dysregulation | .51 | .40 | .002 | 56.0 | 53 | 12.5 | 52.9 | 48 | 11.0 | |
Depressed Mood | .40 | .36 | .005 | 56.8 | 55 | 10.8 | 54.8 | 51 | 10.5 | |
Anxious Thoughts | .52 | .28 | .030 | 58.6 | 53 | 15.4 | 57.7 | 56 | 13.5 | |
Impairment & Functional | Schoolwork | .58 | .48 | < .001 | 62.2 | 61 | 11.2 | 58.2 | 59 | 11.7 |
Peer Interactions | .60 | .45 | < .001 | 57.3 | 55 | 13.2 | 54.7 | 54 | 11.2 | |
DSM Symptom Scales | ADHD Inattentive Symptoms | .51 | .48 | < .001 | 63.4 | 64 | 10.3 | 59.0 | 59 | 10.4 |
ADHD Hyperactive/Impulsive Symptoms | .53 | .43 | .001 | 59.7 | 58 | 12.8 | 53.8 | 53 | 10.2 | |
Total ADHD Symptoms | .51 | .46 | < .001 | 62.6 | 63 | 11.5 | 56.8 | 57 | 10.0 | |
Oppositional Defiant Disorder Symptoms | .50 | .44 | < .001 | 55.1 | 52 | 12.1 | 52.2 | 47 | 9.9 | |
Conduct Disorder Symptoms | .53 | .79 | < .001 | 50.4 | 48 | 7.8 | 49.0 | 45 | 6.3 |
Note. N = 62. Reported p-values are for corrected r. Guidelines for interpreting |r|: very weak < .20, weak = .20 to .39,moderate = .40 to .59, strong = .60 to .79, very strong ≥ .80.
Table 8.20. Inter-Rater Reliability Study 2: Conners 4 Parent/Self-Report
Scale | Obtained | Corrected | p | Parent | Self-Report | |||||
M | Mdn | SD | M | Mdn | SD | |||||
Content Scales | Inattention/Executive Dysfunction | .19 | .19 | .147 | 63.5 | 64 | 10.5 | 60.0 | 59 | 9.4 |
Hyperactivity | .50 | .40 | .002 | 59.0 | 57 | 12.4 | 58.7 | 58 | 10.7 | |
Impulsivity | .23 | .16 | .209 | 59.8 | 60 | 12.6 | 56.1 | 54 | 11.2 | |
Emotional Dysregulation | .38 | .30 | .018 | 56.0 | 53 | 12.5 | 54.2 | 53 | 10.2 | |
Depressed Mood | .39 | .35 | .007 | 56.8 | 55 | 10.8 | 54.8 | 52 | 10.7 | |
Anxious Thoughts | .34 | .20 | .122 | 58.6 | 53 | 15.4 | 55.3 | 52 | 11.3 | |
Impairment & Functional | Schoolwork | .23 | .20 | .121 | 62.2 | 61 | 11.2 | 57.5 | 56 | 10.3 |
Peer Interactions | .18 | .12 | .372 | 57.3 | 55 | 13.2 | 56.9 | 55 | 11.6 | |
Family Life | .23 | .23 | .081 | 56.5 | 51 | 12.8 | 53.3 | 52 | 7.9 | |
DSM Symptom Scales | ADHD Inattentive Symptoms | .23 | .23 | .081 | 63.4 | 64 | 10.3 | 59.3 | 58 | 9.8 |
ADHD Hyperactive/Impulsive Symptoms | .44 | .35 | .006 | 59.7 | 58 | 12.8 | 58.6 | 58 | 10.2 | |
Total ADHD Symptoms | .36 | .32 | .011 | 62.6 | 63 | 11.5 | 59.6 | 58 | 9.8 | |
Oppositional Defiant Disorder Symptoms | .53 | .52 | < .001 | 55.1 | 52 | 12.1 | 52.9 | 51 | 8.5 | |
Conduct Disorder Symptoms | .44 | .72 | < .001 | 50.4 | 48 | 7.8 | 47.8 | 46 | 6.1 |
Note. N = 62. Reported p-values are for corrected r. Guidelines for interpreting |r|: very weak < .20, weak = .20 to .39,moderate = .40 to .59, strong = .60 to .79, very strong ≥ .80.
Table 8.21. Inter-Rater Reliability Study 2:Conners 4 Teacher/Self-Report
Scale | Obtained | Corrected | p | Teacher | Self-Report | |||||
M | Mdn | SD | M | Mdn | SD | |||||
Content Scales | Inattention/Executive Dysfunction | .21 | .22 | .094 | 59.3 | 60 | 10.2 | 60.0 | 59 | 9.4 |
Hyperactivity | .32 | .30 | .021 | 53.8 | 52 | 10.0 | 58.7 | 58 | 10.7 | |
Impulsivity | .13 | .11 | .383 | 54.1 | 53 | 9.8 | 56.1 | 54 | 11.2 | |
Emotional Dysregulation | .22 | .20 | .125 | 52.9 | 48 | 11.0 | 54.2 | 53 | 10.2 | |
Depressed Mood | .41 | .37 | .003 | 54.8 | 51 | 10.5 | 54.8 | 52 | 10.7 | |
Anxious Thoughts | .51 | .36 | .005 | 57.7 | 56 | 13.5 | 55.3 | 52 | 11.3 | |
Impairment & Functional | Schoolwork | .28 | .23 | .072 | 58.2 | 59 | 11.7 | 57.5 | 56 | 10.3 |
Peer Interactions | .21 | .16 | .209 | 54.7 | 54 | 11.2 | 56.9 | 55 | 11.6 | |
DSM Symptom Scales | ADHD Inattentive Symptoms | .21 | .21 | .107 | 59.0 | 59 | 10.4 | 59.3 | 58 | 9.8 |
ADHD Hyperactive/Impulsive Symptoms | .27 | .26 | .047 | 53.8 | 53 | 10.2 | 58.6 | 58 | 10.2 | |
Total ADHD Symptoms | .22 | .23 | .082 | 56.8 | 57 | 10.0 | 59.6 | 58 | 9.8 | |
Oppositional Defiant Disorder Symptoms | .39 | .45 | < .001 | 52.2 | 47 | 9.9 | 52.9 | 51 | 8.5 | |
Conduct Disorder Symptoms | .28 | .61 | < .001 | 49.0 | 45 | 6.3 | 47.8 | 46 | 6.1 |
Note. N = 62. Reported p-values are for corrected r. Guidelines for interpreting |r|: very weak< .20, weak = .20 to .39, moderate = .40 to .59, strong = .60 to .79, very strong ≥ .80.
In addition, the congruence of scores was analyzed by examining percentages of the samples with similar scores. These results are summarized in Tables8.22 to 8.24.
As seen in Table 8.22, it was a frequent occurrence in the Parent/Teacher sample for there to be congruence in scores; that is, scores of ratings of theyouth often differed by less than 1 SD (ranging from 58.1% to 75.8% of the sample, across all scales). Notably, parents provided slightly higherratings than teachers (as evidenced by the small negative mean differences across all scales). The results suggest that scores on the Conners 4 scales havemoderate consistency across parent and teacher ratings, providing supporting evidence for inter-rater agreement, but also showing the importance of acomprehensive evaluation that engages multiple raters who draw upon different perspectives and experiences with the youth.
A similar pattern was seen in the comparisons between parent and self-reported ratings, as presented in Table 8.23. Parents and youth tended to providelargely similar ratings, yielding scores that are within 1 SD of each other for 56.5% to 87.0% of this sample. Additionally, the differences betweentheir scores were modest, ranging from -0.2 to -4.6 points, with higher ratings coming from the parent raters.
Similarly, for the Teacher/Self-Report comparisons, aligned scores that are within 1 SD of each other were observed for 51.6% to 87.1% of this sample (see Table 8.24), with slightly higher scores coming from Self-Report ratings.
Overall, these findings suggest moderate differences between the ratings of parents, teachers, and youth, with parents typically yielding the highestscores of the three rater types. Parents may have a unique opportunity for insight that teachers may be lacking, such as the youth’s home-life behaviors,while teachers or youth rating themselves have more opportunity for peer comparison, providing a different perspective. These findings demonstrate, whenlooking across different types of raters, moderate inter-rater reliability for the Conners 4 scales, capturing consistency in measurement across raters, aswell as the varying perspectives of raters with differing types of relationships to the youth being rated. Although a high degree of similarity isexpected between raters, perfect agreement is improbable; instead, discrepancies between raters’ scores, as seen in this study, emphasize the importance ofconsulting multiple sources to capture unique information that can reveal nuanced differences. The raters may be drawing upon their experiences with theyouth being rated in dissimilar contexts, which may lead to slightly different responses between rater types. Results serve to highlight theimportance of examining information from multiple sources when conducting a comprehensive assessment.
Table 8.22. Difference Between Rater T-scores Study 2: Conners 4 Parent/Teacher
Scale | Percentage of Inter-Rater Sample | M | SD | |||
Teacher ≥ 1 | Scores | Parent ≥ 1 | ||||
Content Scales | Inattention/Executive Dysfunction | 6.5 | 75.8 | 17.7 | −4.2 | −0.3 |
Hyperactivity | 4.8 | 64.5 | 30.6 | −5.2 | −2.4 | |
Impulsivity | 1.6 | 75.8 | 22.6 | −5.7 | −2.7 | |
Emotional Dysregulation | 11.3 | 64.5 | 24.2 | −3.0 | −1.5 | |
Depressed Mood | 11.3 | 66.1 | 22.6 | −2.0 | −0.2 | |
Anxious Thoughts | 14.5 | 61.3 | 24.2 | −0.8 | −1.9 | |
Impairment & Functional | Schoolwork | 11.3 | 58.1 | 30.6 | −4.0 | 0.5 |
Peer Interactions | 4.8 | 71.0 | 24.2 | −2.7 | −2.0 | |
DSM Symptom Scales | ADHD Inattentive Symptoms | 6.5 | 64.5 | 29.0 | −4.4 | 0.1 |
ADHD Hyperactive/Impulsive Symptoms | 3.2 | 66.1 | 30.6 | −5.9 | −2.5 | |
Total ADHD Symptoms | 3.2 | 66.1 | 30.6 | −5.8 | −1.5 | |
Oppositional Defiant Disorder Symptoms | 6.5 | 87.1 | 6.5 | −3.0 | −2.2 | |
Conduct Disorder Symptoms | 11.3 | 67.7 | 21.0 | −1.4 | −1.5 |
Note. N = 62. 1 SD is equivalent to 10 T-score points. A positive M difference indicates that the teacher's ratings are higher than the parent's ratings.
Table 8.23. Difference Between Rater T-scores Study 2:Conners 4 Parent/Self-Report
Scale | Percentage of Inter-Rater Sample | M | SD | |||
Self-Report ≥ 1 SD higher | Scores | Parent ≥ 1 | ||||
Content Scales | Inattention/Executive Dysfunction | 14.5 | 53.2 | 32.3 | −3.5 | −1.1 |
Hyperactivity | 17.7 | 66.1 | 16.1 | −0.2 | −1.7 | |
Impulsivity | 14.5 | 53.2 | 32.3 | −3.7 | −1.4 | |
Emotional Dysregulation | 17.7 | 58.1 | 24.2 | −1.8 | −2.2 | |
Depressed Mood | 11.3 | 64.5 | 24.2 | −2.0 | −0.1 | |
Anxious Thoughts | 14.5 | 51.6 | 33.9 | −3.3 | −4.1 | |
Impairment & Functional Outcome Scales | Schoolwork | 8.1 | 56.5 | 35.5 | −4.6 | −0.9 |
Peer Interactions | 27.4 | 51.6 | 21.0 | −0.4 | −1.6 | |
Family Life | 12.9 | 62.9 | 24.2 | −3.3 | −4.9 | |
DSM Symptom Scales | ADHD Inattentive Symptoms | 14.5 | 51.6 | 33.9 | −4.1 | −0.5 |
ADHD Hyperactive/Impulsive Symptoms | 16.1 | 59.7 | 24.2 | −1.1 | −2.5 | |
Total ADHD Symptoms | 8.1 | 64.5 | 27.4 | −3.0 | −1.7 | |
Oppositional Defiant Disorder Symptoms | 1.6 | 87.1 | 11.3 | −2.2 | −3.6 | |
Conduct Disorder Symptoms | 9.7 | 72.6 | 17.7 | −2.6 | −1.7 |
Note. N = 62. 1 SD is equivalent to 10 T-score points. A positive M difference indicates that the youth's self-report ratings are higher than the parent's ratings.
Table 8.24. Difference Between Rater T-scores Study 2:Conners 4 Teacher/Self-Report
Scale | Percentage of Inter-Rater Sample | M | SD | |||
Self-Report ≥ 1 | Scores | Teacher ≥ 1 | ||||
Content Scales | Inattention/Executive Dysfunction | 22.6 | 58.1 | 19.4 | 0.7 | −0.8 |
Hyperactivity | 6.5 | 58.1 | 35.5 | 5.0 | 0.7 | |
Impulsivity | 21.0 | 51.6 | 27.4 | 1.9 | 1.4 | |
Emotional Dysregulation | 17.7 | 54.8 | 27.4 | 1.2 | −0.7 | |
Depressed Mood | 19.4 | 59.7 | 21.0 | 0.0 | 0.2 | |
Anxious Thoughts | 25.8 | 61.3 | 12.9 | −2.4 | −2.2 | |
Impairment & Functional | Schoolwork | 29.0 | 58.1 | 12.9 | −0.6 | −1.4 |
Peer Interactions | 17.7 | 51.6 | 30.6 | 2.2 | 0.4 | |
DSM | ADHD Inattentive Symptoms | 22.6 | 56.5 | 21.0 | 0.4 | −0.6 |
ADHD Hyperactive/Impulsive Symptoms | 11.3 | 56.5 | 32.3 | 4.8 | 0.0 | |
Total ADHD Symptoms | 11.3 | 64.5 | 24.2 | 2.8 | −0.2 | |
Oppositional Defiant Disorder Symptoms | 9.7 | 87.1 | 3.2 | 0.8 | −1.4 | |
Conduct Disorder Symptoms | 12.9 | 69.4 | 17.7 | −1.2 | −0.2 |
Note. N = 62. 1 SD is equivalent to 10 T-score points. A positive M difference indicates that the youth's self-report ratings are higher than the teacher's ratings.
In addition to the differences already described regarding the Conners 4 scales, the Conners 4 Critical & Indicator Items were also evaluated fordifferences among Parent, Teacher, and Self-Report scores using the same inter-rater sample (note that the Severe Conduct Critical Items are not presentedas these items are subsumed in the DSM Conduct Disorder Symptoms scale). The frequency of endorsement of the Critical & Indicator Items was examined foreach rater group, with the expectation that varying perspectives and insight into sensitive topics may yield quite distinct patterns of responses. For theSelf-Harm Critical Items to be considered endorsed, responses had to be greater than or equal to 1, “Just a little true (Occasionally),” such thatany degree of endorsement flagged a Critical Item for inclusion in the frequency count. For the Sleep Problems Indicator, endorsement was defined asresponses greater than 2, with the exception of the appearing/feeling tired item which requires a rating of 3, “Completely true (Very often/Always)”(on Self-Report only), in accordance with how these behaviors are flagged within the Conners 4 reports (see chapter 4, Interpretation).
As seen in Table 8.25, for the Self-Harm Critical Items, there was greater endorsement on the Self-Report relative to Parent and Teacher raters. Mostnotably, the Critical Item about deliberate self-harm is endorsed more than twice as often in Self-Report ratings (19.4%) relative to Parent ratings(8.1%). These items reflect deeply sensitive and personal behaviors or attitudes, to the extent that a youth may not wish to disclose this information to aparent or teacher. The discrepancies between the reports of different rater types, specifically when inquiring about self-harm thoughts and behaviors,highlight the importance of collecting information from multiple informants.
Table 8.25. Endorsement of Critical Items &Indicator Items by Rater Type
Item Stem | Parent (%) | Teacher (%) | Self-Report (%) | |
Self-Harm Critical Items | Harming self deliberately | 8.1 | 1.6 | 19.4 |
Talking about suicide | 8.1 | 4.8 | — | |
Planning or attempting suicide | 4.8 | 3.2 | 14.5 | |
Thinking about harming self | — | — | 29.0 | |
Sleep Problems Indicator | Having trouble sleeping | 35.5 | — | 40.3 |
Appearing/Feeling tired | 6.5 | 8.1 | 21.0 |
Note. Self-Harm Critical Items are considered endorsed with an item response of ≥ 1, while Sleep Problems Indicator items are considered endorsed with an item response of ≥ 2 for trouble sleeping, an itemresponse of ≥ 2 for tiredness for Parent and Teacher, and an item response of 3 for tiredness for Self-Report.
ncG1vNJzZmilmKiwpbqNm6Ooml6YvLOxjbCgp5yfrMBvusStZqago2LEpq6Ohn%2BMZYekv6Wc0Z6qrGd8mq6zuo2Gf4xnfZa7tq3LrGacp56jsrO%2Fvm2Woaydoayurc2umKWXlqq5rXvPmqmtbF%2BYtXmryKernqqPp661sdFnn62lnA%3D%3D