{"id":2761,"date":"2022-01-26T11:44:29","date_gmt":"2022-01-26T11:44:29","guid":{"rendered":"https:\/\/www.psyctc.org\/psyctc\/?post_type=docs&#038;p=2761"},"modified":"2022-01-26T11:44:29","modified_gmt":"2022-01-26T11:44:29","password":"","slug":"inter-rater-agreement-reliability","status":"publish","type":"docs","link":"https:\/\/www.psyctc.org\/psyctc\/glossary2\/inter-rater-agreement-reliability\/","title":{"rendered":"Inter-rater agreement\/reliability"},"content":{"rendered":"\n<p>One way to assess how good, how usable, a rating system is is to get more than one person to use the system to do ratings of the same things where what is rated has a broad coverage of the possible ratings in the system.  Ratings may be undimensional and continuous, e.g. &#8220;how much warmth in this clip do you think this carer is expressing about &#8230;&#8221;, or categorical &#8220;into which of the categories would you put this AAI interview?  The index of inter-rater agreement you are most likely to see for categorical ratings is <a href=\"https:\/\/www.psyctc.org\/psyctc\/glossary2\/cohens-kappa\/\" data-type=\"docs\" data-id=\"2754\">Cohen&#8217;s kappa<\/a>, a <a href=\"https:\/\/www.psyctc.org\/psyctc\/glossary2\/correlation\/\" data-type=\"docs\" data-id=\"1877\">correlation<\/a> coefficient is better for continuous ratings.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\">Details<\/h4>\n\n\n\n<p>Inter-rater reliability\/agreement is an extremely general way of assessing a rating system.  Cohen&#8217;s kappa, the first really widely recognised and used description of the issues and how to use joint ratings in psychological situations, applied to just two raters rating a set of things but there are extensions and alternatives that can be used for more than two raters.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\">Try also<\/h4>\n\n\n\n<p>Reliability<br><a href=\"https:\/\/www.psyctc.org\/psyctc\/glossary2\/bias\/\" data-type=\"docs\" data-id=\"1845\">Bias<\/a><br><a href=\"https:\/\/www.psyctc.org\/psyctc\/glossary2\/cohens-kappa\/\" data-type=\"docs\" data-id=\"2754\">Cohen&#8217;s kappa<\/a><br>Weighted kappa<br>Interview measures<br>Rating scales<br>Classification<br><a href=\"https:\/\/www.psyctc.org\/psyctc\/glossary2\/categorical-nominal-data-scaling\/\" data-type=\"docs\" data-id=\"2140\">Nominal\/category scaling<\/a><br><a href=\"https:\/\/www.psyctc.org\/psyctc\/glossary2\/ordinal-scaling\/\" data-type=\"docs\" data-id=\"2319\">Ordinal scaling<br><\/a><a href=\"https:\/\/www.psyctc.org\/psyctc\/glossary2\/ratio-scaling\/\" data-type=\"docs\" data-id=\"2144\">Stevens&#8217; levels of measurement<\/a><br>AAI<\/p>\n\n\n\n<h4 class=\"wp-block-heading\">Chapters<\/h4>\n\n\n\n<p>Chapter 3.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\">History<\/h4>\n\n\n\n<p>Created 26\/1\/22.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>One way to assess how good, how usable, a rating system is is to get more than one person to use the system to do ratings of the same things where what is rated has a broad coverage of the possible ratings in the system. Ratings may be undimensional and continuous, e.g. &#8220;how much warmth &hellip; <a href=\"https:\/\/www.psyctc.org\/psyctc\/glossary2\/inter-rater-agreement-reliability\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Inter-rater agreement\/reliability<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","template":"","meta":{"footnotes":""},"doc_category":[18],"glossaries":[],"doc_tag":[],"knowledge_base":[],"class_list":["post-2761","docs","type-docs","status-publish","hentry","doc_category-om-book"],"year_month":"2026-04","word_count":183,"total_views":"1247","reactions":{"happy":"0","normal":"0","sad":"0"},"author_info":{"name":"chris","author_nicename":"chris","author_url":"https:\/\/www.psyctc.org\/psyctc\/author\/chris\/"},"doc_category_info":[{"term_name":"All OM book glossary entries","term_url":"https:\/\/www.psyctc.org\/psyctc\/glossary\/non-knowledgebase\/om-book\/"}],"doc_tag_info":[],"knowledge_base_info":[],"knowledge_base_slug":[],"_links":{"self":[{"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/docs\/2761","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/docs"}],"about":[{"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/types\/docs"}],"author":[{"embeddable":true,"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/comments?post=2761"}],"version-history":[{"count":1,"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/docs\/2761\/revisions"}],"predecessor-version":[{"id":2762,"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/docs\/2761\/revisions\/2762"}],"wp:attachment":[{"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/media?parent=2761"}],"wp:term":[{"taxonomy":"doc_category","embeddable":true,"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/doc_category?post=2761"},{"taxonomy":"glossaries","embeddable":true,"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/glossaries?post=2761"},{"taxonomy":"doc_tag","embeddable":true,"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/doc_tag?post=2761"},{"taxonomy":"knowledge_base","embeddable":true,"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/knowledge_base?post=2761"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}