{"id":2699,"date":"2021-11-16T12:22:57","date_gmt":"2021-11-16T12:22:57","guid":{"rendered":"https:\/\/www.psyctc.org\/psyctc\/?post_type=docs&#038;p=2699"},"modified":"2021-11-16T14:17:15","modified_gmt":"2021-11-16T14:17:15","password":"","slug":"overprinting","status":"publish","type":"docs","link":"https:\/\/www.psyctc.org\/psyctc\/glossary2\/overprinting\/","title":{"rendered":"Overprinting"},"content":{"rendered":"\n<p>Overprinting is the name for a challenge when plotting large datasets in a <a href=\"https:\/\/www.psyctc.org\/psyctc\/glossary2\/scatterplot-scattergram\/\" data-type=\"docs\" data-id=\"2695\">scatterplot<\/a>.  <\/p>\n\n\n\n<h4 class=\"wp-block-heading\">Details<\/h4>\n\n\n\n<p>The problem is that when you have a large dataset it becomes likely that two or more people will have the same values for both variables (or values so close that the points can&#8217;t be distinguished in the plot).  That starts to distort the impression conveyed by  the plot as pairs of scores that occurred frequently will have the same visual impact as those pairs that only occurred once.  Here&#8217;s an example.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter1point-1024x1024.png\" alt=\"\" class=\"wp-image-2700\" srcset=\"https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter1point-1024x1024.png 1024w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter1point-300x300.png 300w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter1point-150x150.png 150w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter1point-768x768.png 768w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter1point-1536x1536.png 1536w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter1point.png 1700w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>That&#8217;s a plot of 883 pairs of scores (from a non-help-seeking sample of young adults in Quito, Ecuador).  However, were to be able to count the points you see there you would only find 346 as the remaining points pairs happened more than once.  In fact one pair of scores (.643 on the non-risk scale and zero on the risk scale) occurred 19 times.  This plot is giving a misleading picture.  <\/p>\n\n\n\n<p>Here&#8217;s an even more extreme example from the same dataset: n = 878 with  complete data on two items from the CORE-OM: items 6 and 27.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/items27and6_plot1-1024x1024.png\" alt=\"\" class=\"wp-image-2708\" srcset=\"https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/items27and6_plot1-1024x1024.png 1024w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/items27and6_plot1-300x300.png 300w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/items27and6_plot1-150x150.png 150w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/items27and6_plot1-768x768.png 768w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/items27and6_plot1-1536x1536.png 1536w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/items27and6_plot1.png 1700w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>This shows us that at least one person of the 878 has had 24 of the 25 possible score pairs (0 and 0 through to 6 and 6) with only the pairing of 3 on item 27 and 4 on item 6 missing. Apart from that it tells us nothing!<\/p>\n\n\n\n<h5 class=\"wp-block-heading\">Solutions for overprinting<\/h5>\n\n\n\n<p>One way around this is transparency: to us a less than opaque fill for the points so that the overprinted points are darker.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter1pointAlph-1024x1024.png\" alt=\"\" class=\"wp-image-2701\" srcset=\"https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter1pointAlph-1024x1024.png 1024w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter1pointAlph-300x300.png 300w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter1pointAlph-150x150.png 150w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter1pointAlph-768x768.png 768w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter1pointAlph-1536x1536.png 1536w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter1pointAlph.png 1700w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>Another option is to use the area of the point to show how many times it occurred.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter2count-1024x1024.png\" alt=\"\" class=\"wp-image-2702\" srcset=\"https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter2count-1024x1024.png 1024w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter2count-300x300.png 300w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter2count-150x150.png 150w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter2count-768x768.png 768w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter2count-1536x1536.png 1536w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter2count.png 1700w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>That certainly conveys a sense of the frequency and location of the repeated pairs of scores but there is still an overprinting challenge.  Perhaps combinging transparency and point size works.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter3count-1024x1024.png\" alt=\"\" class=\"wp-image-2703\" srcset=\"https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter3count-1024x1024.png 1024w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter3count-300x300.png 300w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter3count-150x150.png 150w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter3count-768x768.png 768w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter3count-1536x1536.png 1536w, https:\/\/www.psyctc.org\/psyctc\/wp-content\/uploads\/2021\/11\/scatter3count.png 1700w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>Another option, the last that I know, is jittering.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\">Try also<\/h4>\n\n\n\n<p><a href=\"http:\/\/jitt\">Jittering<\/a><br><a href=\"https:\/\/www.psyctc.org\/psyctc\/glossary2\/scatterplot-scattergram\/\" data-type=\"docs\" data-id=\"2695\">Scatterplots<\/a><br>Regression<\/p>\n\n\n\n<h4 class=\"wp-block-heading\">Chapters<\/h4>\n\n\n\n<p>Chapter 5.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\">Local links<\/h4>\n\n\n\n<p>For those using R my &#8220;Rblog&#8221; entry is probably useful (and if you don&#8217;t know R, dipping into it my Rblog <em>might<\/em> tempt you toward it!)<\/p>\n\n\n\n<p><a href=\"https:\/\/www.psyctc.org\/Rblog\/posts\/2021-01-27-handling-overprinting\/\">Handling overprinting<\/a><\/p>\n\n\n\n<h4 class=\"wp-block-heading\">Dates<\/h4>\n\n\n\n<p>Created 16\/11\/21.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Overprinting is the name for a challenge when plotting large datasets in a scatterplot. Details The problem is that when you have a large dataset it becomes likely that two or more people will have the same values for both variables (or values so close that the points can&#8217;t be distinguished in the plot). That &hellip; <a href=\"https:\/\/www.psyctc.org\/psyctc\/glossary2\/overprinting\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Overprinting<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","template":"","meta":{"footnotes":""},"doc_category":[18],"glossaries":[],"doc_tag":[],"knowledge_base":[],"class_list":["post-2699","docs","type-docs","status-publish","hentry","doc_category-om-book"],"year_month":"2026-04","word_count":334,"total_views":"1309","reactions":{"happy":"0","normal":"0","sad":"0"},"author_info":{"name":"chris","author_nicename":"chris","author_url":"https:\/\/www.psyctc.org\/psyctc\/author\/chris\/"},"doc_category_info":[{"term_name":"All OM book glossary entries","term_url":"https:\/\/www.psyctc.org\/psyctc\/glossary\/non-knowledgebase\/om-book\/"}],"doc_tag_info":[],"knowledge_base_info":[],"knowledge_base_slug":[],"_links":{"self":[{"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/docs\/2699","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/docs"}],"about":[{"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/types\/docs"}],"author":[{"embeddable":true,"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/comments?post=2699"}],"version-history":[{"count":5,"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/docs\/2699\/revisions"}],"predecessor-version":[{"id":2718,"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/docs\/2699\/revisions\/2718"}],"wp:attachment":[{"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/media?parent=2699"}],"wp:term":[{"taxonomy":"doc_category","embeddable":true,"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/doc_category?post=2699"},{"taxonomy":"glossaries","embeddable":true,"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/glossaries?post=2699"},{"taxonomy":"doc_tag","embeddable":true,"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/doc_tag?post=2699"},{"taxonomy":"knowledge_base","embeddable":true,"href":"https:\/\/www.psyctc.org\/psyctc\/wp-json\/wp\/v2\/knowledge_base?post=2699"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}