{"id":350,"date":"2020-09-14T09:00:00","date_gmt":"2020-09-14T13:00:00","guid":{"rendered":"https:\/\/www.macloo.com\/ai\/?p=350"},"modified":"2020-09-14T10:12:01","modified_gmt":"2020-09-14T14:12:01","slug":"using-machine-learning-to-uncover-racist-laws","status":"publish","type":"post","link":"https:\/\/www.macloo.com\/ai\/2020\/09\/14\/using-machine-learning-to-uncover-racist-laws\/","title":{"rendered":"Using machine learning to uncover racist laws"},"content":{"rendered":"\n<p>A common use of machine learning is to train a model to identify a particular kind of document, or a particular characteristic in a document \u2014 and then <strong>sort<\/strong> a gigantic set of documents. This produces a much-reduced subset of all documents that match the desired criteria. There might be some false positives in the subset, but it still gives researchers or journalists <em>a big jump forward<\/em> by eliminating thousands of unwanted documents.<\/p>\n\n\n\n<p>This kind of sorting goes well beyond a simple search for keywords.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"456\" src=\"https:\/\/www.macloo.com\/ai\/wp-content\/uploads\/2020\/09\/onthebooks_photo.jpg\" alt=\"\" class=\"wp-image-359\" srcset=\"https:\/\/www.macloo.com\/ai\/wp-content\/uploads\/2020\/09\/onthebooks_photo.jpg 1024w, https:\/\/www.macloo.com\/ai\/wp-content\/uploads\/2020\/09\/onthebooks_photo-300x134.jpg 300w, https:\/\/www.macloo.com\/ai\/wp-content\/uploads\/2020\/09\/onthebooks_photo-768x342.jpg 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption><em>Above: Screenshot from On the Books at lib.unc.edu<\/em><\/figcaption><\/figure><\/div>\n\n\n\n<p>A great example has emerged from the University of North Carolina at Chapel Hill. <a rel=\"noreferrer noopener\" href=\"https:\/\/onthebooks.lib.unc.edu\/\" target=\"_blank\">On the Books: Jim Crow and Algorithms of Resistance<\/a> is a project that includes <a rel=\"noreferrer noopener\" href=\"https:\/\/cdr.lib.unc.edu\/collections\/f4752p47h?locale=en\" target=\"_blank\">a public plain-text collection<\/a> of North Carolina laws (1866\u20131967) likely to be Jim Crow laws.<\/p>\n\n\n\n<p>There is <a rel=\"noreferrer noopener\" href=\"https:\/\/github.com\/UNC-Libraries-data\/OnTheBooks\" target=\"_blank\">a public GitHub repo<\/a> of the <strong>code<\/strong> used in this project. It includes a full walkthrough of <a rel=\"noreferrer noopener\" href=\"https:\/\/github.com\/UNC-Libraries-data\/OnTheBooks\/blob\/master\/workflow.md\" target=\"_blank\">the project&#8217;s workflow<\/a> \u2014 data acquisition and cleaning, OCR, unsupervised and supervised classification, etc.<\/p>\n\n\n\n<p>The base document set (the main corpus) consists of 96 volumes, with 53,515 chapters, having 297,790 sections (<a rel=\"noreferrer noopener\" href=\"https:\/\/onthebooks.lib.unc.edu\/otb-research\/project-outcomes\/\" target=\"_blank\">source<\/a>).<\/p>\n\n\n\n<p>The project&#8217;s title gives homage to Safiya Noble&#8217;s 2018 book <a rel=\"noreferrer noopener\" href=\"http:\/\/algorithmsofoppression.com\/\" target=\"_blank\">Algorithms of Oppression: How Search Engines Reinforce Racism<\/a>.<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\"><p>&#8220;State-based racial segregation laws were incredibly inconvenient, irregular, and, most importantly, unconstitutional.&#8221;<\/p><cite>\u2014William Sturkey, Ph.D.<\/cite><\/blockquote>\n\n\n\n<p>A historical perspective on this data collection was provided by William Sturkey, a history professor at UNC, in <a rel=\"noreferrer noopener\" href=\"https:\/\/www.aaihs.org\/on-the-books-machine-learning-jim-crow\/\" target=\"_blank\">&#8220;On the Books&#8221;: Machine Learning Jim Crow<\/a> (September 2020). He says <strong>On the Books<\/strong> is &#8220;the first and most complete collection of all Jim Crow laws from a single American state.&#8221; He points to the difficulty of cataloging and studying all Jim Crow laws from any state &#8220;because there were just so many.&#8221;<\/p>\n\n\n\n<p><a rel=\"license\" href=\"http:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/\"><img decoding=\"async\" alt=\"Creative Commons License\" style=\"border-width:0\" src=\"https:\/\/i.creativecommons.org\/l\/by-nc-nd\/4.0\/88x31.png\"><\/a><br>\n<small><span xmlns:dct=\"http:\/\/purl.org\/dc\/terms\/\" property=\"dct:title\"><strong>AI in Media and Society<\/strong><\/span> by <span xmlns:cc=\"http:\/\/creativecommons.org\/ns#\" property=\"cc:attributionName\">Mindy McAdams<\/span> is licensed under a <a rel=\"license\" href=\"http:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/\">Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License<\/a>.<br>\nInclude the author&#8217;s name (Mindy McAdams) and a link to the original post in any reuse of this content.<\/small><\/p>\n\n\n\n<p>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>A common use of machine learning is to train a model to identify a particular kind of document, or a particular characteristic in a document \u2014 and then sort a gigantic set of documents. This produces a much-reduced subset of all documents that match the desired criteria. There might be some false positives in the&hellip; <a class=\"more-link\" href=\"https:\/\/www.macloo.com\/ai\/2020\/09\/14\/using-machine-learning-to-uncover-racist-laws\/\">Continue reading <span class=\"screen-reader-text\">Using machine learning to uncover racist laws<\/span> <span class=\"meta-nav\" aria-hidden=\"true\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[2],"tags":[69,59,67,22,68],"class_list":["post-350","post","type-post","status-publish","format-standard","hentry","category-nlp","tag-history","tag-law","tag-ocr","tag-race","tag-reading"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/posts\/350","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/comments?post=350"}],"version-history":[{"count":10,"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/posts\/350\/revisions"}],"predecessor-version":[{"id":381,"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/posts\/350\/revisions\/381"}],"wp:attachment":[{"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/media?parent=350"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/categories?post=350"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/tags?post=350"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}