{"id":382,"date":"2020-09-16T13:24:41","date_gmt":"2020-09-16T17:24:41","guid":{"rendered":"https:\/\/www.macloo.com\/ai\/?p=382"},"modified":"2020-09-18T12:01:03","modified_gmt":"2020-09-18T16:01:03","slug":"whats-the-use-of-machine-learning","status":"publish","type":"post","link":"https:\/\/www.macloo.com\/ai\/2020\/09\/16\/whats-the-use-of-machine-learning\/","title":{"rendered":"What\u2019s the use of machine learning?"},"content":{"rendered":"\n<p>I&#8217;m interested in applications of machine learning in <strong>journalism<\/strong>. This is natural, as my field is journalism. In the field of computer science, however, accolades and honors tend to favor research on new algorithms or procedures, or new network architectures. <strong>Applications<\/strong> are practical uses of algorithms, networks, etc., to solve real-world problems \u2014 and developing them often doesn&#8217;t garner the acclaim that researchers need to advance their careers.<\/p>\n\n\n\n<p><a rel=\"noreferrer noopener\" href=\"https:\/\/hannah-rae.github.io\/\" target=\"_blank\">Hannah Kerner<\/a>, a professor and machine learning researcher at the University of Maryland, wrote about this in the <em>MIT Technology Review<\/em>. Her essay is aptly titled &#8220;<a rel=\"noreferrer noopener\" href=\"https:\/\/www.technologyreview.com\/2020\/08\/18\/1007196\/ai-research-machine-learning-applications-problems-opinion\/\" target=\"_blank\">Too many AI researchers think real-world problems are not relevant<\/a>.&#8221; <\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\"><p>&#8220;The first image of a black hole was produced using machine learning. The most accurate predictions of protein structures, an important step for drug discovery, are made using machine learning.&#8221;<\/p><cite>\u2014Hannah Kerner<\/cite><\/blockquote>\n\n\n\n<p>Noting that <em>applications<\/em> of machine learning are making real contributions to science in fields <em>outside<\/em> computer science, Kerner (who works on machine learning solutions for NASA&#8217;s food security and agriculture program) asks how much is lost because of the priorities set by the journals and conferences in the machine learning field.<\/p>\n\n\n\n<p>She also ties this focus on <em>ML research for the sake of advancing ML<\/em> to the seepage of bias out from widely used datasets into the mainstream \u2014 the most famous cases being in face recognition, with systems (machine learning models) built on flawed datasets that <a href=\"https:\/\/www.macloo.com\/ai\/2020\/08\/13\/racial-and-gender-bias-in-ai\/\">disproportionately skew toward white and male faces<\/a>. <\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\"><p>&#8220;When studies on real-world applications of machine learning are excluded from the mainstream, it\u2019s difficult for researchers to see the impact of their biased models, making it far less likely that they will work to solve these problems.&#8221;<\/p><cite>\u2014Hannah Kerner<\/cite><\/blockquote>\n\n\n\n<p>Machine learning is rarely plug-and-play. In creating an application that will be used to perform useful work \u2014 to make new discoveries, perhaps, or to make medical diagnoses more accurate \u2014 the machine learning researchers will do substantial new work, even when they use existing models. Just think, for a moment, about <a rel=\"noreferrer noopener\" href=\"https:\/\/techcrunch.com\/2019\/04\/10\/the-creation-of-the-algorithm-that-made-the-first-black-hole-image-possible-was-led-by-mit-grad-student-katie-bouman\/\" target=\"_blank\">the data needed to produce an image of a black hole<\/a>. Then think about <strong>the data<\/strong> needed to make predictions of protein structures. You&#8217;re <em>not<\/em> going to handle those in exactly the same way.<\/p>\n\n\n\n<p>I imagine the work is quite demanding when a number of non\u2013ML experts (say, the biologists who work on protein structures) get together with a bunch of ML experts. But either group working separately from the other is unlikely to come up with a robust new ML application. Kerner linked to <a rel=\"noreferrer noopener\" href=\"https:\/\/gizmodo.com\/ibm-watson-reportedly-recommended-cancer-treatments-tha-1827868882\" target=\"_blank\">this 2018 news report about a flawed cancer-detection system<\/a> \u2014 leaked documents said that &#8220;instead of feeding real patient data into the software,&#8221; the system was trained on data about <em>hypothetical<\/em> patients. (OMG, I thought \u2014 <em>you can&#8217;t train a system on fake data and then use it on real people!<\/em>)<\/p>\n\n\n\n<p>Judging from what Kerner has written, machine learning researchers might be caught in a loop, where they work on pristine and long-used datasets (instead of dirty, chaotic real-world data) to perfect speed and efficiency of algorithms that perhaps become less adaptable in the process.<\/p>\n\n\n\n<p>It&#8217;s not that applications aren&#8217;t getting made \u2014 they are. The difficulty lies in the priorities for research, which might dissuade early-career ML researchers in particular from work on solving interesting and even vital real-world problems \u2014 and wrestling with the problems posed by messy real-world data.<\/p>\n\n\n\n<p>I was reminded of something I&#8217;ve often heard from <strong>data journalists<\/strong>: If you&#8217;re taught by a statistics professor, you&#8217;ll be given pre-cleaned datasets to work with. (The reason being: She just wants you to learn statistics.) If you&#8217;re taught by a journalist, you&#8217;ll be given real dirty data, and the first step will be learning how to clean it properly \u2014 because that&#8217;s what you have to do with real data and a real problem.<\/p>\n\n\n\n<p>So the next time you read about some breakthrough in machine learning, consider whether it is part of a practical application, or instead, more of a laboratory experiment performed in isolation, using a tried-and-true dataset instead of wild data. <\/p>\n\n\n\n<p><a rel=\"license\" href=\"http:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/\"><img decoding=\"async\" alt=\"Creative Commons License\" style=\"border-width:0\" src=\"https:\/\/i.creativecommons.org\/l\/by-nc-nd\/4.0\/88x31.png\"><\/a><br>\n<small><span xmlns:dct=\"http:\/\/purl.org\/dc\/terms\/\" property=\"dct:title\"><strong>AI in Media and Society<\/strong><\/span> by <span xmlns:cc=\"http:\/\/creativecommons.org\/ns#\" property=\"cc:attributionName\">Mindy McAdams<\/span> is licensed under a <a rel=\"license\" href=\"http:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/\">Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License<\/a>.<br>\nInclude the author&#8217;s name (Mindy McAdams) and a link to the original post in any reuse of this content.<\/small><\/p>\n\n\n\n<p>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I&#8217;m interested in applications of machine learning in journalism. This is natural, as my field is journalism. In the field of computer science, however, accolades and honors tend to favor research on new algorithms or procedures, or new network architectures. Applications are practical uses of algorithms, networks, etc., to solve real-world problems \u2014 and developing&hellip; <a class=\"more-link\" href=\"https:\/\/www.macloo.com\/ai\/2020\/09\/16\/whats-the-use-of-machine-learning\/\">Continue reading <span class=\"screen-reader-text\">What\u2019s the use of machine learning?<\/span> <span class=\"meta-nav\" aria-hidden=\"true\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[81,6,5],"tags":[49,48,74],"class_list":["post-382","post","type-post","status-publish","format-standard","hentry","category-applications","category-ethics-and-bias","category-machine-learning","tag-applications","tag-research","tag-science"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/posts\/382","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/comments?post=382"}],"version-history":[{"count":10,"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/posts\/382\/revisions"}],"predecessor-version":[{"id":397,"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/posts\/382\/revisions\/397"}],"wp:attachment":[{"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/media?parent=382"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/categories?post=382"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/tags?post=382"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}