{"id":1379,"date":"2024-03-11T13:09:15","date_gmt":"2024-03-11T17:09:15","guid":{"rendered":"https:\/\/www.macloo.com\/ai\/?p=1379"},"modified":"2024-03-11T13:09:15","modified_gmt":"2024-03-11T17:09:15","slug":"the-bitter-lesson-of-ai-is-not-too-bitter","status":"publish","type":"post","link":"https:\/\/www.macloo.com\/ai\/2024\/03\/11\/the-bitter-lesson-of-ai-is-not-too-bitter\/","title":{"rendered":"\u2018The Bitter Lesson\u2019 of AI is not too bitter"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">I found an old bookmark today and it led me to <a href=\"http:\/\/www.incompleteideas.net\/IncIdeas\/BitterLesson.html\" data-type=\"link\" data-id=\"http:\/\/www.incompleteideas.net\/IncIdeas\/BitterLesson.html\" target=\"_blank\" rel=\"noreferrer noopener\">The Bitter Lesson<\/a>, a 2019 essay by Rich Sutton, a computer science professor and research scientist based in Alberta, Canada. Apparently OpenAI engineers were instructed to <em>memorize<\/em> the article. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>tl;dr: <\/strong>&#8220;Leveraging human knowledge&#8221; has not been proven effective in significantly advancing artificial intelligence systems. Instead, leveraging computation (<strong>computational power<\/strong>, speed, operations per second, parallelization) is the only thing that works. &#8220;These two need not run counter to each other, but in practice they tend to. Time spent on one is time not spent on the other.&#8221; Thus all efforts to encode <strong>how we think<\/strong> and <strong>what we know<\/strong> are just <em>wasting precious time, damn it!<\/em> This is the bitter lesson.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Sutton has worked extensively on reinforcement learning, so it&#8217;s not surprising that he mentions examples of AI systems <strong>playing games<\/strong>. Systems that &#8220;learn&#8221; by <strong>self-play<\/strong> \u2014 that is, one copy of the program playing another copy of the same program \u2014 leverage computational power, not human knowledge. DeepMind&#8217;s AlphaZero demonstrated that self-play can enable a program\/system to learn to play not just one game but multiple games (although one copy only &#8220;knows&#8221; one game type).<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"wp-block-paragraph\">&#8220;Search and learning are the two most important classes of techniques for utilizing massive amounts of computation in AI research.&#8221;<\/p>\n<cite>\u2014Richard S. Sutton<\/cite><\/blockquote>\n\n\n\n<p class=\"wp-block-paragraph\">Games, of course, are not the only domain in which we can see the advantages of computational power. Sutton notes that breakthroughs in <strong>speech recognition<\/strong> and <strong>image recognition<\/strong> came from application of statistical methods to huge training datasets.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Trying to make systems &#8220;that worked the way the researchers thought their own minds worked&#8221; was a waste of time, Sutton wrote \u2014 although I think today&#8217;s systems are still using layers of units (sometimes called &#8220;artificial neurons&#8221;) that connect to multiple units in other layers, and that architecture was inspired by what we <em>do<\/em> know about how brains work. Modifications (governed by algorithms, not adjusted by humans) to the connections between units constitute the &#8220;learning&#8221; that has proved to be so successful.<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"wp-block-paragraph\">&#8220;[B]reakthrough progress eventually arrives by an opposing approach based on scaling computation by search and learning.&#8221;<\/p>\n<cite>\u2014Richard S. Sutton<\/cite><\/blockquote>\n\n\n\n<p class=\"wp-block-paragraph\">I don&#8217;t think this lesson is actually bitter, because what Sutton is saying is that human brains (and human minds, and human thinking, and human creativity) are <em>really, really complex,<\/em> and so we <em>can&#8217;t<\/em> figure out how to make the same things happen in, or with, a machine. We can produce better and more useful <em>outputs<\/em> thanks to improved computational methods, but we can&#8217;t make the machines better by sharing with them what we know \u2014 or trying to teach them how we may think.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><a rel=\"license\" href=\"http:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/\"><img decoding=\"async\" alt=\"Creative Commons License\" style=\"border-width:0\" src=\"https:\/\/i.creativecommons.org\/l\/by-nc-nd\/4.0\/88x31.png\"><\/a><br><small><span xmlns:dct=\"http:\/\/purl.org\/dc\/terms\/\" property=\"dct:title\"><strong>AI in Media and Society<\/strong><\/span> by <span xmlns:cc=\"http:\/\/creativecommons.org\/ns#\" property=\"cc:attributionName\">Mindy McAdams<\/span> is licensed under a <a rel=\"license\" href=\"http:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/\">Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License<\/a>.<br>Include the author&#8217;s name (Mindy McAdams) and a link to the original post in any reuse of this content.<\/small><\/p>\n","protected":false},"excerpt":{"rendered":"<p>I found an old bookmark today and it led me to The Bitter Lesson, a 2019 essay by Rich Sutton, a computer science professor and research scientist based in Alberta, Canada. Apparently OpenAI engineers were instructed to memorize the article. tl;dr: &#8220;Leveraging human knowledge&#8221; has not been proven effective in significantly advancing artificial intelligence systems.&hellip; <a class=\"more-link\" href=\"https:\/\/www.macloo.com\/ai\/2024\/03\/11\/the-bitter-lesson-of-ai-is-not-too-bitter\/\">Continue reading <span class=\"screen-reader-text\">\u2018The Bitter Lesson\u2019 of AI is not too bitter<\/span> <span class=\"meta-nav\" aria-hidden=\"true\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[7,5],"tags":[96,248,247,249],"class_list":["post-1379","post","type-post","status-publish","format-standard","hentry","category-games","category-machine-learning","tag-reinforcement_learning","tag-self-play","tag-sutton","tag-thinking"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/posts\/1379","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/comments?post=1379"}],"version-history":[{"count":8,"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/posts\/1379\/revisions"}],"predecessor-version":[{"id":1387,"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/posts\/1379\/revisions\/1387"}],"wp:attachment":[{"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/media?parent=1379"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/categories?post=1379"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.macloo.com\/ai\/wp-json\/wp\/v2\/tags?post=1379"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}