<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>depth first search &#187; ml</title>
	<atom:link href="http://www.depthfirstsearch.net/blog/tag/ml/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.depthfirstsearch.net/blog</link>
	<description>“We can only see a short distance ahead, but we can see plenty there that needs to be done.&#34;</description>
	<lastBuildDate>Sun, 05 Feb 2012 13:00:00 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Where do labels come from?</title>
		<link>http://www.depthfirstsearch.net/blog/2008/08/28/where-do-labels-come-from/</link>
		<comments>http://www.depthfirstsearch.net/blog/2008/08/28/where-do-labels-come-from/#comments</comments>
		<pubDate>Thu, 28 Aug 2008 15:27:54 +0000</pubDate>
		<dc:creator>JS</dc:creator>
				<category><![CDATA[computer science]]></category>
		<category><![CDATA[ml]]></category>

		<guid isPermaLink="false">http://www.depthfirstsearch.net/blog/?p=532</guid>
		<description><![CDATA[Andrew Ng&#8217;s keynote at ICDL 2008 was quite enjoyable. If nothing else, he has killer demos. One of the themes of the keynote was a particular view that machine learning is moving towards solving learning problems with a paucity of available labels. He quoted Geoff Hinton&#8217;s work on deep belief nets as an example of [...]]]></description>
			<content:encoded><![CDATA[<p>Andrew Ng&#8217;s keynote at ICDL 2008 was quite enjoyable. If nothing else, he has <a href="http://make3d.stanford.edu/">killer</a> <a href="http://heli.stanford.edu/">demos</a>. One of the themes of the keynote was a particular view that machine learning is moving towards solving learning problems with a paucity of available labels. He quoted Geoff Hinton&#8217;s work on <a href="http://www.youtube.com/watch?v=AyzOUbkUf3M">deep belief nets</a> as an example of a learning algorithm that learns through experience, as opposed to learning exclusively through labels. Hinton&#8217;s arguments is that there are more connections in the brain then could possibly be trained through labels alone.</p>
<p>As interesting as this trend is, it does beg the question: Where do labels come from? Now this isn&#8217;t an issue for many practical applications. The labels come from graduate students. But if we change the problem, look at it from the perspective of an AI agent, labels take on a somewhat mysterious quality.</p>
<p>Imagine you&#8217;re a robot waking up in the world. You&#8217;ve got a constant stream of sensory values at your disposal, a motor apparatus that you could use to interact with the world, and perhaps various primitive behaviors to get started learning who you are and what you can do.</p>
<p>This may seem like a contrived scenario, but I would argue this is precisely the scenario that all intelligent agents that we know of solve during the first years of life. It&#8217;s a scenario with some interesting properties. For one, even if labels are provided to the agent, how does the agent come to understand that the labels are, well, labels? These sorts of pernicious grounding problems abound.</p>
<p>So, if you&#8217;re like Geoff Hinton and you want to present your model as method of the way the brain really works, I think you&#8217;d better go through considerable effort to justify the use of labels anywhere in the learning process.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.depthfirstsearch.net/blog/2008/08/28/where-do-labels-come-from/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Applications of Machine Learning</title>
		<link>http://www.depthfirstsearch.net/blog/2007/12/08/applications-of-machine-learning/</link>
		<comments>http://www.depthfirstsearch.net/blog/2007/12/08/applications-of-machine-learning/#comments</comments>
		<pubDate>Sun, 09 Dec 2007 01:31:26 +0000</pubDate>
		<dc:creator>JS</dc:creator>
				<category><![CDATA[computer science]]></category>
		<category><![CDATA[ml]]></category>
		<category><![CDATA[optimization]]></category>
		<category><![CDATA[rl]]></category>

		<guid isPermaLink="false">http://www.depthfirstsearch.net/2007/12/08/applications-of-machine-learning/</guid>
		<description><![CDATA[It turns out that optimizing warehouse tasks is hard.]]></description>
			<content:encoded><![CDATA[<p>It turns out that <a href="http://pasquinade.blogspot.com/2007/10/optimization.html">optimizing warehouse tasks</a> is <a href="http://www.cs.ualberta.ca/~mgh/PUBLICATIONS/icml01.pdf">hard</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.depthfirstsearch.net/blog/2007/12/08/applications-of-machine-learning/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Debugging and Machine Learning</title>
		<link>http://www.depthfirstsearch.net/blog/2007/12/08/debugging-and-machine-learning/</link>
		<comments>http://www.depthfirstsearch.net/blog/2007/12/08/debugging-and-machine-learning/#comments</comments>
		<pubDate>Sat, 08 Dec 2007 17:00:25 +0000</pubDate>
		<dc:creator>JS</dc:creator>
				<category><![CDATA[computer science]]></category>
		<category><![CDATA[ml]]></category>
		<category><![CDATA[rl]]></category>
		<category><![CDATA[robots]]></category>

		<guid isPermaLink="false">http://www.depthfirstsearch.net/2007/12/08/debugging-and-machine-learning/</guid>
		<description><![CDATA[As I near completion on my final project for a course on reinforcement learning, I came across the following from Sutton&#8217;s page on tile coding: With the code described so far, there is a small probability that unrelated inputs will hash into some of the same tiles. In a group of tilings, usually there will [...]]]></description>
			<content:encoded><![CDATA[<p>As I near completion on my final project for a course on reinforcement learning,  I came across the following from Sutton&#8217;s page on <a href="http://www.cs.ualberta.ca/~sutton/tiles2.html">tile coding</a>:</p>
<blockquote><p>With the code described so far, there is a small probability that unrelated inputs will hash into some of the same tiles. In a group of tilings, usually there will be no more than one such &#8220;collision&#8221;, so that it is not a big problem; the learning process will sort it out. There will not be a big effect on performance unless the memory is too small or the hash functions are poorly designed. Nevertheless, the possibility of such a problem is annoying. When one&#8217;s program doesn&#8217;t work, there is a tendency, deserved or not, to suspect a failure of the hashing function.</p></blockquote>
<p>I did, in fact, discover that my memory size was too small, resulting in a number of collisions. That was not the only problem with my agent, but one of many.</p>
<p>Of recent related significance, the UTCS/ART autonomous vehicle team did not make the finals in the Urban Challenge. One of the technical problems the team faced was a bad Ethernet cable that delayed critical sensor readings by as much as five seconds. The thread here is that debugging (in the classic sense as a programmer art) does not apply easily to systems that exhibit degrees of homeostasis or non-determinism (e.g. Ethernet protocol, TD-learning).</p>
]]></content:encoded>
			<wfw:commentRss>http://www.depthfirstsearch.net/blog/2007/12/08/debugging-and-machine-learning/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk: basic
Page Caching using disk: enhanced
Database Caching 1/17 queries in 1.233 seconds using disk: basic
Object Caching 399/432 objects using disk: basic

Served from: www.depthfirstsearch.net @ 2012-02-07 14:08:46 -->
