<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
 
 <title>Jeremy Barnes' Blog</title>
 <link href="http://www.barneso.com/atom.xml" rel="self"/>
 <link href="http://www.barneso.com/"/>
 <updated>2010-03-09T07:12:22-08:00</updated>
 <id>http://www.barneso.com/</id>
 <author>
   <name>Jeremy Barnes</name>
   <email>barnesomail-atom@yahoo.com.au</email>
 </author>
 
 
 <entry>
   <title>Recoset-Starting a Company</title>
   <link href="http://www.barneso.com/2010/03/09/Recoset.html"/>
   <updated>2010-03-09T00:00:00-08:00</updated>
   <id>http://www.barneso.com/2010/03/09/Recoset</id>
   <content type="html">&lt;p&gt;It&amp;#8217;s been official for a couple of weeks now: I&amp;#8217;m starting a company.  Well, we&amp;#8217;re starting a company, &lt;a href=&quot;http://twitter.com/Chebuctonian&quot;&gt;Daniel&lt;/a&gt; and I.  We&amp;#8217;ve even got share certificates on official-looking paper.&lt;/p&gt;
&lt;p&gt;The company is called &lt;a href=&quot;http://www.recoset.com&quot;&gt;Recoset&lt;/a&gt; (web site isn&amp;#8217;t up yet).  The name is designed to invoke &amp;#8220;recommendation&amp;#8221;, as in recommendation engines, most famously used at &lt;a href=&quot;http://www.netflixprize.com/&quot;&gt;Netflix&lt;/a&gt;.  Apart from that there&amp;#8217;s no meaning in it: it&amp;#8217;s a name for which we could obtain the .com and twitter accounts; which means nothing offensive in any major language; and which is not too hard to remember or spell.&lt;/p&gt;
&lt;p&gt;So what are we doing?  From my point of view, the essence of the business is:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;To give small e-commerce businesses access to the tools that the big guys use&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;You see, there are plenty of places that have lots of historical sales data piled up, but can&amp;#8217;t afford to buy software that comes complete with teams of strangely accented guys in blue suits to install it.&lt;br /&gt;
Our expertise is in dealing with difficult data: when there&amp;#8217;s not much of it, when it&amp;#8217;s noisy, when the structure is hard to tease out.  So we&amp;#8217;re going to take our expertise and use it to let smaller e-commerce businesses make the most out of their data.&lt;/p&gt;
&lt;p&gt;Initially, we&amp;#8217;re focusing on recommendations, based upon a scaling-down of the traditional algorithms to small amounts of data (this is much harder to do than scaling up, by the way, especially because you can&amp;#8217;t just coast along in the wake of Moore&amp;#8217;s law).  By restricting ourselves to just e-commerce and not trying to make a general recommendation tool, we can dig really deeply into the problem domain and bring a lot of domain-specific knowledge to bear.  And a lot of what a recommendation engine needs to know to do its job is, when presented well, very useful to the people running the business, so we&amp;#8217;re going to provide access to that.&lt;/p&gt;
&lt;p&gt;Of course, we have bigger plans for the future.  But they might not be big in the &lt;a href=&quot;http://www.amazon.com/tag/netflix/products/ref=tag_dpp_lp_istp_in_f&quot;&gt;way that people might expect&lt;/a&gt;.&lt;/p&gt;</content>
 </entry>
 
 <entry>
   <title>3rd Place in AUSDM Competition</title>
   <link href="http://www.barneso.com/2009/12/05/3rd_Place_in_AUSDM_Competition.html"/>
   <updated>2009-12-05T00:00:00-08:00</updated>
   <id>http://www.barneso.com/2009/12/05/3rd_Place_in_AUSDM_Competition</id>
   <content type="html">&lt;p&gt;I ended up in &lt;a href=&quot;http://www.tiberius.biz/ausdm09/results.html&quot;&gt;3rd place&lt;/a&gt; in the &lt;a href=&quot;http://http://www.tiberius.biz/ausdm09/index.html&quot;&gt;&lt;span class=&quot;caps&quot;&gt;AUSDM&lt;/span&gt; Competition&lt;/a&gt;.  My report, which is probably the most detailed of those submitted, is available &lt;a href=&quot;/files/ausdm-report.pdf&quot;&gt;here&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;A bit of background: this was a competition to blend together predictors that had been created as part of the &lt;a href=&quot;http://www.netflixprize.com/&quot;&gt;Netflix prize&lt;/a&gt;.  There were two tasks (&lt;span class=&quot;caps&quot;&gt;RMSE&lt;/span&gt;, with the same goal as in Netflix, and &lt;span class=&quot;caps&quot;&gt;AUC&lt;/span&gt;, with the goal to predict a binary-valued attribute rather than regress).  There were three dataset sizes: small, medium and large.  The competition was decided on the average rank of the medium and large &lt;span class=&quot;caps&quot;&gt;AUC&lt;/span&gt; and &lt;span class=&quot;caps&quot;&gt;RMSE&lt;/span&gt;, where the large counted for twice the small.&lt;/p&gt;
&lt;p&gt;This was a better result than I had anticipated.  Partly this is because some of the stronger entries over the small dataset didn&amp;#8217;t end up submitting anything for the final competition (due either to then knowingly overfitting in a way that couldn&amp;#8217;t be generalized, or using techniques that didn&amp;#8217;t scale).  I also placed much better than I had expected in the &lt;span class=&quot;caps&quot;&gt;RMSE&lt;/span&gt; sets: second in the large &lt;span class=&quot;caps&quot;&gt;RMSE&lt;/span&gt; and eigth in the medium &lt;span class=&quot;caps&quot;&gt;RMSE&lt;/span&gt;.  On the other hand, my &lt;span class=&quot;caps&quot;&gt;AUC&lt;/span&gt; performance was about what than I expected: 3rd for medium and 4th for large.&lt;/p&gt;
&lt;p&gt;Due to the improvement of the rankings of my models from the small (where I was about 15th to 20th) to medium to large datasets, it appears that other teams either overfit the small dataset or used models that were efficient on constrained data but sub-optimal with more abundant data.&lt;/p&gt;
&lt;p&gt;Phil Brierly, who ran the competition, put together a &lt;a href=&quot;http://www.tiberius.biz/ausdm09/AusDM09EnsemblingChallenge.pdf&quot;&gt;report&lt;/a&gt; containing the reports of all of the teams (though, unfortunately, there was no analysis).  As I wasn&amp;#8217;t present at the conference, I didn&amp;#8217;t hear about what kinds of discussions were had; it would be interesting to hear from anyone who had a summary of what was said.&lt;/p&gt;
&lt;p&gt;Looking quickly through the reports from the other teams, we see that:&lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;Andrzej Janusz from the University of Warsaw  was easily the winner of the contest: he was first on the medium &lt;span class=&quot;caps&quot;&gt;AUC&lt;/span&gt; and large &lt;span class=&quot;caps&quot;&gt;RMSE&lt;/span&gt;, and second on the others.  He used a &lt;strong&gt;lot&lt;/strong&gt; of models (neural nets, regressions, &amp;#8230;) and combined these together using a genetic algorithm to learn a very sparse representation.&lt;/li&gt;
	&lt;li&gt;Vladimir Nikulin from the University of Queensland came in second.  He used random models which were combined with &lt;a href=&quot;http://clopinet.com/isabelle/Projects/KDDcup09/Papers/Nikulin-paper5.pdf&quot;&gt;another sparse technique&lt;/a&gt; which looked at the &lt;em&gt;stability&lt;/em&gt; of the influence of each model and excluded those whose influence was unstable.&lt;/li&gt;
	&lt;li&gt;Tom Au, Rong Duan, Guangqin Ma, and Rensheng Wang from AT&amp;amp;T Labs created a model that was significantly better than the others on the large &lt;span class=&quot;caps&quot;&gt;AUC&lt;/span&gt; task.  They used a lot of extra features to describe the statistical properties of each of the examples (for example, they fitted a beta distribution to the distribution of model outputs for each example), and used a simple boosted logistic regression to combine them.  Their techniques are interesting in that they didn&amp;#8217;t attempt to obtain a sparse representation.&lt;/li&gt;
	&lt;li&gt;C. Balakarmekan and R. Boobesh from team LatentView were the highest placed in the medium &lt;span class=&quot;caps&quot;&gt;RMSE&lt;/span&gt; task.  However, they just used two kinds of regression trees averaged together.  It seems to me like there was probably a substantial amount of luck involved in their model.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;For me, the conclusions are:&lt;/p&gt;
&lt;p&gt;1.  The &lt;span class=&quot;caps&quot;&gt;RMSE&lt;/span&gt; metric is not useful for measuring progress on this kind of a noisy task, as the noise present and its distribution mean that the results are necessarily a lottery;&lt;br /&gt;
2.  I am missing from my &amp;#8220;toolbox&amp;#8221; a means of performing feature selection.&lt;/p&gt;</content>
 </entry>
 
 <entry>
   <title>Welcome</title>
   <link href="http://www.barneso.com/2009/11/29/Welcome.html"/>
   <updated>2009-11-29T00:00:00-08:00</updated>
   <id>http://www.barneso.com/2009/11/29/Welcome</id>
   <content type="html">&lt;p&gt;Welcome to my blog!&lt;/p&gt;
&lt;p&gt;The goal of this blog is to present my thoughts on Machine Learning and everything associated with it, as well as write about my experiences as a consultant.&lt;/p&gt;
&lt;p&gt;The content is going to be technical, but I will try to keep it to a level where interested lay-persons can understand it.&lt;/p&gt;
&lt;p&gt;I&amp;#8217;m yet to see how often I will update it.&lt;/p&gt;</content>
 </entry>
 
 
</feed>
