<?xml version="1.0" encoding="UTF-8"?>
<!--Generated by Squarespace V5 Site Server v5.13.156 (http://www.squarespace.com) on Mon, 20 May 2013 06:51:16 GMT--><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0"><channel><title>Blog</title><link>http://www.dotmaths.com/blog/</link><description></description><lastBuildDate>Wed, 13 Feb 2013 13:11:21 +0000</lastBuildDate><copyright></copyright><language>en-GB</language><generator>Squarespace V5 Site Server v5.13.156 (http://www.squarespace.com)</generator><item><title>Zeno’s Facebook page</title><dc:creator>Matthew Handy</dc:creator><pubDate>Wed, 05 Sep 2012 11:44:29 +0000</pubDate><link>http://www.dotmaths.com/blog/2012/9/5/zenos-facebook-page.html</link><guid isPermaLink="false">545291:6273458:27584908</guid><description><![CDATA[<p><span class="full-image-block ssNonEditable"><span><img src="http://www.dotmaths.com/storage/Screen Shot 2012-09-05 at 13.29.09.png?__SQUARESPACE_CACHEVERSION=1346848213937" alt="" /></span></span></p>
<p>The Economist recently published <a title="Six degrees of mobilisation" href="http://www.economist.com/node/21560977" target="_blank">a story</a> about a competition that tested Stanley Milgram's famous "<a title="Wikipedia" href="Wikipedia" target="_blank">six degrees of separation</a>" claim.</p>
<p>DARPA, the research arm of the Department of Defense in the US, staged the Red Balloon Challenge in 2009. Competitors had to locate ten red weather balloons that had been tethered at random locations across the US.</p>
<p>The intention was not that one person drive around the country with a pair of binoculars. Rather, I might ask all my friends on Facebook to look out for a red balloon and tell me if they saw one. They might then ask all their friends, and so on.</p>
<p>The winning team from MIT found all of the balloons in just nine hours using this type of strategy. But to encourage participation they offered $2,000 to the first person to send them the co-ordinates of a balloon. On its own this may not have been very efficient. So, crucially, they also offered $1,000 to whomever recruited <em>that</em> person to the challenge, and $500 to whomever recruited&nbsp;<em>that</em> person to the challenge, and so on.</p>
<p>One interesting question mathematically is how much money did the MIT team stand to lose? The Red Balloon Challenge offered a prize of $40,000.</p>
<p>In principle, the sender of a winning set of co-ordinates might have been been at the top of a long line of recruiters. Doesn't this mean that the MIT team risked an enormous payout?</p>
<p>Well, no. Consider a recruitment chain of seven people. The total payout would be:</p>
<p>$2,000+$1,000+$500+$250+$125+$62.50+$31.25 = $3,968.75</p>
<p>The seventh payment of $31.25 is pretty small. If there were more people in the chain, their payments would be even smaller. Nonetheless, lots of small amounts can quickly add up to a large amount.</p>
<p>Suppose there were 17 people in the chain. The total payout would be $3,999.97 (to the nearest cent). The seventeenth person would have got 3<span>&cent;.</span></p>
<p>Even so, there could have been many more people in the chain, and might not the total have slowly grown to an unaffordable amount?</p>
<p>Suppose the total amount payable is T and that there are infinitely many people in the chain. Then,</p>
<p>T = 2000 + 1000 + 500 + 250 + ...</p>
<p>Now multiply both sides of this by 2:</p>
<p>2T = 4000 + 2000 + 1000 + 500 + ...</p>
<p>Finally subtract the first equation from the second:</p>
<p>2T &ndash; T = 4000 + 2000 + 1000 + 500 + ... &ndash; (2000 + 1000 + 500 + 250 + ...)</p>
<p>Which gives:</p>
<p>T=4000</p>
<p>So even if there had been infinitely many people in the chain, the total payout would have been $4,000. Since there were ten balloons that gives a grand total of $40,000 which was the value of the prize. MIT were certain to make at least a little profit, provided they actually won the prize. Since the kudos of winning was worth more than the prize, their investment was well worth the risk.</p>
<p>Series such as this one are called <strong>geometric series</strong>. One of the earliest examples of them was <a title="Wikipedia" href="http://en.wikipedia.org/wiki/Zeno's_paradoxes" target="_blank">Zeno&rsquo;s dichotomy paradox</a>. For me to walk 4000 metres, I first have to walk 2000m, i.e. half the total distance. I then have to walk 1000m, i.e. half the remaining distance. Then 500m. Then 250m. And so on. No matter how far I've travelled there's always half the remaining distance left to go. So I have infinitely many stages to complete and will therefore never get to the end of them.</p>
<p>The flaw in the argument is that last sentence. It assumes that the infinitely many stages of the journey will take infinitely long to get through. But they won't.</p>
<p>Suppose I walk at 1m/s. Then the first stage will take me 2000s. The second will take 1000s. The third 500s, and so on. So the total time taken will be T = 2000 + 1000 + 500 + ... We now know that this adds up to 4000. So I can finish the journey in a finite amount of time. (Indeed the 4000 seconds you would expect me to need to walk 4000m at 1m/s.)</p>]]></description><wfw:commentRss>http://www.dotmaths.com/blog/rss-comments-entry-27584908.xml</wfw:commentRss></item><item><title>We're losing the battle: let's fight a different war</title><dc:creator>Matthew Handy</dc:creator><pubDate>Thu, 05 Apr 2012 11:34:11 +0000</pubDate><link>http://www.dotmaths.com/blog/2012/4/5/were-losing-the-battle-lets-fight-a-different-war.html</link><guid isPermaLink="false">545291:6273458:15731028</guid><description><![CDATA[<p><span class="thumbnail-image-block ssNonEditable"><span><a href="javascript:showFullImage('/display/ShowImage?imageUrl=%2Fstorage%2FSCAN0112.JPG%3F__SQUARESPACE_CACHEVERSION%3D1333625745552',2719,1174);"><img src="http://www.dotmaths.com/storage/thumbnails/6269103-17499175-thumbnail.jpg?__SQUARESPACE_CACHEVERSION=1333625745553" alt="" /></a></span></span></p>
<p>The above letter was published in today's <em>Times Educational Supplement</em>. The full text of the letter was:</p>
<p>Conrad Wolfram is right to argue that we need to rethink our priorities for mathematics education ("Computers do it better", TES, 30 March 2012). It is bizarre, for example, that we still teach students to use tables of probabilities three decades after the calculator ousted the books of logarithms so hated by our parents. Worse, exam boards often ask questions that specifically test the ability to use the tables, rather than focussing on candidates' understanding of what the probabilities really mean.</p>
<p>While such approaches rob the subject of its interest and practicality, resulting in fewer students pursuing it at higher levels, there is even more at stake. Dr Keith Devlin of Stanford University -- in an article titled&nbsp;<a href="http://www.huffingtonpost.com/dr-keith-devlin/all-the-math-taught-at-un_b_1371977.html" target="_blank">"All the math taught at university can be outsourced. What now?"</a> -- argues that the West's competitive advantage must come from mathematical creativity; countries like India are already our technical equals and charge a lot less for their skills.</p>
<p>Indeed, Hal Varian, chief economist at Google, has said that <a href="http://flowingdata.com/2009/02/25/googles-chief-economist-hal-varian-on-statistics-and-data/">"<span>the&nbsp;</span><span>sexy</span></a><span><a href="http://flowingdata.com/2009/02/25/googles-chief-economist-hal-varian-on-statistics-and-data/">&nbsp;job in the next ten years"</a> will be that of statistician. Andreas Weigend, formerly chief scientist at Amazon, agrees: <a href="http://mashable.com/2012/01/13/career-of-the-future-data-scientist-infographic/" target="_blank">"Data is the new oil"</a>. Company surveys suggest there is likely to be a serious shortage of suitably qualified data analysts in the years ahead, yet we still seem to expect our bright students to become engineers.</span></p>
<p>If schools are serious when they claim to be preparing young people for the challenges of the 21st Century, they need to understand that forcing another generation to spend hours solving trigonometric equations and reading books of tables will only result in even more people telling me cheerfully how much they hated maths when they were at school.</p>]]></description><wfw:commentRss>http://www.dotmaths.com/blog/rss-comments-entry-15731028.xml</wfw:commentRss></item><item><title>Is Premier League defending getting worse?</title><dc:creator>Matthew Handy</dc:creator><pubDate>Mon, 02 Jan 2012 09:36:30 +0000</pubDate><link>http://www.dotmaths.com/blog/2012/1/2/is-premier-league-defending-getting-worse.html</link><guid isPermaLink="false">545291:6273458:14407306</guid><description><![CDATA[<p><span class="thumbnail-image-block ssNonEditable"><span><a href="javascript:showFullImage('/display/ShowImage?imageUrl=%2Fstorage%2FPL-Defence---31-Oct-2011.gif%3F__SQUARESPACE_CACHEVERSION%3D1325497679565',429,595);"><img src="http://www.dotmaths.com/storage/thumbnails/6269103-15825869-thumbnail.jpg?__SQUARESPACE_CACHEVERSION=1325497679566" alt="" /></a></span></span></p>
<p>In Radio 4's <em><a title="Story starts 26 minutes in" href="http://www.bbc.co.uk/iplayer/console/b018gzqx" target="_blank">More or Less</a></em>, sports statistician Rob Mastrodomenico took a look at a <a href="http://www.bbc.co.uk/blogs/thefootballtacticsblog/2011/10/is_premier_league_defending_ge.html" target="_blank">BBC blog that discusses whether premier league defending is getting worse</a>. He focusses on the fact that the number of goals per game has risen from 2.59 in 2010/11 to 2.97 in 2011/12.</p>
<p>After discussing the football and the pundits, he turns to the statistics. It is unarguable that 2.97 is bigger than 2.59, but he asks whether the difference is statistically significant. He concludes that it is not and calculates how great the difference would need to be for it to be statistically significant. (I addressed the meaning of statistical significance in my blog posting <a href="http://www.dotmaths.com/blog/2011/10/21/significance-isnt-always-significant.html">Significance isn't always significant</a>.)</p>
<p>Now the question of statistical significance arises when we have the necessarily limited information that we gain from a <strong>sample</strong> and we want to know what that tells us about a <strong>population</strong>. Because our sample is an <em>incomplete</em> picture of the population, we are met with<em> uncertainty</em> &ndash;- and that's where probability comes in.&nbsp;But if we have <em>all </em>of the data, there is no uncertainty and no role for statistical inference. If there is a difference between two things then we must conclude that, well, there is a difference between the two things.</p>
<p>Mastrodomenico's analysis of the BBC's football data treats the games played during the first half of each season as a sample of all the games that could theoretically be played. He sees each season as having an unknown, underlying goals-per-game average and the figures of 2.97 and 2.59 as estimates of their respective averages. It is thus reasonable to carry out standard statistical inference procedures.</p>
<p>It's rather like tossing a coin. We can suppose that any given coin has a particular propensity to come down heads. If we were to toss it many, many times we might end up with a very good idea of what that propensity is. It might be 50:50 heads:tails or, in the case of a biased coin, it might be 60:40. In practice we wouldn't toss the coin millions of times, say, but a couple of hundred times isn't too arduous and the resulting proportion of heads would give us an insight into the coin's underlying propensity towards heads. If we obtained 110 heads out of 200, we might reasonably argue that this is not inconsistent with a long-term propensity of 50:50 heads:tails -- although we would expect 100 heads out of 200 tosses, the process is a random one and a small deviation is not unlikely.</p>
<p>The problem for me in this approach is that I don't believe football matches are like coin tosses. Every coin toss is essentially the same as every other coin toss: though the outcome varies, the circumstances surrounding it are the same. And you can, in principle, repeatedly toss the coin as often as you want. But football matches are subject to many different variables: no two matches are the same. And you cannot endlessly repeat matches in an attempt to discern an underlying goals-per-game average.</p>
<p>In some respects probability and statistics are not like other mathematical disciplines: there are differing schools of thought. There isn't universal agreement, for example, on what a probability actually is and how you can calculate it. Some people believe that a probability is inherent to a situation so that all observers would agree on what that probability is. Others argue that if you have different information to someone else you can reasonably come to a different assessment of the probability of a particular event happening.</p>
<p>Mastrodomenico believes that any given football season has an underlying goals-per-game average and that we can gain insight into what that average is by looking at the actual games played. I look at it differently. I believe we can only go by those actual games. I think football matches are fundamentally different to coin tosses and cannot be treated in the same way. He concludes that we have insufficient evidence of a difference between last season and this season; I say there is a difference. Of course, whether that difference is meaningful and what caused it are completely different questions . . .</p>]]></description><wfw:commentRss>http://www.dotmaths.com/blog/rss-comments-entry-14407306.xml</wfw:commentRss></item><item><title>Is Lord Sugar a psychic octopus?</title><dc:creator>Matthew Handy</dc:creator><pubDate>Mon, 28 Nov 2011 22:37:22 +0000</pubDate><link>http://www.dotmaths.com/blog/2011/11/28/is-lord-sugar-a-psychic-octopus.html</link><guid isPermaLink="false">545291:6273458:13895580</guid><description><![CDATA[<p><span class="thumbnail-image-block ssNonEditable"><span><a href="javascript:showFullImage('/display/ShowImage?imageUrl=%2Fstorage%2FLord-Sugar-with-the-Young-007.jpg%3F__SQUARESPACE_CACHEVERSION%3D1322519907327',276,460);"><img src="http://www.dotmaths.com/storage/thumbnails/6269103-15330081-thumbnail.jpg?__SQUARESPACE_CACHEVERSION=1322519907328" alt="" /></a></span></span></p>
<p>During the World Cup last year, a <a href="http://www.dotmaths.com/blog/2010/7/12/psychic-octopodes.html">psychic octopus</a> rose to fame by correctly predicting the outcome of eight football matches. Despite the relatively low probability of this happening by chance, most people are content to believe that the octopus was just guessing.</p>
<p>Does <em>Young Apprentice</em> work in the same way? Harry M has now been in the boardroom on the losing team six times out of six. Is it fair to conclude that he's the weakest contestant?</p>
<p>Perhaps he is, though he's had some notable and impressive moments. Yet given that the teams are regularly mixed up, is it not inevitable &ndash; or, at least, not unlikely &ndash; that one person would find themselves on the losing team every week simply by chance?</p>
<p>And does the winning team always win because they're the best? What role does randomness play? There seems to be a lot of <a href="http://en.wikipedia.org/wiki/Post_hoc_ergo_propter_hoc" target="_blank">post hoc ergo propter hoc</a> analysis in the boardroom. James made great play of the fact that his team found a pocket watch at a considerably cheaper price than the opposition. But might that not have been more luck than judgement? As Zara countered, the watch she bought was the cheapest of the ones she saw; indeed it was cheapest by a large margin.</p>
<p>If the World Cup's octopus was lucky rather than psychic, might not Harry M be unlucky rather than a weak contestant?</p>
<p><a href="http://articles.businessinsider.com/2009-06-09/wall_street/30047033_1_manager-dimensional-fund-advisors-skill" target="_blank">More formal analyses</a> on similar lines to the ones I'm suggesting here have been done in the sphere, for example, of investment management. There is evidence to suggest that many apparently successful fund managers may merely be just consistently lucky.</p>]]></description><wfw:commentRss>http://www.dotmaths.com/blog/rss-comments-entry-13895580.xml</wfw:commentRss></item><item><title>Significance isn't always significant</title><dc:creator>Matthew Handy</dc:creator><pubDate>Fri, 21 Oct 2011 09:05:24 +0000</pubDate><link>http://www.dotmaths.com/blog/2011/10/21/significance-isnt-always-significant.html</link><guid isPermaLink="false">545291:6273458:13401643</guid><description><![CDATA[<p><span class="thumbnail-image-block ssNonEditable"><span><a href="javascript:showFullImage('/display/ShowImage?imageUrl=%2Fstorage%2F_56174211_climate_change_624gr.gif%3F__SQUARESPACE_CACHEVERSION%3D1319187981005',430,624);"><img src="http://www.dotmaths.com/storage/thumbnails/6269103-14744686-thumbnail.jpg?__SQUARESPACE_CACHEVERSION=1319187981006" alt="" /></a></span></span></p>
<p>The BBC report the new climate change data with the headline <strong><a href="http://www.bbc.co.uk/news/science-environment-13719510" target="_blank">Global warming since 1995 "now significant"</a></strong>. While there is an attempt to explain what this means, I think the headline is deceptive and the explanation is unclear. Let me try to do better.</p>
<p>Suppose I have a coin which I suspect is biased in favour of heads. I toss the coin ten times and it comes down heads six times. What conclusions can I draw from this? Well, if the coin is not biased I would have expected it to come down heads five times out of ten. But, crucially, coin tosses are random: if you repeatedly toss a fair coin ten times it doesn't come down heads exactly five times out of ten every single time. Sometimes you get six heads; sometimes you get four. So the fact that I got heads six times is entirely consistent with the coin being a fair coin: for an unbiased coin to come down heads six times out of ten is not that unusual. This evidence is not, therefore, sufficient, to support my suspicion that the coin is biased.</p>
<p>On the other hand, if my coin had come down heads nine times out of ten, I may start to feel justified in my belief that it is biased. Fair coins do sometimes come down heads nine times out of ten, but only quite rarely. If my coin did so I may prefer to believe that it is biased, since coming down heads nine times out of ten is not that unusual for a biased coin.</p>
<p>An outcome is said to be <strong>statistically significant</strong> if the probability of it occurring by chance is very low. How low is low is entirely subjective. A benchmark that is often used is 5%, though the researchers at CERN who are looking for the Higgs boson use a far lower probability, as did the physicists who announced they may have found particles travelling at a speed faster than that of light.</p>
<p>But what statisticians mean by "significant" is simply that there is evidence to suggest, for example, that the coin is biased. The term says nothing about whether the bias <em>is meaningful in practical terms: </em><strong>practical significance</strong>.&nbsp;A conclusion of statistical significance does not distinguish between a coin that is biased in such a way that it comes down heads 50.1% of the time and a coin that comes down heads 90% of the time.</p>
<p>The latest climate change data does offer evidence that the temperature has increased. But there are two reasons to be cautious. First, it remains possible (though unlikely) that the increase is simply a random variation, equivalent to a fair coin coming up heads nine times out of ten. Second, even if the increase is real, it may not be significant in practical terms.</p>]]></description><wfw:commentRss>http://www.dotmaths.com/blog/rss-comments-entry-13401643.xml</wfw:commentRss></item><item><title>What’s the point of Fermat’s last theorem?</title><dc:creator>Matthew Handy</dc:creator><pubDate>Wed, 17 Aug 2011 08:17:16 +0000</pubDate><link>http://www.dotmaths.com/blog/2011/8/17/whats-the-point-of-fermats-last-theorem.html</link><guid isPermaLink="false">545291:6273458:12539293</guid><description><![CDATA[<p><span class="full-image-block ssNonEditable"><span><img src="http://www.dotmaths.com/storage/pierre_de_fermat-2011-hp.jpg?__SQUARESPACE_CACHEVERSION=1313581798606" alt="" /></span></span></p>
<p>Today Google celebrates Pierre de Fermat&rsquo;s 410th birthday. He is most famously known for his &ldquo;last theorem&rdquo;, the subject of Simon Singh&rsquo;s <a href="http://www.amazon.co.uk/Fermats-Last-Theorem-confounded-greatest/dp/1841157910/ref=sr_1_1?ie=UTF8&amp;qid=1313569194&amp;sr=8-1" target="_blank">excellent book</a>. (If you know about Fermat&rsquo;s last theorem, jump down to "Molina&rsquo;s Urns" below.)</p>
<p>The theorem is a generalisation of Pythagoras&rsquo;s theorem. Every schoolchild knows that, for a right-angled triangle,</p>
<p><span class="full-image-block ssNonEditable"><span><img src="http://www.dotmaths.com/storage/CodeCogsEqn-11.gif?__SQUARESPACE_CACHEVERSION=1313597751455" alt="" /></span></span></p>
<p>where x, y and z are the lengths of the three sides of the triangle, with z the length of the longest side.</p>
<p>Some right-angled triangles have sides whose lengths are whole numbers. For example, a triangle whose sides have lengths 3, 4 and 5 is right-angled because</p>
<p><span class="full-image-block ssNonEditable"><span><img src="http://www.dotmaths.com/storage/CodeCogsEqn.gif?__SQUARESPACE_CACHEVERSION=1313569572303" alt="" /></span></span></p>
<p>Fermat was considering the more general equation</p>
<p><span class="full-image-block ssNonEditable"><span><img src="http://www.dotmaths.com/storage/CodeCogsEqn-1.gif?__SQUARESPACE_CACHEVERSION=1313569643200" alt="" /></span></span></p>
<p>where n is a positive whole number. He wondered whether positive whole numbers x, y and z could be found that satisfied this equation if n was larger than 2.</p>
<p>He claimed that no such values could be found and he noted the fact in the margin of an algebra textbook he was reading. He wrote, &ldquo;I have discovered a truly marvelous demonstration of this proposition that this margin is too narrow to contain.&rdquo;</p>
<p>Well, that was in 1637. It wasn't until 1995 that Andrew Wiles and Richard Taylor published a proof of the theorem, some 358 year later.</p>
<p>(Incidentally, Fermat was both a lawyer and a mathematician. As am I :P)</p>
<p><strong>Molina&rsquo;s Urns</strong></p>
<p>Some time before Fermat&rsquo;s last theorem was proved, E. C. Molina invented a problem whose solution relied upon upon it. The problem would only have a solution if Fermat&rsquo;s last theorem was proved to be false.</p>
<p>There are two urns each containing the same number of balls. Each ball is either black or white.</p>
<p>From each urn the same number of balls is selected. Each time a ball is selected from an urn its colour is noted and it is then put back into the urn before the next ball is selected.</p>
<p>The problem is this. Can the distributions of black and white balls in each urn be chosen such that the following condition is satisfied: the probability that the balls drawn from the first urn are all white must equal the probability that the balls drawn from the second urn are either all white or all black.</p>
<p>Let&rsquo;s analyse the problem.</p>
<p>We define the following variables:</p>
<p>x = the number of white balls in the second urn</p>
<p>y = the number of black balls in the second urn</p>
<p>z = the number of white balls in the first urn</p>
<p>n = the number of balls selected from each urn</p>
<p>Since the total number of balls in each urn is the same, there are x + y balls in each urn.</p>
<p>The probability that a white ball is selected from the first urn is</p>
<p><span class="full-image-block ssNonEditable"><span><img src="http://www.dotmaths.com/storage/CodeCogsEqn-2.gif?__SQUARESPACE_CACHEVERSION=1313580640559" alt="" /></span></span></p>
<p>Therefore the probability that all of the balls drawn from the first urn are white is</p>
<p><span class="full-image-block ssNonEditable"><span><img src="http://www.dotmaths.com/storage/CodeCogsEqn-3.gif?__SQUARESPACE_CACHEVERSION=1313580754389" alt="" /></span></span></p>
<p>Similarly, the probability that all of the balls drawn from the second urn are white is</p>
<p><span class="full-image-block ssNonEditable"><span><img src="http://www.dotmaths.com/storage/CodeCogsEqn-4.gif?__SQUARESPACE_CACHEVERSION=1313580811244" alt="" /></span></span></p>
<p>and the probability that all of the balls drawn from the second urn are black is</p>
<p><span class="full-image-block ssNonEditable"><span><img src="http://www.dotmaths.com/storage/CodeCogsEqn-5.gif?__SQUARESPACE_CACHEVERSION=1313580852343" alt="" /></span></span></p>
<p>We require the probability that all of the balls drawn from the first urn are white to be equal to the probability that the balls drawn from the second urn are either all white or all black. In other words</p>
<p><span class="full-image-block ssNonEditable"><span><img src="http://www.dotmaths.com/storage/CodeCogsEqn-6.gif?__SQUARESPACE_CACHEVERSION=1313581020903" alt="" /></span></span></p>
<p>which reduces to&nbsp;</p>
<p><span class="full-image-block ssNonEditable"><span><img src="http://www.dotmaths.com/storage/CodeCogsEqn-1.gif?__SQUARESPACE_CACHEVERSION=1313581065770" alt="" /></span></span></p>
<p>So in order to satisfy the requirement of the problem we must be able to find whole numbers which satisfy this equation.</p>
<p>We can certainly do this if n = 2, since the equation becomes Pythagoras&rsquo;s theorem. Thus we can have 5 white balls and 2 black balls in the first urn, and three white balls and four black balls in the second urn.</p>
<p>If we draw two balls from each urn, the probability that both balls drawn from the first urn are white is</p>
<p><span class="full-image-block ssNonEditable"><span><img src="http://www.dotmaths.com/storage/CodeCogsEqn-7.gif?__SQUARESPACE_CACHEVERSION=1313581361074" alt="" /></span></span></p>
<p>and the probability that the two balls drawn from the second urn are either both white or both black &nbsp;is</p>
<p><span class="full-image-block ssNonEditable"><span><img src="http://www.dotmaths.com/storage/CodeCogsEqn-8.gif?__SQUARESPACE_CACHEVERSION=1313581454745" alt="" /></span></span></p>
<p>which satisfies the requirement of the problem.</p>
<p>If, however, we choose more than two balls from each urn, Fermat&rsquo;s last theorem tells us that there are no mixtures of black and white balls in each urn that will satisfy the requirement of the problem.</p>
<p>(I found this interesting problem in <a href="http://www.amazon.co.uk/Fifty-Challenging-Problems-Probability-Solutions/dp/0486653552" target="_blank">Fifty Challenging Problems in Probability</a>, by Frederick Mosteller.)</p>]]></description><wfw:commentRss>http://www.dotmaths.com/blog/rss-comments-entry-12539293.xml</wfw:commentRss></item><item><title>Why I (mostly) hate league tables</title><dc:creator>Matthew Handy</dc:creator><pubDate>Mon, 04 Jul 2011 15:22:48 +0000</pubDate><link>http://www.dotmaths.com/blog/2011/7/4/why-i-mostly-hate-league-tables.html</link><guid isPermaLink="false">545291:6273458:12003251</guid><description><![CDATA[<p><span class="thumbnail-image-block ssNonEditable"><span><a href="javascript:showFullImage('/display/ShowImage?imageUrl=%2Fstorage%2FTop%20research%20institutions%20in%20plant%20sciences%20and%20mycology.png%3F__SQUARESPACE_CACHEVERSION%3D1309793176820',1458,1248);"><img src="http://www.dotmaths.com/storage/thumbnails/6269103-13032849-thumbnail.jpg?__SQUARESPACE_CACHEVERSION=1309793176820" alt="" /></a></span></span></p>
<p>There are many reasons to object to league tables, and the table above illustrates one of them very well: a tiny change in what you measure results in a big change in league table position.</p>
<p>The three columns rank research institutes according to three different, but similar, metrics. Yet the resulting rankings give very different results. For example, the Max Planck Society tops the first two tables, yet falls to 20th place in the third.</p>
<p>That 20th place ranking illustrates another objection I have to tables such as these. The metric on which the institutes are judged in third column is measured to two decimal places. This is bizarrely specific for something that has a number of subjective features. The definition of a "highly cited paper" is arbitrary, and being cited is itself somewhat arbitrary (and, indeed, may not be evidence that the paper being cited is good: the citation may be a criticism, contradiction or refutation).</p>
<p>Of course, there is an exception to every rule. The table below, in which the University of Cambridge is ranked number one, is entirely beyond any criticism.</p>
<p><span class="thumbnail-image-block ssNonEditable"><span><a href="javascript:showFullImage('/display/ShowImage?imageUrl=%2Fstorage%2Ftable_p11.jpg%3F__SQUARESPACE_CACHEVERSION%3D1309793712653',862,555);"><img src="http://www.dotmaths.com/storage/thumbnails/6269103-13032983-thumbnail.jpg?__SQUARESPACE_CACHEVERSION=1309793712654" alt="" /></a></span></span></p>
<p>(As is this year&rsquo;s <strong><a href="http://www.independent.co.uk/news/education/education-news/cambridge-results-2011--tompkins-table-2315322.html" target="_blank">Tompkins table</a></strong>, which ranks Cambridge colleges by exam success. Trinity College comes top. By happy coincidence, I was at Trinity :)</p>]]></description><wfw:commentRss>http://www.dotmaths.com/blog/rss-comments-entry-12003251.xml</wfw:commentRss></item><item><title>The US in rice</title><dc:creator>Matthew Handy</dc:creator><pubDate>Fri, 25 Feb 2011 19:49:12 +0000</pubDate><link>http://www.dotmaths.com/blog/2011/2/25/the-us-in-rice.html</link><guid isPermaLink="false">545291:6273458:10605810</guid><description><![CDATA[<p><span class="full-image-block ssNonEditable"><span><img src="http://www.dotmaths.com/storage/usa_population.jpg?__SQUARESPACE_CACHEVERSION=1298663421123" alt="" /></span></span></p>
<p>I'm becoming increasingly interested in visualisations of data and this one particularly caught my imagination. It's a mound of rice, with one grain for every person in the US. It's part of a larger installation called <a href="http://stanscafe.co.uk/ofallthepeople/index.html" target="_blank">Of All The People In The World</a>.</p>]]></description><wfw:commentRss>http://www.dotmaths.com/blog/rss-comments-entry-10605810.xml</wfw:commentRss></item><item><title>The central limit theorem</title><dc:creator>Matthew Handy</dc:creator><pubDate>Sun, 20 Feb 2011 10:23:41 +0000</pubDate><link>http://www.dotmaths.com/blog/2011/2/20/the-central-limit-theorem.html</link><guid isPermaLink="false">545291:6273458:10542251</guid><description><![CDATA[<p><span class="thumbnail-image-block ssNonEditable"><span><a href="javascript:showFullImage('/display/ShowImage?imageUrl=%2Fstorage%2F34812strip.gif%3F__SQUARESPACE_CACHEVERSION%3D1298197757025',199,640);"><img src="http://www.dotmaths.com/storage/thumbnails/6269103-10836572-thumbnail.jpg?__SQUARESPACE_CACHEVERSION=1298197757025" alt="" /></a></span></span></p>
<p><em>Another in my occasional series of more technical blog posts, the following is a review of the most important result in statistics, the central limit theorem.</em></p>
<p><strong>The central limit theorem</strong></p>
<p>Strictly speaking the central limit theorem is not a theorem about means at all: it's more general than that. It says that the distribution of &nbsp;the sum of n observations will be approximately Normally and this approximation gets better as n increases. In other words, &sum;X~N, approximately.</p>
<p>You can try this out for yourself by running&nbsp;<a title="central limit theorem demonstrator" href="http://www.stat.sc.edu/~west/javahtml/CLT.html" target="_blank">this java applet</a>. It will allow you to roll one die a thousand times and it will draw a corresponding histogram of the results. That histogram will look pretty much rectangular. This is because you have only taken one observation each time: n=1. If you roll the die twice and add up the total score, and then do that a thousand times, the histogram will look triangular. This is because the middle values (6,7,8) are more likely than the low values (2,3,4) and the high values (10,11,12). So this time you've taken two observations (n=2) and added them together and the result is a little bit Normal. But if you roll the die five times (n=5) and add up the total score, and then do this a thousand times the resulting histogram looks like a Normal distribution &ndash; and that's the central limit theorem kicking in even though you're only making five observations each time and adding them up. (Note that the fact we repeat each set of die rolls a thousand times is arbitrary. The value of n is the number of die rolls we make each time, not how many times we repeat this. The large number of repetitions is so that we can build up a clear picture of the distribution of the total score.)</p>
<p><strong>Approximating distributions</strong></p>
<p>The central limit theorem can be used in distributional approximations. For example a binomial B(50,0.5) can be thought of as the sum of fifty Bernoulli distributions. So B(50,0.5)=&sum;B(1,0.5). Since we are adding up observations, the central limit theorem applies and tells us that B(50,0.5)~N, approximately.</p>
<p>In general B(n,p)~N, approximately, since B(n,p)=&sum;B(1,p). The closeness of the approximation depends, of course, on the values of n and p. If p is close to 0.5, n can be quite small: 10, say. As p gets smaller, n has to get larger for the approximation to be reasonable.</p>
<p>Similarly, a Poisson Po(50) can be considered to be a sum of fifty Po(1). So Po(50)=&sum;Po(1) and again the central limit theorem applies and gives us Po(50)~N, approximately.</p>
<p><strong>The underlying distribution and the sampling distribution of the mean</strong></p>
<p>Before I discuss hypothesis tests we need to be clear about two things. The population we take our sample from is called the&nbsp;<em>underlying population</em>. It will have a distribution which we may or may not know and it may or may not be Normal. Having taken our sample we will calculate its mean. If we repeated this procedure many times we could build up a whole new distribution: the<em>&nbsp;sampling distribution of the mean</em>. The central limit theorem tells us about that sampling distribution: it says that -- whatever the underlying distribution -- the sampling distribution of the mean will be approximately Normal, and this approximation will be good for means based on large samples.</p>
<p><strong>Hypothesis testing for the mean</strong></p>
<p>So far as testing means is concerned there are two distinct situations: where we know the population variance and where we don't.</p>
<p>If we&nbsp;<em>don't</em>&nbsp;know the population variance then<em>&nbsp;the central limit theorem doesn't come into it at all</em>, strictly speaking. The appropriate tests (<strong>t-tests</strong>) are based on the t-distribution and that is a completely different thing. Because we don't know the population variance we have to give the underlying mathematics something to work with and the requirement is that the&nbsp;<em>underlying population</em>&nbsp;from which we're sampling is Normally distributed. That being said, the t-tests are quite robust to deviations from this requirement.</p>
<p>If we&nbsp;<em>do</em>&nbsp;know the population variance we'll be using a&nbsp;<strong>z-test</strong>&nbsp;and we&nbsp;<em>will</em>&nbsp;be using the central limit theorem but<em>only if the underlying distribution is not Normal</em>&nbsp;(or is not known). If the underlying distribution&nbsp;<em>is</em>&nbsp;Normal then the distribution of the mean is&nbsp;<em>exactly</em>&nbsp;Normal no matter what the size of the sample and the central limit theorem is unnecessary. If the underlying distribution is not Normal, then the central limit theorem tells us that the mean will nonetheless be approximately Normal and that this approximation will be good for "large" sample sizes. How large is large? It depends on how far from Normal the underlying distribution is. If the underlying distribution is a chi-squared with 20-degrees of freedom then the sample size need not be that large. Why? Because a chi-squared distribution with 20-degrees of freedom looks a lot like a Normal distribution. (Why? Because it's the sum of twenty chi-squared distributions with one degree of freedom and so the central limit theorem tells us it will be approximately Normal! Note that the central limit theorem is being used here to make an inference about the underlying distribution, not the sampling distribution of the mean.)</p>
<p>Finally, if the sample size is large but we don't know the population variance, we may still use a z-test rather than a t-test because we may consider our unbiased estimate of the population variance (based on our large sample) as being very reliable so that really we "know" the population variance. And the critical values from the t-distribution are very similar to those from the Normal distribution if the sample size is large.</p>]]></description><wfw:commentRss>http://www.dotmaths.com/blog/rss-comments-entry-10542251.xml</wfw:commentRss></item><item><title>I'm not satisfied with student satisfaction</title><dc:creator>Matthew Handy</dc:creator><pubDate>Fri, 18 Feb 2011 19:17:35 +0000</pubDate><link>http://www.dotmaths.com/blog/2011/2/18/im-not-satisfied-with-student-satisfaction.html</link><guid isPermaLink="false">545291:6273458:10524451</guid><description><![CDATA[<p><span class="full-image-block ssNonEditable"><span><img src="http://www.dotmaths.com/storage/THE_cover_170211.jpg?__SQUARESPACE_CACHEVERSION=1298057199573" alt="" /></span></span></p>
<p><em>Times Higher Education</em> has today published the results of its annual <a title="Times Higher Education" href="http://www.timeshighereducation.co.uk/story.asp?sectioncode=26&amp;storycode=415180&amp;c=2" target="_blank"><strong>Student Experience Survey</strong></a>. More than 13,000 undergraduates rated &nbsp;their universities under 21 headings on a seven point scale. The points were given weightings, tallied up and then averaged out so that a league table of universities could be produced. Top of the poll was the Loughborough University, for the fifth year in a row.</p>
<p>As a statistician, my interest is in whether league tables such as these have any real meaning. I have a number of concerns.</p>
<p><strong>Who is doing the rating?</strong> A sample of 13,000 seems like a lot. But this is spread amongst 113 universities. That's around 115 per university, though the numbers varied from 30 to 257. A typical university has around 10-15,000 undergraduates so each university's score is based on a tiny sample. When opinion polls are carried out these are typically taken from about 1,000 people. In those circumstances, and assuming that the people were selected at random from the entire population and asked a yes/no question, there is a margin of error of 3%. These samples are nowhere near that large and the consequent margin of error will be very much greater. Worse, the sample in this case is not random (it comprises voluntary responses from those invited to participate), the questions were not yes/no but 21 ratings on a subjective seven point scale. And what of those voluntary responses? What kind of person doesn't respond to such a survey? What kind of person does? Some universities strongly encourage students to respond to the survey: why would they do that?</p>
<p><strong>How do you rate something on a seven point scale?</strong> What is your frame of reference? How can you ensure that you are consistent in your ratings across the different headings? How can you be sure that your understanding of the points on the scale is consistent with that of other people? When I taught in the US, I had to grade every student on a 12 point scale. One year a student told me that if I graded him A&ndash; in my class he would get admission to his favoured university. I had provisionally graded him B+. But I had no confidence that I had any understanding of the difference between those two grades. I gave him the A&ndash;.</p>
<p>Physical scientists use scales that are very carefully defined and calibrated: there is very precise agreement on what one metre is, for example. And length is such that if you double the value you get double the length. But social scientists don't use scales like this. On a ten point scale, I'd say my happiness today is a five. But what does that mean? If tomorrow my happiness is ten, does that mean I'm twice as happy? If you are feeling five-happy today are you just as happy as I am? If I'm feeling five-happy this time next week will I be just as happy as I am today?</p>
<p>(Happiness may not be easy to measure but beauty is. Helen of Troy was said to have beauty sufficient to launch a thousand ships. If I flatter myself, my own beauty might be sufficient to launch a single, albeit very small, ship, so my beauty is one millihelen.)</p>
<p><strong>How do you combine 21 scores?</strong> If I think my university library is worth six out of seven and the university canteen is worth five, does that mean that the two together are worth 11? Scores such as these are called ordinal: which essentially means that you can put them in an order. If I think my library is worth six points but your library is worth five, we are entitled to conclude that I think my library is better than yours. (Though what basis I have for that judgement and whether anyone would agree is entirely moot.) The problem with ordinal data is that you cannot meaningfully add it up. If you booked three nights at a four star hotel but arrived to find they'd overbooked, would you be happy to be given six nights in a two star hotel? After all, 3x4=6x2. Obviously this is nonsense, yet the Student Experience Survey adds up the scores under the various headings. In fact it weights the scores, doubling some scores for headings that are considered more important. Such judgements are, however, necessarily arbitrary.</p>
<p><strong>How do you compare totals?</strong> The final results of the Student Experience Survey are quoted to two decimal places. This is presumably done so that universities can be separated and ranked. If you look at the data you'll see that there would be a lot of ties if the scores were only quoted to one decimal place. Yet this is an extraordinary level of precision for data that is so subjective. If I can't be sure that even one of my 21 scores has the same meaning as one of yours, how can we combine those scores and report a result to two decimal places? In fact, there was remarkably little difference in the final scores. The top 50 universities were separated by less than 10 points: a one point improvement would see the University of Plymouth jump 10 places up the table to replace the University of Edinburgh at 29th.</p>
<p><strong>So does the survey tell us anything?</strong> <em>Times Higher Education</em> defends the survey in a number of ways, not least by pointing to the relative consistency of table position from year to year. It must mean something that Loughborough has come out top five years in a row. But what can we conclude from the fact that the University of York came 32 with a score of 77.63 whereas York St John University came 54th with a score of 75.12? Or that the former jumped 15 places up the table this year compared with last while the latter jumped 18? (Maybe York itself has become a more conducive place to study?)</p>
<p>Ironically, this week's issue of <em>Times Higher Education</em>&nbsp;<a title="Times Higher Education" href="http://www.timeshighereducation.co.uk/story.asp?sectioncode=26&amp;storycode=415209" target="_blank">reports</a> the publication of a paper critical of the grade-point average system used by many universities. Dr Soh Kay Cheng's critique of the system of grading students is not a million miles away from my own of the ranking of universities themselves.</p>
<p>[For a more technical analysis of league tables and their usefulness, see <a href="http://www.dotmaths.com//jtcs.ctsnetjournals.org/cgi/content/full/125/5/1171-a" target="_blank">here</a> and <a href="http://www.ncbi.nlm.nih.gov/sites/ppmc/articles/PMC28588/" target="_blank">here</a>.]</p>]]></description><wfw:commentRss>http://www.dotmaths.com/blog/rss-comments-entry-10524451.xml</wfw:commentRss></item></channel></rss>