How juries are fooled by statistics陪审团是如何被数据愚弄的 [复制链接]

上一主题下一主题查看指定楼层

幸福大叔

舞台策划

只看楼主倒序阅读使用道具楼主发表于: 2022-07-27

How juries are fooled by statistics

1,371,959 views | Peter Donnelly • TEDGlobal 2005

Peter Peter Donnelly
Mathematician; statistician

Peter Donnelly is an expert in probability theory who applies statistical methods to genetic data -- spurring advances in disease treatment and insight on our evolution. He's also an expert on DNA analysis, and an advocate for sensible statistical analysis in the courtroom.

00:00
As other speakers have said, it's a rather daunting experience -- a particularly daunting experience -- to be speaking in front of this audience. But unlike the other speakers, I'm not going to tell you about the mysteries of the universe, or the wonders of evolution, or the really clever, innovative ways people are attacking the major inequalities in our world. Or even the challenges of nation-states in the modern global economy. My brief, as you've just heard, is to tell you about statistics -- and, to be more precise, to tell you some exciting things about statistics. And that's -- (Laughter) -- that's rather more challenging than all the speakers before me and all the ones coming after me. (Laughter) One of my senior colleagues told me, when I was a youngster in this profession, rather proudly, that statisticians were people who liked figures but didn't have the personality skills to become accountants. (Laughter) And there's another in-joke among statisticians, and that's, "How do you tell the introverted statistician from the extroverted statistician?" To which the answer is, "The extroverted statistician's the one who looks at the other person's shoes." (Laughter) But I want to tell you something useful -- and here it is, so concentrate now. This evening, there's a reception in the University's Museum of Natural History. And it's a wonderful setting, as I hope you'll find, and a great icon to the best of the Victorian tradition. It's very unlikely -- in this special setting, and this collection of people -- but you might just find yourself talking to someone you'd rather wish that you weren't. So here's what you do. When they say to you, "What do you do?" -- you say, "I'm a statistician." (Laughter) Well, except they've been pre-warned now, and they'll know you're making it up. And then one of two things will happen. They'll either discover their long-lost cousin in the other corner of the room and run over and talk to them. Or they'll suddenly become parched and/or hungry -- and often both -- and sprint off for a drink and some food. And you'll be left in peace to talk to the person you really want to talk to.

01:55
It's one of the challenges in our profession to try and explain what we do. We're not top on people's lists for dinner party guests and conversations and so on. And it's something I've never really found a good way of doing. But my wife -- who was then my girlfriend -- managed it much better than I've ever been able to. Many years ago, when we first started going out, she was working for the BBC in Britain, and I was, at that stage, working in America. I was coming back to visit her. She told this to one of her colleagues, who said, "Well, what does your boyfriend do?" Sarah thought quite hard about the things I'd explained -- and she concentrated, in those days, on listening. (Laughter) Don't tell her I said that. And she was thinking about the work I did developing mathematical models for understanding evolution and modern genetics. So when her colleague said, "What does he do?" She paused and said, "He models things." (Laughter) Well, her colleague suddenly got much more interested than I had any right to expect and went on and said, "What does he model?" Well, Sarah thought a little bit more about my work and said, "Genes." (Laughter) "He models genes."

03:06
That is my first love, and that's what I'll tell you a little bit about. What I want to do more generally is to get you thinking about the place of uncertainty and randomness and chance in our world, and how we react to that, and how well we do or don't think about it. So you've had a pretty easy time up till now -- a few laughs, and all that kind of thing -- in the talks to date. You've got to think, and I'm going to ask you some questions. So here's the scene for the first question I'm going to ask you. Can you imagine tossing a coin successively? And for some reason -- which shall remain rather vague -- we're interested in a particular pattern. Here's one -- a head, followed by a tail, followed by a tail.

03:42
So suppose we toss a coin repeatedly. Then the pattern, head-tail-tail, that we've suddenly become fixated with happens here. And you can count: one, two, three, four, five, six, seven, eight, nine, 10 -- it happens after the 10th toss. So you might think there are more interesting things to do, but humor me for the moment. Imagine this half of the audience each get out coins, and they toss them until they first see the pattern head-tail-tail. The first time they do it, maybe it happens after the 10th toss, as here. The second time, maybe it's after the fourth toss. The next time, after the 15th toss. So you do that lots and lots of times, and you average those numbers. That's what I want this side to think about.

04:18
The other half of the audience doesn't like head-tail-tail -- they think, for deep cultural reasons, that's boring -- and they're much more interested in a different pattern -- head-tail-head. So, on this side, you get out your coins, and you toss and toss and toss. And you count the number of times until the pattern head-tail-head appears and you average them. OK? So on this side, you've got a number -- you've done it lots of times, so you get it accurately -- which is the average number of tosses until head-tail-tail. On this side, you've got a number -- the average number of tosses until head-tail-head.

04:46
So here's a deep mathematical fact -- if you've got two numbers, one of three things must be true. Either they're the same, or this one's bigger than this one, or this one's bigger than that one. So what's going on here? So you've all got to think about this, and you've all got to vote -- and we're not moving on. And I don't want to end up in the two-minute silence to give you more time to think about it, until everyone's expressed a view. OK. So what you want to do is compare the average number of tosses until we first see head-tail-head with the average number of tosses until we first see head-tail-tail.

05:16
Who thinks that A is true -- that, on average, it'll take longer to see head-tail-head than head-tail-tail? Who thinks that B is true -- that on average, they're the same? Who thinks that C is true -- that, on average, it'll take less time to see head-tail-head than head-tail-tail? OK, who hasn't voted yet? Because that's really naughty -- I said you had to. (Laughter) OK. So most people think B is true. And you might be relieved to know even rather distinguished mathematicians think that. It's not. A is true here. It takes longer, on average. In fact, the average number of tosses till head-tail-head is 10 and the average number of tosses until head-tail-tail is eight. How could that be? Anything different about the two patterns? There is. Head-tail-head overlaps itself. If you went head-tail-head-tail-head, you can cunningly get two occurrences of the pattern in only five tosses. You can't do that with head-tail-tail. That turns out to be important.

06:21
There are two ways of thinking about this. I'll give you one of them. So imagine -- let's suppose we're doing it. On this side -- remember, you're excited about head-tail-tail; you're excited about head-tail-head. We start tossing a coin, and we get a head -- and you start sitting on the edge of your seat because something great and wonderful, or awesome, might be about to happen. The next toss is a tail -- you get really excited. The champagne's on ice just next to you; you've got the glasses chilled to celebrate. You're waiting with bated breath for the final toss. And if it comes down a head, that's great. You're done, and you celebrate. If it's a tail -- well, rather disappointedly, you put the glasses away and put the champagne back. And you keep tossing, to wait for the next head, to get excited.

07:00
On this side, there's a different experience. It's the same for the first two parts of the sequence. You're a little bit excited with the first head -- you get rather more excited with the next tail. Then you toss the coin. If it's a tail, you crack open the champagne. If it's a head you're disappointed, but you're still a third of the way to your pattern again. And that's an informal way of presenting it -- that's why there's a difference. Another way of thinking about it -- if we tossed a coin eight million times, then we'd expect a million head-tail-heads and a million head-tail-tails -- but the head-tail-heads could occur in clumps. So if you want to put a million things down amongst eight million positions and you can have some of them overlapping, the clumps will be further apart. It's another way of getting the intuition.

07:45
What's the point I want to make? It's a very, very simple example, an easily stated question in probability, which every -- you're in good company -- everybody gets wrong. This is my little diversion into my real passion, which is genetics. There's a connection between head-tail-heads and head-tail-tails in genetics, and it's the following. When you toss a coin, you get a sequence of heads and tails. When you look at DNA, there's a sequence of not two things -- heads and tails -- but four letters -- As, Gs, Cs and Ts. And there are little chemical scissors, called restriction enzymes which cut DNA whenever they see particular patterns. And they're an enormously useful tool in modern molecular biology. And instead of asking the question, "How long until I see a head-tail-head?" -- you can ask, "How big will the chunks be when I use a restriction enzyme which cuts whenever it sees G-A-A-G, for example? How long will those chunks be?"

08:35
That's a rather trivial connection between probability and genetics. There's a much deeper connection, which I don't have time to go into and that is that modern genetics is a really exciting area of science. And we'll hear some talks later in the conference specifically about that. But it turns out that unlocking the secrets in the information generated by modern experimental technologies, a key part of that has to do with fairly sophisticated -- you'll be relieved to know that I do something useful in my day job, rather more sophisticated than the head-tail-head story -- but quite sophisticated computer modelings and mathematical modelings and modern statistical techniques. And I will give you two little snippets -- two examples -- of projects we're involved in in my group in Oxford, both of which I think are rather exciting. You know about the Human Genome Project. That was a project which aimed to read one copy of the human genome. The natural thing to do after you've done that -- and that's what this project, the International HapMap Project, which is a collaboration between labs in five or six different countries. Think of the Human Genome Project as learning what we've got in common, and the HapMap Project is trying to understand where there are differences between different people.

09:43
Why do we care about that? Well, there are lots of reasons. The most pressing one is that we want to understand how some differences make some people susceptible to one disease -- type-2 diabetes, for example -- and other differences make people more susceptible to heart disease, or stroke, or autism and so on. That's one big project. There's a second big project, recently funded by the Wellcome Trust in this country, involving very large studies -- thousands of individuals, with each of eight different diseases, common diseases like type-1 and type-2 diabetes, and coronary heart disease, bipolar disease and so on -- to try and understand the genetics. To try and understand what it is about genetic differences that causes the diseases. Why do we want to do that? Because we understand very little about most human diseases. We don't know what causes them. And if we can get in at the bottom and understand the genetics, we'll have a window on the way the disease works, and a whole new way about thinking about disease therapies and preventative treatment and so on. So that's, as I said, the little diversion on my main love.

10:44
Back to some of the more mundane issues of thinking about uncertainty. Here's another quiz for you -- now suppose we've got a test for a disease which isn't infallible, but it's pretty good. It gets it right 99 percent of the time. And I take one of you, or I take someone off the street, and I test them for the disease in question. Let's suppose there's a test for HIV -- the virus that causes AIDS -- and the test says the person has the disease. What's the chance that they do? The test gets it right 99 percent of the time. So a natural answer is 99 percent. Who likes that answer? Come on -- everyone's got to get involved. Don't think you don't trust me anymore. (Laughter) Well, you're right to be a bit skeptical, because that's not the answer. That's what you might think. It's not the answer, and it's not because it's only part of the story. It actually depends on how common or how rare the disease is. So let me try and illustrate that. Here's a little caricature of a million individuals. So let's think about a disease that affects -- it's pretty rare, it affects one person in 10,000. Amongst these million individuals, most of them are healthy and some of them will have the disease. And in fact, if this is the prevalence of the disease, about 100 will have the disease and the rest won't. So now suppose we test them all. What happens? Well, amongst the 100 who do have the disease, the test will get it right 99 percent of the time, and 99 will test positive. Amongst all these other people who don't have the disease, the test will get it right 99 percent of the time. It'll only get it wrong one percent of the time. But there are so many of them that there'll be an enormous number of false positives. Put that another way -- of all of them who test positive -- so here they are, the individuals involved -- less than one in 100 actually have the disease. So even though we think the test is accurate, the important part of the story is there's another bit of information we need.

12:39
Here's the key intuition. What we have to do, once we know the test is positive, is to weigh up the plausibility, or the likelihood, of two competing explanations. Each of those explanations has a likely bit and an unlikely bit. One explanation is that the person doesn't have the disease -- that's overwhelmingly likely, if you pick someone at random -- but the test gets it wrong, which is unlikely. The other explanation is that the person does have the disease -- that's unlikely -- but the test gets it right, which is likely. And the number we end up with -- that number which is a little bit less than one in 100 -- is to do with how likely one of those explanations is relative to the other. Each of them taken together is unlikely.

13:24
Here's a more topical example of exactly the same thing. Those of you in Britain will know about what's become rather a celebrated case of a woman called Sally Clark, who had two babies who died suddenly. And initially, it was thought that they died of what's known informally as "cot death," and more formally as "Sudden Infant Death Syndrome." For various reasons, she was later charged with murder. And at the trial, her trial, a very distinguished pediatrician gave evidence that the chance of two cot deaths, innocent deaths, in a family like hers -- which was professional and non-smoking -- was one in 73 million. To cut a long story short, she was convicted at the time. Later, and fairly recently, acquitted on appeal -- in fact, on the second appeal. And just to set it in context, you can imagine how awful it is for someone to have lost one child, and then two, if they're innocent, to be convicted of murdering them. To be put through the stress of the trial, convicted of murdering them -- and to spend time in a women's prison, where all the other prisoners think you killed your children -- is a really awful thing to happen to someone. And it happened in large part here because the expert got the statistics horribly wrong, in two different ways.

14:36
So where did he get the one in 73 million number? He looked at some research, which said the chance of one cot death in a family like Sally Clark's is about one in 8,500. So he said, "I'll assume that if you have one cot death in a family, the chance of a second child dying from cot death aren't changed." So that's what statisticians would call an assumption of independence. It's like saying, "If you toss a coin and get a head the first time, that won't affect the chance of getting a head the second time." So if you toss a coin twice, the chance of getting a head twice are a half -- that's the chance the first time -- times a half -- the chance a second time. So he said, "Here, I'll assume that these events are independent. When you multiply 8,500 together twice, you get about 73 million." And none of this was stated to the court as an assumption or presented to the jury that way. Unfortunately here -- and, really, regrettably -- first of all, in a situation like this you'd have to verify it empirically. And secondly, it's palpably false. There are lots and lots of things that we don't know about sudden infant deaths. It might well be that there are environmental factors that we're not aware of, and it's pretty likely to be the case that there are genetic factors we're not aware of. So if a family suffers from one cot death, you'd put them in a high-risk group. They've probably got these environmental risk factors and/or genetic risk factors we don't know about. And to argue, then, that the chance of a second death is as if you didn't know that information is really silly. It's worse than silly -- it's really bad science. Nonetheless, that's how it was presented, and at trial nobody even argued it. That's the first problem. The second problem is, what does the number of one in 73 million mean? So after Sally Clark was convicted -- you can imagine, it made rather a splash in the press -- one of the journalists from one of Britain's more reputable newspapers wrote that what the expert had said was, "The chance that she was innocent was one in 73 million." Now, that's a logical error. It's exactly the same logical error as the logical error of thinking that after the disease test, which is 99 percent accurate, the chance of having the disease is 99 percent. In the disease example, we had to bear in mind two things, one of which was the possibility that the test got it right or not. And the other one was the chance, a priori, that the person had the disease or not. It's exactly the same in this context. There are two things involved -- two parts to the explanation. We want to know how likely, or relatively how likely, two different explanations are. One of them is that Sally Clark was innocent -- which is, a priori, overwhelmingly likely -- most mothers don't kill their children. And the second part of the explanation is that she suffered an incredibly unlikely event. Not as unlikely as one in 73 million, but nonetheless rather unlikely. The other explanation is that she was guilty. Now, we probably think a priori that's unlikely. And we certainly should think in the context of a criminal trial that that's unlikely, because of the presumption of innocence. And then if she were trying to kill the children, she succeeded. So the chance that she's innocent isn't one in 73 million. We don't know what it is. It has to do with weighing up the strength of the other evidence against her and the statistical evidence. We know the children died. What matters is how likely or unlikely, relative to each other, the two explanations are. And they're both implausible. There's a situation where errors in statistics had really profound and really unfortunate consequences. In fact, there are two other women who were convicted on the basis of the evidence of this pediatrician, who have subsequently been released on appeal. Many cases were reviewed. And it's particularly topical because he's currently facing a disrepute charge at Britain's General Medical Council.

18:28
So just to conclude -- what are the take-home messages from this? Well, we know that randomness and uncertainty and chance are very much a part of our everyday life. It's also true -- and, although, you, as a collective, are very special in many ways, you're completely typical in not getting the examples I gave right. It's very well documented that people get things wrong. They make errors of logic in reasoning with uncertainty. We can cope with the subtleties of language brilliantly -- and there are interesting evolutionary questions about how we got here. We are not good at reasoning with uncertainty. That's an issue in our everyday lives. As you've heard from many of the talks, statistics underpins an enormous amount of research in science -- in social science, in medicine and indeed, quite a lot of industry. All of quality control, which has had a major impact on industrial processing, is underpinned by statistics. It's something we're bad at doing. At the very least, we should recognize that, and we tend not to. To go back to the legal context, at the Sally Clark trial all of the lawyers just accepted what the expert said. So if a pediatrician had come out and said to a jury, "I know how to build bridges. I've built one down the road. Please drive your car home over it," they would have said, "Well, pediatricians don't know how to build bridges. That's what engineers do." On the other hand, he came out and effectively said, or implied, "I know how to reason with uncertainty. I know how to do statistics." And everyone said, "Well, that's fine. He's an expert." So we need to understand where our competence is and isn't. Exactly the same kinds of issues arose in the early days of DNA profiling, when scientists, and lawyers and in some cases judges, routinely misrepresented evidence. Usually -- one hopes -- innocently, but misrepresented evidence. Forensic scientists said, "The chance that this guy's innocent is one in three million." Even if you believe the number, just like the 73 million to one, that's not what it meant. And there have been celebrated appeal cases in Britain and elsewhere because of that.

20:23
And just to finish in the context of the legal system. It's all very well to say, "Let's do our best to present the evidence." But more and more, in cases of DNA profiling -- this is another one -- we expect juries, who are ordinary people -- and it's documented they're very bad at this -- we expect juries to be able to cope with the sorts of reasoning that goes on. In other spheres of life, if people argued -- well, except possibly for politics -- but in other spheres of life, if people argued illogically, we'd say that's not a good thing. We sort of expect it of politicians and don't hope for much more. In the case of uncertainty, we get it wrong all the time -- and at the very least, we should be aware of that, and ideally, we might try and do something about it. Thanks very much.
Xiaofei Zhang, Translator
Zhu Jie, Reviewer

00:00
正如一些演讲者所说在这里的观众面前演讲是一次令人畏缩的经历--相当令人恐慌不过与其他演讲者不同我不会给大家讲宇宙的迷团也不会讲进化的奥妙抑或是人们用来对抗世界上主要的不平等现象的那些着实非常奇妙新颖的办法更不会讲现代全球经济下国家之间的挑战就像你们刚才听到的概括来说我讲的内容是统计学-- 更确切地说是一些统计学中很有趣的事情而这-- （笑） --相对所有在我之前以及之后的演讲者而言具有空前绝后的挑战性（笑）当我在统计学这个领域还是新人的时候一个资深同事相当自豪地告诉我统计学家是那些喜欢数字但性格上不适合做会计的人（笑）还有一个统计学的笑话 “怎样看出统计学家是内向还是外向呢？” 答案就是 “外向的统计学家会看别人的鞋” （笑）不过其实我想讲一些有用的--所以请注意今晚在学校的自然历史博物馆里有一个招待会希望你能发现这是一个绝妙的场合也是维多利亚优秀传统中的表现在这样的场合这样的人群中虽然有点不大可能但你也许仍然发现你在跟一些你并不想聊天的人交谈这时候你就可以这么做当他们问：“你的工作是？”--你就说：“我是统计学家” （笑）除非他们事先得到提醒知道这是你编的一般出现的情形都不过以下两种他们会突然在屋子另一角发现了失散多年的表亲然后赶去跟他们说话或者他们会突然很渴或者很饿--通常是饥渴交迫-- 然后奔向食物和饮料这是你就能一个人静下来跟你想聊天的人交谈

01:55
解释我们到底是做什么的是我们这个领域的一个挑战我们并不是晚宴的贵宾也不是理想的交谈对象对此我也一直没能找到什么好的解决办法但我的妻子--当时是我的女朋友在这件事上就比我出色的多多年前那时我们刚开始约会她在英国BBC工作而我当时在美国我回英国看她的时候她跟一个同事说起这事那个同事问：“你男朋友是做什么的？” 她苦苦思索着我刚才解释过的工作于是那段时间她一直是一个专心的倾听者（笑）别告诉她我跟说过这事她当时想我的工作是建立数模来加深对进化和现代基因学的了解所以当同事问：“他是干什么的？” 她就停顿一下然后说：“他做模型。” （笑）当然她的同事立即就对我产生了出乎我意料的兴趣并继续问：“他做什么模型？” 然后萨拉又想了想我的工作然后答：“基因。” （笑） “他建立基因模型。”

03:06
这就是我的初恋题外话了总的来说我要给大家讲一些不确定性、随机性和概率在生活中的影响我们对此的反应是怎样的以及我们了解他们的程度到现在为止大家听得都很轻松到现在为止都是听听笑笑现在大家要开始思考了我会提几个问题下面这个场景就是我开始问第一个问题想象连续掷硬币的情形由于某种原因--我就暂时不做过多的解释了-- 我们很喜欢某种特定的情形比如这个--正面、反面、正面

03:42
假设我们连续掷硬币然后我们设定这样一个情形正反反数着掷十次：一二三四五六七八九十然后看结果怎么样你可能觉得还有更有趣的事可以做不过这次先迁就我一下假设这半边观众都拿出硬币开始投掷直到他们看到正反反现象为止第一回投硬币也许十次以后才能看到第二回也许第四次就能看到再下一回也许比15次还多做过很多遍这个实验后将每遍的次数平均这就是我想让这半边思考的情况

04:18
那半边观众不喜欢正反反出于某些深刻的文化因素他们觉得这很无聊-- 他们跟更喜欢另一种情形--正反正所以这半边的观众拿出硬币反复投掷然后记下看到正反正情形出现时掷硬币的次数然后将所有的次数平均那么这半边的观众得出了一个平均数因为做了很多次所以这个数字是准确的就是正反反情形出现时投掷硬币次数的平均而这半边的观众大家也得出了一个数字--正反正情形的平均

04:46
那么就有了这样一个数学问题两个数之间只能有三种情形他们或者相等或者这个比那个大或者那个比这个大那么在我们这两种情形下这两个数相比会怎样呢大家来思考一下然后投个票现在给大家一些时间不过我不想因为给大家更多的时间思考直到每个人都立场明确而最后以两分钟沉默告终所以你们要做的只是比较这两种情形下平均数的大小

05:16
哪些认为A是对的-- 即平均来看出现正反正的情形要晚于正反反情形？哪些认为B是对的--即平均来看次数相同？哪些认为C是对的--即平均来看出现正反正情形的次数要少于正反反的情形？好谁没有投票？那真是很调皮--我说过你们要选择一个（笑）好的那么大多数人认为B是正确的也许当听到甚至非常优秀的数学家也是这么想的你会放下心来 B不正确答案是A 实际上平均起来正反正情形下掷硬币的次数是10次而正反反情形的次数是8次怎么会这样呢这两种情形有什么不同吗二者的确不同正反正情形会自我重叠如果你掷出正-反-正-反-正你能在这五次中看到两次正反正的情形而这在正反反的情形下无法实现这一点变得很重要

06:21
有两种方法可以来想这个问题我提供其中之一假设我们正在进行这个实验这半边观众--记住你们希望看到正反反而你们希望看到正反正我们开始投硬币第一次是正大家都开始暗自激动因为一个美妙绝伦的事情要发生了第二次是反--大家都很激动手边的香槟已经冰好大家都拿着杯子开始准备庆祝大家都屏气凝神观望最后一掷如果是正那么非常好你们完了而你们可以庆祝了如果这是反--那么有些遗憾你们要把杯子移开然后把香槟放回去接着掷硬币等着下一个正然后开始激动

07:00
而这半边则完全不同这个序列中前两步都是相同的大家因第一个是正有点兴奋当第二个是反的时候变得更加激动然后再掷硬币如果是反你们就可以打开香槟了如果是正你们会感到失望但你们仍旧已经完成了这个模式的三分之一这就是一种不大正式的解释--这就是出现不同的原因另外一种思考的方法就是-- 如果我们掷八百万次硬币我们可能会预计有一百万正反正情形和一百万次正反反情形的出现--但正反正的情形可能接连出现所以如果你想在八百万个位置中得到一百万个固定的模式可能会有一些是重叠的重叠的部分会很长这就是另外一种思考方法

07:45
那么这说明什么问题呢？这是一个非常简单的例子一个很简单明了的问题-- 有很多人跟你们一样--这个问题几乎没有人答对这是一个小小的题外话我很想讲的是基因学在基因学中正反正和正反反两种情形间存在某种联系这个联系是这样的掷硬币的时候你会得到一个正和反组成的序列而当观察DNA时会发现这不是两个元素组成的序列--正反正-- 而是四个字母--A G C T 有一些小小的化学剪刀叫做限制性内切酶当它们遇到特定的情形时就会剪断DNA 在现代分子生物学中它们是非常有用的工具在基因学中我们不问“什么时候能看到正反正的情形？” 你可以问比如说 “如果用限制性内切酶来剪断任何它遇到的GAAG排列剪下来的基因部分会有多大?” 那些基因部分会有多长？

08:35
这是概率和基因之间的一个相当细微的联系他们之间还有一个更深的联系这里我没有时间多讲那就是现代基因学是一个很令人激动的科学领域以后我们可能会在某些大会的演讲中听到这个部分但是若把现代实验技术中发现的秘密公开，关键就是那必须与一些相当复杂的-- 当听到我的工作是多有用的时候你们会倍感释然比正反正的试验要复杂地多-- 但是相当复杂的计算机建模数学建模以及现代统计技术我会举在牛津我们团队正在研究的项目中的两个小例子我认为这两个例子都很有趣大家都了解人类基因组计划那是一个项目目的在于构建人类基因组遗传图谱当完成那个项目后下一步自然是-- --就是这个计划国际人类基因组单体型图计划目前有五六个不同个国家的实验室在合作研究把人类基因遗传图谱看做是对我们共同点的了解而国际人类基因组单体型图计划就是试着了解人类之间的不同

09:43
为什么要这么关注这些呢？这有很多原因最紧迫的一个就是我们想了解其中一些不同是怎样让一些人容易患一种病的--比如说二型糖尿病-- 而另一些不同使人更容易得心脏病或中风自闭症等等其它病症这是一个宏大的项目最近英国威康信托基金会资助了一个项目其规模仅次于上一个项目它包括了很多大型的研究-- 成千上万的人各负责八种不同的疾病有一些比较常见的疾病比如一型糖尿病二型糖尿病和冠心病躁狂抑郁症等等--来试着了解基因着这了解那些导致疾病的基因的不同之处为什么我们想做这些呢？因为我们对大多数人类疾病都了解甚微我们不知道病因是什么如果我们从根本入手并了解基因这边开启了一个通向疾病病理的窗口也开辟了思考疾病治疗方法和预防措施的新路径所以就像我之前说过的那样这是我主要兴趣的一个小分支

10:44
回到一些关于随机性的平凡的问题上来这是给你们的另一个测试-- 现在假设我们拿到了一个疾病的检测这个检测并不是完全准确的但准确性很高这个检测的准确性高达99% 现在我让你们中的一个人或从街上拉来一个人然后检测他患病的几率假设这是一个艾滋病毒的测试--一个导致艾滋病的病毒-- 而测试表明这个人患病那么他患病的几率是多少呢这个测试准确性是99% 所以自然而然会得出99%这个答案谁喜欢这个答案？别这样--每个人都参与进来不要觉得你不再相信我了（笑）不过你们的怀疑是正确的因为这不是正确答案你们可能是这么想的这不是正确答案并不是因为这只是故事的一部分而实际上它取决于这种病是常见的还是罕见的现在我来试着说明一下这个图代表一百万人我们来考虑一种疾病的感染率-- 它非常罕见在一万人中仅一人患病在这一百万人中大部分人都是健康的而一些人会患病实际上如果这是疾病的流行程度那么约一百人会患病而其余人不会现在假设我们给所有人做了测试会出现什么情况呢在100个患有该疾病的人中这个测试会有99%的正确性所以99个人会检测出患病在那些没有患病的人中这个测试仍然有99%的正确率只有1%是错误的但是没有患病的人太多了所以错误的患病检测会非常多换种方法说-- 在所有结果是患病的检测中--就是这些人-- 真正患病的几率小于1% 所以即便我们认为这个测试是准确的这个例子重要的部分在于我们还需要一些信息

12:39
这就是关键当知道测试结果为患病时我们要做的就是权衡下面两种解释的概率或可能性每种解释都有一定的可能性一种解释是这个人不患病-- 这种可能性比较大如果你随机选人的话-- 但是测试结果错了这种情况很罕见另一种解释就是这个人不患病--这很少见-- 但测试结果正确这可能性很大而我们最后得到的数字-- 就是略少于100的数字-- 与这几种解释之间的关联性有关每个解释合起来都不大可能

13:24
这是另一个说明同样道理的例子更加切题在英国的听众知道这是一个很有名的案子一个女人叫做萨里•克拉克她有两个孩子都突然去世很自然人们以为这属于婴儿猝死更正式的说法是婴儿猝死综合征由于多种原因萨里后来以谋杀罪被逮捕在法庭上一个非常著名的小儿科医师作证两个婴儿猝死在一个像萨里的家里-- 有经验并不吸烟的--概率为七千三百万分之一长话短说她最后被判有罪后来最近她在上诉中无罪释放了当置于实际情境中大家就能想象一个人失去了一个孩子然后又失去了另一个然后又被诬为凶手这是多么可怕的事情要被迫承受审判的压力并判有罪-- 在女监里熬过一段日子那里所有的囚犯都认为是你杀了孩子--这件事发生在一个人身上真是太可怕了而这些事的发生很大程度上是因为那个专家得出的数据是错误的错误出在两方面

14:36
那么他是怎样得出七千三百万分之一这个数字的呢他看了一些研究那些研究上说一个家庭里一个婴儿猝死的概率就像萨里•克拉克家这概率是八千五百分之一所以他说：“我假设如果一个家庭中出现了一个婴儿猝死那么第二个婴儿发生猝死的概率也不会变。” 这被统计学家们称为独立事件这就像是在说：“如果你掷硬币第一次是正这并不会影响第二次投掷得到正的概率。” 所以如果你扔两次硬币第一次正的几率是二分之一第二次正的几率也是二分之一所以他说：“我们来假设假设这些事件是独立的当你将八千五百分之一相乘你就会得到七千三百分之一而上面这些并没有在法庭上向陪审团展示作为前提不幸的是--确实很令人遗憾-- 首先在这种情况下要先以经验判断第二这可能是错的我们对婴儿猝死综合症有太多不了解很可能有一些我们并不知道的环境因素也很可能是有一些我们并不了解的基因因素所以如果一个家庭出现一个婴儿猝死你就要把他们放到高概率组他们很可能有这些环境因素和/或基因因素而我们对这些并不知情而就像不知道上面得出的信息一样确定第二个死亡的概率是非常愚蠢的这比愚蠢还糟--这是坏科学但是这推论就这样呈现在法庭上而几乎没有人质疑这是第一个问题第二个问题是七千三百万分之一这个数字意味着什么在萨里•克拉克被定罪后-- 可以想象这在媒体中引起轩然大波-- 一个英国相当有名望的报社记者写到这个专家说 “她无罪的几率是七千三百万分之一” 这是一个逻辑上的错误这个错误相当于认为在准确率99%的疾病测试后患病的几率是99% 在疾病的例子中我们要注意两点一个是这个测试得出的可能性是否正确另一个就是这个人本身是否患病这个情形是完全相同的这个解释包括两个部分我们想知道这两种不同解释发生的可能性或相对的可能性一个是萨里•克拉克是清白的-- 也就是一个先验极为可能-- 大多母亲不会杀自己的孩子这个解释的第二部分就是她遭遇了一个可能性极小的时间不像七千三百万分之一那样小但也同样不可能另一个解释就是我们可能认为一个先验是不大可能然后我们当然应该认为在刑事审判的情形下这是不大可能的因为我们以无罪为前提如果她那时试着杀害孩子那么她成功了所以她无罪的机率并不是七千三百万分之一我们不知道这个个机率是多少这同衡量其它对她不利的证据和数据型证据有关我们知道孩子死了重要的是这两种解释相对发生的机率他们都令人难以置信在这种情形下错误的数据产生了很重大而且不幸的结果事实上还有其他两个女人因这个小儿科医师的作证而被定罪而她们在上诉中都被无罪释放了很多案子都因此而重审这引起了很高的关注因为他正面临着英国综合医学委员会的名誉调查

18:28
总结一下我们应该得到什么警示呢我们知道随机性、不确定性和概率在生活中影响重大并且大家作为一个集体在很多方面都很特别大家没有回答正确我给出的例子是完全正常并具有代表性的有很多人们理解错误的记录他们在不确定性方面犯逻辑错误我们可以很好地解决语言的细微差别还有有趣的进化方面的问题如我们是怎么来到这里的我们并不擅长不确定性这是我们生活中的一个问题像你们听过的很多演讲数据是很多科学研究中的基础--社会科学医学确实很多行业所有的质量控制这些对工业过程的影响极其重要这些都以数据为基础而这方面我们并不擅长至少我们应该意识到这一点并尽力防止错误发生回到法律方面在萨里•克拉克的案子中所有律师都接受了专家的证词如果一个小儿科医师出来对陪审团作证我不知道怎样建造桥梁我在路那边建了一个开车回家的时候请放心过桥他们会说小儿科医师不懂怎样建造桥梁那是工程师的工作而另一方面他们站出来说或暗示我知道怎样运用不确定性我知道怎样处理数据然后大家都说这没问题他是专家所以我们应该明白我们的什么是我们的强项什么不是完全相同类型的问题每天都出现在DNA的测绘中科学家律师有些情况下甚至法官都会错误地解释证据通常--大家希望--结果是无罪只是错误地解释了证据法庭上的科学家说这个人无罪的机率是三百万分之一即使你相信这个数据就像七千三百万分之一这也并不是它真正的含义因为这个在英国和其他地方有很多上诉案件

20:23
这就是在法律层面上我们要考虑的问题说“我们尽量给予证据更好的解释”固然很好但越来越的地在DNA测绘中--这也很重要-- 我们希望陪审团那些普通人-- 记录表明他们非常不擅此类-- 我们希望陪审团能够处理好这些推理在生活的其它方面如果人们在争辩的时候--当然也许不包括政治但是在生活的其他方面如果人们争辩地并不合逻辑我们认为这不是好现象在不确定性方面我们也从某种程度上对政客抱有希望但并不奢求什么我们一直都没对过至少我们应该认识到这一点并且希望我们能试着做什么去改变这一点谢谢大家
https://www.ted.com/talks/peter_donnelly_how_juries_are_fooled_by_statistics/transcript?referrer=playlist-our_brains_predictably_irrati&autoplay=true&subtitle=en

分享到 淘江湖新浪 QQ微博 QQ空间开心人人豆瓣网易微博百度鲜果白社会飞信

发帖回复

返回列表


	关闭您还没有登录，快捷通道只有在登录后才能使用。立即登录还没有帐号？赶紧注册一个


	关闭选中1篇全选

帖子

How juries are fooled by statistics陪审团是如何被数据愚弄的 [复制链接]