Lancet roundup and literature review

by Daniel on November 11, 2004

Well, the Lancet study has been out for a while now, and it seems as good a time as any to take stock of the state of the debate and wrap up a few comments which have hitherto been buried in comments threads. Lots of heavy lifting here has been done by Tim Lambert and Chris Lightfoot; I thoroughly recommend both posts, and while I’m recommending things, I also recommend a short statistics course as a useful way to spend one’s evenings (sorry); it really is satisfying to be able to take part in these debates as a participant and I would imagine, pretty embarrassing and frustrating not to be able to. As Tim Lambert commented, this study has been “like flypaper for innumerates”; people have been lining up to take a pop at it despite being manifestly not in possession of the baseline level of knowledge needed to understand what they’re talking about. (Being slightly more cynical, I suggested to Tim that it was more like “litmus paper for hacks”; it’s up to each individual to decide for themselves whether they think a particular argument is an innocent mistake or not). Below the fold, I summarise the various lines of criticism and whether they’re valid or (mostly) not.

Starting with what I will describe as “Hack critiques”, without prejudice that they might in isolated individual cases be innocent mistakes. These are arguments which are purely and simply wrong and should not be made because they are, quite simply, slanders on the integrity of the scientists who wrote the paper. I’ll start with the most widespread one.

The Kaplan “dartboard” confidence interval critique

I think I pretty much slaughtered this one in my original Lancet post, but it still spread; apparently not everybody reads CT (bastards). To recap; Fred Kaplan of Slate suggested that because the confidence interval was very wide, the Lancet paper was worthless and we should believe something else like the IBC total.

This argument is wrong for three reasons.

1)The confidence interval describes a range of values which are “consistent” with the model[1]. But it doesn’t mean that all values within the confidence interval are equally likely, so you can just pick one. In particular, the most likely values are the ones in the centre of a symmetrical confidence interval. The single most likely value is, in fact, the central estimate of 98,000 excess deaths. Furthermore, as I pointed out in my original CT post, the truly shocking thing is that, wide as the confidence interval is, it does not include zero. You would expect to get a sample like this fewer than 2.5 times out of a hundred if the true number of excess deaths was less than zero (that is, if the war had made things better rather than worse).

2)As the authors themselves pointed out in correspondence with the management of Lenin’s Tomb,

“Research is more than summarizing data, it is also interpretation. If we had just visited the 32 neighborhoods without Falluja and did not look at the data or think about them, we would have reported 98,000 deaths, and said the measure was so imprecise that there was a 2.5% chance that there had been less than 8,000 deaths, a 10% chance that there had been less than about 45,000 deaths,….all of those assumptions that go with normal distributions. But we had two other pieces of information. First, violence accounted for only 2% of deaths before the war and was the main cause of death after the invasion. That is something new, consistent with the dramatic rise in mortality and reduces the likelihood that the true number was at the lower end of the confidence range. Secondly, there is the Falluja data, which imply that there are pockets of Anbar, or other communities like Falluja, experiencing intense conflict, that have far more deaths than the rest of the country. We set that aside these data in statistical analysis because the result in this cluster was such an outlier, but it tells us that the true death toll is far more likely to be on the high-side of our point estimate than on the low side.”

That is, the sample contains important information which is not summarised in the confidence interval, but which tells you that the central estimate is not likely to be a massive overestimate. The idea that the central 98,000 number might be an underestimate seemed to have blown the mind of a lot of commentators; they all just seemed to act like it Did Not Compute.

3. This gave rise to what might be called the use of “asymmetric rhetoric about a symmetric confidence interval”, but which I will give the more catchy name of “Kaplan’s Fallacy”. If your critique of an estimate is that the range is too wide, then that is one critique you can make. However, if this is all you are saying (“this isn’t an estimate, it’s a dartboard”), then intellectual honesty demands that you refer to the whole range when using this critique, not just the half of it that you want to think about. In other words, it is dishonest to title your essay “100,000 dead – or 8,000?” when all you actually have arguments to support is “100,000 dead – or 8,000 – or 194,000?”. This is actually quite a common way to mislead with statistics; say in paragraph 1 “it could be more, it could be less” and then talk for the rest of the piece as if you’ve established “it’s probably less”.

The Kaplan piece was really very bad; as well as the confidence interval fallacy, there are the germs of several of the other fallacious arguments discussed below. It really looks to me as if Kaplan had decided he didn’t want to believe the Lancet number and so started looking around for ways to rubbish it, in the erroneous belief that this would make him look hard-headed and scientific and would add credibility to his endorsement of the IBC number. I would hazard a guess that anyone looking for more Real Problems For The Left would do well to lift their head up from the Bible for a few seconds and ponder what strange misplaced and hypertrophied sense of intellectual charity it was that made Kaplan, an antiwar Democrat, decide to engage in hackish critiques of a piece of good science that supported his point of view.

The cluster sampling critique

There are shreds of this in the Kaplan article, but it reached its fullest and most widely-cited form in a version by Shannon Love on the Chicago Boyz website. The idea here is that the cluster sampling methodology used by the Lancet team (for reasons of economy, and of reducing the very significant personal risks for the field team) reduces the power of the statistical tests and makes the results harder to interpret. It was backed up (wayyyyy down in comments threads) by people who had gained access to a textbook on survey design; most good textbooks on the subject do indeed suggest that it is not a good idea to use cluster sampling when one is trying to measure rare effects (like violent death) in a population which has been exposed to heterogeneous risks of those rare events (ie; some places were bombed a lot, some a little and some not at all).

There are two big problems with the cluster sampling critique, and I think that they are both so serious that this argument is now a true litmus test for hacks; anyone repeating it either does not understand what they are saying (in which case they shouldn’t be making the critique) or does understand cluster sampling and thus knows that the argument is fallacious. The problems are:

1)Although sampling textbooks warn against the cluster methodology in cases like this, they are very clear about the fact that the reason why it is risky is that it carries a very significant danger of underestimating the rare effects, not overestimating them. This can be seen with a simple intuitive illustration; imagine that you have been given the job of checking out a suspected minefield by throwing rocks into it.

This is roughly equivalent to cluster sampling a heterogeneous population; the dangerous bits are a fairly small proportion of the total field, and they’re clumped together (the mines). Furthermore, the stones that you’re throwing (your “clusters”) only sample a small bit of the field at a time. The larger each individual stone, the better, obviously, but equally obviously it’s the number of stones that you have that is really going to drive the precision of your estimate, not their size. So, let’s say that you chuck 33 stones into the field. There are three things that could happen:

a)By bad luck, all of your stones could land in the spaces between mines. This would cause you to conclude that the field was safer than it actually was.
b)By good luck, you could get a situation where most of your stones fell in the spaces between mines, but some of them hit mines. This would give you an estimate that was about right regarding the danger of the field.
c)By extraordinary chance, every single one of your stones (or a large proportion of them) might chance to hit mines, causing you to conclude that the field was much more dangerous than it actually was.

How likely is the third of these possibilities (analogous to an overestimate of the excess deaths) relative to the other two? Not very likely at all. Cluster sampling tends to underestimate rare effects, not overestimate them[2].

And 2), this problem, and other issues with cluster sampling (basically, it reduces your effective sample size to something closer to the number of clusters than the number of individuals sampled) are dealt with at length in the sampling literature. Cluster sampling ain’t ideal, but needs must and it is frequently used in bog-standard epidemiological surveys outside war zones. The effects of clustering on standard results of sampling theory are known, and there are standard pieces of software that can be used to adjust (widen) one’s confidence interval to take account of these design effects. The Lancet team used one of these procedures, which is why their confidence intervals are so wide (although, to repeat, not wide enough to include zero). I have not seen anybody making the clustering critique who as any argument at all from theory or data which might give a reason to believe that the normal procedures are wrong for use in this case. As Richard Garfield, one of the authors, said in a press interview, epidemics are often pretty heterogeneously distributed too.

There is a variant of this critique which is darkly hinted at by both Kaplan and Love, but neither of them appears to have the nerve to say it in so many words[3]. This would be the critique that there is something much nastier about the sample; that it is not a random sample, but is cherry-picked in some way. In order to believe this, if you have read the paper, you have to be prepared to accuse the authors of telling a disgusting barefaced lie, and presumably to accept the legal consequences of doing so. They picked the clusters by the use of random numbers selected from a GPS grid. In the few cases in which this was logistically difficult (read: insanely dangerous), they picked locations off a map and walked to the nearest household). There is no realistic way in which a critique of this sort can get off the ground; in any case, it affected only a small minority of clusters.

The argument from the UNICEF infant mortality figures

I think that the source for this is Heiko Gerhauser, in various weblog comments threads, but again it can be traced back to a slightly different argument about death rates in the Kaplan piece. The idea here is that the Lancet study finds a prewar infant mortality rate of 29 per 1000 live births and a postwar infant mortality rate of 54 per 1000 live births. Since the prewar infant mortality rate was estimated by UNICEF to be over 100, this (it is argued) suggests that the study is giving junk numbers and all of its conclusions should be rejected.

This argument was difficult to track down to its lair, but I think we have managed it. One weakness is similar to the point I’ve made above; if you believe that the study has structurally underestimated infant mortality, then isn’t it also likely to have underestimated adult mortality? The authors discuss a few reasons why the movement in infant mortality might be exaggerated (mainly, issues of poor recall by the interview subjects), though, and it is good form to look very closely at any anomalies in data.

Which is what Chris Lightfoot did.

Basically, the UNICEF estimate is quoted as a 2002 number, but it is actually based on detailed, comprehensive, on-the-ground work carried out between 1995 and 1999 and extrapolated forward. The method of extrapolation is not one which would take into account the fact that 1999 was the year in which the oil-for-food program began to have significant effects on child malnutrition in Iraq. No detailed on-the-ground survey has been carried out since 1999, and there is certainly no systematic data-gathering apparatus in Iraq which could give any more solid number. The authors of the study believe that the infant mortality rates in neighbouring countries are a better comparator than pre-oil for food Iraq, and since one of them is Richard Garfield, who was acknowledged as the pre-eminent expert on sanctions-related child deaths in the 1990s, there is no reason to gainsay them.

I’d add to Chris’ work a theory of my own, based on the cluster sampling issue discussed above. Infant mortality is rare, and it is quite possibly heterogeneously clustered in Iraq (not least, post-war, a part of the infant mortality was attributed to babies being born at home because it was too dangerous to go to hospital). So it’s not necessarily the case that one needs to have an explanation of why they might have been undersampled in this case. Since this undersampling would tend to underestimate infant mortality both before and after the war, it wouldn’t necessarily bias the estimate of the relative risk ratio and therefore the excess deaths. I’d note that my theory and Chris’s aren’t mutually exclusive; I suspect that his is the main explanation.

We now move into the area of what might be called “not intrinsically hack” critiques. These are issues which one could raise with respect to the study which are not based on either definite or likely falsehoods, but which do not impugn the integrity of the study, and which are not themselves based on evidence strong enough to make anyone believe that the study’s estimates were wrong unless they thought so anyway.

There are two of these that I’ve seen around and about.

The first might be called the “Lying Iraqis” theory. This would be the theory that the interview subjects systematically lied to the survey team. In fact, the team did attempt to check against death certificates in a subsample of the interviews and found that in 81% of cases, subjects could produce them. This would lead me to believe that there is no real reason to suppose that the subjects were lying. Furthermore, I would suspect that if the Iraqis hate us enough to invent deaths of household members to make us look bad in the Lancet, that’s probably a fairly serious problem too. However, the possibility of lying subjects can’t be ruled out in any survey, so it can’t be ruled out in this one, so this critique is not intrinsically hackish. Any attempt to bolster it either with an attack on the integrity of the researchers, or with a suggestion that the researchers mainly interviewed “the resistance” (they didn’t), however, is hack city.

The second, which I haven’t really seen anyone adopt yet, although some people looked like they might, could be called the “Outlier theory”. This is basically the theory that this survey is one gigantic outlier, and that a 2.5% probability event has happened. This would be a fair enough thing to believe, as long as one admitted that one was believing in something quite unlikely, and as long as it wasn’t combined with an attack on the integrity of the Lancet team.

Finally, we come onto two critiques of the study which I would say are valid. The first is the one that I made myself in the original CT post; that the extrapolated number of 98,000 is a poor way to summarise the results of the analysis. I think that the simple fact that we can say with 97.5% confidence that the war has made things worse rather than better is just as powerful and doesn’t commit one to the really quite strong assumptions one would need to make for the extrapolation to be valid.

The second one is one that is attributable to the editors of the Lancet rather than the authors of the study. The Lancet’s editorial comment on the study contained the phrase “100,000 civilian deaths”. The study itself counts excess deaths and does not attempt to classify them as combatants or civilians. The Lancet editors should not have done this, and their denial that they did it to sensationalise the claim ahead of the US elections is unconvincing. This does not, however, affect the science; to claim that it does is the purest imaginable example of argumentum ad hominem

Finally, beyond the ultra-violet spectrum of critiques are those which I would classify as “beyond hackish”. These are things which anyone who gave them a moment’s thought would realise are irrelevant to the issue.

In this category, but surprisingly and disappointingly common in online critiques, is the attempt to use the IBC numbers as a stick to beat the Lancet study. The two studies are simply not comparable. One final time; the Iraq Body Count is a passive reporting system[4], which aims to count civilian deaths as a result of violence. Of course it is going to be lower than the Lancet number. Let that please be an end of this.

And there are a number of odds and ends around the web of the sort “each death in this study is being taken to stand for XXYY deaths and that is ridiculous”. In other words, arguments which, if true, would imply that there could be no valid form of epidemiology, econometrics, opinion polling, or indeed pulling up a few spuds to see if your allotment has blight. This truly is flypaper for innumerates.

I would also include in this category attempts like that of the Obsidian Order weblog to chaw down the 98,000 number by making more or less arbitrary assumptions about what proportion of the excess deaths one might be able to call “combatants” and thus people who deserved to die. This is exactly what people accuse the Lancet of doing; it’s skewing a number by means of your own subjective assessment. Not only is there no objective basis for the actual subjective adjustments that people make, but the entire distinction between combatants and civilians is one which does not exist in nature. As a reason for not caring that 98,000 people might have died, because you think most of them were Islamofascists, it just about passes muster. As a criticism of the 98,000 figure, it’s wretched.

Finally, there is the strange world of Michael Fumento, a man who is such a grandiose and unselfconscious hack that he brings a kind of grandeur to the role. I can no more summarise what a class A fool he’s made of himself in these short paragraphs than I could summarise King Lear. Read the posts on Tim’s site and marvel. And if your name is Jamie Doward of the Guardian, have a word with yourself; not only are you citing blogs rather than reading the paper, you’re treating Flack Central Station as a reliable source!

The bottom line is that the Lancet study was a good piece of science, and anyone who says otherwise is lying. Its results (and in particular, its central 98,000 estimate) are not the last word on the subject, but then nothing is in statistics. There is a very real issue here, and any pro-war person who thinks that we went to war to save the Iraqis ought to be thinking very hard about whether we made things worse rather than better (see this from Marc Mulholland, and a very honourable mention for the Economist). It is notable how very few people who have rubbished the Lancet study have shown the slightest interest in getting any more accurate estimates; often you learn a lot about people from observing the way that they protect themselves from news they suspect will disconcert them.

Footnotes:
[1]This is not the place for a discussion of Bayesian versus frequentist statistics. Stats teachers will tell you that it is a fallacy and wrong to interpret a confidence interval as meaning that “there is a 95% chance that the true value lies in this range”. However, I would say with 95% confidence that a randomly selected stats teacher would not be able to give you a single example of a case in which someone made a serious practical mistake as a result of this “fallacy”, so I say think about it this way.
[2]Pedants would perhaps object that the more common mines are in the field, the less the tendency to underestimate. Yes, but a) by the time you got to a stage where an overestimate became seriously likely, you would be talking not about a minefield, but a storage yard for mines with a few patches of grass in it and b) we happen to know that violent death in Iraq is still the exception rather than the norm, so this quibble is irrelevant.
[3]And quite rightly so; if said in so many words, this accusation would clearly be defamatory.
[4]That is, they don’t go out looking for deaths like the Lancet did; they wait for someone to report them. Whatever you think about whether there is saturation media coverage of Iraq (personally, I think there is saturation coverage of the green zone of Baghdad and precious little else), this is obviously going to be a lower bound rather than a central estimate, and in the absence of any hard evidence about casualties there is no reason at all to suppose that we have any basis other than convenient subjective air-pulling to adjust the IBC count for how much of an undersample we might want to believe they are making.

{ 3 trackbacks }

Deltoid » Shannon Love and Andy S take swipes at the Lancet study
09.15.05 at 1:36 pm
Deltoid » Hitchens’ crazed fabrication
09.16.05 at 7:31 am
Deltoid » Innumerate BBC
11.08.05 at 4:41 am

{ 110 comments }

1

kevin donoghue 11.12.04 at 12:07 am

Many thanks. I suspect the study would have been ignored rather than slimed if it had been published after the election all the more reason for publishing it early and ensuring it got some attention). Was there supposed to be a link in “this from Marc Mulholland” ? Once again, thanks; you’re doing useful work here. Fuck the begrudgers.

2

dsquared 11.12.04 at 12:13 am

yes there was and I’ve put it in now, thanks very much. To be honest, the editor of the Lancet leaves a slightly funny taste in my mouth. By being so blatantly political, he gives a lot of succour to the people who are claiming that there were defects in the refereeing process, a point which I haven’t covered here because it would only matter if there were flaws in the actual paper, which there aren’t. (Some people, mainly people with no real idea about science, seem to regard “peer review” as magic oofle dust that has to be sprinkled on a piece of research to turn the words into Science). I agree with the Lancet guy that violent death is a genuine public health problem in Iraq and therefore the factors which cause it are a proper object of study for the Lancet, but, in any case, it was too late in the campaign to have ever made it as a real campaign issue, so why bother? God I feel like Fred Kaplan after all that.

3

washerdreyer 11.12.04 at 12:19 am

I expect to see a comment, other than this one, which ignores everything else in the main post and criticizes you for being a “moral relativist” due to this statement, “…the entire distinction between combatants and civilians is one which does not exist in nature.”

4

George 11.12.04 at 12:28 am

You can count me among the innumerate (no pun intended), so my comment can immediately be discounted. However, I would be interested to know in which class of hackery you would put it. According to the Guardian article, the study was carried out in “33 randomly-chosen neighbourhoods of Iraq representative of the entire population.” How much confidence do they have about this? In a war zone, and in a country closed to the outside world by sanctions for a dozen years, how accurately can one estimate demographic variables such as population density, total population, employment, illness, etc. Employment and illness might seem to be irrelevant, since the study is measuring violent death, but maybe if unemployment is high in one town, there are more people at home when the bombs fall. Maybe in one region, people are more likely to go to the hospital when they are sick. Seems like small variations in these variables could result in significant differences in the confidence interval. But what do I know, I’m not a statistician. Yet let me say this: many people found this study bogus for the simple reason that extrapolating large numbers from small numbers seems to almost always be pure guesswork. The analogy that first occurred to me was crowd size estimates: the people who put on the rally estimate that 100,000 people were there, while the cops put it at closer to 20,000. Presumably they both have some rational basis for their estimate, but the results are so divergent that there seems to be no science involved at all. Anyway, those are my thoughts on why people seem to have such a hard time with this, rather than (in most cases) an unwillingness to admit that the war could have killed people.

5

Gyan 11.12.04 at 12:29 am

About that statistics course, try here.

6

dsquared 11.12.04 at 12:33 am

George; the procedure for ensuring a random and representative sample is detailed in the paper and it’s not hard to understand – it’s nuts and bolts stuff rather than anything specifically mathematical. It looks good to me and I haven’t seen anyone seriously criticising it. I don’t agree that this is like estimating crowd sizes. As I say, I don’t like the extrapolated number, but the relative risk ratio is solid, and that’s the big thing for me. It would be very difficult to get this result from a random sample if the death rate had not risen.

7

Hansomdevil 11.12.04 at 12:37 am

If people were honest supporters of improving Iraq they would look at these numbers and try to come up with some solutions to improve the situation instead of insisting that everything is a-ok. At the very least some introspection is called for. As Daniel says, the exact number isn’t as important as the fact that things are much worse now than they were before.

8

anon 11.12.04 at 12:55 am

Hi, I was curious about something. Is the “cluster sampling method” anything like a Monte Carlo technique? I am glad you have taken the time to debunk the hack critiques that many of us have seen. Personally, if a journal like Lancet publishes a study of this nature, I would be very careful in critiquing and rubbishing it.

9

Clayton 11.12.04 at 1:08 am

This was really useful, thanks. Now if only I could get my students to actually read news articles referencing the Lancet study so they can begin the inevitable process of trying in vain to rubbish it. I’m (almost) envious of those of you who live amongst the rubbishers, at least they care enough to engage in self-serving and fallacious reasoning.

10

Donald Johnson 11.12.04 at 1:17 am

First let me take a minute or two for some wild enthusiastic cheering. Good job DD. It more than makes up for my disappointment when you attacked Steve Landsburg not on his libertarian sins, but for that paper on quantum game theory. Okay, two comments. First, why can’t major news organizations hire people like you to explain studies like this? (Consider that to be part of the cheering). Second, why can’t they manage to spend more than thirty seconds mentioning them? This story was a one or day wonder in the American press—one story, usually containing some dismissive remarks, and then that’s all the time we had for investigating the utterly irrelevant story about whether we’ve made things worse in Iraq.

11

Sean Lynch 11.12.04 at 1:19 am

Several of the external links in this article have spaces around the href attribute, inside the quotes. Most web browsers just strip them out, but they confuse our news aggregator and probably many others.

12

Kieran Healy 11.12.04 at 2:08 am

Yet let me say this: many people found this study bogus for the simple reason that extrapolating large numbers from small numbers seems to almost always be pure guesswork. George, this is the mere random sample objection, and it’s not well-founded. More generally, I’d agree with Daniel that most of the objections to the study either misunderstand the methods or just assert that because it doesn’t conform to an ideal design it can therefore be dismissed out of hand as garbage. This kind of objection is common in the first year of graduate social science programs, where people who have yet to do any empirical research find it very, very easy to trash the work of others who have. Later they tend to realise that the available methods really are very powerful, all things considered, and that even a modest empirical paper takes a lot of work to pull off. The fact that these guys did work like this under the circumstances they faced is really remarkable. Of course it’s not the last word on the subject, but it’s a real contribution with a striking finding. To write it off with malformed (or stock) objections is sheer ideology.

13

Donald Johnson 11.12.04 at 2:33 am

Browsing around, I found there was a poll in Iraq commissioned by the American Enterprise Institute which came up with interesting results, reported or misreported by Karl Zinsmeister in the September 10 2003 Wall Street Journal opinion page online at the following location— http://www.opinionjournal.com/editorial/feature.html?id=110003991 Anyway, according to Karl, half of the Iraqis knew someone killed by Saddam’s security forces (though someone else at the CASI website said that Karl was misrepresenting the poll question—-50 percent said they knew someone who either died in one of Saddam’s wars or were murdered by Saddam’s security forces). Karl goes on to say less than 30 percent (which presumably means nearly 30 percent) knew someone who died in the spring fighting. Can you tell anything from relative death rates from this kind of thing? I fell back on my Poisson distribution for dummies approach, assuming that each person in Iraq knows the same number of people and setting either 0.7 (for the invasion) or 0.5 (for the Saddam years) equal to the probability of nobody dying in a given group of known people, and if I did it right you find that the number of people who died violently under Saddam is only twice the invasion number, which seems awfully high for the invasion figure, unless the Saddam total has been inflated. I’m not clever enough to come up with alternative models that might be more realistic and don’t know if the Poisson distribution approach makes sense. My perhaps incorrect approach aside, I’m surprised that only 50 percent of Iraqis knew someone either killed in the wars or murdered by Saddam. If you took the 300,000 figure for the murders and added an equal number for deaths for war casualties, that’s 600,000 out of 24 million (these days—the population has grown, presumably). The NYC metro area has around 15-20 million, I think and out of the 3000 who died on 9/11 I didn’t know anyone, though I knew people who knew people who died. But I think that if, say, 400,000 died I’d have known quite a few and probably most people around here would have known a victim. The 50 percent number might make sense if most of the victims came from a subset of the population that didn’t mingle with the rest, I’m guessing, or if most people in Iraq don’t know very many people. (Which seems silly). I throw these amateurish efforts out there in hopes that someone (ahem) who actually understands statistics would be willing to say what, if anything, can be learned from this earlier poll.

14

wcw 11.12.04 at 2:36 am

A reasonable piece about an unreasonable subject, though rather than “flypaper” I tend to see it as just another episode of “when innumerates attack”. Fumento has been fun, though. I shall admit to writing an insulting letter to the editor in response to his column being syndicated by my local paper. One guess what happened when I CCd him.

15

George 11.12.04 at 4:41 am

dquared and kieran: thanks. I will have to take your word for it, though. If a statistics degree is necessary for one to swallow the claim that 61 deaths can be extrapolated to hundreds of thousands, you guys in the reality-based community have got a problem with the rest of us.

16

Mike 11.12.04 at 6:08 am

Add another critique to the study – the model the researchers fit allowed for an increase (and only an increase, based on their write-up – they never explicitly write down their model) in the death rate after the invasion. Thus the lowest estimate for excess death they could come up with would be 0 – no chance for a decrease in deaths. I don’t have enough experience with survival analysis to know about how this will effect the size of confidence bands, but it is suspect. It’s also another example of the authors’ bias.

17

wcw 11.12.04 at 6:28 am

Never heard of negative coefficients?

18

dsquared 11.12.04 at 6:29 am

Mike, that’s simply not true. They did explain what their model was and it did allow for a decrease. They calculated a relative risk ratio associated with the intervention (that’s their model) and if that risk ratio had been less than 1 (it wasn’t) then this would have corresponded to a falling death rate. They in fact did observe a falling death rate in the Kurdish provinces and reported it.

19

Paul Orwin 11.12.04 at 6:41 am

Kieran brings up an interesting point, and not one limited to the social sciences. In my chosen field (microbiology) I found similar patterns, and certainly participated in them. In the first year of grad school, every paper you read is crap, and any moron could do it better. By the fifth year, you start to see how hard it is to do a really great piece of research, and you learn not to throw out too many babies with the bathwater. Maybe if all the whiners and criers (yeah, you Fumento!) would actually spend some time doing research, and learning about what they are critiquing, they might have a bit more perspective. Admittedly, however, the ones who get paid to publish this drivel probably have a leg up on us poor oppressed academics!

20

Brian Weatherson 11.12.04 at 6:46 am

I think George is in for a shock when he finds out how economic indicators here in America (e.g. unemployment) are calculated, since the extrapolations from data are just as dramatic. But it should be obvious this kind of thing is basically sound. We regard it as a news story when opinion polls (which extrapolate from 1000 voters to 100 million) are off by 3 or 4%. If extrapolation was as bad as people are claiming here, getting within 10% would be miraculous, which it pretty clearly isn’t. If this study is off by 3 or 4% in its mean figures, or even 10 or 15%, that means the Iraq war has led (so far at least) to waaaay more deaths than it has prevented.

21

Shannon Love 11.12.04 at 6:58 am

dsquared, You might due your readers a service by providing the actual numbers on infant mortality in order to show just how badly off the Lancet study is. The Lancet study measured an infant mortality rate in the 18 months prior to the war (Oct 2001-Mar 2003) of 29/1000. UNICEF reported the 2002 rate at 102/1000. Richard Garfields study to which you link reports (p7-8) that: “In the context of an increase in the number of total visits to hospitals and a rapid decline in malnutrition since 2000, it is reasonable to assume that there has indeed been a decline from the very high levels of child mortality in the late 1990s. The decline from a high rate of 131/1,000 among under five-year-olds during 1995-1999 may well be to a rate of 90–100/1,000.” Since under five mortality and infant mortality track it reasonable to assume that the infant mortality dropped from in similar ranges which would put Garfield in agreement with UNICEF. There is absolutely no way that the 2002 Iraqi infant mortality rate was 29/1000. That would be the best rate ever recorded in Iraq. We have to conclude either the Lancet study or the all the other pre-war studies miss-measured the pre-war infant mortality rate. Since the Lancet study is the odd man out, Occam razor says that it’s the flawed study. The really sick thing about all this is that the same people who now trumpet the Lancet study and the very low measure of pre-war infant mortality are the very same people who trumpeted the other studies high rates. The only commonality here is those individuals opposition to US policy in Iraq. They portray pre-war Iraq either as a charnel house or as eden depending on what gives the best advantage of the moment. Dead children don’t matter, only screwing over their internal political enemies does.

22

Gary Farber 11.12.04 at 7:07 am

“…people have been lining up to take a pop at it despite being manifestly not in possession of the baseline level of knowledge needed to understand what they’re talking about.” Gosh, that makes it like most blog comments, alas. It frequently drives me crazy, to be sure. To take a random pop, the stuff early in this post about “mines” makes me wonder the relevance, given that an anti-tank mine is set off by magnetic field, or weight, and “tossing a rock” is irrelevant. So this, for one, seems to be evoking some sort of unreal idea of a “mine,” that then, when an example purporting to represent the real world, uses such a non-existent sense of a “mine,” makes me think whatever proceeds makes no sense, given the premise on a non-existent device in reality. Anti-personnel mines are even trickier in their variability, but given their imaginary use in this theoretical example, I’m uninterested in further pursuing the fantasy of an accurate conclusion. Any argument based upon things that don’t exist, though people popularly imagine them, is suspect, in my book.

23

Martin Wisse 11.12.04 at 7:07 am

The thing is, support for the Iraq invasion has become an issue of religion with many of its supporters, especially with the socalled leftists who support it, that everytime negative news about it is published it has to be denied. Which is why you have the increasingly desperate attempts to rubbish it. Then there are the professionals, who know their objections are false but who don’t care as long as they can mislead others into thinking the Lancet piece was rubbish.

24

dsquared 11.12.04 at 7:55 am

Shannon, this objection is dealt with in my piece and (with much greater thoroughness) at Chris Lightfoot’s site. UNICEF did not “report” the 2002 rate as anything. They extrapolated a 2002 rate based on the 1999 research. And from then on in, you are citing Richard Garfield as an authority in an argument against Richard Garfield. You’re also exaggerating the extent to which a mortality rate of 29 is an outlier; given the very wide confidence intervals on the estimation process as a whole (and the substantial bias toward underestimation, which, as a “service” to your own readers you have consistently failed to acknowledge), it strikes me that a 1999 rate of about 100, a guesstimated fall of 40% and a confidence interval of +/- 20 would include the sampled figure of 29 at its lower end. I see that you are no longer defending the clustering critique, for which thanks, but you seem to now be advancing a much less intellectually respectable critique that “Occam’s Razor” tells us that there are strange murky flaws in the methodology which you can’t identify but nevertheless insist are there. For which, not thanks. I see you are also continuing to attribute the worst of motives to people who produce results that you disagree with; you are presumably doing this because you are judging the behaviour of others by your own standards.

25

dsquared 11.12.04 at 8:02 am

Gary: I define a “mine” in the simplest way possible; as “a stochastic finite state automaton defined on the integer field with mutually exclusive states sampled conditionally on a matrix M”. Doesn’t everybody? The definition of a “rock” is a bit more technical …

26

nic 11.12.04 at 8:08 am

But it should be obvious this kind of thing is basically sound. We regard it as a news story when opinion polls (which extrapolate from 1000 voters to 100 million) are off by 3 or 4% Exactly, and you never see people attempting to rubbish polls especially when the numbers give pretexts for other ideological conclusions. Last year’s EU survey about European opinions on the war in Iraq and about Israel was a good example of that. Or the “moral values” election poll. Everyone has draw all sorts of conclusions but I haven’t seen anyone disputing actual numbers. People do think there is a lot of coverage about Iraq because there’s always something in the news about it, but how much of it has been about civilian deaths? So, because they don’t hear or see it on tv, it must not exist. Civilian deaths in Iraq, oh, it must be some foggy concept that can be easily put aside. (Possibly with “anyway, Saddam would have killed more” – now that’s a scientific method the Lancet authors should learn something from!). The combination of attacks on Lancet is also so self-contradictory. I remember the IBC website was considered as reliable as Indymedia and now it’s become more authoritative than a peer-reviewed medical journal. I don’t even understand the argument about UNICEF infant mortality rates presumably proving the Lancet study is all wrong. How can the criticism simultaneously be that Lancet underestimates those rates therefore it must overestimate the war casualties?

27

John Quiggin 11.12.04 at 8:11 am

Another way of looking at the opinion poll example is that a 5 per cent swing (50 people in a sample of 1000) in a well-conducted survey is (correctly) taken as strong evidence that somewhere between 2 million and 8 million people in an electorate of 100 million will change their votes.

28

stuart 11.12.04 at 9:50 am

For me the big negative about the Lancet survey is they had to do it in the first place – the Geneva Convention insists that occupying forces keeps track of these figures, of course there is little chance of the US doing this – rather they just assume that everyone blow up by a bomb was probably an insurgent anyway.

29

dsquared 11.12.04 at 10:00 am

Donald asked: Anyway, according to Karl, half of the Iraqis knew someone killed by Saddam’s security forces (though someone else at the CASI website said that Karl was misrepresenting the poll question—-50 percent said they knew someone who either died in one of Saddam’s wars or were murdered by Saddam’s security forces). Karl goes on to say less than 30 percent (which presumably means nearly 30 percent) knew someone who died in the spring fighting. Can you tell anything from relative death rates from this kind of thing? I think the answer would be “maybe, but not in any straightforward way”. Saddam was in power for twenty years, and his murders were concentrated within that period to 1985-90 (Kurds) and 1991 (Shia). The coalition, on the other hand, has only been in Iraq for eighteen months. So I think it would be difficult to straighten out the time periods and get an apples-to-apples comparison. I’m not saying it couldn’t be done, but I suspect you would have to make some truly heroic assumptions.

30

Chris Lightfoot 11.12.04 at 10:17 am

Oh, one other brief comment. Chris Williams pointed me to a poll done by the International Republican Institute which found (see the “Power Point” presentation linked at the bottom of that page) that 22% of Iraqis answered “yes” to the question, “In the past year and a half, has your household been directly affected by violence in terms of death, handicap, or significant monetary loss? (Close family member, up to 4th degree)” Now, suppose that there are ten people in an average “close” family group; then these figures imply that about 2% of individuals have been “directly affected by violence” (because each person who is killed or injured or whatever is known to about ten others). That gives a raw number of about 600,000 victims; we can’t, of course, tell how many of those were killed, how many injured and so forth (or who did the killing and injuring). But the important point about this is that, whatever the difficulties of conducting an opinion poll in Iraq and the imprecision of the number which comes out, it gives an independent estimate of the number of casualties—and that estimate is of about the same magnitude as Roberts et al.’s estimate.

31

Jack 11.12.04 at 10:49 am

What are the competing estimates? Starting with the IBC estimate, c.15,000 and filling in the gaps consisting of indirectly caused deaths, I doubt less than 5,000, and non-civilian deaths, surely at least as many as the IBC figure, gives a number well into the tens of thousands. Since the IBC figures are by design systematically at the low end, the indirect figure is probably low and hte military deaths figure very likely out by a factor of two or more, there seems nothing at all implausible about the Lancet numbers. I am surprised not to have heard more about the numbers Saddam is not killing any more. I don’t think they would wash anyway but they would be hard to count and therefore to dismiss. Has this point now been conceded?

32

dsquared 11.12.04 at 11:04 am

Chris: average size of a nonextended family in Iraq is six, according to IBC, so your numbers look pretty good.

33

John Kozak 11.12.04 at 11:22 am

The Jamie Doward piece was not in the Guardian, but the barely-credibly-awful Observer; see the article on Homo Floresiensis in the same issue for another example of how not to do science journalism

34

Barry 11.12.04 at 11:43 am

Gary – anti-tank mines are now set off by magnetic triggers? Last I heard, it was a pressure plate or a tilt rod. I think that you’re thinking of anti-ship mines.

35

Martin Wisse 11.12.04 at 2:04 pm

You’ll have to excuse Gary, he suffers form the nerdish condition known as nitpickerii pointlessia.

36

derek 11.12.04 at 2:05 pm

Fred Kaplan of Slate suggested that because the confidence interval was very wide, the Lancet paper was worthless and we should believe something else like the IBC total Does he cite a confidence interval for the Iraq Body Count? Because if not I have a hint for him: just because they don’t give one, it doesn’t mean the confidence interval is small or zero. To their credit, the IBC people emphasise that the best their figure can do is set a minimum, and that it is likely that it is a gross underestimate.

37

Brett Bellmore 11.12.04 at 2:17 pm

I think the fundamental problem with this study is not in the area of statistics, but rather moral interpretation. Is it not true that most of the excess deaths have occured, not as a result of the original invasion, and overthrow of the tyrant, but as a consequence of the “insurgency’s” efforts to retake the country, and prevent it from becoming a free democracy? So, why shouldn’t THEY take the blame?

38

Tom T. 11.12.04 at 2:31 pm

Daniel, I’m not statistically educated, and the Lancet site doesn’t seem to allow a non-subscriber to read the actual study, so excuse me if these questions are addressed in the study or are simply ignorant. 1) Does the study distinguish between combatants and civilians (I can’t tell for sure from the posts and comments here)? During the first Gulf War, the US estimated Iraqi military deaths at 100,000. Is it reasonable to believe that Iraqi combat deaths from the current war would have been comparable or even higher? I.e., is it possible that the figure of 100,000 deaths reported by the Lancet is indeed under-reported but consists largely of combatants? 2) The CIA Factbook reports Iraqi population at 25.3 million and a death rate of 5.66%, which I think suggests that about 150,000 people might die in Iraq in an ordinary year. Does this mean that 100,000 excess deaths represents a 2/3 jump in the death rate (I may be misusing figures, so please correct me)? It strikes me that even a well-functioning city would have trouble finding morgue and grave space to deal with such a number of additional bodies, and so it seems that, under the chaotic wartime conditions in Iraq’s cities, this amount of excess deaths would have resulted in heaps of bodies or mass graves. If so, I would think that America’s opponents (whether the Baathists, Zarqawi, or whomever) would have every incentive to film these corpses and smuggle out the video much as they do with the beheadings. Obviously, this is not a critique of the study but simply uninformed intuition. Am I thoroughly off base, though; is it reasonable to conclude that these excess deaths were more or less absorbed by the Iraqi infrastructure? I’m just asking because these questions occurred to me; I don’t mean to be combative toward the study or your defenses thereof, or to suggest that any number of deaths should be minimized or dismissed.

39

Donald Johnson 11.12.04 at 2:43 pm

Way to go, Brett. Fall back on the other guy made me do it argument. Americans have no choice, but to use helicopter gunships and 500-2000 lb bombs in Iraqi cities. It is possible, you know, to blame both sides in a war for their brutality. Thanks for the response, Daniel. I agree that it’d take a lot of assumptions to get any number out of the earlier poll, but it still makes me wonder what sort of assumptions would be required to give an answer of many hundreds of thousands for the Saddam years and low tens of thousands (military and civilian) for the Spring 2003 invasion, if 50 percent knew people who died under Saddam and nearly 30 percent knew people who died in the invasion.

40

Brett Bellmore 11.12.04 at 3:35 pm

Well, obvious question is: If you get blown up while waiting in line to join the Iraqi police, or gunned down execution style on your way home from boot camp, why is it OUR fault? Yes, the case can be made that we ought to be willing to accept more casualties on our side, in an effort to reduce civilian casualties. (Good luck selling that idea to the American people, by the way!) But the fact is, we already liberated Iraq, and now we’re fighting a force that’s trying to enslave the country again, and not only are THEY responsible directly for a lot of the deaths Lancet is complaining of, they’re indirectly responsible for all of them, because there wouldn’t be a war if not for their agression.

41

dsquared 11.12.04 at 3:43 pm

Tom: 1) No, the survey does not distinguish. I’m guessing that not a small reason for this would be the safety implications for the researchers if they went round Fallujah and Sadr City with a clipboard asking “Does your household contain members of the resistance?”. But in any case the property of being a combatant is a political one, not a natural one. 2) Yes; the estimate suggests about this order of magnitude increase in the death rate. But no, I don’t think that this would cause the death infrastructure to collapse. I would guess that the order of magnitude of the effect is equivalent to a medium-sized flu epidemic.

42

Rob 11.12.04 at 3:43 pm

You know Brett, what a great argument! If only the US capilulated to Osama 3000 people would still be alive. Obviously its the fault of the US! If the Israelis woudl just leave, they wouldn’t suffer terrorist attacks. Every death is their own fault! If only native americans surrendered even faster, they wouldn’t have been as fully wiped out!

43

Tim Lambert 11.12.04 at 3:45 pm

tom, last year there were many stories about the overwhelming number of dead bodies at the Baghdad morgue. That problem has now been solved—reporters are no longer allowed to visit the morgue.

44

Clayton 11.12.04 at 4:03 pm

Question for Brett, If you hit a hornet’s nest with a stick, are you not at least partially responsible for the increase in stings? We knew going in that there would be an insurgency, there was inadequate planning for it, and obviously the administration considered the losses that would come from an insurgency to be at an acceptable level. So even if a significant number of those deaths were due to the actions of insurgents, it wouldn’t show that the administration bore no responsibility for them. Perhaps more important is that regardless of whether the cause of death was the bullet or bomb of an American or an insurgent, the numbers matter for determining whether this war satisfies the macroproportionality condition on standard versions of the just war theory. It is starting to look like they don’t. Even those of us on the left gracious enough to play along and imagine that there was somewhere in the shifting sands a just cause for this war and to set aside speculation about the purity of the President’s motives can cite the Lancet number as solid evidence that this war quite simply doesn’t pass the tests imposed by traditional just war theory.

45

Ragout 11.12.04 at 4:11 pm

I don’t think your explication of the problems cluster sampling with rare events makes clear the main points: 1. Most of the time there will be a small underestimate. 2. Rarely, there will be a large overestimate. 3. The estimate is unbiased on average. To take your example (nice example, BTW): suppose there is a 1% chance of hitting a mine with a stone toss. 10 stones are tossed. Then: 90% of the time no stones hit mines, leading to an estimate of zero instead of 1%. 10% of the time, 1 stone hits a mine*, leading to an estimate of 10%. On average, the estimate is about 1% (although we’ll never actually get this estimate with 10 stones). Admittedly, this is just a pedantic point, not a devastating critique of the Lancet study: the standard errors (confidence interval) reported by the authors presumably account for the uncertainty. FN * Actually, the chance of 1 or more stones hitting a mine is 1-.99^10. I’m ignoring the possibility of hitting 2 or more mines, since it is tiny, and I’m rounding, but calculating things exactly would make little difference.

46

dsquared 11.12.04 at 4:19 pm

That’s a very nice and clear way of putting it; it explains the “Outlier Theory” – this would be the case in which a 2.5% tail event had happened and the study had significantly overestimated.

47

Brett Bellmore 11.12.04 at 4:32 pm

Hornets are not moral agents. And the “insurgents” are the agressors in this case, just as OBL was on 9-11.

48

Ragout 11.12.04 at 4:34 pm

Dsquared, You are ignoring the dynamic issues, and discussing the study as if it were a cross-sectional (point in time) analysis. Other studies which use this technique don’t push it nearly as hard, and apoligize for it more. For example, Grein et al, conduct a very similar study, but it is superior because: 1. The recall period is shorter (less than half). 2. They are careful to inquire about those entering and leaving the household. Also, their reference period is clearer, and I’d guess that in an urbanized society like Iraq, people enter and leave the household more often (schooling, military service, hospitals, etc.)than in the Angolan refugee camps studied by Grein et al. Nonetheless, Grein et al acknowledge (referring to the same method used by the Lancet study) that “the WHO/EPI sampling technique (originally conceived for immunisation and nutrition surveys) has not been fully validated as a tool to estimate mortality and alternative methods have been suggested.” This is the same point I’m making: the sampling technique is suited for cross-sectional studies, but not so well suited for longitudinal studies. This is especially the case for studies that are sloppy about including people before they entered or after they’ve left the household, as in the Lancet study. To speak in jargon, you should take the issue of informative censoring much more seriously.

49

dsquared 11.12.04 at 4:51 pm

Why are you claiming that the Lancet study was ” sloppy about including people before they entered or after they’ve left the household”? They weren’t; there’s a discussion of this issue on pages 2 and 3 of the paper. Note also that your point about recall bias is significantly attenuated by the Lancet team’s use of death certificates, which were not available in the Angolan study which you cite. This is a version of the “Lying Iraqis” critique rather than anything which really relates to the methodology of the survey. It is therefore, as I note above, potentially a valid objection, but one which depends on making assumptions about the interviewees for which there is no evidence. Your comment about the methodology not having been “validated” has a flavour of Steven Milloy about it to me. This is a blanket disclaimer pointing out for anyone unaware that cluster sampling is a difficult business (hence the wide confidence intervals). If Grein et al had really thought that their survey methodology was incapable of delivering useful resutls, they wouldn’t have bothered.

50

Ragout 11.12.04 at 5:18 pm

Dsquared, The Lancet study does discuss the issue of people entering and leaving the household. As I argued in comments on your previous post, their discussion makes clear that they are doing it wrong. The Lancet authors are basically attempting to exclude household members who leave the household, rather than include them, which is what other studies that use this technique do. This is very important, because there are many reasons to think that leaving the household could be associated with death. The disclaimer in the BMJ article I cited isn’t about cluster sampling, it’s really about the definition of the sample universe. I’m prepared to believe that this was reasonably clear in the BMJ study, but not the Lancet study. It isn’t well-defined who is supposed to be included in the Lancet study’s universe. This is an invitation to overcount the kinds of deaths the Lancet authors were looking for. This isn’t lying. The surveyors in the field may well have thought that by counting violent deaths to people whose membership in the study universe was unclear, they were collecting complete information and doing a good job. And the death certificates don’t validate anything, since they were only collected from a small, non-random, subset of the respondents.

51

Rajeev Advani 11.12.04 at 5:24 pm

This is a great summary post, and I take the Lancet’s findings seriously. If I may offer a thought, not a critique: keep counterfactuals in mind when making any humanitarian assessment. Had the war not happened, Saddam’s regime would have dissolved at some point anyway, possibly ending in civil war or the reign of his (now dead) sons. Presumably, numerous deaths would have followed. The Lancet’s excess deaths figure (obviously) can’t take these counter-factual deaths into account. In other words, many of the 100,000 may have perished anyway when the Ba’ath regime (“naturally”) crumbled, however long down the road. I acknowledge this is counter-factual speculation, but I think it deserves consideration in the humanitarian argument (it is not a critique, in any way, of the very honorable Lancet study) In light of this I’m still not sure where I stand on the humanitarian case for war. If Iraq is successful in 10 years, any ex-post justification of the war will still require pondering the question: how many lives is freedom from political repression worth?

52

dave heasman 11.12.04 at 6:08 pm

“In other words, many of the 100,000 may have perished anyway when the Ba’ath regime (“naturally”) crumbled, however long down the road.” And if they didn’t die then, they’d have died eventually, so it’s all good, eh? “how many lives is freedom from political repression worth?” Depends if it’s my life or not, don’t it? (Well, it does to me).

53

George 11.12.04 at 6:16 pm

This was getting to be a long comment, so I split it up. Both parts are important to what I am trying to say. Part 1: Brian W and John Q use the example of opinion polls to demonstrate how accurate statistical sampling can be. But that comparison only reinforces my skepticism, since in fact opinion polls are almost never right. Look at this table of various polls taken the week before the American election: http://www.realclearpolitics.com/polls.html. All are (I presume) reputable and statistically sound, and all purport to measure the same thing, yet there’s a spread of 4% between the highest and the lowest. In state polls, the spreads were higher. In other words, it’s not news if an opinion poll is 3% to 4% off – it’s news if it’s accurate. And this is in a population that pollsters know abundantly well; how much more difficult must accuracy be in a population we do not know well, and that has in fact recently been through enormous change? Granted, this is not necessarily a critique of the study itself, since I gather that that uncertainty is reflected in the wide confidence interval. But then I fall back on something like the Fred Kaplan argument, which, despite Daniel’s thorough and probably accurate criticism, nevertheless seems to hint at something true: when the actual results might be anywhere in such a wide range, does the study really say anything useful for judging whether the war was a good idea or not?

54

George 11.12.04 at 6:17 pm

This was getting to be a long comment, so I split it up. Both parts are important to what I am trying to say. Part 1: Brian W and John Q use the example of opinion polls to demonstrate how accurate statistical sampling can be. But that comparison only reinforces my skepticism, since in fact opinion polls are almost never right. Look at this table of various polls taken the week before the American election: http://www.realclearpolitics.com/polls.html. All are (I presume) reputable and statistically sound, and all purport to measure the same thing, yet there’s a spread of 4% between the highest and the lowest. In state polls, the spreads were higher. In other words, it’s not news if an opinion poll is 3% to 4% off – it’s news if it’s accurate. And this is in a population that pollsters know abundantly well; how much more difficult must accuracy be in a population we do not know well, and that has in fact recently been through enormous change? Granted, this is not necessarily a critique of the study itself, since I gather that that uncertainty is reflected in the wide confidence interval. But then I fall back on something like the Fred Kaplan argument, which, despite Daniel’s thorough and probably accurate criticism, nevertheless seems to hint at something true: when the actual results might be anywhere in such a wide range, does the study really say anything useful for judging whether the war was a good idea or not?

55

George 11.12.04 at 6:20 pm

Part 2: Let me be clear about something: I supported this war, and I did so in the full knowledge that it would kill people, including innocent people. Every fair-minded advocate of the invasion of Iraq should be willing to acknowledge that. But we would have to kill an awful lot of people even to come close to Saddam’s body count. Here’s how I figure the numbers: Saddam was in power for, what, 23 years? He was a murderous tyrant from day one (literally), but as you say, he put up the really big numbers after 1985, in his purges against the Kurds and the Shia. The estimates I’ve seen put the death toll from internal terror alone at somewhere between 150,000 and 300,000. (Also a wide confidence interval.) Over 9 years, that’s an average of 17,000-33,000 (in round numbers) per annum. But that’s not all: Saddam also prosecuted two wars of aggression, against Iran and Kuwait, in which it is estimated that 1.5 to 2.0 million people died. These wars took place during the same period (i.e., after 1985), so that’s an average of several hundred thousand per annum. (Many people respond that that’s all irrelevant, since Saddam was in a box, hamstrung by the sanctions and the no-fly zones. Someone on CT, I believe, calculated that only about 2,000 people were being killed by Saddam each year at the time we invaded, so this figure represents the alternative case to the war. I think that’s naïve. Prior to 9/11, the sanctions regime was already falling apart, as (it has been shown) Saddam was actively working with members of the very organization that had put him in the box to dismantle it. Sooner or later, Saddam was going to get fully out of the box and resume business. The Kurds, the Iranians, the Kuwaitis, the Marsh Arabs, the Shia – Saddam was a serial aggressor and mass murderer; it was absurdly optimistic to think he – or, perhaps worse, his sons—would just give it all up.) Against the status quo of several hundred thousand violent deaths (on average) per year, put the casualties that have come about from the invasion. And here’s where quantity becomes important. The Lancet study says (if I understand it) it is equally likely that either more or less than 98,000 people have been killed in a year and a half. If the actual number is less than 98,000, I’d say those numbers clearly work: if (as I assert) Saddam was inevitably going to return to his bloody average of several hundred thousand a year, then the people of the region (including for this analysis both Iraqis and Iranians) are better off for the war. If the actual casualty figure is above 98,000…well, that gives some pause, as that figure approaches Saddam’s tally. But remember, in either case, it is going to get better. Whatever the actual casualty figure, it will almost certainly be lower next year, and if we are successful (I am still optimistic, and this week’s news make me more so) it will decline further in the years to come. Look, I don’t mean to dismiss a single one of those innocent deaths. Even if “only” 8,000 innocent Iraqis died as a result of the American invasion, every one is a tragedy. But people would have died as a result of our not invading, too, and a lot more of them. Action is not alone in having moral consequences; inaction does too.

56

Rajeev Advani 11.12.04 at 6:22 pm

And if they didn’t die then, they’d have died eventually, so it’s all good, eh? Actually, no. We’re talking about excess deaths. Natural death would occur irrespective of the counter-factual. Depends if it’s my life or not, don’t it? (Well, it does to me). True, I could reframe the question to capture that effect. Suppose the US was overtaken by a nasty tyrant, and X amount of lives from the population, chosen randomly, were required to remove said tyrant via revolution. Suppose you are among the population from which the sample is chosen. How small would X have to be for you to support revolution? Pacifist Response: 0 Pro-war nut response: 300 million Anyway I don’t want to veer this thread off its discussion of the Lancet. My point was that the humanitarian case has taken a serious blow from the Lancet, but it’s not out the window just yet.

57

LizardBreath 11.12.04 at 6:27 pm

On a moderately off topic point, there’s a semi-serious suggestion in the above post that readers should spend their evenings taking statistics. Does anyone know of a respectable university-level statistics course offered on-line? I’ve wanted a better statistics grounding for awhile (mostly because I get so irritated by having to take arguments like this one on faith), but haven’t got the time to do it in person.

58

George 11.12.04 at 6:28 pm

D’oh! Despite proofreading, I made a big goof in my arithmetic. The period of time between 1985 and the invasion was 18 years, not 9, so cut all my “status quo” numbers in half. Not that it derails my basic argument, but talk about innumerate…

59

Rajeev Advani 11.12.04 at 6:28 pm

Sorry Dave, I see where I may have been misinterpreted. When I said “many of the 100,000 may have perished anyway” I was trying to make the counter-facual intuitive, but my statement was misleading. I should have said “excess deaths comparable to those found in the Lancet may have resulted from counter-factual succession/civil-war”

60

Dano 11.12.04 at 6:29 pm

Thank you. Well done. The Lancet piece is indeed a good piece of science.

Replicate this study 5 times and you’ll likely get close to a good number.

Best,

D

61

nic 11.12.04 at 6:38 pm

how many lives is freedom from political repression worth? I think you need to ask the Iraqis. Preferably by going round from house to house of a representative sample and ask them directly, then report back what you find…. (But be careful, they may be all lying/insurgents/spoilt rotten with exceedingly high expectations from living under dictatorship for three decades. In which case you can certainly take on the heavy burden of answering that question for them and declaring the humanitarian dilemma solved.)

62

Stephen Soldz 11.12.04 at 6:52 pm

I discuss the confidence interval issue in my ZNet piece: 100,000 Iraqis Dead: Should We Believe It? http://www.zmag.org/content/showarticle.cfm?SectionID=15&ItemID=6565 What I conclude is that a major reason the CIs are so large is that the authors engaged in a conservtive (in a statistical sense) analysis of their data and included the Kurdish region. In this region, there was a reversal of the pattern in the rst of the country: more deaths prior to invasion than after. If this region had been excluded, the CIs would have been narrower, and the point estimate (98,000) higher. The fact that the authors did not exclude the Kurdish region is one piece of evidence that they were NOT seeking to maximize the estimate of casualties. This type of thing is one of the things we researchers look for when judging the quality of another’s study. This study was one of the best I’ve seen. Still, given its discrepancy with the estimates of others (Iraq Body Count, Project for Defense Alternatives), I tend to believe the “true” estimate is somewhat below 98,000, but still in the many tens of thousands.

63

dsquared 11.12.04 at 7:20 pm

Ragout, I don’t understand half these points: The Lancet authors are basically attempting to exclude household members who leave the household, rather than include them, which is what other studies that use this technique do. This is very important, because there are many reasons to think that leaving the household could be associated with death. But this would lead to an underestimate rather than an overestimate, most likely. I would guess that more people left their households after the invasion than before. Indeed, I suspect that the researchers chose this criterion precisely in order to avoid picking up people who left home to join the Al-Mahdi Army and were “missing, presumed dead”. The disclaimer in the BMJ article I cited isn’t about cluster sampling, it’s really about the definition of the sample universe. I’m prepared to believe that this was reasonably clear in the BMJ study, but not the Lancet study. You’ll have to explain this more clearly. The sample universe in the Lancet study is perfectly clear; it’s the population of Iraq. It isn’t well-defined who is supposed to be included in the Lancet study’s universe. This is an invitation to overcount the kinds of deaths the Lancet authors were looking for. I disagree. It’s perfectly clear. Household members who died is what are counted, and the membership criterion is specific – people who were part of the household when they died and had been for the preceding two months. What specific sentences in the article make you think that the criterion isn’t clear? And the death certificates don’t validate anything, since they were only collected from a small, non-random, subset of the respondents. I don’t see how this is true. They were checking for two death certificates (for noninfants) per cluster. They had 66 death certificates out of 182 deaths. That’s not a small sample. I’m troubled that there is no description of how they decided whether or not to ask for death certificates, though. In general, the potential for recall bias and lying (what you’re calling “informative censoring”) is discussed at fairly decent length on the last two pages of the article. The team concludes that it’s not enough of a problem to credibly threaten their conclusion and I agree with them.

64

Jack 11.12.04 at 8:13 pm

Ragout, Do you find the results of this survey to any great extent surprising? The IBC numbers plus the number of combatant deaths would give you a low ball at best rather less than an order of magnitude smaller than the Lancet numbers. If they were really only after the sensation value they could have given estimates including the Falluja figures. At best you are going to find holes in what is already known to be a net. Whatever its shortcomings it is the best figure we have to go on and it has been peer reviewed by a reputable journal. It doesn’t say anything very surprising beyond crystalising the effects of a gradually developing situation. I’m not sure why you think its burden of proof should be so high or that it fails to meet it. Why do you think there are no official figures? Surely this number, whatever its true value is an important performance indicator? In any case you will convince more people if you can provide a persuasive case for a significantly different number rather than just trying to find ways this number might not be right.

65

dsquared 11.12.04 at 8:28 pm

I think ragout’s actually being quite intellectually honest here and I’d appreciate it if people didn’t pile on (Jack; this is a general and pre-emptive warning, not one aimed at you, I think your questions are pretty fair). There is a potential issue of reporting bias, and I was probably wrong to trivialise it with the nickname “lying Iraqis”. I maintain my view that the dataset and the design of the experiment don’t give any real reason to believe that there is a problem here, but ragout is not blowing smoke; that’s why I’m asking him for clarification. One way you can tell that ragout isn’t acting the hack, by the way, is that he’s not throwing around hysterical accusations at the paper’s authors. They’ve designed a good piece of science and carried it out well; I count it as significant progress that the debate we’re now having is one about how even a legitimate piece of science might have given us wrong numbers. While I’m on the subject, I’d add that I would have expected that, if there was recall bias (and if it differentially affected responses so as to overestimate the post-invasion death rate relative to the pre-invasion death rate, then I’m surprised that it was so consistent across non-Kurdish Iraq (including the Shia provinces, who are meant to support the coalition) and so consistently the other way in the Kurdish North.

66

Steve 11.12.04 at 9:34 pm

I’m not positive, but I don’t think this article addressed the strongest arguement agains the Lancet study: the numbers are utterly ridiculous. For 100,000 deaths to have occurred since the end of the war, that would be an average of around 175 a day (I don’t know the exact number, but its been discussed since the study came out). The 100,000 may have included additional infant mortality, and may have included additional ‘natural’ deaths that weren’t occurring under Saddam, but as I recall, it specifically said that ‘most’ deaths were occurring in violence, and ‘most’ of those were due to American (presumably unintentional) weapons. So at most 49,999 were additional infant mortality figures, (leaves ‘most’ due to violence), and at most 24,999 were non-American weapons (leaving ‘most’ due to Americans) So of 180 deahts per day, you’ve got 90 or so deaths per day due to violence, and 45 or so deaths per day due to American violence. 90 (or 45) per day, every day, for the last year and a half. Thus, in the last year and a half, when you have read in the papers about a bombing that killed 70 people, you were actually reading about a slow day! The news media has been writing headlines about the days when violence was actually lower than the norm! 100 people would have had to have been killed for a normal run of the mill day, I guess. So maybe you should address the “these numbers are absurd” statistical test. Steve

67

Dan Hardie 11.12.04 at 9:38 pm

Quick question: has anybody in the media or blogosphere both a)criticised the Lancet study and b) criticised the Allawi government’s decision to stop publicising the Iraqi Ministry of Health’s collation of death figures from Iraqi hospitals? I think maybe Fred Kaplan; otherwise, everyone who did b) but not a) wins this year’s Intellectual Dishonesty Prize.

68

The Eradicator! 11.12.04 at 9:46 pm

In light of this I’m still not sure where I stand on the humanitarian case for war. If Iraq is successful in 10 years, any ex-post justification of the war will still require pondering the question: how many lives is freedom from political repression worth? A fair question, but it would be a better one if the case for war had actually rested on humanitarian grounds. It only really did ex post facto when the WMD and “terrorist links” arguments went into the crapper.

69

Josh Narins 11.12.04 at 9:56 pm

Thx for the work. I think you should edit the parenthetical “(that is, if the war had made things better rather than worse).” For Iraqis, Saddam’s last couple years in power were pretty mild (according to HRW), and, as a matter of course, the war was going to make things worse in the short run. This is a longer short-run than Iraqis would have hoped for, but, if BushCo pulls its head from its arse and starts treating Iraqis like human beings[1], things could improve quickly. The discussion of the number of people who knew people who died under Saddam, versus now, also suffers because the timespan is shorter for information to have travelled, even if the intensity of information travel might be higher. And, Rajeev Advani’s mention of succession, the process whereby control is relinquished from one leadership group and given to another, ignores the rigged election process so prevalent in modern despotic regimes. Uday or Qusay would have “run” for strong-man, and won between 90 and 99.99 percent of cast ballots. [1] The murder rate by african-american males halved in the first thirty years after Civil Rights, declining quite steadily in all age ranges. Sadly, largely laid at the feet of the drug wars, late in that period, in the younger men’s age range, it started rising again. Something tells me that if one treats people like people, there is more incentive for them to act like people, while if one is brutal and unust…

70

dsquared 11.12.04 at 10:04 pm

I’m not positive, but I don’t think this article addressed the strongest arguement agains the Lancet study: the numbers are utterly ridiculous. That’s not the stronget argument against the study. If anything, it’s the weakest. As Marc Mulholland has pointed out, 175 deaths/day is about ten times worse than the worst year in Northern Ireland. This doesn’t seem all that unreasonable, given that the British Army was not in the habit of calling in airstrikes on the Falls Road. Forget the media reports. There is saturation coverage of the green zone of Baghdad. There is no good information about the dangerous bits of Iraq. As Tim points out above, journalists aren’t allowed to visit the morgue. I’m not saying the 100K number is gospel; as I hope I’ve been consistent in saying, I don’t like linear extrapolation. But the entire problem is that we have literally no information of sufficient quality to allow us to gainsay it.

71

dsquared 11.12.04 at 10:10 pm

btw, full disclosure; I plagiarised the joke about the British Army calling in airstrikes from Chris B and Dan Hardie.

72

Dan Hardie 11.12.04 at 10:10 pm

‘But the entire problem is that we have literally no information of sufficient quality to allow us to gainsay it.’ …Although we might well have had such information had Allawi not ordered his Ministry of Health to stop publishing the death figures they were collecting from the hospitals.

73

Dan Hardie 11.12.04 at 10:22 pm

Also- if anyone objects that collated death rates from Iraqi hospitals would be unreliable since not all corpses would be taken to hospital: agreed. But injured people would be, and the military do a lot of studies of killed:wounded ratios which would enable a good estimate of fatalities. The Iraqi Health Ministry apparently still collates these figures but is under orders from Allawi to keep them silent. This was covered in some detail back in September by Knight-Ridder and AP. Similarly, if we’re arguing about infant mortality and other forms of death related to the war but not directly caused by violence, the obvious way to go about things would be for the Iraqi Health Ministry to collate the figures (which it may well be doing) and publish them. This would also rather tend to help such matters as the supply of medicines, funding of health services etc. Those who object to the Lancet figures need to admit that the Iraqi Health Ministry, in September of this year, publicised the figures of deaths-from-violence that it had collated from April 2004 on, and estimated that persons were twice as likely to be killed by coalition forces as by ‘insurgents’. Now I suspect that a lot of those killed were in fact insurgents, but 348 of them were women and children. Two third of 348 is approx. 238 women and children in five months, which is frankly a pretty abusive use of firepower. And it is also a fact that as far as I know, the only person to have a pop both at the Lancet and at the suppression of the Iraqi Health Ministry figures was Kaplan. The US government did not raise a peep, nor did the courageous Mr Straw, who has been so loud in his condemnations of the Lancet paper.

74

Dan M. 11.12.04 at 10:53 pm

There are several clear problems with the Lancet report that have not been adaquately addressed by this blogger. I would like to address two. They are the infant mortality rate, and the clear, overwhelming overestimation of the violent deaths in the Fallujah area. First, let me consider the infant mortality rate. Quoting the report: First, the preconflict infant mortality rate (29 deaths per 1000 livebirths) we recorded is similar to estimates from neighbouring countries. The use of neighboring countries appears reasonable…given the tacit assumption that no better comparisons are available from Iraq itself. However, as the initial article here mentioned, there was a study, sponsored by UNICEF. It is available at: http://www.unicef.org/newsline/99pr29.htm Some relevant numbers are: 24,000 households in the South/Center of Iraq (85% of the population, Arabs) 16,000 households in the North of Iraq (15% of the population, Kurds) were randomly sampled. This is far greater than the Lancet survey, which surveyed, roughly, 1000 households. Second, the infant mortality rate was measured at: 108 per 1000 live births in the South/Center not given in the North but said to have fallen like the under 5 mortality rate The under 5 mortality rate was 138 per 1000 live births in the South/Center 72 per 1000 live births in North. So, we have an under-5 mortality rate of 12.8%, and an estimated infant mortality rate of 10% for the entire country in 1999. This contrasts with the remembered estimate of 29 per 1000 live births pre-invasion in the Lancet study. It is considered good technique to address previous literature in the field. This study stands out as important relevant literature. Yet, it was ignored. If these numbers were correct, then one would expect a death rate of 130/5 per year for children under 5. With infant mortality being the majority of this, we can estimate that 1006 under 5 year olds seen correspond to at least 1100 live births in the last 5 years. (This gives a birth rate that is slightly lower than the pre-invasion birth rate 226/year given in the Lancet study. Using these UN figures, we would expect that there would be 32 under 5 deaths during the pre-invasion interval. The Lancet report gives only 12 deaths under 15…and doesn’t break it down by under 5 and over 5. This difference of 20 deaths over 14.6 months is absolutely critical. With that age cohort representing 39% of the total population, it also represents a difference in the under 15 contribution to the mortality rate of 6.4 deaths/1000. With an Iraq population of, approximately, 25 million, this translates to over 125,000 deaths per year…more than the 98,000 given in the Lancet. Given this, the authors of the Lancet study have a duty to mention the UN study and then explain why they think the death toll showed an unprecedented decline between ’99 (Feb to May) and ’02. (Jan 02, to mid-Mar 03) AFAIK, this decline far surpasses any other three-year decline. Since the author of the blog states that the start of the oil for food program is the cause of the improvement, let me address that. There are a couple of problems with this: first the oil for food program started in ’96. With infant mortality, in particular, it is hard to see why it would not have had an effect on infants born two years later. Second, during that time, the oil- for-food program was raided by Hussein for other purposes. One would need to provide evidence that this stopped in ’98 and ’99 in order to argue that the infant mortality rate fell tremendously. The second suspect number is the Fallujah death toll in 8-04 and 9-04. Reading the graph, I obtain 33 deaths in two months in Fallujah. Given that the total sample rate is, roughly, 1 in every 3200 Iraqis, this can be extrapolated to more than 100,000 deaths in this area during those two months. The city only has a population of 250,000. Since civilians were allowed to escape, why would they stay when people were dying at that rate? Further, how would it go unnoticed? As mentioned in the Lancet study, deaths must be taken care of quickly in Arab countries. Wouldn’t 100,000 requests for death certificates, 100,000 funerals, be noticed. Further, since there are usually >1 wounded per death in military actions of this type, wouldn’t nearly everyone else be wounded? Wouldn’t the hospitals and morgues be more than overwhelmed? Wouldn’t someone notice the smell of the rotting bodies, since there wouldn’t be enough able-bodied people to take care of the dead? Yet, the authors simply glide over this point…stating that there was a low possibility that this outlier was improperly sanding Even decent technique would require this type of cross checking between sampling and more direct measurement. Given these two examples of bad technique; given contrary data that appears to be much more solid, I don’t think the Lancet article can be taken seriously. My question is what sort of peer review goes on at the Lancet. It might be the same folks who reviewed Sokal’s paper. :-) Dan M.

75

dsquared 11.12.04 at 11:03 pm

Dan M: I will do you the courtesy of not mincing words: I simply have no fucking idea how you can think that those two issues were not addressed in the article above. They were.

76

Dan M. 11.12.04 at 11:45 pm

I searched for “infant mortality” and “UNICEF”, and obtained one reference to that study: it is below: and the last estimate of under-five mortality was from a UNICEFsponsored demographic survey from 1999.11,12 So, it is true, that it was mentioned. But, it appears that you and I differ on what properly adressing a previous study, with far better statistics is. For example, they also quoted numbers on infant mortality. That wasn’t mentioned. So, it was only briefly mentioned in the introduction, and not considered as relevant data when the infant mortality rate was discussed. Second, the consideration of the Fallujah data was poor. Their data indicates that 100k died in two months where there was minimal US activity in Fallujah. Even if you only take half of that, its still 50k. Cluster sampling is a technique that is fraught with difficulties. Done perfectly well, it works. But, if there are any indications that there are serious problems with the techniques, a serious researcher is oblidged to deal with them directly. What they did, instead, is just say it was an outlier, and gave results with and without it. A scientist is supposed to search for every possible way his data could be wrong before publishing. Cross checking with other data sources should be done, whenever possible. A reasonable scientist, would have cross checked the Fallujah data, and known that there was a serious problem with the sampling. The authors wrote as though it was a minor, or modest problem at most. So, to be accurate, I shouldn’t say they didn’t address these two issues; rather I should say that they didn’t adress them as one would expect in a serious piece of work. If I implied the former, I apologize. One other thing, it is possible that I miscalculated a number, since I don’t perform the same due diligance in posts as I do in professional papers that I write. If you can catch an important numerical error that I made, I’d appreciate you letting me know what my mistake was. Dan M.

77

dsquared 11.13.04 at 12:04 am

Sorry, Dan. Mrs. dsquared is currently out with her friends, and as a result I am quite drunk, but that’s no real excuse. To answer your questions directly: 1) The UNICEF study is from 1999, and there is decent reason to believe that infant mortality fell substantially in Iraq between 1999 and 2002. 1a) Furthermore, the cluster sampling methodology would always tend to undersample infant deaths; but this would not affect the longitudinal study (ie, the undersample would be more or less the same for the pre- and post-invasion death rate. 2) The Fallujah cluster wa