Open Mind

PCA part 4: non-centered hockey sticks

March 6, 2008 · 531 Comments

In parts 1, 2, and 3 we’ve looked into the basics of PCA. The essence is that we can view multiple data sets as a vector data set, or as points occupying a (sometimes very) high-dimensional space. The original basis for this space is simply the variables with which we express the raw data; we could call this the canonical basis. In this sense, a “basis” is like a coordinate system for the space, and the canonical basis is the set of variables which define the raw data.


But we don’t have to use just that basis; from a purely geometrical point of view any basis will do. For position in space, e.g., we can define a standard set of x, y, z coordinates, but there’s a great deal of freedom in how to choose even those coordinates. We can pick any point as our origin (where x=y=z=0), as well as any directions for the x-, y-, and z-axes. It simplifies many things if the x-, y-, and z-directions are all perpendicular to each other. It also helps if the distances along each direction are expressed in measures of the same size; if we gave the x-coordinate in meters while y and z were in inches, it could lead to a lot of confusion.

Sometimes there are advantages to a particular basis. For example, we might take as “canonical” coordinates for position, x, y, z coordinates in which the origin is the center of the earth, x points in the direction from the center of the earth to where the prime meridian (which defines longitude 0) meets the equator, y points from the center of the earth toward where longitude 90 deg. East meets the equator, and z points from earth’s center toward the north pole. It’s a perfectly good coordinate system, but if what we’re really interested in is studying the movement of a canoe on a lake, we’ll find it not very convenient. A better choice for the canoe problem, would be to choose a reference point on the lake as our origin (maybe the point on the lake surface where we usually keep the canoe), then choose coordinates so that x points east, y points north, and z points up (this can be referred to as the ENZ system, or “east-north-vertical”). We still have 3 coordinates, which is what we need for a 3-dimensional space, but we’ll find that things are a lot simpler using this system than the other.

We might even reap simplification from the coordinate system itself. Unless we sink the canoe, or lift it high out of the water, we’ll notice that our movements all seem to be restricted to the surface of the water, which conveniently corresponds to z=0. If we took a large number of measurements of position, and performed PCA, we’d likely notice that the last (lowest-variation) PC would be simply the up-down coordinate, and that its variation is tiny compared to the others. This would indicate that we might be able to omit that coordinate (use just the first 2 PCs) without any substantive loss of information, and that would make analysis even simpler and easier.

The real essence of PCA is that it enables us to define a new coordinate system for our data space, one based on the observed variations of the data itself, which gives us an excellent chance to reduce complexity, possibly even reveal some insight. PCA by itself doesn’t alter the data at all, and certainly doesn’t “create” patterns that don’t already exist, it simply offers us new coordinates with which to alter our description of it. A classic example (dealt with in part 1) is the case in which our space is 2-dimensional. The PCs (for normalized data) end up being proportional to the sum, and the difference, between the normalized data values. This does not represent a change to the data — instead of giving x and y we give numbers proportional to x+y and x-y, but the positions don’t change. But as easy as it is to transform from canonical coordinates to sum/difference coordinates, it’s just as easy to transform back. No information is lost, we just look at it from a different viewpoint.

One of the real advantages we may reap is to reduce the number of coordinates we have to use without losing important information. The canoe problem illustrates this perfectly: even if we used some bizarre directions for the coordinate system we would still expect the final PC to be the up-down direction, that its variation would be tiny compared to the others, and that it can probably be discarded without losing anything meaningful. This can be especially desirable when estimating something (like temperature) from a very large number of proxy data sets. Sooner or later we’re likely to do some sort of multiple regression, to calibrate the relationship between our proxies and observed temperature measurements. But if we keep all the proxies, we may have so many degrees of freedom in our “predictor” variables that we end up fitting a lot of the noise in the data, rather than just the signal. This is the classic problem of overfitting, a recurrent problem in analysis, nothing to be sneezed at. So, it’s entirely natural that those who wish to use proxy data sets to estimate past temperature (the most famous being the original “hockey stick” of Mann, Bradley, & Hughes 1998, hereafter called MBH) would use PCA.

If we keep all the PCs, then we don’t reduce the problem of overfitting. But we also have zero information loss, and nobody could accuse anyone of having the massaged the data to achieve a particular result. It’s the same data, just in different coordinates! But when we omit all but the leading PCs, then we might be accused of manipulation by way of our choice of which PCs to include and which to omit. Also, if we keep all the PCs it doesn’t matter whether we normalize the data (so we use the correlation matrix) or not (so we use the covariance matrix). But if we omit PCs, then whether we normalize or not can have a profound impact on the variation present in the various PCs, and could affect our decision which to include and which to omit. In fact, we really don’t even have to zero the data! We can perfectly well do PCA based on variation from the origin rather than the average. But again, it will affect what the individual PCs look like and how much variation they contain, and might therefore affect the decision which to include/omit.

Now to the crux of one of the criticisms levelled against MBH by McIntyre & McKitrick (hereafter MM). The criticism which spurred this series on PCA in the first place is that MBH used non-centered PCA (sometimes called “de-centered” PCA) rather than the usual, centered PCA. In part 1 our very first step was to subtract from each data point the average value of the series — this is the centered case. But MBH, instead, chose to subtract the average value during the calibration interval rather than the entire time span. This makes it non-centered PCA. MM claimed that the hockey stick resulted from the use of non-centered PCA, and that non-centered PCA is just plain faulty anyway.

First let’s dispense with the last claim, that non-centered PCA isn’t right. This point was hammered by Wegman, who was recently quoted in reader comments thus:


“The controversy of Mann’s methods lies in that the proxies are centered on the mean of the period 1902-1995, rather than on the whole time period. This mean is, thus, actually decentered low, which will cause it to exhibit a larger variance, giving it preference for being selected as the first principal component. The net effect of this decentering using the proxy data in MBH and MBH99 is to produce a “hockey stick” shape. Centering the mean is a critical factor in using the principal component methodology properly. It is not clear that Mann and associates realized the error in their methodology at the time of publication.”

Just plain wrong. Centering is the usual custom, but other choices are still valid; we can perfectly well define PCs based on variation from any “origin” rather than from the average. It fact it has distinct advantages IF the origin has particular relevance to the issue at hand. You shouldn’t just take my word for it, but you *should* take the word of Ian Jolliffe, one of the world’s foremost experts on PCA, author of a seminal book on the subject. He takes an interesting look at the centering issue in this presentation.

Wegman’s right about this much: a non-standard centering convention will indeed emphasize the variation of the mean, which turns out to be the main difference between the 20th century and the rest of the time span. What he fails to realize is that that’s the point. He’s also right that the changes associated with it will become more prominent in the PCA analysis, causing those changes to “move up” the PCA list. Again, that’s the point.

PCA (centered or not) doesn’t create patterns at all, they have to be there already even to “exhibit a larger variance.” PCA most certainly cannot “create” a statistically significant relationship between that variation and the observed temperature data. That the relationship is significant emerges from both the regression itself, and the testing done on the verification data (more about that in a moment).

What impact will non-centered PCA really have? Downloading all the data used in MBH and reproducing their analysis is a lot of work, and has already been done (by several researchers), so I decided to take the simpler and easier approach of generating artificial data. I created an artificial temperature time series which follows a distinct “hockey stick” pattern, and also generated 40 separate artificial proxy data time series, all spanning the time interval from 1400 to 1980. I assumed that the temperatures from 1900 to 1980 are known, while those before 1900 are not. The proxy data series were generated by computing a response to the temperature signal and other factors as well. And all the series include random noise. The task is to use PCA to characterize the proxy series, use only a few PCs to characterize it, then determine the relationship between the PCs and observed temperature during the “known” time period (i.e., the calibration interval, from 1900 to 1980), then use that relationship to estimate temperature before 1900. Let’s do this using the non-centered approach of MBH, and the centered approach.

In reality, one wouldn’t use all the known data to calibrate the proxies. Instead, one would use part of the known data, then create the prediction, then check prediction against observation in the remaining part. This is “verificiation,” an independent check on whether or not the method is working. We can’t do this if verification uses the same data we used to calibrate — that would only show that if we fit our proxies to the calibration data, then it fits the calibration data. And we already know that! But this is just an exercise using artificial data, so I’ll skip the verification step.

First let’s use the non-centered method. I subtracted from each proxy series its average during the calibration interval. Then I normalized each series so that they’re all about the same “size.” Then comes PCA. When we do so, the very first PC is by far the biggest one of all, and it looks like this:

noncenterpc1.jpg

It looks strikingly like the (artificial) temperature data:

artitemp.jpg

If we do a regression of temperature against PC1 during the calibration interval, we get this:

noncenterfit.jpg

That’s an excellent fit! If we use the regression line to take the PC1 value and transform it to temperature, we get an outstanding match with actual temperature, not only during the calibration interval but over the entire time span. It appears that the method works excellently, and that PC1 (which dominates the variation) is sufficient to generate it.

Now let’s use the MM approach and see what happens. They claim to reproduce the MBH approach precisely except for the use of non-centered PCA. So this time, we’ll subtract from each proxy series the average value over the entire time span, normalize, and run PCA. This time, the 1st PC looks like this:

centerpc1.jpg

Whoa! That doesn’t look at all like the temperature data. If we now regress temperature against PC1 during the calibration interval we get this:

centerfit1.jpg

It’s fair to say that this fit isn’t very good. In fact, it’s fair to say (and the numbers confirm) that this fit sucks. If we use it to estimate past temperature, the estimate doesn’t match observation. At all. It shows spurious warming in the late 16th century and cooling in the mid-19th century, but no real 20th century warming and certainly no hockey stick.

What’s going on here? Didn’t we use exactly the MBH procedure, except to switch to centered PCA (which should be valid)? We ran the PCA, kept only the 1st PC, did the regression, and generated our estimate. But it came out horribly wrong! But something is going on — because the “standard” choice (centered PCA) gave us the wrong result while non-centered PCA gave the right answer! For this artificial data we already know the answer: temperature looks like a hockey stick. This is what MM did; they ran (centered) PCA on the actual data, kept only the first 2 PCs (as MBH had done), ran the regression, and generated the estimate. But it came out dramatically different from the MBH estimate.

The difference in estimates emphasizes the reason for MBH’s choice of non-centered PCA. Its purpose is to emphasize differences between the 20th century (the calibration interval, actually) and the remainder of the time span. That main difference is the 20th-century temperature increase, so that becomes the prime focus of the PCA results; that’s why most of the variation is in the 1st PC (artificial data) or first 2 PCs (real data), including the 20th-century increase which enables us to estimate the proper relationship between proxies and temperature. None of these factors are created by the non-centered PCA, but they are emphasized by it, bringing the relevant variation high up the list of PCs. That enables us to use just a few PCs and get the correct result.

With centered PCA, all the relevant variation is still there. It’s all still statistically significant. But the analysis has less emphasis on isolating the 20th-century changes, so the variation which correlates strongly with observed temperature change (which is the variation we really want) is still among the leading PCs, but no longer in the 1st PC (artificial data) or first 2 PCs (real data). Properly to reproduce the MBH procedure, when we switch from non-centered to centered PCA we also have to change the selection of PCs — because we can expect that the important variation may not be so high up the PC list as it was when the method itself (non-centered PCA) is chosen to emphasize it.

So let’s do it right. Let’s use centered PCA, but this time let’s not restrict ourselves to only the 1st PC (artificial data) or first 2 PCs (real data) just because that’s what emerged from non-centered PCA. With the artificial data, this time it’s the first two PCs that contain the bulk of the variation. We’ve already seen the 1st PC, here’s the 2nd:

centerpc2.jpg

Lo and behold, there it is! The variation relevant to temperature is still there, it was always there, it’s still among the leading PCs, it always was, but now there are two big PCs instead of just one, and the temperature-related changes are in the 2nd PC. If we regress temperature during the calibration interval against this PC, we get this:

centerfit2.jpg

Now that is a good fit! If we use this regression (more properly, a multiple regression on the first two PCs), it turns out that temperature during the calibration interval is strongly (and significantly) related to the 2nd PC, but not to the 1st PC. When we estimate temperature over the entire time span, we get: a hockey stick. It also matches (quite excellently) the actual temperature (which in this case we know, because it’s artificial data).

All of which emphasizes the fact that PCA doesn’t change the data. It only changes our description of it. The variation in proxies which correlates with temperature was always there, PCA only isolates it in a tractable way. If we use centered PCA, it turns out to be in the 2nd PC, but with non-centered PCA it ends up in the 1st PC. That’s the reason for the choice of non-centered PCA: to make the relevant, i.e. temperature-related, variation end up higher in the PC list. And: it worked.

I’d guess that some of you are thinking, “Can it really be that simple? Did MM really not get this? Did they really discard the relevant PCs just to copy the bare number of PCs used by MBH, without realizing that the different centering convention could move the relevant information up or down the PC list?”

You betcha. When done properly on the actual data, using 5 PCs rather than just 2, the hockey stick pattern is still there even with centered PC — which is no surprise, because it’s not an artifact of the analysis method, it’s a pattern in the data. Here’s a comparison of PC#1 for the North American ITRDB (international tree ring data base) data using the MBH method (red), and PC#4 from using the MM method (blue, which has been adjusted to have the same mean and amplitude as the red curve, but it’s only the relative pattern of variation that matters):

pc1pc4.jpg

Some of you might be thinking, wouldn’t MM have noticed that their fit (the one lacking a hockey stick) failed statistical significance when they did the verification step to check the correctness of their version? Probably yes. Too bad they simply didn’t do the verification step. But independent researchers have replicated the MM procedure and subjected it to verification. It failed.

The MBH hockey stick result has been the target of more scrutiny, criticism, and downright hostility, than perhaps any other aspect of climate science. That may be because there are things to criticize; not necessarily valid ones, but things nonetheless. It may be because it’s one of the things that has had the greatest impact on public perception of global warming, by making the unusual character of modern warming so vivid. It has certainly made Mike Mann one of the most frequent victims of vicious personal attacks (Mike Mann and Jim Hansen seem to be the people most hated by global warming denialists).

The use of non-centered PCA is not the only criticism of the MBH hockey stick levelled by MM. There are several — but as far as I can tell, none of them are any more valid. I have a strong suspicion that they’ll be repeated in reader comments here. Perhaps some day I’ll address those mistakes by MM as well. In the meantime, I have one request: before you raise other issues about the hockey stick, address this one.

UPDATE UPDATE UPDATE:

Readers referred to figure 2 in Wahl & Amman, so here it is (click for a larger, clearer view):

wafig2.jpg

Categories: Global Warming · climate change

531 responses so far ↓

  • caerbannog // March 6, 2008 at 6:57 pm

    In my work, I use the singular-value decomposition (SVD) method from time to time (note to lurkers: you use the SVD method to calculate principal components).

    The very *first* thing I do after computing the SVD is to look at the singular-value magnitudes. Only then will I know what do do next.

    So did M&M do anything more than demonstrate that you can screw things up if you don’t bother to look at the singular-value magnitudes before charging ahead with your data analysis?

  • The Flea // March 6, 2008 at 7:00 pm

    Wow! That’s all I can say right now!!!

    Great post!!

    The Flea

  • Hank Roberts // March 6, 2008 at 7:35 pm

    Ok. One thought — I suspect you’ll want to edit the first post to clarify it as questions arise.

    Can you footnote the actual artificial data sets made up for this illustration?

    1) an artificial temperature time series which follows a distinct “hockey stick”

    2) 40 separate artificial proxy data time series

    3) by computing

    4) the temperature signal (that’s item 1?)

    5) and other factors (what they?)

    6) all the series include random noise. (how created?)

    I know you don’t want to overload the first post, wondering if there’s a way to explain those details via links.

    When you refer out to the actual analyses done with real data, cites to those would help as well.

    I wish I could come scrub your tile or trim your hedges or something to contribute my own skills (grin) to this.

  • dhogaza // March 6, 2008 at 7:49 pm

    Why not just delete those comments that aren’t relevant to your post, i.e. attempts to divert the discussion to “strip-bark samples”, etc?

    Including this one, after you read it? :)

    Great post, BTW.

  • BoulderSolar // March 6, 2008 at 8:07 pm

    Jolliffe’s position on uncentered PCA in the reference you cited is the following:

    “It seems unwise to use uncentred analyses unless the origin is meaningful. Even then, it will be uninformative if all measurements are far from the origin”

    I think this applies for MBH. Please comment

    [Response: In this case, the origin is meaningful (it's the average during the calibration interval), and not all measurements are far from the origin.]

  • Gavin's Pussycat // March 6, 2008 at 8:21 pm

    Very nice explanation, though essentially the same as given by Schmidt and Rahmstorf in 2005 on RC (IIRC; RC is down).

    Reading this, one question came up: what if tree ring growth depends not only on temperature, but on same other factor (moisture? precipitation? we know it does…), but one that does not vary over the calibration period. Say, tree ring growth is a function of temperature and “serendipity”. But serendipity is constant over the 20th century. You can calibrate as much as you like, but you won’t be able to get a coefficient between any of the PCs and serendipity. When calibrating temperature, it’s just as if serendipity doesn’t exist.

    Then, also assume that serendipity varies wildly (or is completely different) during the Middle Ages. Then you may have the situation that also tree ring growth varies wildly, which will be interpreted in the estimation step as temperature variation.

    Clearly this is an insoluble problem with any analysis technique, not only PCA; a causative factor that’s “not there” (does not vary) in the calibration period is just impossible to take into account. That would even be so if PCA tells us about its the existence.

    Am I missing something here? Can there be serendipity-like factors? Note that a requirement is that they significantly and independently (i.e., not through temperature) affect tree ring growth. Is this an argument for including not only tree rings, but as many as possible different types of proxies, like glaciers, whatever, likely to have different dependency patterns on causative factors? Is this mathematically tractable?

    [Response: I think you've hit a nail on the head. In fact I did include a non-temperature factor which influences the proxies. I think the relevant factor is: if the non-temperature factor affects all the proxies in the same way that temperature does, it simply won't be possible to disentangle the influences. But if the temperature affects proxies one way while the other factor affects them differently, or affects different proxies in a different way which can be separated from the temperature influence, then they can be disentangled.

    All of which emphasizes that temperature reconstructions will never lack uncertainty. But the criticisms I've heard so far don't pass muster.]

  • P. Lewis // March 6, 2008 at 8:23 pm

    I have to say that the way you’ve approached this (i.e. the use of artificial data) is IMHO very clever. I look forward to reading it a couple more times in greater depth.

    HB referred to other analyses above, one of which is probably this one by Wahl and Ammann (or their actual paper). However, you don’t need to get hung up on the latter, because the blog post suffices.

  • Zeke // March 6, 2008 at 10:22 pm

    Excellent article Tamino. One thing I might add is an explicit test of MM’s argument that this technique necessarily produces hockey stick shapes. An easy method come to mind:

    Do an uncentered PCA on a randomly generated dataset that exhibits a warm period or two in the past that exceeds present temperatures. See if this warm period drops out of the results.

  • Ian // March 6, 2008 at 10:42 pm

    Boulder Solar re: uncentered analyses,

    Keep going through Jolliffe’s presentation – he elaborates on one instance of “meaningfulness” in a later slide as follows: “One case where uncentred analyses are appropriate is if we can assume that the population means of our variables are zero, although the sample means are not. This is the case when the data are anomalies.”

  • Surly // March 7, 2008 at 1:58 am

    Thanks so much for this series. I struggled through a few of the equations but appreciate the way you worked us through the steps and issues. I am glad to finally get this part as I was really looking forward to reading your take on MBH and the hockey stick issue. Can you, for the sake of those of us who want to discuss this with average people who have little background, give us a take home statement?

    Thanks again.

    [Response: Take-home points:

    * The claim that non-centered (or de-centered) PCA is invalid, is itself invalid.

    * If you switch from non-centered to centered PCA, *and* do the PC selection properly, you get the same hockey stick that MBH got.]

  • Paul Middents // March 7, 2008 at 3:03 am

    Great set of posts. There may not be enough equations to ward off the trolls. I too would be interested in the data refered to by Hank. I’ll even hold the bucket while he scrubs your tile!

  • J // March 7, 2008 at 3:05 am

    Tamino, let me apologize in advance if I’m wrong about this. But here goes.

    Let’s say you have a lot of initial data sets, some of which show a 20th-century trend (”hockey-stickish” data sets), and others of which don’t.

    If you do a PCA with the data centered on the calibration period, you’ll likely end up with a hockey-stickish PC1, because PC1 will be weighted most heavily by the raw data that showed a rising trend during the calibration period.

    If you do PCA with the data centered on the series-wide means, then the hockey-stickish pattern may be relegated to one of the lower PCs.

    Either way, of course, you could select it and use it in a regression model to predict temperature or whatever.

    Now, here’s where I think there may be some fundamental disagreement. Some people may say that’s problematic — essentially, your method preferentially selects a PC component that looks hockey-stickish (in the one case by ensuring that PC1 is primarily based on the most hockey-stickish proxies; in the other case by manually looking through the PCs and picking PC3, 4, 5, or whatever happens to be the most hockey-stickish). In essence, you’re taking a bunch of proxies, some (most?) of which don’t themselves look like a HS, and using PCA to pull out only the part of the total data set that do look like a HS.

    Other people would say that there’s nothing wrong with that — during the calibration period, we know that temperatures were trending upwards, so there’s nothing wrong with a methodology that transforms the original data in such a way that it matches that upward trend.

    I’m afraid I’m not explaining this clearly (or, I may be simply flat-out wrong). But I think this may fall into the gray zone where something isn’t objectively correct or incorrect.

    If you see the MBH method as simply trying to find a transformation of the proxy data that best matches the known 20th century trend, and using that transformed data set to examine the past, this PCA based approach looks fine. On the other hand, if you see the MBH method as trying to faithfully represent the patterns in most or all of the raw proxies, regardless of whether that matches your assumptions about the 20th century trend, then this approach seems suspect.

    Both of these viewpoints actually seem to have some merit to me. But this is all outside my own field and I ought to recognize that I’m not really qualified to comment on this.

    [Response: It is possible to "seek out" a hockey-stick pattern by picking out just the PCs that look like that -- if there is one in there. There might be, but there might not!

    But that's not what MBH did. They used non-centered PCA, then applied strict "selection rules" to choose which ones to keep. They didn't keep just the hockey-stick PC (which in their analysis is the 1st), they also kept the 2nd. Applying the *same* selection rules to the centered case indicates that 5 PCs should be retained, and the hockey stick is mainly in the 4th.

    Also, you can in some cases find a hockey-stick PC if you're looking for it (but not always, it has to be there already), but that *won't* necessarily make it fit temperature very well in the modern era. The 20th century isn't just the blade of a hockey stick, it's rise from about 1915 to 1945, rough constancy 1945 to 1975, and rise again post-1975. More to the point perhaps, it certainly *won't* make the analysis pass significance tests when applied to the verification data.

    MBH didn't "go looking" for a hockey stick. They applied non-centered PCA, standard selection rules, did the regression, confirmed it with verification, and the hockey stick emerged. I guess the real take-home point is that if you use centered PCA, you get exactly the same thing.

    One more thing, which I didn't mention in the post but is surely relevant. You can do the analysis *without* PCA. It suffers from a loss of precisoin because of the vastly greater number of degrees of freedom, but is still valid. And what emerges? A hockey stick. You can also make a random selection of only a limited number of proxies, and repeat that process many times with different random selections, and average the results. The result? A hockey stick.

    The simple fact is that the hockey stick pattern is most assuredly NOT an artifact of the use of non-centered PCA. You get the same thing with centered PCA (using proper, and objective, selection of PCs), with all the proxies as is, and with random subsets of proxies.]

  • Bob North // March 7, 2008 at 4:52 am

    HB/Tamino -
    Thanks for the articles on PCA. I am sure that I will have to go through the whole series as couple of more times before I fill confident in my understanding, though I definitely feel I better understand things as it stands.

    To kind of follow along with Hank R’s question regarding posting the underying simulated data, one thing that struck me was that the correlation of your proxies with PC1 were a “little too good”. I don’t believe you would get that good of correlation if you looked at GISS v Hadcrut v NCDC v UAH v RSS (normalized to the same base period of course). If you could give a little more information on the way you developed your simulated proxies and added noise and made sure they didn’t vary too perfectly witht simulated temperature, it would be helpful to understanding the process. Also, any further thoughts on why the uncentered analysis provided such a better match than the centered analysis would also be helpful.

    One final thing I have been thinking about relative the whole question of using PCA on proxies such as tree rings is this. What if the primary PC actually represents a combination of environmental factors (e.g., some combination of temp, rel. humidity and soil fertility) rather than a single factor such as temperature. Since a PC is nothing more than a mathematical construct, is there any reason why we should assume it would represent a single environmental variable?

    regards,
    Bob North

    [Response: Indeed the fit is a *lot* too good! But it's artificial data for illustration purposes, so it's kinda designed that way.

    In addition to an artificial temperature series I also generated an artificial "other influence" series. Temperature was designed as a hockey stick (actually, to mimic early-20th-century warming, mid-century flattening, then warming) plus random noise. It was flat (except for the random noise) before 1900. For "other influence," I just sketched a smooth curve from 1400 to 1980 in a meandering way, digitized that, and added random noise. The proxies were generated as linear combinations of the two influence curves, plus random noise. Different proxies were given different (random) response rates to both temperature and "other influence."

    The "too-too-good fit" is because there are only 2 influences (real life has many more), the response to the influences was strictly linear (I suspect in real life there's some nonlinearity), and the noise was about the right size but certainly *consistent* (although different) for all the series. Hence it was easy for PCA to separate the signal from the noise, and to separate the two different signal components.

    Finally, it's indeed possible that PCA is isolating a "combined" influence, which is common to all (or most) proxy responses, and is partly but not entirely temperature. PCA really doesn't isolate a temperature influence, or isolate physical processes -- it isolates correlated variation. It's up to the regression on the calibration data to isolate the temperature influence, and it's up to the verification data to show that the relationship is strong enough actually to estimate unknown temperature.]

  • MrPete // March 7, 2008 at 4:53 am

    What’s most interesting to me in this is the challenge of connecting PCx to any specific physical meaning.

  • Ed Davies // March 7, 2008 at 12:12 pm

    Am I right in visualizing that the effect of the centred PCA is to promote the random noise in the data to the first few PCs?

    [Response: No, not random noise. Just plain old random noise is always gonna be far down the list. It can, however, promote (only) one of the meaningful variations by magnifying its total variation.]

  • MrPete // March 7, 2008 at 12:13 pm

    Tamino, you said MBH didn’t “go looking” for a hockey stick. They applied non-centered PCA, standard selection rules, did the regression, confirmed it with verification, and the hockey stick emerged.

    Now we’re getting to the crux of the matter.

    Tamino, haven’t you just clarified for all of us kindergarten stats students ;) that PCA is simply a mathematical transformation of the data, a sophisticated “sorting” if you will? That it brings the “strongest” signal in a sense to the fore? Doesn’t matter what that “signal” means. Correct?

    So, PCA is agnostic with respect to source and meaning of signal. That tells me the interpretation is still key. You need good data. You need good reason to assign physical meaning to that data. Correct?

    More importantly, PCA is a method that will pull a strong signal to the fore in a field of mostly-weak signals. Toss a pile of data in, get the strongest signal out. Correct?

    This also means, if there’s a few strong signals present in a large data collection, most of the data I toss in has little if any impact on the outcome and can be ignored. You said it yourself: it’s a nice method of data reduction.

    The methoddoesn’t care what the “strongest signal” means, doesn’t care if it is truly significant for our study. And most important, the method doesn’t care if that “signal” has any physical validity with respect to what is being studied. That’s all in the hands of the scientist feeding data in and interpreting the information that emerges.

    What Steve demonstrated, and many others have confirmed (most recently Craig Loehle) is that the hockey stick is incredibly dependent on a small portion of the data set. More than anything else, in the proxy series used to create the infamous hockey stick graph, it is dependent on the presence of stripbark BCP data from a tiny region of the world. The rest of the proxies are so weak as to have little impact on the results.

    The hockey stick proves too much. It proves that stripbark data (up to the 1980’s) was nicely calibrated to the temp record for a certain period of time in the 20th century.

    Remove stripbark and the hockey stick begins to melt.

    Substitute a variety of other “hockey stickish” data, that clearly has nothing to do with climate at all (e.g. stock prices) and you get a hockey stick with similar stats.

    But even more telling: update the BCP data itself and you get the “divergence problem.”

    Tamino, it’s good that you’ve elucidated the analytical details of PCA. Don’t lose the forest for the trees. Step back and consider what’s going in and coming out. Dont’ get caught treating it as a magic box where you put in a number, do A, B, C and voila the answer is 42!

    You get the same thing with centered PCA (using proper, and objective, selection of PCs), with all the proxies as is, and with random subsets of proxies.

    Not if you leave BCP out. Various people believe BCP is irrelevant. If so, then it should make no difference at all if one leaves BCP’s out of the equation, shouldn’t it! Same thing with the Polar Urals data.

    There’s nothing wrong with using a method that highlights certain data in a large data set. But it must be used with care. All those “less strong” signals are telling us something as well. So is the “noise.” We can’t presume to know that the signal we’ve found is a temperature signal.

    [Response: You're right that PCA doesn't generate the patterns -- just affects the order of their prominence -- and that it doesn't care about their physical meaning.

    You're quite confident that if we remove the bristlecone pines, the hockey stick falls apart (and indeed that's a totally separate issue from PCA). But according to what I've seen, MM got *that* part wrong too!]

  • fred // March 7, 2008 at 12:47 pm

    tamino, could you explain how, if you just have a set of series, and know nothing about what they represent, and you’ve decided for some reason (not sure why you would do this) not to use the series mean, but the mean of a subset of the series, how do you pick what subset to use? Is there some mechanical method which does not depend on prior knowledge of what is in the data?

    Or is this the wrong question?

    [Response: I'm not aware of any mechanical, set procedure. I think the motivation in this case was to emphasize the differences between the mean of the calibration interval and that of the preceding centuries, so the average of the calibration interval was chosen as the "origin."

    In retrospect, I'd say it might have been better to use centered PCA -- not because it's mathematically more valid or more precise, but because it's *simpler*, gives essentially the same result, and would have made the result less vulnerable to criticism. But that's a reason relevant to persuasion rather than mathematics.]

  • steven mosher // March 7, 2008 at 1:43 pm

    Nicely done tamino. There are several other problems here that are detailed in this post.

    http://www.climateaudit.org/?p=609#comments

    [Response: He has lots of objections, both methodological and data-related. Don't take McIntyre's word for it; maybe I'll post on that issue (and others) someday.

    Despite all the criticisms that can be levelled against the MBH procedure (valid or no), the overwhelming fact is that you get the hockey stick even if you don't use PCA at all.]

  • George // March 7, 2008 at 1:56 pm

    Did MM really not get this? Did they really discard the relevant PCs just to copy the bare number of PCs used by MBH, without realizing that the different centering convention could move the relevant information up or down the PC list?”

    So, what you are saying is they discarded the hockey stick PC.

    What’s the big deal?

    It was broken anyway, right?

  • dean_1230 // March 7, 2008 at 2:23 pm

    The line in your write-up that caught my attention was the following:

    “But it came out horribly wrong! But something is going on — because the “standard” choice (centered PCA) gave us the wrong result while non-centered PCA gave the right answer!”

    In the case you show, you know that there is a “right answer” and also what the “right answer” is. But assume that you don’t know (as is the case with research, where you may have an idea but aren’t sure and want the data to confirm it). How do you tell that one method is giving the right answer and another method is giving the wrong answer?

    In other words, how do you determine that Uncentered is a better choice than centered when you have no real idea what the “right” answer is?

    You also mentioned that the MM method gives a hockey stick in PC4. Would that show that the hockey stick is a much lower level trend and not nearly as significant as some of the noise that the data has?

    [Response: As stated in an earlier response, PCA doesn't really care about the physical meaning of significant variations. Transforming statistical significance to physical meaning is the job of the regression fit, which is done *after* the PCA. And the regression uses multiple leading PCs, so that's where they get to "battle it out" over which one is really related to the physical variable under consideration (temperature).

    That's one of the reasons that the verification step is crucial. We can *always* get some kind of fit (significant or not) to whatever set of proxies we include. Verification tells us whether or not it's plausible that the relationship is meaningful.]

  • MrPete // March 7, 2008 at 3:43 pm

    Transforming statistical significance to physical meaning is the job of the regression fit

    This is a fallacy. Mathematical analysis cannot produce physical meaning.

    Finding two curves that look the same, or even are the same, is not even suggestive that they mean the same. The meaning must come from outside.

    I sure hope I’m not saying anything surprising here.

    (A fun illustration of the problem is found in Made To Stick. The Curse Of Knowledge is illustrated by the Tappers and Listeners test. Pick a well-known song. Find a friend, tap it out with your finger on a table. How often do you think they’ll be able to guess what song you’re tapping? Most people guess 50%. Actually, only a few percent succeed. Why? The meaning that is so obvious to the Tapper is not inherent in the data.)

    [Response: I'm guilty of a very poor choice of words. But I think most readers got my meaning, which is, that PCA alone doesn't reveal any relationship between signals in the data and the physical process we're trying to model. If there is one, then the regression fit will reveal it -- but even then one must abide by the ever-present caveat that correlation is not causation. That's why the verification step is so important. And even that doesn't constitute proof, just plausibility.]

  • MrPete // March 7, 2008 at 4:01 pm

    You’re quite confident that if we remove the bristlecone pines, the hockey stick falls apart (and indeed that’s a totally separate issue from PCA). But according to what I’ve seen, MM got *that* part wrong too!

    The best current exposition I’ve seen of this proposition is Loehle’s latest, which includes corrections in response to the feedback he got. He explicitly leaves all tree rings out of the picture, just to be conservative about the matter. All of the data is available, so not too hard for interested parties to do their own analysis.

    Loehle’s data is not as pretty as tree-ring-based data, because tree rings are one of the few proxies that can provide accurate dating and annual data values. But stepping back from the minutiae, doesn’t an unbiased observer have to sit up and take note when leaving out one class of data changes the hockey stick so significantly? Here is his reconstruction, including CI bounds.

    [Response: Loehle's reconstruction is a (rather bad) joke. Readers should examine this.]

  • steven mosher // March 7, 2008 at 4:31 pm

    Tamino, you didnt read the post.

    The post illustrated the point you made. It did so
    well before you posted this. So you are kinda late to this debate.

    The issue is not whether PCA can effectively mine data. The issue is can it reliably mine data.
    and what confidence can you put on that mining

    Have a look at all the tests.

    What does the mann method produce when you
    have an inverse hockey stick in your noise?

    What does the method produce when your “noise” really DOES have a signal?

    Stipulated. if you feed manns method a Real signal and noise, it will pull a signal out. DUH.
    Now feed it other food. watch out for bones in the throat

    That’s not a robust test. Feed it a

    [Response: I suspect your comment somehow got truncated.

    In any case, you and McIntyre are really clutching at straws; the desperation to discredit the hockey stick is almost hilarious. It reminds me of the con artist in Huckelberry Finn, who refuses to admit the falseness of his claims even under the most extraordinary evidence.

    Whether you use non-centered PCA, or centered PCA, or *no PCA at all*, you get a hockey stick. Unless of course you commit other errors with the effect of suppressing it.]

  • steven mosher // March 7, 2008 at 4:56 pm

    Tamino,

    You seem like a fair guy to me. You have an opinion. That’s unavoidable. It’s not a capital crime. We all have them. At times you express your opinion in ways that inflame others. Other times not. I like that. Lighting a fire under people’ s butt moves them to action. The outcome is the outcome.

    Read the post. Address the issues.

    #1. the data mining issue.
    #2. The Missed signal issue
    #3. The performance against noise signals
    that are ‘upside down’

    M&M never questioned the ability of non center ed PCA to data mine signal out of noise.

    That is not the question. The question is this.

    HOW do we verify that the SIGNAL MINED
    is , in fact, a signal?

    The method will mine a signal. prove it’s gold
    and not iron pyrite

  • fred // March 7, 2008 at 4:58 pm

    So are you saying that it makes no material difference which subset of the data you take the mean of? One subset is as good as another?

  • MrPete // March 7, 2008 at 5:05 pm

    Tamino, you’re just appealing to authority by linking to the RC posting. Even the valid RC complaints only apply to the original paper without corrections. Do your own work. Loehle’s work is no joke.

    I for one am not desperate to discredit the hockey stick. I’m desperate to find good data and good analysis.

    My frustration is that when I stand in front of a Bristlecone Pine (Almagre is visible in the distant mountains outside my window right now)… it is painfully obvious that stripbark data is not a temp proxy. Yet people persist in acting as if it were so. That is the rather bad joke.

    You can cover it up with fancy analysis. You can take potshots at the details. But Loehle did a very simple thing: he removed just bad data from the proxy pile, and the entire picture changed. Nobody has challenged that, including RC.

    You want to try, go ahead. Because it sure appears that Loehle, and others, have falsified the hockey stick premise.

    Leave out the known-bad small fraction of the data, and you don’t get The Stick.

    Try it, Tamino. Falsify Loehle. It should be simple, right? You’ve made the claim several times that the HS will show up no matter what, unless you “commit other errors.”

    All “we” are asking you to do is leave the tree rings out. Do your analysis. If it’s that easy to falsify Loehle, you’ll have a nice feather in your cap.

    [Response: Perhaps some time (maybe even soon) I'll post about Loehle. But it's important to realize that while I take (and sometimes fulfill) requests, I don't do challenges or demands.

    In fact I think it's a very common (and very effective) strategy of denialists simply to keep throwing stuff -- whether it has a point or not -- at climate science. It keeps a lot of us busy doing less-than-useful stuff, and enables the opposition to "set the pace" for the discussion. If some day I *do* falsify Loehle, it doubt it'll be a "feather in my cap." It'll just be an invitation for denialists to throw the *next* piece.]

  • Christopher // March 7, 2008 at 5:07 pm

    Any chance you could post some code to this? I’m still working thru the first 3 PCA posts and simply want to do what you did step by step. For some reason I’ve always had a hard time with PCAs. Any code (R, Matlab, pseudo-code even) would be very helpful, for me anyway, to have this finally sink in. Also, some caveats as the when to use PCA. That might sound broad but one of my laments as a past graduate student having to take a fair amount of stats coursework is that you are given canned examples with the right tool already picked. I always wanted a more practical angle where you have a problem and first need to figure out which tool in the toolbox is best. So any such rules of thumb, cardinal assumptions about your data, and the like… that you could offer based on your experience would be very helpful.

    Another comment: I’ve always found it odd that the MBH debate figures so prominently (maybe this is my perception). In terms of debunking or supporting AGW I’ve always thought it was meaningless. I’m more interested in the ice core data and how current trends fit in (they don’t).

  • George // March 7, 2008 at 5:10 pm

    Bulldog says:

    It reminds me of the con artist in Huckelberry Finn, who refuses to admit the falseness of his claims even under the most extraordinary evidence.

    Perhaps it should be renamed the “Hucky-stick” debate.

    [Response: Thanks for a good chuckle.]

  • J // March 7, 2008 at 5:26 pm

    fred writes:

    So are you saying that it makes no material difference which subset of the data you take the mean of? One subset is as good as another?

    If you’re going to be using the PCA to estimate some thing like, say, temperature, then you want to use a subset for a time period when you have known temperature data to use as calibration. Thus, the mean is calculated for the 20th century, when temperature data are already known. It wouldn’t be useful to calculate a mean for, say, the 17th century, because we don’t have calibration data for that period.

  • dhogaza // March 7, 2008 at 5:35 pm

    Towards the beginning of this thread, I posted …

    Why not just delete those comments that aren’t relevant to your post, i.e. attempts to divert the discussion to “strip-bark samples”, etc?

    And sure enough …

    What Steve demonstrated, and many others have confirmed (most recently Craig Loehle) is that the hockey stick is incredibly dependent on a small portion of the data set. More than anything else, in the proxy series used to create the infamous hockey stick graph, it is dependent on the presence of stripbark BCP data from a tiny region of the world.

    And the CA crowd are off to the races.

    I think McIntyre and the brighter inhabitants of CA have figured out long ago that M&M’s “shattering” of the hockey stick via statistical analysis has failed, despite Wegman’s supporting testimony. I imagine this is why there’s so much effort to undermine the legitimacy of the proxy data.

    But it’s a bit humorous to watch the performance here. HB writes a fine series on PCA and rather than say, “OK, McIntyre was wrong about that”, what we see is an attempt to shift the discussion to another topic altogether …

    Now, something specific …

    Tamino, you’re just appealing to authority by linking to the RC posting. Even the valid RC complaints only apply to the original paper without corrections.

    The corrections ignored most of the valid RC complaints, therefore those complaints apply equally well to the very partially corrected paper.

  • dhogaza // March 7, 2008 at 5:40 pm

    My frustration is that when I stand in front of a Bristlecone Pine (Almagre is visible in the distant mountains outside my window right now)… it is painfully obvious that stripbark data is not a temp proxy.

    *how* is it “painfully obvious”? One thing science teaches us is that things that seem intuitively obvious are often just wrong.

  • John Cross // March 7, 2008 at 7:02 pm

    steven mosher: you wouldn’t be visiting “Tammy town” since it ” is a fun place to explore rhetorical probing missions.”

    It seems like it.

    Regards,
    John

  • Lazar // March 7, 2008 at 7:05 pm

    MrPete,

    More than anything else, in the proxy series used to create the infamous hockey stick graph, it is dependent on the presence of stripbark BCP data from a tiny region of the world. The rest of the proxies are so weak as to have little impact on the results.

    In that case, when excluding bristlecone pine series and not using PCA at all, there should not be a ‘hockey-stick’ result. Then can you explain Wahl & Amman 06, Figure 2, green line?

    PS, after excluding bristlecone pine series, Wahl & Amman compute much inferior (but marginally significant) verification scores over the 15th century, suggesting that there is a scarcity of proxy data extending back to 1400, but also that BCP series are NH temperature proxies.

  • Hank Roberts // March 7, 2008 at 7:34 pm

    “rhetorical probing missions”

    Is that line from one of the troll listservs? It sounds so familiar.
    http://www.hyphenologist.co.uk/killfile/anti_troll_faq.htm

    The organized ‘missions’ disrupting conversations have long been part of life on the Net.

    Note the IP numbers used for the record. Save them. Some day web hosts will catch up with the Usenet folks in the anti-spam and anti-troll techniques, and they’ll be useful.

  • David B. Benson // March 7, 2008 at 7:40 pm

    Tamino — This is, once again, excellent!

    I currently understand the need for determining which PCAs to use prior to exploring the data, yes? But I have no conception of how to go about deciding which PCAs to use.

    Enlightment is kindly requested.

    [Response: The procedure used by MBH is this: study the spectrum of eigenvalues (the variance attached to each PC). Compare that to what would be expected if the signal were noise (with the same autocorrelation structure as the data). Those PCs whose eigenvalues are significantly higher than expected from noise, are included in the regression.]

  • cce // March 7, 2008 at 7:56 pm

    Isn’t one of the main reasons for including the BCPs (strip bark or otherwise) is that they contain an ENSO signal? Isn’t that why they strengthen the verification scores?

  • Lazar // March 7, 2008 at 8:07 pm

    Loehle… no areal weighting, no idea as to how well the series represent local or global temperatures, no verification stats, only eight series beyond 1935, out of a total of eighteen. It remains a joke.

    The corrections ignored most of the valid RC complaints, therefore those complaints apply equally well to the very partially corrected paper.

    Indeed.

  • BoulderSolar // March 7, 2008 at 8:36 pm

    After reading this post and McI post about PCA I see there is some talking past each other in a technical sense. The assumption made in MBH is that the “signal” (ie temperature) extracted in a PCA analysis MUST match the temperature in the calibration period. This means that the PC selected is one which has the best correlation to the calibration data. For MBH noncentered procedure it is PC1 and PC2. For a centered procedure it is not PC1.

    MM shows that by doing this, a real signal in the proxy data gets lost if your criteria is for best correlation in the calibration period. This can happen especially because of the very few data points in the calibration period for the proxies.

    I can see valid points for both approaches. What makes me lean toward MM’s viewpoint is that they showed how totally random proxies would always produce the hockey stick. Tamino actually confirms this when he insists that the only valid PC to use in a centered analysis is the one with the hockey stick.

    [Response: That's not what I said at all. I stated that objective criteria must be applied to selecting PCs, and the *same* criteria must be applied when using centered as non-centered PCA. Nor did I use only the "hockey stick" PC when doing the centered analysis on the artifical data. I used both the first 2 PCs in a multiple regression, and it turned out that the regression itself indicates that the 1st is no real indicator of temperature. The reason the correlation graph shows only the 2nd (hockey stick) PC is that I wanted to graph the correlation, which for 2 predictors requires a 3-D graph -- and those are far less clear than 2-D graphs (this is, after all, a blog post for illustrative purposes).

    And by the way, the mistake about non-centered PCA being invalid (which according to some reader comments, McIntyre now disavows) isn't the only mistake in the MM analysis. It's *really* not a good idea to take McIntyre's analysis, or blog posts, at face value.]

  • Lazar // March 7, 2008 at 8:43 pm

    Steven Mosher,

    Read the post. Address the issues.

    I believe that HB already has…

    #1. the data mining issue.

    which seems a non-issue, to quote HB; “a non-standard centering convention will indeed emphasize the variation of the mean, which turns out to be the main difference between the 20th century and the rest of the time span. What he fails to realize is that that’s the point. He’s also right that the changes associated with it will become more prominent in the PCA analysis, causing those changes to “move up” the PCA list. Again, that’s the point.” — HB

    #2. The Missed signal issue

    Eh?… here McIntyre has decided a priori, that is without reference to anything physical or to any aim of any experiment, (i.e. entirely arbitrarily), what he calls the (artificial) ’signal’ and what he calls the (artificial) noise, and then claims, falsely, that the signal is ‘missed’ by an MBH methodology to which the data has no relevance or meaning. Of course the ’signal’ is in a higher order PC, and an actual experiment would consider whether or not to include the higher order PC, but since the data has no relevance to any actual experiment, the point is moot and entirely tangential to anything MBH have done.

    #3. The performance against noise signals that are ‘upside down’

    M&M never questioned the ability of non center ed PCA to data mine signal out of noise.

    That is not the question. The question is this.

    HOW do we verify that the SIGNAL MINED
    is , in fact, a signal?

    HB again; “PCA most certainly cannot “create” a statistically significant relationship between that variation and the observed temperature data. That the relationship is significant emerges from both the regression itself, and the testing done on the verification data”

  • RomanM // March 7, 2008 at 8:44 pm

    You wouldn’t care to share your data with the rest of us so that we might try some experiments of our own with it, would you? It might be interesting to see what it looks like.

    Apart from the fact that your post is based on a sample of size one, you neglected to address the other side of the coin – does this non-standard methodology also create hockey sticks when they are not present in the data?

    [Response: Sigh... It doesn't -- certainly none that will pass verification. But I didn't address the issue of bristlecone pines, or tree ring data in general, or how well the instrumental record serves to calibrate proxies, or any number of others. This post is about the criticism that non-centered PCA invalidates the MBH analysis. It doesn't.

    I also get the impression that certain folks would like to take *artificial* data, created only to illustrate a point, for a blog post no less, and subject it to an "audit." No thanks -- I've got way better things to do.]

  • Lazar // March 7, 2008 at 9:08 pm

    cce;

    Isn’t one of the main reasons for including the BCPs (strip bark or otherwise) is that they contain an ENSO signal? Isn’t that why they strengthen the verification scores?

    Wahl & Amman suggest so;

    MM directly note the link between
    bristlecone/foxtail pines and precipitation (p. 85, MM05b), which is exactly the kind of largescale pattern registration that the MBH CFR method takes as axiomatic because large portions of this region are known to have important ENSO/precipitation teleconnections (cf. Cole and Cook, 1998; Rajagopalan et al., 2000). Since ENSO has a strong role in modulating global temperatures as well as affecting regional precipitation patterns, a CFR method of temperature reconstruction can effectively exploit regional ENSO/precipitation teleconnections that register in proxy data.

    RomanM;

    you neglected to address the other side of the coin – does this non-standard methodology also create hockey sticks when they are not present in the data?

    Eh? Sigh… HB has just shown, over a series of four extensive posts, that PCA does not “create” anything that is “not present in the data”.

  • MrPete // March 7, 2008 at 9:11 pm

    dhogaza, *how* is it “painfully obvious”? One thing science teaches us is that things that seem intuitively obvious are often just wrong.

    Exactly. I don’t know if this is an east-west thing, but it seems so. Scientists sitting on the eastern seaboard of the USA, or perhaps even those who live in the hot desert of Arizona, may conjecture that tree growth is temp limited — because that’s how it works where they live. Sadly, their intuition is failing them in other places.

    On the eastern seaboard, there’s generally plenty of precipitation, and even more there’s plenty of humidity. (Gotta love those 100-100 summer days :) ) You can look at a USDA plant hardiness map and see nice bands that correlate well with minimum temperature going right up the east coast. In Pennsylvania, as Tamino pointed out earlier, temperature is pretty consistent over wide areas. Same as in upstate NY where I spent a lot of growing-up years.

    Now let’s go to the high mountains of Colorado and Southeast California. First obvious thing: Here in Colorado, it is dry. Not quite desert-dry, but incredibly dry. In a good year, there’s some moisture to help trees grow well. In a bad year, there is almost none. How snow collects, how sun melts it (or not), and how the snowmelt flows makes a huge difference. On Almagre, BCP’s are found in protected spaces where snowmelt water exists.

    It’s also windy here. In other places, 40-60mph wind with gusts to 80 might be considered a small hurricane. Here? That’s a nice storm. We batten down the hatches and hope the homes under construction don’t blow away. I’ve had a wind gust break the deadbolt lock on our front door. With wind like that, slopes and hills make a huge difference in microclimate. Our home garden is placed specifically to avoid the wind. Makes all the difference in plant growth. Same thing on Almagre. BCP’s are in places at least somewhat protected from the worst wind. (And if not — then a storm is just as likely to destroy the tree as a nearby tree falls.)

    Third, there are a variety of other important factors when you’re at altitude. The available soil is not deep. Nutrients are hard to come by. Thus geology is a major factor — anything that causes available nutrients to vary over time. A good gullywasher storm can significantly change the available nutrients for many many trees.

    Finally, and most important: look at the trees. Whole bark trees grow reasonably uniformly. Strip bark trees grow irregularly. One look and it is obvious: strip bark trees are trying to recover from the damage they’ve sustained. (Dig in on that and you’ll find plenty of science — including astounding things we’re learning about plant hormones and how roots and shoots “talk” to ensure there’s an appropriate balance of nutrients, etc.)

    All the things I’ve mentioned are huge factors. They’re so obvious to plant growth experts, like gardeners, that when my wife made a presentation to the Colorado Master Gardeners** about BCP tree ring studies, the room filled with laughter at the suggestion that temp was the major signal here. And that was a room filled with pro-AGW people. They couldn’t believe their ears.

    I’m not suggesting that temp is completely irrelevant here in the mountains, but it’s proven to be less significant than other factors by plant biologists here, and it is less significant as a practical matter to those who do high altitude gardening.

    Perhaps this helps the reader understand why I’m a big advocate of multidisciplinary science. The challenges we face require the insight and understanding of folks from many areas of expertise. Climatology is one important specialty, but so is statistics, dendrobiology, geology and more.

    Tamino’s posting is a great intro to the math. But as he’s pointed out, the math does not do provide a physical interpretation of the data.

    [Just between me, you and the fencepost, I have a hunch. You can score me on this in a hundred years or so :)… my gut sense is that many stripbark measurements will tend to show a “recent” growth spurt, simply because the tree is trying to survive. If it fails, it’s dead and we don’t measure it. (Once seriously stripped, the bark never heals completely.)

    BTW, I’m not trying to draw attention away from Tamino’s very good exposition on PCA math. Gotta get the math right. We also must get other parts right. Some things have a bigger impact than others. And of course the most important by far is to know what you’re looking at. We can yell about numbers all day long but if we don’t even know what the numbers mean, their value is unimportant.

    ** PS Master Gardeners are volunteers who disseminate research-based advice, carefully limiting their statements to scientifically tested material published by the state university systems across the nation. A great use of tax dollars to promote good science!

  • Lazar // March 7, 2008 at 9:17 pm

    HB;

    I stated that objective criteria must be applied to selecting PCs, and the *same* criteria must be applied when using centered as non-centered PCA.

    I would say that if excluding a PC
    a) greatly alters the shape of the reconstruction, and b) reduces scores over a verification period, then it should not be excluded.
    Would those be ‘objective’ criteria?

  • BoulderSolar // March 7, 2008 at 9:31 pm

    Tamino,

    The verification procedure used in your and MBH’s PCA is the correlation during the 20th century temperature. This is what I refer to as the hockey stick. This is also one of your “objective criteria”. The point that I think MM seems to make is that by doing so the best PC will not produce the actual temperature for prior years. They illustrate this by using totally nontemperature dependent data sets which still produce good verification (i e hockey stick).

    As a test of this what if your test data had a temperature record which had large scale variations higher than during the verification period and that the proxy data had the limited amount of data points during the verification period that MBH had. Would we see the good correlation to actual temperature for prior years?

    I appreciate your time and effort to explain this. However your snipe about not taking anything McI posts at face value is a bit ironic coming from one who hides behind an alias

  • RomanM // March 7, 2008 at 9:33 pm

    I agree that bristlecones, calibration, etc. are important considerations to MBH, but peripheral to what you are doing here. There is also a serious problem due to the fact that proxies are time series whose correlation structure has a serious impact on the correlation or covariance matrix used to extract the PCs. However, I maintain that it is not at all clear what impact using uncentered series may have on creating hockey sticks and this is central to your demonstration.

    It is unfortunate that you seem to have so little confidence in the correctness of your analysis that it prevents you from allowing someone else to look at it. There are simply too many imprecise statements in your post that it raises questions about exactly what it was that you did:

    “I created an artificial temperature time series which follows a distinct “hockey stick” pattern, and also generated 40 separate artificial proxy data time series, all spanning the time interval from 1400 to 1980.” … “The proxy data series were generated by computing a response to the temperature signal and other factors as well. And all the series include random noise.” … “Then I normalized each series so that they’re all about the same ’size’.”

    These statements could mean just about anything. How correlated (both timewise and between each other) were your proxies? How strong was the “signal” compared to other variation. Hown variable was the time signal outside the calibration time period? Were you doing correlation or covariance PCs – how did the “normalizing” affect those results? What were the relative sizes of the eigenvalues? You are trying to convince us that this method is “correct” based on this one constructed example so answers to these questions are pretty important. I had thought that the data might provide some of the answers.

    Lazar:

    I must have missed some of the posts which contained the “proofs” that you refer to. They aren’t in the textbook that I am currently using.

    [Response: Permit me not to believe you. Here's what I think you're aiming at. If I put the data online, you'll find fault whether it's valid or not. If I make the computer code available online, you'll find fault whether it's valid or not. If I then demonstrate that your supposed fault is faulty, you'll either disbelieve the proof or find another fault. Etc., etc., ad infinitum. You'll never stop.

    That's *exactly* what McIntyre has done to the MBH results. Despite every one of his criticisms being shown to be faulty, he either maintains that it's not so, or moves on to another one -- all so he can keep attention focused on the idea that the hockey stick is wrong, rather than the simple fact that the globe is getting warmer due to human activity. It's a very effective application of the old adage, "the best defense is a good offense." But I'm not gonna play your game; I really do have way better ways to spend my time. One would hope that you do too.

    What's next? Will you try to force WordPress to disallow any posts that don't have their data archived for universal web access? Am I next in line after Al Gore to be taken to court in a lawsuit?]

  • MrPete // March 7, 2008 at 9:40 pm

    One more brief interlude from the math, then I’ll quit (gotta do my bookkeeping :-) )

    Something perhaps helpful on precipitation vs temp… which speaks to things like ENSO teleconnection etc as well. Some good studies have been done to examine tree growth at an incredibly fine scale — to see at what time of year the “annual growth” occurs, etc. What they found is that all the significant tree ring growth takes place in the spring (at snowmelt time.) If conditions are good for growth then, you get good growth. If not, you don’t.
    The growth is essentially nonexistent and also independent of climate variations that take place during the rest of the year (other than the fact that major storms, deep freeze, fire, etc can cause great damage and put the tree into “recovery” mode…)
    Sorry, I don’t have the reference handy. Perhaps someone else can supply. Truly a fascinating study!

    Back to your regularly scheduled PCA math lesson :)

  • Lazar // March 7, 2008 at 9:41 pm

    BoulderSolar;

    The verification procedure used in your and MBH’s PCA is the correlation during the 20th century temperature. This is what I refer to as the hockey stick. This is also one of your “objective criteria”. The point that I think MM seems to make is that by doing so the best PC will not produce the actual temperature for prior years.

    No, the data is ‘trained’ over the period 1902-1980, but the verification period is 1854-1901. To pass, the reconstruction scores need to be significant over both periods.

  • Spence_UK // March 7, 2008 at 9:43 pm

    Ermmm… no.

    The MBH98 PCA approach mines for hockey stick shaped data. What this means is that:

    1. Historical temperature is hockey stick shaped – MBH98 method will produce a hockey stick shape

    2. Historical temperature is NOT hockey stick shaped – MBH98 method will produce a hockey stick shape, IF just one “rogue” hockey stick proxy unrelated to temperature is used.

    All you seem to have done is demonstrated is (1), which we already knew. Unfortunately you have failed to touch on the real issue, which is (2).

    In statistical language, MBH98 has no power to determine whether history was hockey stick shaped or not, because it is predisposed to producing hockey stick shaped output, even if there is just one rogue hockey stick proxy. Von Storch demonstrated this with German unemployment and McIntyre demonstrated it with tech stocks.

    Another way of putting it: the hockey stick shape is built into the MBH98 assumptions. Likewise here, by assuming there is a hockey stick shape up front, you have also put your conclusions into your assumptions.

    McIntyre understood this; Wegman understood this; Von Storch understood this; the GRL reviewers understood this. Evidently still not everyone does.

    That said, if you think you’ve got a point, write it up and submit to GRL. Here’s a tip: it won’t get published, because it misses the point. That’s why Steve’s work got published, and yours won’t.

    [Response: Here's a tip for you: faulty results (like MM) get published all the time, even in GRL. And far more often in "Energy & Environment." Von Storch at least had the honesty to *admit* his mistake.

    As for illustrating the faulty nature of MM's work, that too made it to the peer-reviewed literature: Wahl & Amman, Climatic Change, DOI 10.1007/s10584-006-9105-7, and Amman & Wahl, Climatic Change, DOI 10.1007/s10584-007-9276-x, to name just a few.]

  • dhogaza // March 7, 2008 at 9:44 pm

    HB said, quite clearly

    one would use part of the known data, then create the prediction, then check prediction against observation in the remaining part. This is “verificiation,” an independent check on whether or not the method is working. We can’t do this if verification uses the same data we used to calibrate — that would only show that if we fit our proxies to the calibration data

    Yet BoulderSolar persists in believing that

    The verification procedure used in your and MBH’s PCA is the correlation during the 20th century temperature.

    No, that’s the period used for calibration, not verification.

    HB didn’t do the verification work in his post, indeed stated explicitly that he’d skip that step in his example, for simplification.

  • dhogaza // March 7, 2008 at 9:47 pm

    However your snipe about not taking anything McI posts at face value is a bit ironic coming from one who hides behind an alias

    The use of one’s real name doesn’t change the usefulness of their arguments one bit. Nor does the use of a pseudonym. That’s an ad hominem attack on your part.

    Have you ever read The Federalist Papers?

  • cce // March 7, 2008 at 9:52 pm

    Forgive me if this has already been tried, but if the BCPs contain a signal for ENSO, PDO, or whatever, isn’t that easy to test? We have the new Ababneh chronology that the Auditors keep talking about. We have various indexes, (this one going back to 1850 and beyond, http://jisao.washington.edu/datasets/globalsstenso/). Don’t we have everything we need?

  • Lazar // March 7, 2008 at 10:01 pm

    RomanM;

    I must have missed some of the posts which contained the “proofs” that you refer to.

    Then I would suggest you read the first sentence in this blog entry.

  • Spence_UK // March 7, 2008 at 10:09 pm

    Tamino -

    Faulty work gets published in peer review all the time – agreed, but that applies equally to MBH98 and Amman and Wahl. Note that got rejected from GRL too, just like your post here would. Seems like E&E and Climatic Change are two peas in a pod.

    Anyway, a challenge to you. In your response to RomanM on the same point I made:

    “Apart from the fact that your post is based on a sample of size one, you neglected to address the other side of the coin – does this non-standard methodology also create hockey sticks when they are not present in the data?”

    you replied:

    “[Response: Sigh… It doesn’t — certainly none that will pass verification. ”

    OK, so if the hockey stick is “artificial” and not in the original data, the verification period will have problems. Now, you’ve tested the quality of the match with those pretty scatter plots above (very informative, thanks for those).

    How about plotting from MBH98 – just the final data set for the 1400AD step, so you don’t have to do all the work, the data is publically available – a scatter plot for the calibration period, and a scatter plot for the verification period. All fair, using the techniques you’ve applied above. And we can see for ourselves, using your own methods, how well the MBH98 verification performs for the 1400AD step.

    If you don’t want to do it, I’ll happily do it for you and link it here in the comments. Just let me know.

  • Otto Kakashka // March 7, 2008 at 10:13 pm

    MrPete,

    perhaps you should take a look at Rossi et al., 2007: Evidence of threshold temperatures for xylogenesis in conifers at high altitudes.

    http://www.springerlink.com/content/978017037815120w/

    First line of abstract: “Temperature is the most important factor affecting growth at high altitudes.”

  • null{} // March 7, 2008 at 10:20 pm

    tamino/HB, a most excellent direct response to the technical issues mentioned by RomanM // March 7, 2008 at 9:33 pm.

  • dhogaza // March 7, 2008 at 10:22 pm

    The MBH98 PCA approach mines for hockey stick shaped data. What this means is that:

    2. Historical temperature is NOT hockey stick shaped

    Even if your first statement were true, which it’s not, your second statement does not follow.

    And it flies in the face of the fact that other analysis done by others, which doesn’t use PCA at all, reveals …

    a hockey stick.

    Your argument boils down to “it’s not a hockey stick because I don’t believe it’s a hockey stick”.

  • BoulderSolar // March 7, 2008 at 10:25 pm

    Like the replies! I’m learning. So after doing a PCA we pick the PC(s) which have the best correlation to the verification period. In this case the verification period is just prior to the calibration period. The question that is paramount is that by doing this do we get the actual temperature for prior periods? Why does using the NASDAC index still produce a result which passes verification?

    [Response: No! We don't "pick the PC(s) which have the best correlation to the verification period." We pick the PCs which pass the selection criteria -- all of them. Then we regress temperature against *all* those PCs (multiple regression) during the calibration period. It turns out that those which don't correlate with temperature get very low coefficients in the expansion, and so end up having very little impact on the reconstruction (which is as it should be); those that do correlate with temperature get larger coefficients and *do* end up having a significant impact on the reconstruction (which is as it should be).

    Only *then* do we check the reconstruction against observed temperature during the verification period. That's "verification." It seems to be the one step that critics consistently omit.

    The level of illogic, and falsehoods about what I've said or intended to say or meant, has become downright disturbing.]

  • MrPete // March 7, 2008 at 10:39 pm

    Tamino, you said about Steve M’s assertions: Despite every one of his criticisms being shown to be faulty…
    I’m sorry but that’s a bit much, unless you are saying Wegman was full of it. If so, we’ve got quite the scientific tussle going here, don’t we?! As a peanut gallery observer, this is sounding like more he said / she said argument.

    Hard part is that us non-experts have no way to evaluate the assertions.

    Looks to me like Steve and Tamino need to sit down and have a face to face chat. You’re talking past one another. As real people, I’m confident much would be better understood if you both found a way to talk it over.

    Speaking of which, Lazar, I’m sorry but I can’t help you with your question. I don’t have access to the papers nor diagrams you refer to. I can answer field questions and a bit more, but will bow out from the math food fights. I’m not a player in that game.

  • MrPete // March 7, 2008 at 10:55 pm

    Tamino, something you just said confused rather than enlighted. Probably my own stupidity :)

    [PC's] that do correlate with temperature get larger coefficients and *do* end up having a significant impact on the reconstruction…
    Only *then* do we check the reconstruction against observed temperature during the verification period. That’s “verification.”

    So all this is to say:
    * We use PCA to pull out the strongest “signal” curves, i.e. the curves with largest non-noise wiggles, so to speak.
    * We regress against an (arbitrary, hopefully) calibration period (C) to find the best match.
    * We then test (validate) against a different period (V), to discover if the signal fits both calibration (C) and verification (V) periods.

    Sounds to me like the argument over “mining for desired signal” is really a question of which of the following is going to emerge. Either

    a) There’s a “signal” that matches both C and V. We then assume it is temperature.
    b) There’s a “signal” that matches C but not V. Hypothesis falsified.
    c) No signal matches C. Hypothesis falsified.

    The problem seems to be: how probable is it that a C+V match is actually temperature?

    One side argues “highly probable; if it were not temp it wouldn’t match.”

    Other side argues “not so probable. In fact, we can plug in almost any kind of random or arbitrary ’stick-ish’ data and get the same result. The method just pulls out hockey sticks.”

    Leaving aside the acrimony. my question is: how DOES one calculate the probability or CI that the signal is actually temp? How does such a calculation take into account the results found by plugging in other essentially random sticklike data that produces statistically similar results? On the surface, it would seem the method is not actually telling us much.

    IIRC, someone created a stick-generator that used the MBH method plus random noise to generate hockey sticks. If so, the fact that the method produces results that pass C+V seems less than exciting.

    Where am I confused?

    [Response: Generally, "stick generators" don't pass verification. There's also a bit of a logical inconsistency in the phrase "essentially random sticklike data."

    Then of course there's the fact that even if a result is ridiculously improbable -- say 1 in 10,000 (which is plenty to claim statistical significance), if you try to find it 10,000 times you probably will. Such an effort is called a "fishing expedition."

    And of course there are other pesky facts ... like if you use centered PCA you get a hockey stick. If you don't use PCA at all, you get a hockey stick.

    There's no absolute guarantee, no Godlike assurance from on high, that the signal which mimics temperature in the calibration interval, and reproduces it correctly during the verification interval, is actually caused by temperature. There's *always* a chance (possibly a ridiculously tiny chance) that the result it due to accident.]

  • dhogaza // March 7, 2008 at 10:58 pm

    I’m sorry but that’s a bit much, unless you are saying Wegman was full of it.

    Well, yes, that’s essentially what HB said, didn’t you read the post carefully?

    In his post, HB said…

    First let’s dispense with the last claim, that non-centered PCA isn’t right. This point was hammered by Wegman, who was recently quoted in reader comments thus

    followed by the quote (which you can read above).

    HB then goes on to say

    Just plain wrong. Centering is the usual custom, but other choices are still valid; we can perfectly well define PCs based on variation from any “origin” rather than from the average. It fact it has distinct advantages IF the origin has particular relevance to the issue at hand. You shouldn’t just take my word for it, but you *should* take the word of Ian Jolliffe, one of the world’s foremost experts on PCA, author of a seminal book on the subject.

  • Lazar // March 7, 2008 at 11:05 pm

    In statistical language, MBH98 has no power to determine whether history was hockey stick shaped or not, because it is predisposed to producing hockey stick shaped output, even if there is just one rogue hockey stick proxy. Von Storch demonstrated this with German unemployment and McIntyre demonstrated it with tech stocks.

    Am I reading you correctly? Are you implying that, supposing the real temperature history is not ‘hockey stick’ shaped, i.e. it has ‘warm’ 15th century values, and you have 94 proxy series which correlate with that assumption, and use one ‘rogue’ series of tech stock figures to represent a ‘flat’ 15th century and 20th century upturn, as a 95th proxy, and you feed all that through the MBH98 method, decentered PCA, regression, and verification, that you will a) get a reconstruction with a ‘flat’ 15th century, that b) gets a significant score over the calibration period, and c) also over the verification period?
    If not, then what are you saying?
    Thanks in advance.

  • Lazar // March 7, 2008 at 11:09 pm

    MrPete,

    Speaking of which, Lazar, I’m sorry but I can’t help you with your question. I don’t have access to the papers nor diagrams you refer to.

    No problem, but the Wahl & Amman paper is available online, and might be worth reading…

  • Mike D // March 7, 2008 at 11:09 pm

    You can use any of the trees you want as long as they match the temp some where at some time.
    Such as trees in No. Am. that match temp in Europe. Who cares it is only Climatoligy

  • Boris // March 7, 2008 at 11:14 pm

    The issue of Bristlecone pines is interesting, but yet another red herring offered up by hole punchers.

    1. BCP are a temperature proxy.
    2. The NAS recommended that BCP (strip bark) not be used in temp reconstructions because it appears to be affected by CO2 fertilization.
    3. MBH are aware of this issue and account for it in their analysis.

    I haven’t investigated whether or not the BCP are the key proxy, but I strongly believe this is BS because McIntyre and company have repeatedly misled people on what the NAS report on surface temp. reconstructions actually said.

  • BoulderSolar // March 7, 2008 at 11:23 pm

    Tamino,

    Thankyou for your patience here. I am an EE with no climate and limited statistics background. Can you enumerate the “selection criteria” for the PCs?

    Again the crucial test is whether this type of PCA analysis actually produces the real temperature prior to the verification period. Since your data set did not have any significant variation prior to the “hockey stick” what would happen if there was a robust variation in temperature during that time? Would we see that with the same PCA analysis?

    [Response: The procedure used by MBH is this: study the spectrum of eigenvalues (the variance attached to each PC). Compare that to what would be expected if the signal were noise (with the same autocorrelation structure as the data). Those PCs whose eigenvalues are significantly higher than expected from noise, are included in the regression. The expected eigenvalue spectrum (and confidence intervals) for a given autocorrelation structure can be estimated by Monte Carlo simulations (a fanciful term for a large number of computer simulations).

    Whether or not a reconstruction reproduced a robust variation in past temperature, depends on whether or not the *proxies* show the response to it. One assumes that they would (in fancy terms, that the relationship between temperature and proxies is "stationary").]

  • JeanS // March 7, 2008 at 11:34 pm

    There is a lot of rubbish is this post! Too sad, you had a couple nice post about PCA, and now you write this. For anyone who does not have technical skills to undeestand this issue, I strongly suggest that you complete ignore this post. Or take it to your stats professors, they’ll explain you.

    Here are a few points that really caught my eye:

    1) Centering is the usual custom, but other choices are still valid; we can perfectly well define PCs based on variation from any “origin” rather than from the average.

    BS. It does not matter how many times you or some other people repeat this lie, it won’t make it true anyhow. Citing Jolliffe does not help: nowhere in his presentation nor in the book he advocates centering on partial average. The reason is simple: by not centering, you loose all your theoretical properties (such as those you nicely developed in the previous posts). In other words, you loose meaning of you transformation. Other “conventions” just lead to ad-hoc, arbritary (orthogonal) transformations.

    [Response: I must have really hit the bulls-eye with this post, because now I can *smell* the desperation.

    No matter how often, brashly, or rudely you repeat the assertion that "you loose [sic] meaning,” it won’t make it true. Your own words are unambiguous: you assert that non-centering itself is not valid (there’s no other interpretation of “by not centering, you loose [sic] all your theoretical properties…” etc.). But Jolliffe flatly contradicts you (that’s what his presentation is about), so you brush that off with the pathetic statement that he doesn’t specifically mention using a partial average.

    Jolliffe’s entire presentation is about the fact that centering is not an essential aspect of PCA. But because he doesn’t specifically mention choosing a partial average, you feel free to ignore his point. That of course enables you to repeat the crap about non-centered PCA being invalid. Now *that* is rubbish, and particularly stinky rubbish to boot.

    I suspect that you *know* your amateurish claim is bullshit, but you’re *counting* on readers not having the expertise to catch your lie.]

    2) Wegman’s right about this much: a non-standard centering convention will indeed emphasize the variation of the mean, which turns out to be the main difference between the 20th century and the rest of the time span. What he fails to realize is that that’s the point. He’s also right that the changes associated with it will become more prominent in the PCA analysis, causing those changes to “move up” the PCA list. Again, that’s the point.

    What a spin-doctor you are! Why would you want to emphasize series whose 20th century average deviates much from the overall average? This is exactly what Mann’s “non-centered PCA” is doing: it rewards series whose average on the chosen calibration period differs from the overall average. Do you have some a priori knowledge to justify that?

    [Response: It turns out to emphasize exactly the series that correlate with temperature. Imagine that.]

    3) You example is extremely misleading, to say the least.

    PCA is not capable of separating different signals in general. So if you have, say k factors and n-dimensional observation space and you project your data to k dimensions (with normal PCA), you do get rid of most of noise, but the k projected factors are essentially an arbitrary orthogonal transformation of the originals plus some remaining noise. That is, after PCA you can not expect you signals be separated unless you have, say, one extremely strong component. You need other methods such as the Independent Component Analysis.

    But that’s not the point in MBH, if properly done. What you want is to reduce your data (tree ring indecies) to those determing k factors. The separation (of temperature, moisture etc. factors) is expected to happen later. Again, this is achieved by the centered PCA, but not by the “non-centered convention”. Now in your example you use all the factors to generate your pseudoproxies. This is not the case for the question under study: those proxy sets contain a few series (e.g. bristleconde pines) that do not represent the “overall general variation in data set” (i.e., are not factors determining most series). Normal PCA handles that nicely by putting that “variation” to a low numbered PC, which gets disregarded as noise. On the other hand, “non-centered” Mannian “PCA” tuned to pick up the 2oth century mean vs. overall mean difference places that noise incorrectly to high ranked PC, and gets passed to the future stages of the algorithm.

    [Response: Bullshit. It's the abandonment of valid PC selection procedures that enabled MM to discard exactly the variation that corresponds to the hockey stick. Unfortunately, that particular variation *should* have been included according to objective selection criteria. As a result it's no surprise that the MM result fails verification.]

    So if you have any intellectual honesty left, you could try the following experiment:

    - generate 70 (eg white noise) proxies with approximately the same overall mean, and then artificially lower the 20th century of one of the proxies (an outlier series). Run centered and Mannian PCAs on that data set.

    [Response: Try this experiment: do the analysis of the actual data with *centered* PCA, and without repeating the other mistakes of MM. Wait, that's already been done. Result: hockey stick. Do the analysis *without PCA at all*. Wait, that's already been done too. Result: hockey stick. Try selecting random subsets of the proxies. Oh my, that's been done too. Result: hockey stick.

    You prove by example that there are people who will go to *any* length to discredit the hockey stick. That pretty well defines a lack of intellectual honesty.]

  • John Cross // March 8, 2008 at 12:00 am

    Hank Roberts: I kid you not, the phrase (including “rhetorical probing missions” ) is a direct quote from a post Steve Mosher made over at CA in regards to this site!

    Hansen’s Bulldog: Man did you ever make a good call when you said: “In the meantime, I have one request: before you raise other issues about the hockey stick, address this one.” Of course it doesn’t stop them from doing it!

    Regards,
    John

  • luminous beauty // March 8, 2008 at 12:03 am

    The mechanical correlation between plant growth and annual temperature variation can be and has been empirically determined to be physically caused. It is not spurious.

    One can prove it to one’s own satisfaction with an environmentally controlled grow room. Or just take up gardening for a number of years.

    None of these proxies were selected just because they correlate strongly to the calibration interval, but because from multiple lines of corroborating evidence, painstakingly accumulated over decades, researchers have developed strong physical reasons to expect they really are proxies for temperature.

  • kim // March 8, 2008 at 12:15 am

    This is complete rubbish.

    H/t JeanS
    =======

    [Response: WOW!!!. How interesting. I have to wonder why you give a "H/t" to JeanS, and repeat his assertion of "rubbish," even before his comment has appeared -- 'cause just like this post, it's in the moderation cue at this very moment. I smell a conspiracy; are the CA people are so stung by this post that they're organizing a mob to assault my blog?

    Not gonna happen.]

  • MrPete // March 8, 2008 at 12:30 am

    Lazar, thanks for the pointer. Now, with respect to your question: can you explain Wahl & Amman 06, Figure 2, green line?

    Looks to me like we’re unable to even comment on it. The green line doesn’t show up in the “hockey stick” section at all, so it’s a bit hard to see what’s going on.

    I’ve learned enough from watching these arguments to know that it is not as simple as reading the “surface” of a paper like this and deciding whether the claims made are correct. The devil’s in the nit-picky details. For example, apparently WA isn’t using the same data — correlation 0.7 with MBH data.

    AFAIK, the only serious comment made on this WA paper at CA can be found in the comment (not main post) section here. A hint, if you’re new to browsing CA: in that thread, “Jean S”, “UC” and Steve M are serious stats people. Others are mostly lurkers.

    I’ll be curious to see what else is said in the days to come.

    [Response: I've updated to post to show WA figure 2.]

  • Spence_UK // March 8, 2008 at 12:43 am

    Lazar,

    No, that’s not what I’m saying. You’ve inserted assumptions which are not part of my argument, and indeed not even part of the MBH98 assumptions. For example, your statement “correlate with that assumption” is just peculiar. I’m not sure how you go about correlating a data series with an assumption. If you mean do the series correlate with the global mean temperature, then no, that isn’t an assumption that even goes into MBH98. In fact, the proxies are not even required to correlate with local gridcell temperatures (and good job too, because most of them don’t). Remember the signal to noise is low, and the residuals are highly autocorrelated, because they have confounding factors (that’s why they are called “proxies”). If there was a simple correlation, we wouldn’t have to worry about all of these complex procedures to try and tease temperature out. I’ve probably just misunderstood your point here, so feel free to restate it in clearer terms.

    Secondly, I assume you are aware that there are two PCA steps in MBH98, one of which is uncentred and one which is centred. If you want to expand this discussion to the whole suite, you have missed that step. What’s the rationale for using different forms of PCA? If the uncentred method is so great, why use a different method elsewhere? If there are no clear justifications as to why one should be used in one location and the other elsewhere, then it leaves a degree of freedom within the study to fine tune the results, which in turn lowers the effective statistical significance of the results (not dissimilar to the form described by Burger and Cubasch in their 2005 paper).

    Your point (c) is obscure. Why would tech stocks pass verification tests? As Tamino notes, this would not be expected. Indeed, MBH98 itself fails multiple verification tests – it doesn’t even pass RE when properly benchmarked (as per McIntyre’s reply to Huybers, which also addresses Wahl and Amman’s claim and is the reason WA were rejected from GRL)

    Anyhow, it looks like Steve McIntyre is, at some point, likely to post a response to this which will likely argue the points better than I will here, so I would recommend you read that when it becomes available. He should put the case more clearly and more succinctly than I do.

  • Lazar // March 8, 2008 at 12:45 am

    Boris,

    1. BCP are a temperature proxy.
    2. The NAS recommended that BCP (strip bark) not be used in temp reconstructions because it appears to be affected by CO2 fertilization.
    3. MBH are aware of this issue and account for it in their analysis.

    Good points. Again, Wahl & Amman show a 20th century upturn without BCP series, and here is the upturn again, produced without including any dendro series whatsoever.

  • Lazar // March 8, 2008 at 1:13 am

    I’m still struggling to get to terms with what Steven Mosher;

    The issue is not whether PCA can effectively mine data. The issue is can it reliably mine data. and what confidence can you put on that mining

    [...]

    Stipulated. if you feed manns method a Real signal and noise, it will pull a signal out. DUH.
    Now feed it other food. watch out for bones in the throat

    [...]

    M&M never questioned the ability of non center ed PCA to data mine signal out of noise.

    That is not the question. The question is this.

    HOW do we verify that the SIGNAL MINED
    is , in fact, a signal?

    The method will mine a signal. prove it’s gold

    and Spence_UK;

    The MBH98 PCA approach mines for hockey stick shaped data. What this means is that:

    1. Historical temperature is hockey stick shaped – MBH98 method will produce a hockey stick shape

    2. Historical temperature is NOT hockey stick shaped – MBH98 method will produce a hockey stick shape, IF just one “rogue” hockey stick proxy unrelated to temperature is used.

    … are claiming.

    That decentered PCA emphasizes, and increases the chance of selection, of series that diverge from the 20th century calibration period mean?
    Yes.
    That those series might not be temperature proxies?
    Yes.
    But they might just as well be?
    Yes.
    That, the logical counterpoint, decenterd PCA de-emphasizes series which do not deviate from the 20th century calibration period mean?
    Yes.
    That those series might be temperature proxies?
    Yes.
    But they might also not?
    Yes.

    So, in other words, PCA just selects patterns, but doesn’t care either way with whether those patterns either correlate to, or in fact represent, anything physical such as temperature.
    Which HB has pointed out.
    Isn’t that why MBH98 do linear regression and validation statistics against atemperature data? Do your criticisms in have anything at all to do with PCA?

    So, I’ll ask again, in response to Spence_UK;

    In statistical language, MBH98 has no power to determine whether history was hockey stick shaped or not, because it is predisposed to producing hockey stick shaped output, even if there is just one rogue hockey stick proxy. Von Storch demonstrated this with German unemployment and McIntyre demonstrated it with tech stocks.

    Am I reading you correctly? Are you implying that, supposing the real temperature history is not ‘hockey stick’ shaped, i.e. that it has instead ‘warm’ 15th century values, and you have 94 proxy series which correlate with that assumption, and use one ‘rogue’ ‘hockey-stick’ series of tech stock figures to represent a ‘flat’ 15th century and 20th century upturn, as a 95th proxy, and you feed all that through the MBH98 method, decentered PCA, regression, and verification, that you will a) get a reconstruction with a ‘flat’ 15th century, that b) gets a significant score over the calibration period, and c) also over the verification period?
    If not, then what are you saying?
    Thanks in advance.

  • MrPete // March 8, 2008 at 1:26 am

    LB, The mechanical correlation between plant growth and annual temperature variation can be and has been empirically determined to be physically caused.

    That’s quite a general statement. And yes it’s true in the general case. One could substitute a variety of other metrics for “temperature” and still have a valid statement.

    The challenge is to determine whether it is true for a particular species in a particular environment. There can be any of a number of limiting factor(s) to growth.

  • MrPete // March 8, 2008 at 1:56 am

    Tamino, no worries about conspiracies or time travel :) — Jean S’s “rubbish” quote was simply from his CA comment. What a world we live in when people can kinda-sorta track conversations all over the planet in real time!

    I’m glad to see some experts engaging with one another. It’s a “vigorous discussion” :-) but hopefully something good will emerge from it all.

    You’re *all* reasonable people with reasonable perspectives. Obviously can’t all be correct about every point made here… so some things will ultimately be shown incorrect. That’s ok — science thrives that way.

    [Response: You might be right, but I still smell a rat. I suspect I'll be deluged with critical comments from regulars at CA who hate this post almost as much as they hate Mike Mann, Jim Hansen, and the hockey stick. The result will be endless back-and-forth which will ultimately serve to confuse rather than enlighten readers.

    What they do in their house is their business. This is my house.]

  • dhogaza // March 8, 2008 at 2:07 am

    Well, it’s going to be interesting, given this quote by McIntyre

    Some of the contortions that people make on behalf of the Stick are quite amazing. I’ll post on this in due course.

    (he’s speaking of HB’s post)

    It will be interesting to see if he attempts a coherent attack on HB’s statistics or if he armwaves like his supporters here are doing.

    It would also be a bit more interesting if he were to post here where the readership isn’t quite so worshipful as it is at CA.

  • Lazar // March 8, 2008 at 2:20 am

    Bear in mind, HB, before taking Jean S too seriously, that he has previously…

    a) claimed that removing the ‘hockey stick’ PC ‘preserves the temperature signal in the proxy data’.
    b) Criticized MBH98 regarding potential for “overfitting”, whilst ignoring, or forgetting, the validation period. Isn’t reference to a ‘validation period’ the standard way by which ‘overfitting’ is understood/taught? Perhaps Jean S can explain, he claims to teach stats.
    c) signed onto this nonsense criticizing the employment of RE in MBH98, see, if you place a small ‘random’ wiggle (11 points) right on top of a small section of data (11 points) far away from the mean of a 113 point calibration period, you may find that the small wiggle matches the small section of data better than the calibration mean. You ignore the lack of an RE score over the calibration period, as the ‘random’ data doesn’t exist for it, and you ignore that the circumstances under which MBH98 get high RE scores include a 78 pojnt calibration period and 58 point validation period.

  • Lazar // March 8, 2008 at 2:30 am

    Spence_UK,

    assumption = supposing the real temperature history is not ‘hockey stick’ shaped
    correlate with that assumption = proxy series correlate with a real temperature history that is not ‘hockey stick’ shaped

    If this still is not clear, please reiterate.
    Thanks.

  • Spence_UK // March 8, 2008 at 3:01 am

    Lazar,

    Thanks, but this:

    correlate with that assumption = proxy series correlate with a real temperature history that is not ‘hockey stick’ shaped

    is not an assumption in MBH98, and indeed is not evident in the data set, for the reasons explained above. Correlations between proxies and both global and local temperatures are often insignificant, particularly for tree rings; if they were significant, we wouldn’t need to be doing this complex procedure to tease the temperature signal out.

    Having thought about it, we are opening a whole can of worms here that is likely to drag this thread way off-topic. I’ll explain my reasoning.

    Tamino has raised a discussion here on the narrow point of the PCA decentring issue. If you want to investigate that in isolation, that is one thing. If you want to address the whole MBH98 study, you have to fully understand the entire algorithm and assumptions that go into it, AND you have to take into account the interplay between the PCA decentring and several other issues that Steve McIntyre raised. You can’t disaggregate the PCA effect and look at the whole algorithm – the interplay between the problems are important. Such a discussion would just explode, in fact I might go so far as to suggest you could write an entire blog on the topic ;)

    For this reason, I will focus on the narrow topic of PCA rather than the wider issue of MBH98: your characterisation of PCA (”emphasising” or “de-emphasising” signals) isn’t really a good one to help understand the issue here – it really rotates the data matrix with the goal of reducing the dimensionality by aligning the common signals along the principle axes. What you want to avoid is aligning an outlier signal on one of those principle axes and inflating its “variance” (which isn’t really even a good measure variance in the decentred case) – which is exactly what decentred PCA can end up doing, as illustrated by McIntyre and Wegman. Promoting outliers as significant common signals in this case is an undesirable property for such a statistical tool to have, which is why the short-segment centring PCA of this type is (as far as I know) never used by statisticians.

  • Lazar // March 8, 2008 at 3:07 am

    Jean S;

    This is not the case for the question under study: those proxy sets contain a few series (e.g. bristleconde pines) that do not represent the “overall general variation in data set” (i.e., are not factors determining most series). Normal PCA handles that nicely by putting that “variation” to a low numbered PC, which gets disregarded as noise. On the other hand, “non-centered” Mannian “PCA” tuned to pick up the 2oth century mean vs. overall mean difference places that noise incorrectly to high ranked PC, and gets passed to the future stages of the algorithm.

    And how do you presume to know that the PC is noise?
    Because it represents 8% of variance under centred PCA?
    Why does 8% demarcate noise, is it some kinda magic number?
    What if the signal, i.e. that which relates to temperature, is a small component of variance?
    How do you tell?
    Oh, that’s right, you do regression and test with stats.
    And when we remove the ‘hockey stick’ PC the scores drop to zero or negative over the calibration and validation intervals.
    And if we procede without using PCA so that nasty ‘hockey stick’ doesn’t dominate the other series beyond it’s natural (actually, arbitrary) bounds, we still get a hockey stick. And good scores.
    As HB has pointed out.
    Repeatedly.
    In the post you describe as rubbish.

    So if you have any intellectual honesty left, you could try the following experiment:

    - generate 70 (eg white noise) proxies with approximately the same overall mean, and then artificially lower the 20th century of one of the proxies (an outlier series). Run centered and Mannian PCAs on that data set.

    If you “have any intellectual honesty left”, you’ll try to describe what the point is, that you’re, presumably, trying to make…

  • George // March 8, 2008 at 3:30 am

    If the hockey stick shape is indeed an artifact of PCA, as some claim, then why do I see it when I look at this?

    Apparently I am not completely alone, since at least one other person (Eli) sees it too.

    Then again, perhaps it is a characteristic common to all humans. Perhaps we have actually evolved as a species to just look at data and automatically do de-centered PCA.

    Possible, but I’d have to say not likely. What evolutionary advantage would it give (other than perhaps to hockey players)

    I understand the point of illustrating the issue to those not familiar with PCA (and can appreciate that some have their very identity riding on the “debunking” of the hockey stick), but if the hockey stick shape is independent of whether one uses PCA, isn’t it sort of a waste of time to debate the issue ad nauseam?

  • Lazar // March 8, 2008 at 3:33 am

    MrPete,

    Looks to me like we’re unable to even comment on it. The green line doesn’t show up in the “hockey stick” section at all, so it’s a bit hard to see what’s going on.

    ‘Hockey stick’ refers both to an upturn in 20th century temps and relatively flat temps extending from the 15th to 19th centuries (actually it’s only the flatness of the 15th century which is at dispute).
    The reason you can’t see the green line in the 20th century is because it’s right on top (or underneath) of the instrumental temperature record and the decentered PCA reconstruction.

    The green line is the reconstruction using no PCA at all, and no bristlecone pine series.

    I’ve learned enough from watching these arguments to know that it is not as simple as reading the “surface” of a paper like this and deciding whether the claims made are correct. The devil’s in the nit-picky details. For example, apparently WA isn’t using the same data — correlation 0.7 with MBH data.

    Probably the relevant quote from WA06 is;

    Our emulation of the MBH98 results is shown in Figure 1. The Northern Hemisphere mean surface temperature reconstruction (WA) is nearly identical to the original MBH98 reconstruction (Fig. 1, red and grey lines). The slight differences are related to our methodological simplifications of using a consistent annual time step throughout the
    Feb 24, 06 Wahl and Ammann Climatic Change, in press 26 reconstruction process and equal weighting of the proxies.

  • MrPete // March 8, 2008 at 3:59 am

    Tamino said: this is my house.

    Of course. And the brief “rubbish” quote that alarmed you was written without noting where it came from… so no surprise at all that you were upset. [How's anyone to know who/where/how people are communicating beyond what you can see?! Craziness.]

    Lazar said: Probably the relevant quote from WA06 is…

    Could be. Except the “0.7 correlation” is with respect to the data before any analysis, if I’m understanding correctly what Steve M said in the CA thread linked above. So no reconstruction or weighting involved. Supposedly the same source data, obtained from Mann.

    What a mess. In my most numeric work — demographic analysis — when things got this contentious we’d slow down, expand our data until we could see exactly what was same/different, and then move forward. It helps if one has cooperative people all around who want to get the various methods and data sources in sync. In this case, that’s not a valued goal (yet).

    Well, maybe someday. At least the various parties are interacting now. I’m an optimist that this will work out over time. :-)

    ‘night all. Combine this with some bookkeeping nightmares and my head hurts :-)

  • Lazar // March 8, 2008 at 4:01 am

    Spence_UK,

    For this reason, I will focus on the narrow topic of PCA rather than the wider issue of MBH98

    Ok, in which case am I correct in assuming that focus applies below; that you are restricting your claims to the use of (decentered) PCA?

    In statistical language, MBH98 has no power to determine whether history was hockey stick shaped or not, because it is predisposed to producing hockey stick shaped output, even if there is just one rogue hockey stick proxy. Von Storch demonstrated this with German unemployment and McIntyre demonstrated it with tech stocks.

    In which case,

    In statistical language, MBH98 has no power to determine whether history was hockey stick shaped or not

    Agreed.

    because it is predisposed to producing hockey stick shaped output

    No, not because decentred PCA is predisposed to producing a certain type of pattern, but because PCA “doesn’t care either way with whether those patterns [elicited] either correlate to, or in fact represent, anything physical such as temperature”, regardless of the centering mechanism. Which is why regression and statistical tests and selection of suitable proxies etc.

    Again, I must ask you, when you write;

    In statistical language, MBH98 has no power to determine whether history was hockey stick shaped or not, because it is predisposed to producing hockey stick shaped output, even if there is just one rogue hockey stick proxy. Von Storch demonstrated this with German unemployment and McIntyre demonstrated it with tech stocks.

    … could you explain what are the point(s) you are trying to make?

    You can’t disaggregate the PCA effect and look at the whole algorithm – the interplay between the problems are important. Such a discussion would just explode,

    Unfortunately I think that was doomed from the start, unless HB starts heavily censoring. Primarily because the various issues are so difficult if not impossible, to meaningfully, as TCO puts it, disaggregate. E.g. one can’t easily speak of the relevance of decentered PCA without describing the effects on the reconstruction, the aims of MBH98 etc.

  • Lazar // March 8, 2008 at 4:25 am

    Spence_UK,

    Your point (c) is obscure. Why would tech stocks pass verification tests? As Tamino notes, this would not be expected.

    So why raise the issue of tech stocks at all? MBH98 is the reconstruction and the verification stats. No (or different) verification stats, no result as exists in MBH98.

    Correlations between proxies and both global and local temperatures are often insignificant, particularly for tree rings; if they were significant, we wouldn’t need to be doing this complex procedure to tease the temperature signal out.

    … doesn’t the movement of the ‘hockey stick’ from PC4 to PC1, and the reduction (to abject failure) in verification stats when excluding the ‘hockey stick’ PC, suggest that the decentered method is rather effective at teasing the temperature signal out?

  • bouldersolar // March 8, 2008 at 4:56 am

    What would happen if the verification and calibration periods were switched? Should we expect the reconstruction to be the same?

    Can anyone enlighten me on how many data points each proxy had during the calibration and verification periods?

    Emotions running high but I do still see nuggets of info that is helpful.

  • Christopher // March 8, 2008 at 5:18 am

    “I also get the impression that certain folks would like to take *artificial* data, created only to illustrate a point, for a blog post no less, and subject it to an “audit.” No thanks — I’ve got way better things to do.”

    I feel this might have been directed at me. I have no intention of auditing a single thing, nor the hubris to assume that would even be valuable. My goal is to understand the whole PCA bit and your example seems like a useful place to start as I have some background in biometeorology and see great value, for me personally, to figure out PCAs, and I’m interested in the whole thing. Like I said before, this whole debate and diatribe is so irrelevant wrt AGW. I’m not sure why people get so upset about it. It’s like AGW requires MBH98, which is just boggling.

  • Hank Roberts // March 8, 2008 at 5:39 am

    Good pointer, that, to the AR4:

    “McIntyre and McKitrick (2005a,b) raised further concerns about the details of the Mann et al. (1998) method, … using Principal Components Analysis. The latter may have some theoretical foundation, but Wahl and Amman (2006) also show that the impact on the amplitude of the final reconstruction is very small (~0.05°C; for further discussion of these issues see also Huybers, 2005; McIntyre and McKitrick, 2005c,d; von Storch and Zorita, 2005).”

    M’n'M get credited appropriately, they pointed out a method issue that makes a quantifiable difference (in the North American number). Five percent of a degree C isn’t insignificant. Well, actually …. maybe it is.
    Did anyone check the significance?

  • MarkR // March 8, 2008 at 5:53 am

    This has all been done to death before. Read what Wegman (”a professional statistician for some 38 years. I have served as editor of the Journal of the American Statistical Association and served as coordinating editor, associate editor, member of the editorial board and a
    number of other editorial roles for many journals during this time period. I am
    currently on the Board of Directors of the American Statistical Association as the
    publications representative and will become the Chair of their Publications Committee as of 1 January, 2007.”) had to say in front of the US Congress:

    “Without attempting to describe the technical detail, the bottom line is that, in the MBH original, the hockey stick emerged in PC1 from the bristlecone/foxtail pines. If one centers the data properly the hockey stick does not emerge until PC4. Thus, a substantial change in strategy is required in the MBH reconstruction in order to achieve the hockey stick, a strategy which was specifically eschewed in MBH. In Wahl and Ammann’s own words, the centering does significantly affect the results.”

    http://www.climateaudit.org/?p=2322

  • fred // March 8, 2008 at 6:02 am

    Lazar Are you implying that, supposing the real temperature history is not ‘hockey stick’ shaped….and you feed all that through the MBH98 method, decentered PCA, regression, and verification, that you will a) get a reconstruction with a ‘flat’ 15th century, that b) gets a significant score over the calibration period, and c) also over the verification period?

    That’s what I think, rightly or wrongly, both McIntyre and Wegman are saying.

    Spence says “short-segment centring PCA of this type is (as far as I know) never used by statisticians.” It was my own impression too. I would really like a textbook reference to the contrary showing in detail the recommended method of doing it.

    It would also help if we could ban the expression ‘intellectual honesty’ from this debate – which should be about something perfectly objective, the use of statistical methods. Helps not in the slightest.

  • Armagh Geddon // March 8, 2008 at 9:02 am

    Tamino, I would just like to congratulate you on your calm, dispassionate, professional and objective approach to this issue. Well done.

  • James Lane // March 8, 2008 at 9:03 am

    I’ve been using PCA in social science applications for over 20 years.

    As a general point, I would say that applied PCA is not a black-box procedure but requires a lot of iteration and inspection of the data, especially the identification possible outliers.

    When you run PCA, the first thing you do is ID the series that load heavily on each component. If I looked at my PC1 and determined that it consisted mainly of high altitude BCPs from California, I’d be very concerned that I have an outlier. I certainly wouldn’t think “that’s the temperature signal!”

    OTOH, If I looked at my PC4 and determined that it consisted mainly of high altitude BCPs from California, I’d be less concerned, but more interested in assigning meaning to the higher order components. If you’re going to interpret PC4 as the temperature signal, what’s the signal in the other PCs?

    Anyway, you’d go and have a good look at the raw data for these possible outliers, and you’d run PCA again with and without them, to see the effect on the “face-validity” of the solutions.

    Then you have to deide how many PCs to retain for the regression stage. Tamino may have given the impression that this is all mechanical “nothing to see here” but in fact it’s an inexact science. There are at least a dozen “rules” for component retention. Take your pick.

    After the centering issue was raised, Mann invoked Preisendorfer’s Rule N to allow retention of PC4 (and hence the BCPs) in the centered scenario. In fairness to Mann, he did use Preisendorfer on his temp PCA (different part of the method) although it’s not discussed in the context of the proxy network.

    (BTW, it would be fascinating to know what the actual eigenvalues for the PCs under the centered and decentered options for MBHx).

    But PC retention is very important, as the regression stage doesn’t “care” about the order of the PCs, the percentage of variance explained or the interpretation of the component. Again, in practice you’d run the regression with 2, 3, 4 & 5 PCs and see how the results vary.

    That’s how it’s done in the real world.

    Finally, it seems to me that the centered/ decentered debate (while interesting) is essentially a side-show. It’s all about getting the BCPs into your reconstruction. Of course you can do that without PCA. In fact, many of the non-PCA reconstructions include Mann’s PC1!

  • Mike N. // March 8, 2008 at 11:34 am

    hey John Cross, you’re a cool guy, just don’t sweat the technique on a brotha like steven mosher, c’mon. He’s funny; plus he gets into downright dirty Detail in his analyses.

  • mikep // March 8, 2008 at 12:36 pm

    This discussion is losing sight of the simple issues. The MBH reconstructions was supposed to be of the Northern Hemisphere temperature. There are too many data series, it was claimed, to do a sensible regression of temperature against all the potential proxies, which are supposedly highly correlated with each other anyway. One standard solution is to use
    PCA as a purely mathematical transform of the original series to give components which desribe the dominant pattern of variation in the data and reduce the number of variables to put on the LHS of the regression. These components may or may not have a physical interpretation. Mannian PCA (which no one who just read the original article would have known he was using, incidentally) instead efficiently looks for those components in the data with a deviation from the mean at the end. It answers a different question from standard PCA. And it efficiently picks up hockey sticks. This has never been in dispute. What this means in practice is that the Mannian PC1 picks out a very small number of series, notably the bristlecone pines, which correlate well with temperature in the 20th century and gives them a very heavy weighting. What mannian PCs don’t do is represent the dominant pattern of variation in the sum total of the original proxy data. In other words, for practical purposes MBH could have thrown away most of their data and just gone to the regression step using the bristlecones and a few other similar series. And yes, of course if you systematically mine for data with the “right” shape you will get decent verification stats.

    But the claim to be reconstructing the Northern hemisphere temperature rests on using bristlecones as a kind of “Standard Candle” for the Northern Hemisphere temperature. If we had good reasons independently for thinking they did act as standard candles well and good. But let’s bear in mind.
    There is no physical theory or independent data that says bristlecone pines accurately ( and linearly) track temperature.
    The bristlecone ring widths do not correlate with local temperatures. Indeed the original collectors thought the 20th century growth spurt was caused by carbon fertilisation.
    Most importantly the best test of all, seeing if bristlecones track late 20th century and early 21st century temperatures, is failed badly. This is the so-called divergence problem.
    What all this suggests to me is that MBH is a essentially a classic example of the well known “spurious regression” problem (NB this is not simple name-calling but a well-defined phenomenon first discussed by Yule in teh 1920s and, following on the work of Novel prize-winning econometrician Clive Granger now standard fare in most modern econometric textbooks i.e not something particularly esoteric).

  • Lazar // March 8, 2008 at 1:04 pm

    James Lane,

    Anyway, you’d go and have a good look at the raw data for these possible outliers, and you’d run PCA again with and without them, to see the effect on the “face-validity” of the solutions.

    Then you have to deide how many PCs to retain for the regression stage. Tamino may have given the impression that this is all mechanical “nothing to see here” but in fact it’s an inexact science. There are at least a dozen “rules” for component retention. Take your pick.

    I have a lot more sympathy for that approach than the approach described by Wegman;

    Wahl and Ammann reject this criticism of MM based on the fact that if one adds enough principal components back into the proxy, one obtains the hockey stick shape again. This is precisely the point of contention. It is a point we made in our testimony and that Wahl and Ammann make as well. A cardinal rule of statistical inference is that the method of analysis must be decided before looking at the data. The rules and strategy of analysis cannot be changed in order to obtain the desired result.

    He’s saying you would have to guess beforehand (a fraught process) which, and how many, PCs to include. If your guess is wrong, if you include irrelevant PCs, or worse, if you exclude the signal you’re interested in, you would have to pack up tools and go home. After spending years collecing data. According to his ‘rules’, the correct decision process is the worse one. It doesn’t work in the real world. The worse error in logic is that, he ignores “the rules and strategy of analysis” were changed when M&M altered the centering mechanism.

    Finally, it seems to me that the centered/ decentered debate (while interesting) is essentially a side-show.

    Then HB is correct, decentered PCA is not per se wrong.

    It’s all about getting the BCPs into your reconstruction. Of course you can do that without PCA.

    As Wahl & Amman show, the hockey stick shape back to 1400 is not dependent on the presence of bristlecone pine series if you do the analysis without PCA, but the verification stats most definitely are. Those to me are strong grounds for including bristlecone pine series.

    Anyway, thanks for your input.

  • Lazar // March 8, 2008 at 1:25 pm

    mikep,

    And yes, of course if you systematically mine for data with the “right” shape you will get decent verification stats.

    a) The verification stats are not dependent on ‘data mining’ for the ‘right’ (hockey stick) shape, but on the scarcity of data.

    b) Neither is the hockey stick shape dependent on ‘data mining’.

    Eg…

    If you do not use PCA, and therefore do not mine for the ‘right’ shape, the hockey stick shape (flat) over 1400-1450 (and upturn over the 20th century) remains, and the verification stats unchanged.
    If you then exclude bristlecone pine series, the hockey stick shape still does not change.
    But the verification stats do change over the 1400AD step.
    When M&M obtained their spurious 15th century warmth, and failed verification stats, they removed a lot more than bristlecone pine series.

    Regards

  • luminous beauty // March 8, 2008 at 1:31 pm

    MrPete,

    Yes temperature sensitivity in plants is dependent on environment. A tree in a certain environment might be more or less sensitive to confounding factors.

    Your assignment as a budding dendroclimatologist is to determine where to find environments in which confounding factors are minimalized.

    Think you can do it? How many tree cores from how many locations do you think it will take?

  • Barton Paul Levenson // March 8, 2008 at 1:38 pm

    BoulderSolar writes:

    [[However your snipe about not taking anything McI posts at face value is a bit ironic coming from one who hides behind an alias]]

    How many times are we going to see this same stupid charge repeated? HB/Tamino is “hiding behind an alias, hiding behind an alais, hiding behind an alias.” Nya nya na nya nya. Grow up, for Christ’s sake.

  • Hansen's Bulldog // March 8, 2008 at 1:40 pm

    You know that prank where you take a paper bag, fill it with manure, set it on somebody’s porch, light it on fire, and ring the doorbell? The plan is to get the homeowner to open the door and stomp on the bag trying to put out the fire, so he’ll end up spreading crap all over his porch, his shoes, and whatever else might be nearby. This morning I found another burning bag of crap on my doorstep. I’m not gonna stomp on it. I’m not gonna bring it into the house either.

  • Lazar // March 8, 2008 at 2:00 pm

    Stephen Mosher writes at CA;

    Tammy town is a fun place to explore rhetorical probing missions. When Tammy said he was doing PCA I figured he would get to the decentered issue. I even praised him for educational efforts. Now, we shall see how he extricates himself. Should be good theater. Popcorn is on me.

    Well Steven, I’m a little underwhelmed so far. Some rather confused claims expressed by yourself, the Steve McIntyre post you link, and some others here, seem to boil down to…

    1) PCA is the factor which isolates temperature signal from proxy data.

    Wrong. In MBH98 it is PCA + regression against temperature data + statistical tests on the model, and PCA is not necessary.

    2) If you apply decentered PCA to tech stock series of a ‘hockey stick’ shape, that the method might (except eigenvalues aren’t mentioned) select series that bear no relation to temperature, and you conclude therefore the MBH98 decentered method, and from 1), therefore MBH98 itself, is wrong, wrong, wrong.

    Nope. Tech stock series can be selected by centered PCA if you include enough series, or indeed if you do no PCA at all. I’m not entirely sure, but this may be why MBH98 don’t include tech stock series in their data, and why they b) apply regression against temperature data, and c) test the results of that regression against temperature data. PCA identifies and weighs patterns based on variance, not a relation to a specific physical quantity. The amount of variance may or may not be related to the amount of signal contained, of any physical quantity that we desire to measure. The decentered method in MBH98 is an informed guess which turned out to be correct.

    3) Dear old Jean S concludes that a PC with 8% variance must be noise. Wrong for the previous reasons. Also note, a rough guide to a likely safe noise threshold is 100%/number of series, which, over the 1400AD step, 100%/22 < 5%.

    Pass the popcorn.

  • dhogaza // March 8, 2008 at 2:45 pm

    hey John Cross, you’re a cool guy, just don’t sweat the technique on a brotha like steven mosher, c’mon. He’s funny

    Yeah, he’s funny, alright. MBH98 is a fraud equivalent to Piltdown Man. Yeah, I’m ROTLMAO over his wit.

  • fred // March 8, 2008 at 2:45 pm

    Tamino, any comment on this?

    http://www.uoguelph.ca/~rmckitri/research/PCnotes.pdf

    very different angle on it.

    [Response: I'll have to study it before I can comment. The first thing I notice is that it appears to be by McIntyre and/or McKitrick, so it's no surprise there's a different angle.]

  • luminous beauty // March 8, 2008 at 2:55 pm

    “The bristlecone ring widths do not correlate with local temperatures. Indeed the original collectors thought the 20th century growth spurt was caused by carbon fertilisation.”

    Yes. When carbon fertilization is corrected for, high correlation with local temperatures is confirmed.

    So much for the diversion problem.

  • Lazar // March 8, 2008 at 2:59 pm

    Fred,

    That’s what I think, rightly or wrongly, both McIntyre and Wegman are saying.

    Spence_UK now says that is not what he is claiming, and I’ve come to believe it is also not what Steve McIntyre and some others are claiming.

    If not, then I want the claim clearly enunciated and signed on the dotted line. Then proven.

    Spence says “short-segment centring PCA of this type is (as far as I know) never used by statisticians.” It was my own impression too. I would really like a textbook reference to the contrary showing in detail the recommended method of doing it.

    No, Fred, nooo! HB’s work allows you to understand what PCA does to the data, what decentering does to PCA, and how those mesh with the aims of MBH98. I won’t accept you falling back on an appeal to authority to defend M&M’s criticisms. Make an effort to understand. Please.

  • Hank Roberts // March 8, 2008 at 3:10 pm

    There’s one climatologist, as far as I know, participating at CA, that’s Dr. Curry.

    Who wrote on the dot.earth blog recently, replying to Andy Revkin’s invitation for argument there:

    “… I am astonished to see such tilting at windmills ….
    … For the record, I view the IPCC 4th Assessment Report to be the best available statement of the state of climate science at the time it was written. Policy makers do not have a better document or analysis from which to work with in grappling with the myriad of issues associated with climate change.

    … we need to do more and better science, and more extensive assessments. We are wasting time attacking each other’s credentials and motives. Andy Revkin is right in desiring to switch the focus to policy, management and technology solutions to these complex issues.

    — Posted by Judith Curry

    http://dotearth.blogs.nytimes.com/2008/01/24/earth-scientists-express-rising-concern-over-warming/#comment-10491
    … we need to do more and better science, and more extensive assessments. We are wasting time attacking each other’s credentials and motives. Andy Revkin is right in desiring to switch the focus to policy, management and technology solutions to these complex issues.”

    — Posted by Judith Curry

    http://dotearth.blogs.nytimes.com/2008/01/24/earth-scientists-express-rising-concern-over-warming/#comment-10491

  • Hank Roberts // March 8, 2008 at 3:21 pm

    Last “paste” in my previous hiccuped, once is enough

  • George // March 8, 2008 at 3:38 pm

    The goal posts seem to be moving …even as Bulldog writes.

    First, McIntyre et al claimed that it was decentered-PCA that “created” the hockey stick shape….

    Then, when it was shown that this was not the case — indeed that you can do the analysis without even using PCA (which is probably only a surprise to some statisticians and spectroscopists who treat PCA like it was the greatest thing since sliced bread) — they transitioned smoothly (watch carefully or you might miss it) into “the strip-bark bristlecones are not valid temperature proxies”

    *…and, as everyone knows, hockey sticks are always made from Bristlecone pine, because of the tree’s inherent hockey-stick shape.

    Don’t believe it? Visit a stand of Bristlecones some time. Every branch is “hooked” at the top. It may be genetic or it might just be the prevailing winds at the top of mountain ridges where the trees grow (or perhaps a combination of the two).
    At any rate, word has it among those in the know about this kind of stuff that you can make a great number of hockey sticks from just one tree, with virtually no waste!

    I’m really glad that we are having this conversation because a lot of people are unaware of the latter important fact.

  • luminous beauty // March 8, 2008 at 3:40 pm

    A question for all those CAuditors who believe they know everything there is to know about dendroclimatology.

    Consider a single core sample in which there is poor correlation to local temperature as ascertained only from annual ring widths:

    Can you disaggregate the temperature signal from confounding factors? What other measurements besides annual ring width would you use?

  • MrPete // March 8, 2008 at 3:42 pm

    LB sez: “Your assignment as a budding dendroclimatologist is to determine where to find environments in which confounding factors are minimalized.”

    I’m 100 meters away from a nice greenhouse :-)

    Personally, and speaking with others who have significant bio experience, I’m cautious about ever finding such an environment. My null hypothesis is that it does not exist.

    Back to the books… certainly a lively discussion here!

    [Totally OT but hopefully lightening the atmosphere a bit... why do we need greenhouses at 7000+ ft altitude? Sure, extending the growing season is one. Not much grows when the night time low is 10F. However, even more pertinent to a successful growing season is: avoiding hail (ever seen 50cm of hail fall? How about 5-6cm hailstones? Ouch!), wind, and other craziness. My wife has developed a mini-greenhouse technique for gardening, involving PVC hoops and plastic to make "Conestoga Wagons." Even with that, our outdoor tomatoes were decimated in last fall's hail storm. Punched right through all but the toughest materials.]

  • MrPete // March 8, 2008 at 3:50 pm

    LB,

    a) We sure don’t claim to know all about d-c. Far from it. More like, there’s a lot of disciplines involved in understanding this stuff. Some of us have expertise in surprisingly-related areas that too easily are ignored.

    b) There’s been extensive discussion of your question. Google Almagre on CA.

    c) Our Starbucks adventure did involve other measures. Check out the provenance files. Not sure what if anything will emerge from all that. Also not sure if some of the other analyses, like soil samples, have or will turn into useful data. We accomplished the primary field goals; other data may or may not have been successfully collected and processed.

  • George // March 8, 2008 at 4:28 pm

    Judith Curry quote:

    ‘We are wasting time attacking each other’s credentials and motives.”

    Actually, the entire problem is the “we”.

    By and large, the “we” are not scientists and I would guess that the vast majority of them have little to no idea what they are talking about when it comes to climate science.

    Scientists (at least the ones who pay attention outside science) know who the “we” are and it is very easy for them to recognize that the vast majority of the “we” have no background whatsoever in the relevant scientific areas they are critiquing. The “we” are simply speaking gibberish, as far as the scientists are concerned. And most scientists simply ignore them.

    In the pre-internet age, these people, the vast majority of whom do not publish in scientific journals, would never get the time of day and their complaints would fall only on the ears of their (unfortunate) spouse and perhaps fellow workers.

    But, for worse, in this case their “voice’ has been amplified by a factor of 1000 or even 1,000,000 — at least when it comes to the public perception — for no other reason that they have a blog that is popular among the contrarian crowd and that their nonsense gets air time (amplified yet again) on FOX News.

  • Spence_UK // March 8, 2008 at 4:36 pm

    Lazar,

    I understand your question now, and appreciate the scientific style in which your question was posed. Your question is, in essence, does this problem matter? Does the rest of the algorithm cope with artefacts introduced by the decentred PCA?

    I don’t have time to go into great detail now, but my answer would be – yes, it does. A simple test is to apply trendless noise to the algorithm both before and after the decentred PCA step – which McIntyre effectively did in his reply to Huybers. There is no temperature signal going in, so we should get no real difference on the output. But there is a huge difference – the RE significance without decentring is 0.0, the RE significance with decentring is 0.54. Clearly, the step which is supposed to be throwing out the non-temperature data is failing to deal with what the decentred step is providing it. I’ll touch on my view as to why this might be below.

    Robustness is the second issue, and the results of MBH98 were specifically claimed to be robust. Robustness in statistics is achieved by removing sensitivity to outliers, not increasing it. See here for example:

    http://en.wikipedia.org/wiki/Robust_statistics

    For an uncontroversial example, imagine averaging 100 time series containing consistent signal and orthogonal noise. You should get a tenfold improvement in signal to noise. If I throw out 10 samples, I get a trivial loss of 5% of my signal to noise. If that tiny change substantially changes my conclusions, it should be of great concern – what if I had only taken 90 samples? If I took 110, could it change my conclusions again?

    Of course, McIntyre performed this test and found that by eliminating particular samples, the conclusions changed markedly. The reason for this is not just the decentred PCA, but the whole algorithm; clearly, having an up front process that can promote outliers is not a good start to developing a robust algorithm. Note that once you have identified a lack of robust to one small group of samples, you cannot reinstate it by testing against throwing out other groups.

    The failure of the step that selects temperature signals is not suprising. Inject trendless noise into the decentred PCA, and you get a band-limited step function out, with the centre of the step around 1902. The band limiting is a function of the colour of the injected trendless noise. This might be, say, a rising ramp from 1902 to around 1940, followed by a plateau from 1940 to 1980. That sounds to me like it might well be close enough to the historical temperature record for the algorithm to grab it with both hands. Looking at the proxy weights out of the algorithm does indeed support this.

  • fred // March 8, 2008 at 4:52 pm

    Lazar, stats is not my field. Are you saying there is no textbook or reference manual published in which how to do decentered PCA is is explained at somewhat greater length, and with some worked examples. We can find plenty of stuff on how to apply every other stats technique. There must be some material someplace on how to do this, besides Tamino’s blog!

  • Phil B. // March 8, 2008 at 6:14 pm

    Tamino, I just got through rereading MBH98 and I am a bit confused about your above post. MBH98 did a PCA on 1082 monthly grid cell temperatures during the calibration period and then averaged the PCs to obtain annual temperature PCs. MBH98 then regressed the proxies sets in a stepwise manner against the largest variance temperature PCs to obtain weights for each proxy to reconstruct the temperature PCs. IMO, Dr. Mann centered the proxies around the calibration period so he didn’t have to calculate offset terms as the grid cell temp PCs had zero means. Did I miss the PCA on the proxies?

    I was surprised that you did your PCA on synthesized data rather than actual proxy data that MBH98 used. Dr. Mann has made the data readily available and the files are relatively small. I think that it’ s useful to perform PCA on the proxies sets and look at scree plots of the eigenvalues/singular values. Caerbannog, as the first commenter, suggests that this is what he would do. I performed a PCA on MBH98 proxy set data 1400.dat for the 1400AD to 1980AD period (22 proxies) and the scree plot looks like what one might see from noise, but I would welcome your observations.

    Phil B.

  • Gavin's Pussycat // March 8, 2008 at 6:20 pm

    It warms this old furry feline heart to see some territorial instinct on display here… go get them Bulldog!

  • nanny_govt_sucks // March 8, 2008 at 7:07 pm

    [Response: Bullshit. It’s the abandonment of valid PC selection procedures that enabled MM to discard exactly the variation that corresponds to the hockey stick. Unfortunately, that particular variation *should* have been included according to objective selection criteria. As a result it’s no surprise that the MM result fails verification.]

    I’m confused. In the comments of the first part of this series you said:

    “Indeed there aren’t hard-and-fast rules about how many PCs to include in all cases.”

    So, what are “valid PC selection procedures “, and what are “objective selection criteria”?? It would seem that you are talking about hard-and-fast rules here.

    [Response: There aren't any hard and fast rules to cover all cases. In fact, Jolliffe mentions case in which the low-variance PCs can be what you're really looking for (like outlier detection). But MBH applied a procedure for selection. MM didn't follow that procedure when claiming to emulate them, just copied the bare number included.]

  • TCO // March 8, 2008 at 7:37 pm

    Did the MBH article mention the use of the decentered transform? Should this aspect of methodology been shared?

  • Ron Cram // March 8, 2008 at 8:37 pm

    Tamino,

    Did you bother to ask any statisticians about your post before posting it? You are not just opposing McIntyre and McKitrick. You are going up against Wegman and Von Storch and Zorita as well. Von Storch and Zorita published a peer-reviewed article on this topic. They found McIntyre was correct that decentering the data was wrong and resulted in an artificial hockey stick. They also claimed it didn’t “matter.” I had a long email exchange with Zorita about this. I asked how it could not matter and Zorita replied that it was because MBH had so many other errors. Von Storch and Zorita believe in global warming. They just want to see science done correctly. MBH was not.

    If you really think you are correct here, you should try to have your analysis published in a journal. If you can find a lazy editor willing to publish it, I can guarantee a clear thinking rebuttal from one of any number of well known statisticians.

  • Surly // March 8, 2008 at 9:07 pm

    Can some kind soul link to whatever article discusses adjustments to account for BCP carbon fertilization and the resulting temperature reconstruction? The claims that the hockey stick is “broken” seem to be based on the two issues: MBH’s PCA methodology and BCP CO2 fertilization. I am trying to get a handle on both those claims on the part of the “auditors” and would appreciate some references. Thanks!

  • nanny_govt_sucks // March 8, 2008 at 9:58 pm

    But MBH applied a procedure for selection.

    What was the MBH procedure for selection of PCs? Where is it described?

  • MrPete // March 8, 2008 at 10:14 pm

    Surly,
    There was a 2006 discussion of this at CA with a dendroclimatologist. He’s been studying BCP’s etc at Cirque Peak, a wetter location (near Sequoia N.P.).

    There’s conjecture that stripbark growth is due to fertilization, but it doesn’t really fit. We have an alternate hypothesis in development that is essentially non-climatic in nature. The following quote from the linked discussion may be helpful.

    To tie into the present PCA discussion… this is what makes statistical analysis interesting. If the physical evidence doesn’t really match the parameters you know, it is important to be very cautious about presuming that a good statistical “fit” actually means something.

    I (and probably most others) am not really certain how strip bark growth relates to photosynthetic area, root area and other metrics that drive conifer growth. It is also quite possible that the strip bark growth mechanism has little to do with climate – in which case the models for strip bark growth and normal trees might be quite different. I am not aware of any studies that have looked at this- not really my area of expertise.

  • dhogaza // March 8, 2008 at 10:15 pm

    I had a long email exchange with Zorita about this. I asked how it could not matter and Zorita replied that it was because MBH had so many other errors.

    List them. As presented, this is an empty assertion, since we have no way of verifying 1) whether Zorita said that 2) whether he was correct 3) whether you’re even telling the truth about having had such an exchange in the first place.

  • dhogaza // March 8, 2008 at 10:25 pm

    The claims that the hockey stick is “broken” seem to be based on the two issues: MBH’s PCA methodology and BCP CO2 fertilization.

    Given that you don’t need to use MBH’s PCA methodology to get a “hockey stick” from the data, why is this even an issue ten years later?

    Is there any reason other than to smear Mann as being incompetent, dishonest, or both?

  • John Cross // March 8, 2008 at 11:14 pm

    Nanny: You must have missed the part of Tamino’s post where he said “I have one request: before you raise other issues about the hockey stick, address this one.”

    So can you please point me to the part of your discussion where you addressed this. Thanks,

    John

  • TCO // March 8, 2008 at 11:16 pm

    Dghosza: Yes, there are reasons other than persecuting poor Mike to examine this work and look for errors. Importance of the result to policy. Sheer math/science interest. And note, it’s not just about “other methods give the same error”. In school, you get marked off if you have two errors that correct for each other for instance. This is not solely an advocacy issue. There is also a dimension of using proper stats methods.

    Maybe the Zorita comment is empty. But then maybe many blog comments are empty. These things are relative. Sure, I’d LOVE to have a complete discussion from Zorita of what he means and how it happens. But the comment in and of itself is interesting to me. As it encapsulates a concept. A concept that cuts both ways…against Mann…and against Steve M. IOW, the step while flawed is compensated for with other effects. The way that this cuts against McI is that he has been unwilling (and sophistic) in describing the exact nature and impact of each posited flaw. McI tries to conflate things so that the headliner issue decentering carries the weight of other aspects of the algorithm. Also, Zorita is one of the few (Huybers another) that I really trust to look at this thing without rhetoric games that I (an opinion, but based on 43 years on the planet and lots of science and lots of debates on science and observation of human nature) see from Mike and Steve.

  • Surly // March 8, 2008 at 11:16 pm

    dhogasa, that is precisely why it is still alive because this is not about evidence or science but about smearing and raising doubt. In my experience on various unrelated forums, “skeptics” inevitably bring up the hockey stick as proof that global warming is wrong. They dredge up the PCA argument and BCP argument and I for one would like to be able to respond . The “hockey stick is broken” meme is still out there circulating in the public debates and I don’t have anything to post in response to it. I can at least point to this post to respond to the PCA issues, but the BCP argument remains.

  • Hank Roberts // March 8, 2008 at 11:19 pm

    Look, he didn’t use the best possible tool in the best possible way to get the correct result, he just got the correct result (within 0.05 degree C, as noted).

    The 0.05 degree C error is proof he wasn’t perfect.

    Notice none of the climate scientists even get involved in these discussions any longer? Nobody who’s published in the field previously has even a “submitted to” paper online.

    Hobby horse stampede.

  • TCO // March 8, 2008 at 11:35 pm

    Hank: how do you know he got the right result? Have you been in a time machine to check the MWP? Lots of other reconstructions (Moberg, Esper) have different MWPs. Also, the Burger 2005 paper shows how different methods choices can make large differences in the recons (see the spaghetti graphs).

    If you’re just talking about there being a .05 difference, but about McI inflating it…well…I sympathize. McI has a nasty tendancy to conflate things, to resist disaggregation and examination of scale.

    BTW, if he DID use the wrong method, why is it so hard to get an admission? Why the refusal to just rerun with correct method? That is ego science…and I know ego scientists.

  • Hank Roberts // March 9, 2008 at 12:00 am

    >admission
    Read subsequent papers; methods are always improved. The only time you find an ‘admission’ is something like the Christy mistake where he did admit an error, or the journal where the editors resigned after publishing a bad paper. That’s how you tell an error in a profession.

    If everyone still used the same methods used 20 years ago you’d think that was proof they were doing the wrong thing too, wouldn’t you?

    You folks setting yourselves up as judges are like the people in the bleachers shouting at the players on the field who you’re there to watch.

    They’re not there for your coaching.

  • Lee // March 9, 2008 at 12:32 am

    TCO says:
    “Why the refusal to just rerun with correct method? That is ego science…and I know ego scientists.”

    TCO, I strongly, strongly (did I say strongly, strongly enough?) disagree. That is normal science.

    When someone’s early work is improved by later analyses, one does not normally go back and redo the early work with the new methods. The published improved analyses ARE the improved analyses. Good scientists take the improvements and corrections and apply them to new work going forward.

    Why should Mann go back and redo this founding (and therefore by definition least sophisticated) work, when there are many subsequent improved works already out there?

    To me, the continued hammering on the founding paper by Stevie Mac et al is an avoidance of the newer work – and it does not place any onus on Mann to redo the founding work.. This seems especially true since I spent more time than I like to admit poring through CA trying to figure out what the hell they were saying in their occasional scattershot comments on the newer work. It has become hard for me to believe that the incredible obfuscation of argument by SM over there is entirely accidental – he’s too smart for that. The fact that he engages in what is IMO dishonest behavior – altering his posts without notice, for example – doesn’t help.
    Why should Mann go back and redo superceded work because someone like SM refuses to stop chewing on it?

  • JCH // March 9, 2008 at 12:50 am

    Whoops.

    Well, if he did, would all these baying boobs of bountiful boo-boos finally shut the fiddlesticks up?

  • TCO // March 9, 2008 at 1:03 am

    If it was in error, he should admit the error. There is something truculent about both Mann and McI.

  • Hank Roberts // March 9, 2008 at 1:08 am

    Remember you’re in the bleachers. You’re part of the audience, and screaming instructions at the player is merely recreation.

    The important papers around the editorial resignations are here, as references –see links at July 1.
    http://www.arp.harvard.edu/sci/climate/journalclub/

    That’s an example of a mistake.

    This is an example of finding a mistake:

    Correcting Temperature Data Sets
    John R. Christy, Roy W. Spencer;, Carl A. Mears, Frank J. Wentz;, Steven C. Sherwood, and John R. Lanzante
    Science 11 November 2005 310: 972-973 [DOI: 10.1126/science.310.5750.972] (in Letters)

    ……correction being used by Christy and Spencer for the lower…troposphere had the opposite sign from their correction for the middle troposphere sign, we knew that something….
    We question why Christy and Spencer adopted … found a previous error in their methodology……

    Read the followups to see Christy et al. acknowledge that a mistake was made, and subsequent corrections propagating through other people’s work who had relied on the mistaken paper:
    http://www.sciencemag.org/cgi/search?session_query_ref=rbs.queryref_1205024415139&COLLECTIONS=hw1&JC=sci&FULLTEXT=%28christy+AND+sign+AND+error%29&FULLTEXTFIELD=lemcontent&RESOURCETYPE=HWCIT&ABSTRACTFIELD=lemhwcompabstract&TITLEFIELD=lemhwcomptitle

  • Surly // March 9, 2008 at 1:55 am

    “It has become hard for me to believe that the incredible obfuscation of argument by SM over there is entirely accidental – he’s too smart for that. ”

    Agreed. One recent post regarding Hansen has the “I don’t mean to suggest any wrong-doing, but” tone to it. Too many posts there do the “When did you stop beating your wife”trick and while SM is unwilling to say it outright , the other posters aren’t so reticent. Then SM says he’s too tired to clean up the posts. Sure.

  • George // March 9, 2008 at 2:15 am

    TCO says:

    “If it was in error, he should admit the error. There is something truculent about both Mann and McI.”

    Why? What difference does it make?

    As far as the science is concerned, even if Mann got the “right answer for entirely the wrong reason”, that would not matter one bit.

    The only thing that matters is whether his conclusion is correct — and if the analysis done without using PCA also yields a hockey stick (which it does), then whether Mann did the PCA analysis correctly is basically moot as far as the science is concerned.

    Johannes Kepler made errors in his calculations and nonetheless got the right answer.

    No one tries to claim today that “his findings are bunk because he made arithmetic errors”. That would be absurd.

    From the standpoint of science, the only thing that matters is that he was right.

  • MrPete // March 9, 2008 at 3:33 am

    From the standpoint of science, the only thing that matters is that he was right.

    Unprovable.

    Methods and data are important for many reasons, not least because others use the same or similar methods and data. If bad data and methods are reused significantly in a small community, you get a “consensus” that’s built on a house of cards.

    When methods are obfuscated such that others are unable to replicate the work, and the methods cannot be confirmed as having statistical validity, we’re left not knowing if the results are correct or not.

    Worse, when earlier work does not disclose statistical methods, and years later after the methods are decoded, it is discovered the earlier work did not use standard statistical methods… do we just say “hey, move on” and ignore the issue?

    Do we sit by quietly if the same “game” is being played with newer work? What should be done if “alternate” or “refined” statistical analysis techniques are employed? And details are not disclosed? And the new techniques are not run through statistical peer review for validation?

    I suppose we should appreciate that as creative entrepreneurialism in 21st century science?

  • John Mashey // March 9, 2008 at 3:50 am

    TCO mentioned “Importance of the result to policy.”

    As far as I can tell, none of this has the slightest importance for policy. [It's wonderful tutorial.]

    Suppose a) we had 2000 years’ of current-technology temperature data [time amchine] or b) We didn’t have any of the data used in MBH before 1500, or c) MBH had never written the papers.

    What, exactly, in case a) or b) or c) would we do going forward? We’d know more in a), a little less in b), and in c), I suspect others would have done similar reconstructions sooner or later.

    Would Greenhouse physics change? Would First and Second Laws of Thermodynamics be repealed? Would the energy imbalance of the Earth disappear?
    I.e., see Fig 2 of http://pubs.giss.nasa.gov/docs/2005/2005_Hansen_etal_1.pdf

    If not, exactly what policy difference is there?

    Anyway, can anyone suggest *exactly* what the policy difference would be?

    All of this seems like: a $B company is losing money, but someone (not even a strockholder) thinks its CFO was off $1 in the 1998 tax returns and they should go back and GET THAT FIXED before they do anything else, and if turned out it was right, well, there’s always 1999.

  • George // March 9, 2008 at 3:50 am

    Mr pete:

    Analysis done without using PCA also yields a hockey stick.

    So, how does it matter one way or the other whether Mann was right or wrong with the PCA analysis?

  • Hank Roberts // March 9, 2008 at 4:02 am

    What matters in science is kind of like what matters in evolution, where fitness is approximated by how many grandchildren you have.

    In science, it’s how many citations build on your work, how many people found something useful to extend, or improve on, or confirm, or even disprove.

    That’s why I pointed to the way the Christie error was handled — it was the basis for quite a few years of other people’s work and a gazillion blog posts; it was in error; it was acknowledged; the papers citing it now point back to the correction and much got rewritten.

    This is why you can click those buttons for subsequent citations at science sites.

    Very few papers have corrections that significant (where a sign change changes the basic conclusion that two things were different — and the conclusion is that they weren’t different after all).

    What matters now is his subsequent work and what others find useful.

    Science isn’t a mighty oak with a single deep root.

    Science is kudzu. It grows at the young new end fresh each time putting roots wherever there’s fresh substance to draw on.

    Science thanks the old work with citations — but it’s moved beyond them.

    That’s why the first step is to replicate — your own lab, your own data collection — the previous work.

    This is why scientists don’t “audit” — they replicate.

    It’s not a religion. Founders are nice people but they’re just people, and their work is no better than it has to be.

    What counts is what’s going on now.
    The people doing science in this area moved on long ago.

    Funny how come so many people want to pretend they’re discussing climate change but don’t talk about current work.

    Why? Know anything about what’s being published these days?

    Scary, isn’t it?

  • MrPete // March 9, 2008 at 4:33 am

    Analysis done without using PCA also yields a hockey stick.

    Depends on the data and methods. Analysis with/without PCA also yields MWP and LIA. So what?

    Me? I just wanna see some good solid studies: well-founded data, well-founded methods, proper CI analysis, etc etc. There’s not much of that in the “contested” arenas at this point.

    Funny how come so many people want to pretend they’re discussing climate change but don’t talk about current work.

    Funny how so many people want to “pretend” (your word, not mine) they’re making progress on climate change research but don’t talk about current data either.

    Hank, I wouldn’t slam the skeptics too much for being a bit behind. It’s easier to crunch through everything when you have a budget, cooperation and visibility.

    Plenty of surprising/confusing material out there to keep everybody curious, AFAIK :)

    (Why does it take so much longer for good data and good work to be published, just because it doesn’t fit the received wisdom? No reason to be reluctant to publish divergence-problem data, etc. It’s that stuff that keeps science lively.)

  • TCO // March 9, 2008 at 4:53 am

    The rationale for admitting errors however small is so that we can have real examination of things that moves to truth. This is the way of the scientist, not of the sophist, not of the lawyer. Both McI and Mann are truculent. They refuse to answer questions. Refuse to admit minor errors. I assume they justify this with some feeling that others will seize on admissions to make wider claims of error. And based on what I usually see…I agree that their opponents would do that. But it doesn’t matter. If you are a true scientist. A true mathematician.

  • nanny_govt_sucks // March 9, 2008 at 5:18 am

    Nanny: You must have missed the part of Tamino’s post where he said “I have one request: before you raise other issues about the hockey stick, address this one.”

    Tamino brought this up on this thread, not me. He said: “But MBH applied a procedure for selection.”. I’d like to know what he’s talking about.

  • dhogaza // March 9, 2008 at 5:40 am

    Depends on the data …

    In the context of evaluating whether or not Mann’s statistical approach is valid or not, this is …

    a goalpost move.

    Why do you people always do that?

    Many posts ago (over a 100) I said to HB:

    Why not just delete those comments that aren’t relevant to your post, i.e. attempts to divert the discussion to “strip-bark samples”, etc?

    And that’s EXACTLY what you’re doing when you say “depends on the data”.

    Which, I might add, is a tacit admission that the statistical attacks don’t add up to SQUAT.

    When you move the goalposts, you are tacitly conceding the original point (not that I expect you to be honest enough to admit it).

    Why does it take so much longer for good data and good work to be published, just because it doesn’t fit the received wisdom?

    Prove your assertion. What good data isn’t being published and good work isn’t being published because it doesn’t fit the received wisdom.

    Sorry, as long as HB allows me to post here, I’m not going to allow unsubstantiated bullshit like this appear without my challenging it.

    Prove your statement. Prove that 1) the data you claim is suppressed is “good” and 2) the work is “good” (I presume you mean analysis?) and 3) it is because it doesn’t meant “received wisdom”.

    You do realize that when Hansen etc first started making claims they were laughed out of the house, right? Yet they managed to get published. They never whined the way you are.

  • George // March 9, 2008 at 6:14 am

    Mr Pete claims:

    I just wanna see some good solid studies: well-founded data, well-founded methods, proper CI analysis, etc etc. There’s not much of that in the “contested” arenas at this point.

    And you are the one who decides what is “good”. “well-founded” and “proper”, right?

    How convenient. :)

    Care to tell us what your qualifications are for making such a sweeping judgment?

    University credentials in climate science or a directly relevant area?

    Peer reviewed papers in climate science?

  • Hank Roberts // March 9, 2008 at 6:21 am

    http://www.realclimate.org/index.php/archives/2007/05/hansens-1988-projections/
    and
    http://pubs.giss.nasa.gov/abstracts/1988/Hansen_etal.html
    “… Principal uncertainties in the predictions involve the equilibrium sensitivity of the model to climate forcing, the assumptions regarding heat uptake and transport by the ocean, and the omission of other less-certain climate forcings.”

  • fred // March 9, 2008 at 6:50 am

    I am still puzzling over this question of decentering. Its not my subject of course.

    Can no-one point me to a textbook account of how to do this, with a few worked examples, and some discussion of why it is the same and in what circumstances its the same as conventional PCA? The presentation cited is not what’s meant, I can’t get to a recipe and explanation from it.

    My confusion is something like this. We seem to have a procedure which conventionally is carried out using the mean of the series. Someone carries it out using a mean of a subset. I can’t see how this can be either correct or give the same results as using the mean. Once we start picking something which is not the series mean, can we pick anything? Like, can we pick in a 100 item series either data points 90-100, or else 1, 5, 26,31….etc? If not, why not?

    In fact, to generalize it, why not pick any number at all? If you are not using the series mean, why not just make up a number?

    Obviously you cannot pick anything but the series mean when computing most basic statistics. So why is PCA different?

    Perhaps my professional background with its focus on defined recipes and procedures is getting in the way here, but it is not clear. I can’t tell from the descriptions and comments how to implement this thing, or how to know when its been done correctly. Were one writing a program which would do decentered PCA in some cases but not others, at the moment I’d have no idea how to either write the decentered PCA part, or to write the selection algorithm so it picked the ones where it is appropriate.

    Always knew that sooner or later this was going to lead to learning R. Oh well. There are a couple of good textbooks on that.

    [Response: I don't know if there's a textbook exposition. But textbooks are generally behind the times when it comes to analysis. If you restricted yourself to textbook methods on Fourier or Wavelet analysis, you'd not only be behind the times, in some cases (especially regarding the analysis of unevenly sampled data), you'd actually be in error.

    But for your sake, I'll do *yet another* post in this series. About non-centered PCA in general. I'm working on a post on sea level at the moment (at another reader's request), so it'll have to wait for that.]

  • Patrick Hadley // March 9, 2008 at 10:27 am

    John Mashey asks whether all this is important to policy. On the one hand he is right, the physics of GHG is not in any way dependent on MBH.

    There are however four aspects in which this argument is very important.

    The first two relate to the medieval warm period. The graph used by IPCC in 1995 showed a sustained warm period between 1100 and 1400 AD. Before MBH it was accepted that the MWP was warmer than the present and that this period was a period of general worldwide prosperity.

    If the MWP did exist then it would help (but obviously not prove) two arguments used by sceptics: that the world benefits from warmer temperatures; and that the recent rise in global temperatures is not unprecedented and therefore one cannot argue that just because warming has followed a rise in CO2 it must therefore be caused by this rise.

    The third way in which this argument is important is in questioning the methods and principles of the climate science consensus. If it could be shown that a very influential and widely distributed graph was based on poor scientific methodology and dodgy statistics, this would give critics the chance to question whether other pillars of the AGW argument are similarly defective.

    Fourthly the argument is important because if it can be shown that after serious errors in MBH were proved by McIntyre and McKitrick, the AGW establishment, far from welcoming this criticism as a major contribution to the scientific process, instead mocked MM’s work; and various attempts were made to cover up the errors they found and deny them. If this were true then it would also greatly undermine the credibility of the global warming consensus.

  • Chris O'Neill // March 9, 2008 at 11:02 am

    George:

    If the hockey stick shape is indeed an artifact of PCA, as some claim, then why do I see it when I look at this?

    Indeed, and I could also ask why we see it when it is produced by up-to-date methods such as regularized expectation maximization. I think the answer is that in hockeystick denialism the strategy is to concentrate on an obscure debatable issue (non-centered PCA), even though it is rendered completely insignificant by later developments (the use of regularized expectation maximization instead of PCA), in the hope that enough people won’t realize its complete insignificance. Anyway, it’s still worth annoying the hockeystick denialists (by pointing out their misunderstanding of non-centered PCA) if for no other reason than confirming the understanding of people who are not as well educated as our blog host.

  • MrPete // March 9, 2008 at 12:46 pm

    dhogaza, you said prove…

    Obviously, some of these things are scientific statements, and some are “intent” statements impossible to prove. All one can do is look at evidence and watch.

    ‘twould be nice if someone had time to keep a record of such things, like the truth in advertising folks. I don’t have that kind of time.

    But there’s plenty of evidence out there.

    AR4 was published in 2007. Compare:

    * the speed with which certain “confirming” papers got referenced (with original paper, references, and AR4 dates all piling up at the “finish line” to get hurried into print, even though the “confirming” paper came incredibly close to not being published at all due to its serious problems). Obviously I’m thinking of W&A etc.

    * The lack of uncomfortable data and papers surrounding the infamous Divergence Problem. Happily, some younger dendros have been getting involved recently, yet in general, data from updated tree ring studies gets published and archived with not-quite-glacial slowness (particularly relative to other papers such as mentioned above.) And the young dendros weren’t exactly represented in AR4, although the related data has been reportedly being collected for many years.

    * We have the intriguing case of a grad student working under “H” of MBH fame. After his student’s work was approved as a PhD thesis, the thesis advisor himself still used the old data. And the new data has gone missing. And nothing has been done.

    To echo Hank’s question, why are W&A, H, AR4, etc etc so intent on delving further into propping up the old hockey stick formulation based on old data, when new data has been collected, worked through the grad student slave labor mill (how do you do that cross-out thing, Tamino? :-) ), and certainly ought to be available for analysis and publication.

    From personal experience, it does not take years to move tree ring data from collection straws to usable form.

  • Hansen's Poodle // March 9, 2008 at 1:02 pm

    Tamino, any reason why you refuse to release Jean S’ answer? I hope it’s not censorship and it was lost in cyberspace.

    [copy of JeanS's "answer" removed]

    [Response: It's personal.

    We were having a heated discussion, tempers flared from time to time, passions were aroused, but it was pretty much at least at a *minimal* level of civility -- until JeanS came along. Then it's "repeat that lie" and and "bullshit" and the statement that I have no intellectual honesty, and other things specifically targeted at me, designed to annoy me. Personally. It did. If he chooses to make it personal, so be it. I don't have to host it on my blog.

    The "answer" has more of the same, pretty much repeats the same claims as made in the original, and waves credentials around. I find him offensive, and arrogant on a galactic scale. If that's the way he chooses to argue, that's his choice. I don't have to pollute my blog with it.

    I didn't just make this stuff up. As an earlier comment points out, my rebuttal isn't really my own creation, it borrows heavily from the response on RealClimate by Mike Mann himself, here and here. My main contribution was the exposition. Those who are keen on this issue can find more information there, including discussion of *more* of the mistakes of MM. Highly recommended reading.

    It's crystal clear that he will *never* concede anything, he will *never* let it go, he'll *always* have to get the last word, and it won't be polite either. Hell, McIntyre has been hammering on this for years. It's the unruly but often effective strategy of dominating an argument by volume. I've stated my case, he's made his. I guess they really don't have anything better to do with their time than continue arguing forever; I do.

    I'm confident those who are desperate to know, can read all about it on CA.]

  • fred // March 9, 2008 at 1:22 pm

    http://www.uoguelph.ca/~rmckitri/research/fallupdate04/MM.resub.pdf

    Tamino, it may be a bit different from what you think. Go through the bits about weighting. Or maybe I’m not understanding it, that is possible too.

  • John Cross // March 9, 2008 at 2:16 pm

    Nanny: well, over a half a year ago I submitted a list of replies to some points that you made (you can find it in this thread ).

    You brought that up on that thread and I would like to know what you are talking about. In fact, you have said several times that you were working on an reply. Perhaps you should hold off opening new cans until you can finish your detailed analysis.

    Awaiting your answers.

    John

  • Meltwater // March 9, 2008 at 3:00 pm

    fred said:

    It would also help if we could ban the expression ‘intellectual honesty’ from this debate – which should be about something perfectly objective, the use of statistical methods. Helps not in the slightest.

    On the contrary, people just beginning to do their homework on this issue need to be warned that they will encounter debaters as well as arguments that are intellectually dishonest. Frankly, they need to be warned also that their own sincere questions may be mistaken for such dishonesty because of the extent to which it has come to pollute the debate. mikep said:

    This discussion is losing sight of the simple issues.

    Keeping the simple issues in sight can help keep us honest. One of the simplest is to keep asking: Given the accelerating rate of human unburied carbon emissions, how could a temperature ‘hockey stick’ not result? CO2 is one of several gases that absorb or scatter infra-red photons, mostly from the surface of the Earth, and heat up as their molecules are kinetically excited by the scattering and absorption. This fact of radiation physics was discovered by the likes of J.B.J. Fourier and John Tyndall between 190 and 150 years ago. What physical mechanism could possibly prevent climate sensitivity to so much kinetically excited CO2? A few have been proposed but those proposals have not fared well in the scientific literature. Unless a proposal of greater merit can be advanced then the smart money will still bet that the ‘hockey stick’ is right. Remember, it is part of a larger picture that has been patiently assembled over several half-centuries of peer reviewed science. Temperature measurements, weather observations, paleoclimate core and proxy data, and satellite surveillance together yield a picture of climate that cannot be modeled, reproduced or explained unless the CO2 sensitivity is around 2.3 Celsius or Kelvin degrees, or 4.14° Fahrenheit, at a minimum. People intent on finding fault with the ‘hockey stick’ deserve to have their arguments and their intellectual honesty challenged. For the ‘hockey stick’ to be wrong, a great deal of other science has to be wrong too. How could it all be wrong? Where is a compelling reason to conclude that so much science is wrong? By giving us this lesson on Principal Components Analysis, Hansen’s Bulldog, slash tamino, strengthens our confidence in an alternative explanation: the statistical methods that produced the ‘hockey stick’ are under politically motivated attack, and some of the most prominent attacks are without valid mathematical or scientific foundation.

  • Surly // March 9, 2008 at 3:43 pm

    “If it could be shown that a very influential and widely distributed graph was based on poor scientific methodology and dodgy statistics, this would give critics the chance to question whether other pillars of the AGW argument are similarly defective.”

    And this is the crux in a nutshell — the goal of the denialists (I won’t call them skeptics) is to discredit the hockey stick and thus, they hope, discredit the whole science of global warming. That’s why this is more than an academic debate, although it is that, too. It’s a political and policy debate as well. As long as the meme of the “broken hockey stick” remains in circulation, the other half of its premise — broken theory — will circulate and give denialists something to wave in front of everyone.

  • Hank Roberts // March 9, 2008 at 3:44 pm

    > To echo Hank’s question,

    Don’t put words in my mouth with your dirty hands. You’re getting your material from CA, don’t pretend you’re echoing my question.

    Third derivative of acceleration.

  • P. Lewis // March 9, 2008 at 3:55 pm

    Re George // March 9, 2008 at 6:14 am

    Mr Pete claims:

    And you are the one who decides what is “good”. “well-founded” and “proper”, right?

    How convenient. :)

    Care to tell us what your qualifications are for making such a sweeping judgment?

    University credentials in climate science or a directly relevant area?

    Peer reviewed papers in climate science?

    You forgot one or two probabilities possibilities, George: Climate Audit? Watts Up With That?

  • mmghosh // March 9, 2008 at 4:36 pm

    Mr Hadley

    I do not think that anyone disputes the presence of a medieval warm period in Northern Europe. The point that is made by by the climate scientists consensus, is that it was not a global phenomenon, because it is not found in the paleoclimatology data from the rest of the world.

    The next point about the MWP is that it is really a red herring. And that in the sense that even it were true that a warmer earth could sustain the population of the Middle Ages, the higher temperatures that are seen today and, if they do rise higher, will cause serious problems for an estimated population of 9 billion humans and commercial establishments in 2050.

    Many orders of increases of magnitude of people now live in flood plains – New Orleans or Galveston or Gangetic Bangladesh are examples. We are talking about a possible very large migration of poulations which are not feasible today as they perhaps were in the 1300s.

    As you are aware, the next cycle of warming will include the next next El Nino phenomenon and a rising curve of solar irradiance sometime in the next five years. GHG aggravation of such a warming cycle has the potential for major consequences.

  • George // March 9, 2008 at 4:40 pm

    Mr pete said above

    I just wanna see some good solid studies: well-founded data, well-founded methods, proper CI analysis, etc etc. There’s not much of that in the “contested” arenas at this point.

    When I asked him above about his qualifications/expertise related to climate science that enabled him to make such a sweeping generalization — “There’s not much of that in the “contested” arenas at this point” — I simply got silence

    Perhaps Mr Pete missed my question but I would just point out that expertise in the area in question is highly relevant when assessing the current state of the science.

    After all, if someone with no background in physics says that there’s not much in the way of good science going on in the area of high energy physics today, at a minimum, we need to know whether he was competent to make such an assessment to begin with. We don’t just take his word for it.

    I would note that, without listing EVERY current study, it is virtually impossible to prove the claim that “There’s not much of that in the “contested” arenas at this point.”

    And the implication that what is going on is “not solid”, “not good”, “not well-founded”, “not proper” etc, is highly subjective, at any rate, so such a claim has questionable value, at any rate (IMHO).

    When it comes down to it, we simply MUST rely on the judgment of the person making the statement — which depends a great deal on their knowledge/background in the area they are judging.

    So, Mr. Pete.

    Let me ask you again, upon what climate science-related “expert basis” have you come to the conclusion that “There’s not much of that [well-founded data, well-founded methods, proper CI analysis, etc etc] in the “contested” arenas at this point.”

    If I get no answer about your credentials in climate science or peer-reviewed papers or even links to other non-peer-reviewed papers that you have written, I will have to assume that you have no expertise in the relevant area and therefore are simply not knowledgeable enough to be making such a sweeping assessment.

    …and I will feel confident that i can safely ignore your sweeping claims about how bad the current state of the science is from here on in. :)

  • TCO // March 9, 2008 at 4:57 pm

    Irregardless of denialists trying to shoot down MBH as a pillar of the temple before going after the rest of the building…or of alarmists propping up weak timber because the whole building is so important…THE QUESTIONS themselves are analytically interesting. It’s just so easy for both sides to refuse to engage on specifics because of not trusting the other side. It’s like the refusing to admit an error because the other side will think that it implies more faults elsewhere. WHO CARES. Be UNILATERAL!

    I understand Tammy’s concerns with JeanS. He does spout on so much about credentials. Big whoop that he has a Ph.D. It’s always the weakest ones who insist on being called Doctor. However, Jean does understand the math here better than most of us commentators so it would be interesting to see Tammy and he examine things to deeper levels. I have also found JeanS to be a bit imperious…to get upset at valid even interesting points (the times I’ve gotten “nuggets” by chewing away without knowing the math) if not expressed with knowledge of linear algebra and the like. But the good thing is that Tammy and Jean both know R and linear algebra and the like. So there is not the room for such tangential putdowns of knowledge level to distract from critical disaggregation and examination of issues.

    It’s unfortunate that Tammy sees this as a situation of “public teaching” relying heavily on RC defenses of the Mann paper (Mann is a prominent member of RC…so he’s defending his own work). Instead a fresh look at things would be helpful. Less Bulldog, less Pussycat. More Richard Feynman (he pissed NASA off by asking penetrating question after penetrating question, when they wanted to give fluffy slide presentations…but Dick actually wanted to understand stuff. He told the NASAites, “yes this seems painful, but trust me, I speed up. Taking things apart like this is needed” (or words to that effect, don’t have the book in front of me.)

    (I have a bunch of methodology questions left, which I will post separately. Unfortunately, I don’t know lin alg. or theoretical stats. However, it doesn’t scare me a bit from asking critical questions. I am really cocky enough to show vulnerability.

  • TCO // March 9, 2008 at 5:23 pm

    1. The “canonical” word makes me wince. Reminds me of stat mech that I never understood. Can you say what this means with a simpler word? Please?

    [Response: It's the word commonly used. One might, I suppose, say "standard" or "universal" or "most common," although in this particular case none of those -- not even "canonical" -- necessarily applies.]

    2. With the canoe (great example btw), this feels like an example where we don’t want to standardize. Not only do we not want to inflate the noise (or perhaps slow sinking) of the canoe in the Z direction, we want to understand EW versus NS in terms of absolutes. Am I thinking about this right? Comments?

    [Response: You're quite right. This is a case where we know ahead of time that the units for EW, NS, and vertical, are the same size, and we can fully expect that the errors will be the same size in all directions, while standardization will in fact undo that useful property. So in fact this isn't such a great example -- it illustrates one issue very well, but fails to deal with the normalization issue. For the canoe problem, we should probably use the covariance matrix (no normalization) rather than the correlation matrix.]

  • null{} // March 9, 2008 at 5:30 pm

    George, how are we to determine that you are qualified to evaluate/assess the qualifications of MrPete? Is it necessary for you too also pony up some peer-reviewed papers published in all the right journals, of course. so that we can make that determination.

    “Third derivative of acceleration.”

    I think you mean third derivative of displacement.

  • TCO // March 9, 2008 at 5:38 pm

    3. “The real essence of PCA is that it enables us to define a new coordinate system for our data space, one based on the observed variations of the data itself, which gives us an excellent chance to reduce complexity, possibly even reveal some insight. ”

    Yeah…no…agreed. PCA is this really cool tool for data compression (if you’re short of data space, which isn’t the case much nowadays), for calculational efficiency (similar), and ESPECIALLY for looking for patterns. Everyone who talks about PCA, always mentions it’s strength in terms of initial inspection, pattern finding. But if you KNOW what your input variable of interest is…that’s not initial inspection. I mean in the book SUPER CRUNCHERS (written by a liberal, btw, and very popular), they talk about a model for predicting red wine taste upon aging which is based on two factors (summer temp and something else). Was that honker derived by regression against PCs (leaving part of the data behind) or by regression against the data itself? It’s one thing to find patterns using PCA. Another to do the regressions only against PCs or to declare that temp is concentrated in a particular PC, etc. etc. And what does that mean if temp is “prominent” in PC1 based on the (somewhat unusual and non-documented) acentric standardixation), but not with a normal correlation matrix. (sorry if I wander, I’m drinking bloody maries and typing on the wireless.)

  • fred // March 9, 2008 at 5:59 pm

    “People intent on finding fault with the ‘hockey stick’ deserve to have their arguments and their intellectual honesty challenged.”

    No, they need to have their arguments challenged. The rest is pointless personal insults which get no place.

  • TCO // March 9, 2008 at 6:02 pm

    4. I really don’t get your concern with the overfitting on regressions. So much to the extent, that I wonder if you are not quite considering this correctly. Yes, I know the sheer impertinance of suggesting such! :) Seriously, I can do a regression of temp versus ring width and see the impact and make a model with minimal factors. That’s perfectly normal. People build MC models all the time in chemical engineering and leave all kinds of degrees of freedom left over and never touch a PCA.

    5. There is also an issue (regardless of using PCA or some other form of training and weighting individual series) of how important are the series really? You might see this superimpressive number of proxies…but some are carrying much more weight than others. So that the degrees of freedom and overfitting concern becomes more of an issue. It’s not like having 100 independant predictors. It’s like having 30. Because the rest got fed through the training hopper and rejected. Capisce?

    6. There is also an important issue of what we train on. Is it really reasonable to have local proxies training on the global temp. Fancy it up and call it a “field”, but it is still an ambitious teleconnection. Is there a strong (proven statistically, prior) basis for saying that local proxies measure the global field? And the claims of having worldwide coverage become a bit tricky. You DON’T have a set of global gridded temps like with Hansen and the thermometers. What you have is a global set of “global thermometers”.

  • TCO // March 9, 2008 at 6:05 pm

    Tammy: It’s a great example. And don’t worry that the example spoke against a different aspect. I think it’s nice that we can occasionally reach some understanding despite disagreement on other issues, like the greatness of the Washington Redskins and George Allen. Thanks for kind reply.

    [Response: Agreed on George Allen. I suppose if I mention Bill Belichick I'll get slammed?

    (Note: just humor. Please nobody start a Belichick war.)]

  • Patrick Hadley // March 9, 2008 at 6:12 pm

    mmgosh, I did not intend to introduce a lot of red herrings into the thread by mentioning the MWP, benefits of warming etc, just to give my opinion about why the MBH hockey stick is still an important topic of discussion.

    Incidentally, while I gave four reasons to justify the continued debate about the HS, if Meltwater was right when he wrote ” For the ‘hockey stick’ to be wrong, a great deal of other science has to be wrong too. ” then it would seem that the stakes are even higher.

    [Response: I disagree strongly with the "a great deal of other science has to be wrong too" part. But I don't speak for my readers, just as they don't speak for me.]

  • TCO // March 9, 2008 at 6:21 pm

    I guess anothe vague question/concern that I have is in the method of training for decision to keep/reject proxies (or to weight them). For instance, is this like “wiggle matching” of tree rings themselves, where you really have a lot of individual data points and variability and can really feel good that you have validated a match. Or is this essentially a case of a broady increase in 20th century of both temp and of bcps, but with little “wiggle matching” so that you are really fitting to single data point matching (a trend to a trend). Therefore, we don’t feel as confident in the physical driver as if there were a lot of wiggling and response going on.

  • TCO // March 9, 2008 at 6:33 pm

    Previous was 7, this is 8.

    8. Is there a book or a procedure that tells one, that one should do PCA, throw away the lower PCs and then build a predictive model based on the regression of temp versus higher PCs?

    9. I wonder if we do what I said in 8, how it affects predictive efficiency and cofidence intervals. Let’s say we go back to the wine Super Cruncher’s example. And we use PCA to find some tricky hard to find signal. And then build a predictive model based of off it. Won’t it have huge CIs because the noise is still there (that was a driver for using the PCA to find a tricky signal in the first place). And will one really get better predictive efficiency for wine values in the future? If you were a Bayesian betting man, would you prefer the simple model trained on the data itself or the one trained on the PCA? And this is actually an economic esmple. you really can decide to speculate on new wine….

  • george // March 9, 2008 at 6:36 pm

    Null asks:

    “George, how are we to determine that you are qualified to evaluate/assess the qualifications of MrPete? Is it necessary for you too also pony up some peer-reviewed papers published in all the right journals, of course. so that we can make that determination.”

    I would think that it would be pretty obvious that the answer to your last question is “no.”

    I am not the one making sweeping generalizations about the state of the science (at least in the “contested arena” in question), as Mr. Pete has done above:

    I just wanna see some good solid studies: well-founded data, well-founded methods, proper CI analysis, etc etc. There’s not much of that in the “contested” arenas at this point.

    And as far as assessing Mr. Pete’s qualifications, where did I do that?

    I am merely interested in knowing whether to take him seriously.

    It is important (for me, at least) to know whether he has the relevant background to be making the general claims he has.

    If he has no qualifications (or if he chooses not to answer), I for one, will assume that he is not qualified.

    Others — including yourself — are free to make their own decision in that regard.

    I would note that it is not specific claims/implications (ie, ones that can be checked) that I am questioning. One can assess those based on their own merits. But it is the very broad claims/implications that I question: there are no “good solid studies: well-founded data, well-founded methods, proper CI analysis, etc”.

    Surely, you can see the difference?

  • TCO // March 9, 2008 at 6:50 pm

    10. Jollife presentation:
    A. I don’t understand all the terminology and linear algebra. Is the centering he’s talking about the same as what we talk about? What’s “column centering” or “double centering”? does he ever really adress the issue of Mannian decentering?

    B. Is this the best? There’s no academic paper? No math runs showing how different methods (acentered, decentered) give different numbers in different situations and how one is better for a problem than another?

    C. Jolliffe presentation seems to have several cautions against not centering (slides 21 and 38).

    D. Doesn’t seem like he is really discussing “acentric offsets” as much as to center or not to center (i.e. raw versus mean deducted).

    P.s. if in your response, you can distinguish between places where I just don’t understand Jollife and where we are in disagreement but I have a point, I appreciate it.

  • TCO // March 9, 2008 at 6:53 pm

    1o.e.: If you go to the Jolliffe site itself he has several posters on PCA or presentations on it that are interesting. I couldn’t get understand them all, but they mostly seemed to concern being careful (both positive and negative) about what you say you can accomplish with PCA. Good stuff.

  • John Mashey // March 9, 2008 at 6:54 pm

    Patrick Hadley
    “I did not intend…”

    Really? I’ve seen your posts elsewhere…

    Why don’t you attempt to *answer* the questions I actually asked, rather than write what looks like a standard set of arguments for wanting to argue about MBH *forever*? I’ve seen all these before I asked those questions.

    Did you read the Hansen/Nazarenko paper yet?

    Do you understand the difference between physics-based analysis [which work] and modeling stock market prices [which don't]?

    From this comment in RC, http://www.realclimate.org/index.php/archives/2008/02/antarctica-is-cold/langswitch_lang/index.php?p=529#comment-81391, I rather doubt it.

  • TCO // March 9, 2008 at 6:57 pm

    11. Are you really sure that the acentric centering to promote recent temp rise series “was the point”? Mann has never answered the question as to whether this transform was deliberate. And his write-up in science did not explain the transform or tell the story of how he had done it on purpose?

  • TCO // March 9, 2008 at 7:10 pm

    12. Your comment about what impact will decentering really have (after feeding through the rest of the algorithm) is interesting. McI has been somewhat disingenuous in confounding impact on PC1 versus impact on “the hockey stick”. However, if the impact is minimal, because of the rest of the algorithm, why use this relatively uncommone (and not properly explained in the methods) offset? Also, I believe there is one sentence in MBH where they tout the prominent “signal” in PC1. (There are graphs also.) Given that one has to use the acentric transform to get that, this should be explained. While it may not be important to the end state, it is important to the intermediate.

    [Response: As I said in a response to fred, there are enough points of confusion that I'll do yet *another* post in this series, about non-centered PCA in general. I'll keep your issues in mind. I may even feature them; give me time to consider them in the detail they deserve before deciding. I would like to avoid, if possible, off-the-cuff responses, especially when irritated by juveniles (I'm not referring to you). Perhaps one of the sources of confusion is hurried, knee-jerk comments and responses.

    As I also said to fred, I'm working on a post about sea level (yet another reader request), and it'll have to wait until after that. And I'm just one man, with a job and a family and a *life*. Patience...]

  • nanny_govt_sucks // March 9, 2008 at 7:11 pm

    John Cross, your incessant needling won’t make me get to a response any quicker. Asking for a clarification from Tamino is hardly a “new can”, unless you are implying that Tamino’s response will open a new can of worms, as the saying goes. But why would that be, John? Shouln’t it be easy to point to the MBH procedure for selection of PCs?

  • MrPete // March 9, 2008 at 7:22 pm

    Hank, I get my material from lots of places. I happen to give a lot of links to CA in these responses right now because IMHO they do a pretty reasonable job of assembling material. And, I don’t have a lot of time right now for digging out material, even from my own laptop. (And no, my hands aren’t dirty at the moment — too early for gardening season. Shoes are dirty though — just shot a bunch of tree-trunk photos :-) )

    george, I actually did miss your earlier comment. At the moment, I’m stopping by here on occasion as a break from some Real World stuff… I don’t have much available time.

    First, your recent comments have stretched what I said beyond the breaking point. I would never suggest nor imply that there are no good studies, data or methods in all of climate science. That would be beyond silly.

    I said there’s not much to be found, in the contested areas. I like good work, and want to see better work; hopefully you do too. I’m frustrated and saddened that politics and policy have so greatly intruded on scientific process. It’s apparently inevitable, because good science costs real money. Still, regrettable.

    As for qualifications, there are many ways to evaluate qualifications, as well as contributions. From your question, it’s likely you’ll not be satisfied by any answer I can give. I have exactly zero peer reviewed papers to my name, so you will almost certainly consider my multidisciplinary expertise of little value.

    Not to start an academy vs non-academy war, but more as a lighthearted illustration, I could likewise give you a challenge. Since my expertise and contributions in support of scientific progress are not to be taken seriously, it would be inappropriate for me to unduly influence the work of climate scientists, correct?

    So, my suggestion is that you encourage them to avoid making use of the methods, products and tools that I have significantly influenced over the years. They’re all questionable when it comes to climate science. So, please encourage the climate scientists to stop using computers, databases, geographic information and analysis systems for their work…unless they only use mainframes and punched cards :-) [I'd toss in digital telephones and laser printers as well, but that would be mean :-) ]

    In case you continue to read this posting despite my horrific lack of qualifications, I’ll answer your question a bit more directly: I am a generalist who chose a non-academic route through life. However, I’m hardly disconnected from the scientific arena.

    Meanwhile, back to you, George. What are your peer-reviewed credentials in statistics, seminumerical algorithms, field biology (several specialties), history and management of science, and data management practice? It would seem these are relevant topics to assessing work done in this subarea of climate science. If a reader lacks peer-reviewed expertise, how can they expect to evaluate the expertise of anyone else?

    Personally, I consider “real world” intelligence, education and experience to be just as valuable as rigorous academic analysis. My suggestion: let’s respect one another as intelligent people.

    I’ve said it before; I’ll say it again: what has surprised me no end is that some (climate scientists) have seen value in any of my “citizen science” work. Appreciated, not expected. Certainly doesn’t make me an expert.

  • fred // March 9, 2008 at 7:54 pm

    meltwater, it will help to get talk about different things differently.

    The hockeystick, MBH98 and subsequent papers, could be wrong. Decentered PCA could also be wrong. And it still could be warming due to CO2.

    You don’t have to accept MBH to be convinced by the arguments for CO2 driven warming.

    Then, the amount of this you say is 4 degrees. Well, it may be, but there are two distinct steps, one of which is more certain than the other. One is the 1.2 due to CO2 directly. The other is feedback loops.

    There is different evidence, and differently compelling evidence, for each.

  • TCO // March 9, 2008 at 8:30 pm

    13. How about the autocorrelation? How does that fit into an examination of training? Espeically training versus a broad uptrend period? Versus wiggle mathcing?

  • TCO // March 9, 2008 at 8:48 pm

    (response to (12)): Tammy take your time. I don’t think you should try to have all the answers before having a discussion. If you need time to go over things that’s fine. Also perhaps (impertinant of me, I know) discussion with the internet community will make you think of new things or re-examine some ideas. That’s all fine. No one expects an oracle. The tendancy to “on high professor teaching to adoring acolytes” attitude of the RCers is really annoying. I prefer the brutality of a Univ Chicago grad student of econ seminar.

  • TCO // March 9, 2008 at 8:53 pm

    14. WRT the actual experiment you ran, I didn’t quite get it all intuitively. Partially my fault, I’m sure. I guess, I agree that using the acentered method (for this set of data) allows a smaller choice of PCs. However, I really wonder if one should just use the data itself without all this separating into PCs stuff. I might have some more thoughts (more critical), but am having a hard time getting my head wrapped around what type of data input is reasonable as a test of methods, etc. What is circular. And what the impact of different scenarios of data (with/withouth MWP) would be if one were to have them. I guess at a certain point, I wonder if we are just pickiung the few series that show recent warming (cherry picking) and then by average, they oscillare around zero in the shaft portion of the stick. But I wander…

  • mmghosh // March 9, 2008 at 9:10 pm

    Mr Pete

    I think you have raised some important points.

    While many of us have some mathematical or statistical training at grad school level, I don’t think we can follow linear algebra or even PCA although I think HB has done a great job in explaining as simply as possible.

    All of us have a stake in this debate, though, as global warming and measures to combat them could affect our lives and pockets directly. So I think it is certainly the business of the experts to explain things to concerned laymen – more so than, say, the space program.
    I don’t think having specialised climatology experience is necessary to follow the debate, most of us can read the information in NASA, NCDC, HadCRU and follow the graphs.

    However, the whole apparatus of science, publication in peer-reviewed journals, international conferences, reviews and consensus documents are part of the standard academic processes is designed to provide the public the maximum guarantee that these subjects are debated among experts. In the same way that, say, it is difficult for us to follow medical debates. What I find interesting is the fact that otherwise scientifically minded laymen are willing to listen to the experts in other fields, but somehow reserve a level of vituperation and hostility to the science and scientists in this field, unmatched in other fields. And coupled to this is a strange eagerness to take the work of amateurs as being superior to the professionals.

  • Lazar // March 9, 2008 at 9:10 pm

    Spence_UK,

    Excellent points.
    I’ve read through the GRL paper, response to Huybers (again), and some blog posts.
    There are a few niggles…
    Steve McIntyre has not, to my mind, produced a head-on, convincing response to questions over the issue of persistence (e.g. here and here.)
    Secondly, and if MBH98 really did apply the Preisendorfer selection rule, then that must form part of the test… and I can’t see how one would reliably get ’simulated’ PCs past when comparing two eigenvalue spectrums from essentially the ’same’ source, red noise with the same properties, passed through the same procedure (decentred PCA), unless I’m misunderstanding Preisendorfer here.
    PC1 eigenvalues… I know Steve says they’re not important ‘coz variance is scaled, but there must, surely be some form of selection before that step even if it isn’t Preisendorfer.
    If those are non-issues, then I would agree that decentered PCA does indeed pass red noise through and fool the rest of the algorithm.
    I’ll assume Steve is right.
    Because it’s fun.
    Those are strong theoretical criticisms of MBH98.
    In practise do they matter?
    Use centered PCA and include PCs1-4, and the problem goes away, including issues regarding the benchmarking of RE.
    Or use no PCA.
    Dependency on bristlecones… use no PCA and no bristlecones, the ‘hockey stick’ shape remains over 1400-49AD but the RE scores are lower over the same period though still significant. Use centered PCA, exclude bristlecones, and use PCs1-2, and you get high temperatures in 1400-49, the ‘double-bladed hockey stick’, but scores fail over that period. However, over 1450-49 produces a flat hockey stick and good scores.

  • TCO // March 9, 2008 at 9:13 pm

    sharing of the data and method of your example. While I’m sire that some of the examination of what you did would be annoying or ven flawed, there is a possiblity of other insteresting things coming from it. Looking at type 1 versus type 2 errors.seeing how mehtod and data interact. Etc. etc. And at the end of the day, if it’s good neough to refer to as a toy example, it’s good enough to let people examine and see places where the toy example might be off. You should share the info. And you don’t need to worry about me. I’
    m lazy. I’ll just bug you with little master’s degree thesis questions every post or so…MWAHAHAHAH!

  • TCO // March 9, 2008 at 9:50 pm

    10.f (sorry, was drinking bloodies, forgot a Jolliffe point): Based on the references in the Jolliffe ppt, that presentation was made AFTER the MBH paper came out! Heck, it may have been influenced by the MBH paper and controversy surrounding it. In any case, it was not something that was public for Mann to look at, for reviewers, for readers, etc. WHEN MANN published! So it doesn’t help with the issue of unusualness of the Mannian acentric correction and with Mannian failure to cite and describe his method.

    You’re a physicist (I think). So a knuckle-cruncher example. When I do a conductivity measurement on a sample and write a paper about it, I cite relevant publications in the footnotes within the methods. I will say something like: “Conductivity was measured with the 4-point method using linear spring-loaded tips and a Keithley nano-voltmeter.” Correctional factors for pellet geometry were made from tables of Smits(fn), which uses the method of images to compensate for longer current paths through a finite thickness sample. Pellets were measured by hand using a micrometer and were assumed to be perfect cyclinders despite small warpage, estimated by eye to be less than 10%.” Actually…that’s a little loquacious. But I will cite the Smits paper. You need to see it to know that I did the work properly, that I did not assume a thin film configuration. Also, almost as important, you need to have that footnote, so that you can go out and do good similar science of your OWN. Not just so you can “second check me for mistakes”, but for promotion of proper procedures. Thirdly (related to the first reason) if someone finds some flaw with Smits (1958 Bell Technical Journal) in the future, it makes it easier to still get value out of my work. One may be able to put corrections for instance on top of my work. Important especially if I have not reported every raw form of data (I should, but let’s be real, some of it is in Excel and in lab notebooks).

  • Patrick Hadley // March 9, 2008 at 10:44 pm

    John Mashey, your questions are interesting but are Off Topic for this thread, which is about PCAs and the MBH hockey stick. You argued that the hockey-stick has “no importance for policy”, and in response I mentioned some areas where I thought the question of whether or not the hockey stick was a good representation of global temperature history still had the potential to influence policy makers. I tried to make my post pretty neutral, since I did not want to raise red herrings since this is not the thread on which to discuss those issues, but obviously I failed.

  • George // March 10, 2008 at 12:59 am

    Mr pete says:

    “So, my suggestion is that you encourage them to avoid making use of the methods, products and tools that I have significantly influenced over the years.”

    Thanks for the reply and being honest.

    It’s great that you have worked on computers, databases, geographic information and analysis systems and other tools used by scientists and I commend you for your contribution, but that is largely (if not completely) irrelevant to what I referred to above.

    I wanted to know what your expertise was in the climate science area that would allow you to assess the current state of climate science — in the particular “contested’ arena that you referred to or any other.

    With all due respect, having worked on computers (as have I and a significant fraction of engineers in this country) has no bearing whatsoever on assessing the state of climate science (in the “contested area or any other) — something that I was not the one doing, by the way (so my credentials are irrelevant)

  • TCO // March 10, 2008 at 2:04 am

    I suspect 95% of the commenters here do not hae a basis for making braod statements about the quality of climate science literature. I think you need to have read a lot of it before you can really do so. McI has done that and so has some basis to make such a statement. I do worry a but that he has a bit of a screw looose and that he has not done much peer reviewed work himself. I don’t think mpublication in climate is needed to judge the litareature. Part of getting a Ph.d. is acquirinng a skill that allows you to move to new fields to see where they aer weak or sttrong etc. And people all the time do this as they change workpalces and the like.

  • MrPete // March 10, 2008 at 2:54 am

    George, you misunderstood. I did not mean “worked on” computers. I meant, developed underlying mathematical tools for computation, analysis, data manipulation, analysis and management, that climate (and other) scientists typically assume are working well as they do their work. Not many engineers are familiar with the strengths and weaknesses of computation systems for retaining (or destroying) data integrity as it is mathematically processed.

    Read up on seminumerical algorithms sometime. As Bill Gates once said (only partly in jest), if you can make it all the way through Knuth’s books, give Bill a call — he’d like to hire you.

    I have a bit of experience in professional handling and analysis of quantitative data. Once you’ve addressed issues in a few dozen areas of science, business, medicine, etc… one more is not a big deal. I may not know the mathematical details (at least up front), but I do know what good work looks like, whether at the input, analysis or reporting stage. There’s more, but I’ll stop there.

    As for why I asked you — well, you expressed the sense that you could evaluate whether I’m able to assess the quality of scientific work. I’m suggesting it’s a bit hard to do that from a distance, no matter what measure is used.

    By your measure (which is not unusual, honestly I’m not picking on you :-) ), I’m actually unqualified even in my own arena: I have zero certifications and have only taken two computer classes in my life.

    To consider my background and experience irrelevant and insignificant for the issues at hand is no surprise. As I said earlier, the very nature of your query assured me of your perspective. You’re likely impressed by those who also measure significance by metrics that can be seen from a distance, rather than qualities that can be experienced in person. From that perspective, a lifetime of excellence and influence over science, business, etc has no meaning if not accompanied by the appropriate publications and accolades.

    In my experience, such measures are immaterial. I have had the privilege of great productivity, great influence, etc, all with an unlisted phone number, no advertising, and no acclaim. Try it sometime, it’s incredibly freeing :-)

    Enough of this; it’s really OT. This will be my last on qualifications and such. Any more can be taken to private email or some other venue.

  • MrPete // March 10, 2008 at 3:06 am

    mmghosh, you surfaced an important perspective: “I find interesting is the fact that otherwise scientifically minded laymen are willing to listen to the experts in other fields, but somehow reserve a level of vituperation and hostility to the science and scientists in this field, unmatched in other fields.”

    Let’s leave out the extremists (who hiss with vituperation and hostility.)

    Is this area of science in need of more remediation than most others? In my experience, when the published shenanigans in this subarea are shared with other scientists, leaving all the details out so as to avoid the “heat”… they tend to react with shock, disbelief, or sad laughter. The most mature response tends to be along the lines of “hey, it’s obviously a young science; they’ll eventually clean up their act.”

    If you have yet to see issues worthy of scientific shock and disbelief, I would suggest you have not yet learned to step away from the fray to look at it dispassionately.

  • Hansen's Bulldog // March 10, 2008 at 3:14 am

    I’m posting this to multiple threads. Things have gotten a bit out of hand. The level of hostility is … well … over the top.

    So I urge everyone to try an experiment. For you next comment, make the exact point you wanted to make, but leave the hostility at the door. I really don’t want to censor viewpoints. But as has been pointed out, too much insult not only turns people off, it interferes with communication. Meanwhile, it’s time for me to reconsider my moderation policy … because “moderation” doesn’t seem to apply.

    And please don’t send a comment saying it’s really my fault. I already know.

  • dhogaza // March 10, 2008 at 3:45 am

    Is this area of science in need of more remediation than most others? In my experience, when the published shenanigans in this subarea are shared with other scientists, leaving all the details out so as to avoid the “heat”… they tend to react with shock, disbelief, or sad laughter. The most mature response tends to be along the lines of “hey, it’s obviously a young science; they’ll eventually clean up their act.”

    I want proof. I want cites. I want specific quotes.

    You’re just another internet poster who, AFAIK, is just a random anti-science denialist hack.

    And you continue to post based on personal authority, without back up …

    So, who are those other scientists?

    What exactly are the “shenanigans”?

    If you have yet to see issues worthy of scientific shock and disbelief, I would suggest you have not yet learned to step away from the fray to look at it dispassionately.

    An argument from personal authority.

    Tch, tch.

    Meanwhile, it reached 16C today in PDX, and while the month has overall been warm, we’ve had some cooler days.

  • anonymous // March 10, 2008 at 3:55 am

    If the analysis of MBH were correct, I suspect long ago some professional academic statisticians would have backed them up. There is no downside for them to do so. They would not be picking a fight on a highly politicized topic with their colleagues in atmos or geo.On the contrary, they would be helping them. Yet I have not seen this to be the case. Where I am located Wegman is taken seriously by statisticians. Has any statistician publicly refuted his critique? It is
    very disconcerting to see climatologists continue to defend MBH without at least either finding serious statisticians to back them up or stating explicitly that the analysis they are doing is a new method, and justified by the following papers. I do not need to wade through the discussion here and at CA; I am unqualified to judge it professionally. However, many people are qualified and they are not backing up MBH, but rather remaining silent. It is a rather bizarre situation, really.

  • mmghosh // March 10, 2008 at 4:31 am

    I agree with HB. Thank you for your moderation.

    Part of the attraction of this site is the education that one receives.

    Mr Pete,

    I have plenty of experience with scientific disagreement, and I have been involved in several of them myself. The debate is between the scientists themselves. The point is that this particular debate is being carried out between scientists and laymen about issues which are not issues in science in general.

    I’ll just pick out some examples from the recent past on these threads – Q – is every single datum archived? A – No scientist in any field in my experience archives every single dataset. Q – Why is averaged data used? A – Scientists do it all the time. Q – Criticism based on not understanding the difference in baselines, or understanding what an anomaly is – commented on by HB in the past few posts. A – read up the subject before making basic errors.

    My point is that question of this sort are not made in other disciplines.

    Once again I have gone off topic. I apologise to HB and will comment only on the open thread.

  • EliRabett // March 10, 2008 at 4:48 am

    I think what HB has brought out very well is that people who think about physical systems view mathematics/statistics as a tool. This drives mathematicians/statisticians mad which is OK. Think of Feynman/Dyson, etc.

    I think that the point here is that given the data (and remember that Bradley and Hughes knew the available data, indeed they created a great deal of it), the non-centered PCA was the best choice.

    One more comment. Somewhere above someone claimed that one should choose the depth of the PCs that one will consider before doing the analysis. This is, well, auditory. What one should choose is the criteria for including a PC.

  • John Mashey // March 10, 2008 at 5:32 am

    Edward Wegman got it right:

    ‘As we said in our report, “In a real sense the paleoclimate results of MBH98/99 are essentially irrelevant to the consensus on climate change. The instrumented temperature record clearly indicates an increase in temperature.” We certainly agree that modern global warming is real. We have never disputed this point. We think it is time to put the “hockey stick” controversy behind us and move on.’

    http://energycommerce.house.gov/reparchives/108/Hearings/07272006hearing2001/Wegman.pdf

    They said it in the report that I studied, and then Wegman said again at the later hearing. This was July 2006. That’s about as clear as it gets.

  • Hank Roberts // March 10, 2008 at 5:53 am

    Wegman said it was less than an ideal approach, and that it was time to move on.

    Wegman also said he believed CO2 forms a layer in the atmosphere according to its weight.

    He was right about the conclusion, but he was wrong about his understanding of atmospheric physics.

    But he got the right answer anyhow.

    Time to move on. CO2 is well mixed.

  • fred // March 10, 2008 at 6:35 am

    There is really only one way of getting to the bottom of this thing for yourself. R is freely available to download for all operating systems, and (not speaking from experience yet) cannot be any more difficult than any other high level language or specialist application. So harder than Excel, but probably no harder than Photoshop. Haven’t fired it up yet, though.

    The best regarded introductory book seems to be Dalgaard “Introductory Statistics with R”.

    Another well reviewed introduction is Crawley “Statistics, an Introduction using R”.

    Verzani’s book is apparently the most basic intro. “Using R for Introductory Statistics”.

    There is a Springer text on time series analysis which Tamino probably knows and could comment on.

    And there’s a Cambridge title by Maindonald and Brain “Data Analysis and Graphics Using R: An Example-based Approach”, which has been well recommended to me to learn R with. Amazon reviews differ, some being very positive on it, others less so. Its not cheap.

    The great advantage if we were to do this, is we can very simply for ourselves actually do some PCA without having to struggle with setting the thing up in Excel or some other spreadsheet. Spreadsheets are fine in their place, but for this stuff I can’t see it, its too easy to make mistakes because you’re effectively programming your own statistics package. We can also see whether or not the MBH approach does produce hockey sticks from red noise, and verify all the other contentions.

    And we can get the debate onto a level where content and not insult will dominate.

  • fred // March 10, 2008 at 8:13 am

    meltwater, just to continue the issue of climate sensitivity. People usually say, when the issue of feedbacks is raised, that one is moving to something called denialism if one doubts the existence of positive feedbacks of the level needed to raise the temperature by 3 or 4 degrees.

    There are a number of papers (naturally from authors who were instantly then labelled denialists, if they had not already been!) which argue that the total sensitivity of global temperatures to a doubling of CO2 is around 1 degree. Maybe the most thought provoking line of argument on this comes from the observation that the preindustrial level of 280ppm has risen to the recent level of 380ppm. Because the effects of CO2 are not linear, a great deal of the warming from a rise from 280 to 560 must already have occurred. The percentage of the total rise will be greater than the percentage of the rise in CO2 that has already happened.

    So the counter argument is, lets look at how much it has warmed already for that rise, and use that to estimate sensitivity, plotting it on the log curve to allow for decreasing returns as indicated above. Several people looking at this have come to estimates of around 1 degree – which is actually sometimes less than the 1.2 degrees everyone agrees is injected simply from the CO2 effect.

    I don’t know if this argument is right or wrong, or exactly what it shows if its right, but it is not obviously silly or in bad faith. It is quite possible to look at this, accept the physics of CO2 heat absorption, but not be persuaded that it works out in the real world so as to produce sensitivity levels of 4 degrees.

    You don’t have to reject 200 years of science. Its a bit like saying, I know this car has x amount of energy in its tank. If it uses it to some level of efficiency, it should do 100 miles with it. But in fact, owing to the way it is constituted or driven, it may in fact do 50 or 150. This isn’t physics, it is empirical observation of all the various factors and how they work together.

    Motl has an article which summarizes the argument, though it then gets into solar, which is another and perhaps further fetched story. Motl of course is a denialist, which doesn’t shed much light on the argument one way or the other.

    http://lubos.motl.googlepages.com/cep-eng.doc

    Pielke has also argued that land use is a very important driver. Not saying any of these are right, just that reasonable people trying to get to grips with it can accept CO2 heat absorption, while not being convinced of the usual estimates of total climate sensitivity. It is a complicated subject and its quite wrong to think people who are half way through forming opinions on it are in some way bad, or in bad faith.

  • ChrisC // March 10, 2008 at 8:19 am

    Anonymous@10th March 3:55am

    Firstly, the 2006 NRC synethesis report into temperature reconstruction, which according to Pielke Jr. was a “near-complete vindication for the work of Mann et al.” included numerous statistians as authors. These include Prof. Peter Bloomberg, a professor of stats at North Carolina State University, Dr. Douglas Nychka, from NCAR. The relevant IPCC chapters also include various statistians.

    Secondly, Tamino/Hansen’s Bulldog _is_ a professional statistician. I would say this post is is backing up the MBH1998 analysis.

    People ARE backing up MBH, and have been for years. It helps to do some research before making claims such as “However, many people are qualified and they are not backing up MBH, but rather remaining silent.”

    Just as an aside, PCA and SVD are not used solely by statistians. I’ve used both in Signal Processing and physical modelling applications. The input of statistians into trivial applications of the techniques is often not required.

  • fred // March 10, 2008 at 9:18 am

    Couple more useful R sites:

    http://rss.acs.unt.edu/Rdoc/library/pcaMethods/doc/index.html

    the docs on the pca methods package.

    http://cran.r-project.org/other-docs.html

    quite a bit of free docs in pdf form.

  • Don Fontaine // March 10, 2008 at 12:55 pm

    I am one of the guys who sometimes uses the methods people like you invent. Would doing the analysis using nonmetric multidimensional scaling, rather than PCA, have eliminated some of the questions you have had to spend time refuting here? Since NMDS uses ranked data it doesn’t depend on the units of measurement. I suppose you could still subtract out different subsets of the data prior to the analysis creating questions about which choice is best. I learned of NMDS from PRIMER software from Plymouth Marine Laboratory (originally) http://www.primer-e.com/Primary_papers.htm.

  • George // March 10, 2008 at 1:19 pm

    I messed up the closing blockquote for the quote above

    TCO:

    “I suspect 95% of the commenters here do not hae a basis for making braod statements about the quality of climate science literature. I think you need to have read a lot of it before you can really do so.

    Agreed.

    He continues:

    McI has done that and so has some basis to make such a statement.”

    It’s not simply reading that is important.

    It is understanding.

    Does some guy with no background in dendrochronology who starts reading the literature in that area somehow immediately understand the issues better than those who have been working in the area for 30 years?

    Tell me another funny one.

  • Barton Paul Levenson // March 10, 2008 at 1:23 pm

    Mr. Pete posts:

    [[Funny how so many people want to “pretend” (your word, not mine) they’re making progress on climate change research but don’t talk about current data either. ]]

    And why do you suppose that is, especially in this blog? Could it be because you and your pals continually bring up MBH over and over and over and over and over again so that we have to spend most of our time countering your false statements and mistakes?

    You want to talk about the contemporary work? Go read through the last year’s worth of JGR-Atmospheres, J. Atmos. Sci., Monthly Weather Review and Icarus. Bring up a paper there you don’t like, and we’ll talk.

  • MrPete // March 10, 2008 at 1:30 pm

    dhogaza, sorry, I’m not going to do your work for you. I can turn on the lights, but you’ve got to follow the path for yourself. It would be a disservice to you to sit here and get into a big argument while spoon feeding you what is easily available. You need to value the issue enough to go find it. Step back, look at the best material available, assess what is said while ignoring the heat.

    Tamino’s concern about flame wars is well placed. To get past that, we need to develop a higher level of respect for one another. Part of that comes from trusting enough to then invest personally in digging out honest answers.

    [And no, I'll not draw my personal relationships into the public eye. I'm public (more than I'd like but it's wayyy too late), but privacy is valued by many of my friends. Yes, that means I'm limited to sharing "stories" in some cases, which obviously carry no scientific rigor. A limit I place on myself.]

  • Barton Paul Levenson // March 10, 2008 at 1:38 pm

    Patrick Hadley writes:

    [[If the MWP did exist then it would help (but obviously not prove) two arguments used by sceptics: that the world benefits from warmer temperatures; and that the recent rise in global temperatures is not unprecedented and therefore one cannot argue that just because warming has followed a rise in CO2 it must therefore be caused by this rise. ]]

    The latter point is a straw man which reflects a mistaken belief of AGW deniers, apparently including you — that AGW theory is based on some kind of statistical analysis of climate data. Apparently the idea is that scientists said, “Hey, CO2 is rising — and temperature is also rising! The CO2 increase must have caused the temperature increase!”

    No. Wrong. The prediction of global warming from increase CO2 is based on radiation physics. CO2 is a greenhouse gas; John Tyndall proved that back in 1859. That means that the more of it you have in an atmosphere, the warmer the ground is going to get, all else being equal. To avoid it there would need to be some kind of countervailing process, and nobody has been able to find one sufficient to prevent a rise of several degrees this century. Lindzen tried with his “tropical infrared iris” idea, but satellite observations shot that down. Note, by the way, that Lindzen’s denialist argument here was a real scientific hypothesis, that it was accepted by a journal, and that it was rejected simply because empirical tests falsified it.

    If it were somehow proved that the Medieval Warm Period was global, rather than largely confined to Europe, it would not disprove AGW theory at all. It would simply indicate that some other factor back then must have been large enough to cause a huge worldwide temperature anomaly. Maybe the Earth’s albedo was lower then, due to all the sprays of blood from sword-fighting.

  • luminous beauty // March 10, 2008 at 2:12 pm

    anonymouse,

    Wegman was correct as far as he went. Having no knowledge of the nature of the data, one cannot tell a mechanical correlation from a spurious one. If he had bothered to inform himself on the hundred years of research done validating the data, he might have made a different conclusion.

    It was not what Joe Barton asked him to do.

    It is not really a question of statistics. It is a question of whether tree-ring growth has a causal, non-probabilistic relation to climate variation.

    It does. The various ‘divergence’ problems don’t contradict this long established science, but rather suggest, since the mid 20th century, the planet is going through a climatic shift.

    Most professional statisticians don’t work in the physical sciences, but fields like economics and sociology where the data are often, if not typically, probabilistic, and in which causality is infinitely more difficult to determine than in the physical sciences.

  • fred // March 10, 2008 at 2:22 pm

    “CO2 is a greenhouse gas…. That means that the more of it you have in an atmosphere, the warmer the ground is going to get, all else being equal.”

    This is not true, as a matter of science, for two reasons. First that as CO2 rises, incremental rises have diminishing warming effects. The curve is logarithmic. It is not true that the more you have of it the warmer it gets in any unqualified sense.

    Second, because CO2 warming in itself is about 1.2 degrees. What produces the rest of the warming is the alleged amplification of the initial warming.

    The statement makes the whole thing seem much simpler and more certain and one dimensional, and susceptible of simple evidential proof, than it really is.

  • Hank Roberts // March 10, 2008 at 2:45 pm

    > alleged

    See, this is why it’s hard to discuss current science. The old lack of understanding keeps being reiterated.

    Lawyer language, yet.

    Waster of time.

  • Martin Vermeer // March 10, 2008 at 2:46 pm

    This is not true, as a matter of science, for two reasons. First that as CO2 rises, incremental rises have
    diminishing warming effects. The curve is logarithmic

    Well, we got this far. Do you know what a logarithmic curve looks like? Don’t you agree that it goes monotonously up?

    …and by the way, what in “all else being equal” exceeded your power of comprehension?

  • dhogaza // March 10, 2008 at 2:53 pm

    I’m not going to do your work for you. I can turn on the lights, but you’ve got to follow the path for yourself. It would be a disservice to you to sit here and get into a big argument while spoon feeding you what is easily available. You need to value the issue enough to go find it. Step back, look at the best material available, assess what is said while ignoring the heat.

    More handwaving from Mr. Pete.

    Without specific information, I’m afraid I’m going to have to rely on the IPCC reports. Nothing I see there supports your statement that climate science is in a laughable state, and that other scientists who look at the field believe it to be laughable.

    You really think I’ll consider someone who considers CA to be a reliable source to be a reasonable judge as to the state of climate science?

    You do write with a clear, arrogant, condescending style that I imagine you believe masks the fact that you rarely say anything of substance. Doesn’t work with me, sorry.

  • Ian // March 10, 2008 at 3:09 pm

    fred,

    You mentioned papers arguing that “total sensitivity of global temperatures to a doubling of CO2 is around 1 degree” – were you thinking of Stephen Schwartz? If so, have a look at James Annan’s take on it (I found his criticisms apt and convincing): http://julesandjames.blogspot.com/2007/08/schwartz-sensitivity-estimate.html .

    In short, Schwartz has a couple of calculation errors and a strange assumption of a 5-yr fixed climate response.

    If this is not what you meant, then when looking at the amount of atmospheric warming so far, are you factoring in that the oceans have kept atmospheric temp from rising as much as it would have in their absence?

  • P. Lewis // March 10, 2008 at 3:26 pm

    fred

    To a first approximation for CO2 forcing, Delta(F) = a ln(c/c_{0}), where Delta(F) is the change in forcing, a is a constant (value unimportant here), c is CO2 final concentration and c0 is CO2 initial concentration. Plug some figures in for, say a doubling of CO2 from 100 to 200 ppm, 150 to 300 ppm, 200 to 400 ppm, 250 to 500 ppm.

    Actually, you should be able to do this in your head, because it doesn’t really matter what the actual result is for this illustration.

  • George // March 10, 2008 at 3:33 pm

    Fred says

    This is not true [that "the more of it you have in an atmosphere, the warmer the ground is going to get, all else being equal.”] as a matter of science, for two reasons. First that as CO2 rises, incremental rises have diminishing warming effects. The curve is logarithmic. It is not true that the more you have of it the warmer it gets in any unqualified sense.”

    Actually, Fred, is is strictly true that “the more of it [GHG] you have in an atmosphere, the warmer the ground is going to get, all else being equal.”

    To say that the ground gets “warmer” just means its temperature goes up. It says nothing at all about how it goes up with increasing concentration of GHG’s — exponentially, linearly, logarithmically etc.

    “Logarithmic warming” with GHG concentration increase is still “warming”.

    Though the amount of warming (temperature increase) for a given incremental increase in GHG concentration does diminish as the concentration increases (as you correctly point out), as long as the concentration continues to increase, so does the warming (all else being equal).

  • Gavin's Pussycat // March 10, 2008 at 3:33 pm

    fred:

    Because the effects of CO2 are not
    linear, a great deal of the warming from a rise from 280 to 560 must already have occurred. The percentage of
    the total rise will be greater than the percentage of the rise in CO2 that has already happened.
    So the counter argument is, lets look at how much it has warmed already for that rise, and use that to estimate
    sensitivity, plotting it on the log curve to allow for decreasing returns as indicated above. Several people
    looking at this have come to estimates of around 1 degree – which is actually sometimes less than the 1.2
    degrees everyone agrees is injected simply from the CO2 effect.

    Yes, this has been studied. If you do it properly, you have to take into account that the ocean takes up a large
    amount of the greenhouse energy imbalance, slowing down the observed warming. For sensitivity you have to look at
    equilibrium warming, i.e., the (hypothetical) temperature reached after the ocean has had time to warm up as
    well.

    Let’s do that: we observe some 0.7 deg C for 385 ppm, i.e., 1.375x pre-industrial, the 2log of which is 0.46.
    The sensitivity to doubling is then 0.7/0.46 = 1.5 deg C.

    Now take the ocean heat storage into account. According to most studies, some 0.6 deg C is “missing” from the
    observed 0.7 deg C, most of it on the Southern hemisphere where the oceans are. Equilibrium heating would be
    0.7 + 0.6 = 1.3 deg C (BTW don’t mix this up with your 1.2 deg doubling sensitivity. Apples and oranges. Just in
    case.) Now we get a doubling sensitivity of 1.3/0.46 = 2.8 deg C.

    (Did you read the article on RC on Antarctica recently? Discusses also this issue:
    http://www.realclimate.org/index.php/archives/2008/02/antarctica-is-cold
    )

  • Luke Silburn // March 10, 2008 at 3:35 pm

    Fred quoted Barton:
    “CO2 is a greenhouse gas…. That means that the more of it you have in an atmosphere, the warmer the ground is going to get, all else being equal.”

    and then said in reply:
    “This is not true, as a matter of science, for two reasons. First that as CO2 rises, incremental rises have diminishing warming effects. The curve is logarithmic. It is not true that the more you have of it the warmer it gets in any unqualified sense.”

    Sorry, what? How does a logarithmically-declining warming effect mean that ‘the more you have of it the warmer it gets’ becomes an untrue statement?

    Regards
    Luke

  • luminous beauty // March 10, 2008 at 3:54 pm

    fred,

    Once again. The uncertainty in feedback response goes both ways. The likelihood that it is less than the agreed mean value from several methods of calculation is the same that it is greater.

    One cannot honestly argue for one direction of uncertainty without simultaneously arguing for the other, given you have no physical reason for doing so.

  • elspi // March 10, 2008 at 3:54 pm

    Fred brings the stupid:
    “The curve is logarithmic. It is not true that the more you have of it the warmer it gets in any unqualified sense.”

    Fred, the log is a strictly increasing function.
    Therefore, IT IS TRUE THAT THE MORE YOU HAVE OF IT THE WARMER IT GETS.

    Furthermore fred, the part of the function that we have witnessed in the modern era is very close to being linear.

    Venus, on the other hand, has the property that any additional CO_2 will not change things much. (The point being that the concentration is so high that the curve is almost horizontal)

    If it ever gets to that point on Earth, we will all be WELL DONE anyway.

    Also there is a feed back loop. It see how it works, try opening a can of soda at 120F.

    How much simpler could it be?

  • Nick Barnes // March 10, 2008 at 4:37 pm

    fred:

    This is not true, as a matter of science, for two reasons. First that as CO2 rises, incremental rises have diminishing warming effects. The curve is logarithmic. It is not true that the more you have of it the warmer it gets in any unqualified sense.

    Um, yes it is. A logarithmic relationship is still monotonic. “The more [...] the warmer [...]” describes any monotonic relationship. The more you press on the accelerator of a car, the faster it goes. Not linear, but monotonic. The higher a mountain you climb, the thinner the air is. Not linear, but monotonic. The louder the denialists shout, the flimsier their arguments. Not linear, but monotonic.

  • luminous beauty // March 10, 2008 at 4:52 pm

    fred,

    The last time I looked at a logarithmic curve, the abscissa value continues to increase as the ordinate value tends to infinity. It is only the rate of increase that goes down.

  • chriscolose // March 10, 2008 at 4:57 pm

    I finally have a more detailed of the greenhouse effect here, and a second part on the runaway and anti greenhouse effects in Part 2. http://chriscolose.wordpress.com/

    CO2 is a greenhouse gas. CO2 WILL warm the surface. The logarithmic relationship in absorption is already accounted for when people say that its doubling will result in x amount of warming. The best constraints on climate sensitivity clearly show around 3 C (such as work on the LGM). It is not true that we should have gotten most warming already, and you can work out that the rest of the way to 560 is nearly the same RF as what we got so far. In addition, you still need to consider positive feedbacks, the fact climate sensitivity applies only at equilibrium and so we still have more to go, and the fact that aerosols have offset warming. Getting climate sensitivity over the last few decades is not a great idea, but the rise so far clearly indicates a sensitivity beyond the planck response. It is true you need positive feedbacks to get to 3 C, but these are not just “alleged.” There is subtantial evidence to back their existence.

    Gonig back to this topic, I am really not sure it matters much anymore. The comments that a MWP being warm shows that warmth is a good thing is nonsense on several grounds. For one thing, small amounts of warmth have shown to be disastrous to many cultures (see the drought effects on the Mayans). I suppose the fact that Viking agriculture faired a bit better in Greenland is a sign that the world will be a much more prosperous place, but that is not the best global indicator. Next, a lot of conditions are different today such as the amount of infrastructure and people on the coasts, popualtions, etc. Finally, the temperature projections don’t stop at 2008, they keep rising. So, what we are gonig to get could potentially make current warmth (or medieval warmth) quite small. It should not be too hard to think that getting 3 C in 100 years can have large consequences.

  • RomanM // March 10, 2008 at 5:07 pm

    So far it appears that the discussion on the use of uncentered “PCs” as a statistical procedure you and your supporters have been advocating their use through arm waving that it was a perfectly good and valid technique which does exactly what one wants it to do. But no one has offered any evidence to support that claim other than vague references to their statisticians knowing more than someone elses statisticians. So I decided to do some investigation for some facts. However, I think some explanation of how things work in mainstream statistics might be in order for some of your readers. If someone produces a new methodology or statistical procedure, they first specify all of the assumptions that are needed for the data or situation where the procedure will be applied. Then, under those assumptions, they derive the properties of the results showing, as the
    case may be, that the results are unbiased for what is being estimated, what impact the randomness inherent in the situation may have, determining optimality of results, etc. If the assumptions are reasonably satisfied and if the procedures are followed as set out, then you know how to interpret what you get at the end of the procedure. Should someone alter the assumptions or change the procedure in any way, it is incumbent on them to provide clear evidence that either the procedure
    still works as before or to derive new results that incorporate the changes. Any modifications not supported by such evidence would be considered invalid.

    I found a book on Principal Components on the web and did some reading on centered and uncentered “PCs”. On p. 392 of this book I ran across some interesting statements:

    “(i) the columns of X are left uncentred, that is xij is now the value for the ith observation of the jth variable, as originally measured;

    (ii) both rows and columns of X are centred, so that sums of rows, as well as sums of columns, are zero.

    In either (i) or (ii) the analysis now proceeds by looking at linear functions of x whose coefficients are the eigenvectors of X’X, with X now non-centred or doubly centred. Of course, these linear functions no longer maximize variance, and so are not PCs according to the usual definition, but it is convenient to refer to them as non-centred and doubly centred PCs, respectively.

    If the data are such that the origin is an important point of reference, then this type of analysis can be relevant. However, if the centre of the observations is a long way from the origin, then the first ‘PC’ will dominate the analysis, and will simply reflect the position of the centroid.”

    There are several things that seem to be relevant to the issues here. The first is that “uncentred PCs” are in fact NOT PCs although some people may still refer to them as such. The author actually refers to centering at the origin, but the same observation applies to centering at any other chosen point as a new origin. Because this is a drastic change to ordinary PC thory, the procedure needs genuine justification for the results to be interpretable in any meaningful fashion. It also means
    that eigenvalues no longer are measures of “explained variability” and methods of choosing the number of eigenvectors (not PCs) are also not applicable. The author of the book (oh, did I mention the title – Principal Component Analysis by Prof. I.T. Jolliffe) does not seem to give any references to this technique from the mainstream statistical literature. I presume that he would have done so had such references existed.

    The last part of the quote relates to the hockey stick. He does say that if the origin is meaningful then “this type of analysis can be relevant”. Whether centering at an arbitrary interval in time over a short range of the data is “an important point of reference” is certainly questionable. But the important issue is that if the center chosen is far from the average of the entire data, the first eigenvector will dominate the results as a function of that diference.

    What does this mean exactly? Let’s do a little math. I’m sure that you will correct me if I make an error – it’s been known to happen. What effect does choosing another centre other than the average have on the matrix which is used to determine the “PCs” for the Tamino example and the MBH data. If I understand the centering procedure correctly, a little algebra says that you can get that matrix by starting with the ordinary covariance matrix of the data and then inflating values on the diagonal by adding (basically) the square of the difference between the average of the variable in the section chosen for the centering and the average over all of the variable’s data. This means that variances” of variables which have past values simiar to the present will not be altered. However, the corresponding terms for variables whose earlier values are much lower (or much higher) than present (including hockey stick proxies) will increase by possibly large amounts. The “covariances” will
    also change but that will tend mainly to increase the size of first eigenvalue. As you well know, an eigenanalysis of a covariance type matrix puts a heavy emphasis on those variables whose “variances” are larger and a much smaller weight to the others. The net effect is to pick out the HS – not because it is necessarily the right answer – but, because the altered mathematics of the situation does so automatically, right or wrong. On the other hand, any proxies that might indicate
    that the past and present are similar will receive a much smaller role in determining the PC because of the center chosen. This is what Prof. Jolliffe referred to in the last part of the above quote.

    Does Prof. J. advocate the use of this methodology? A quote from his powerpoint on To Centre or Not to Centre:

    “Standard EOF analysis is (relatively) easy to understand – variance maximisation

    For other techniques it’s less clear what we are optimising, and how to interpret the results

    There may be reasons for using no centering or double centering, but potential users need to understand and explain what they are doing”

    Sounds lukewarm. Indeed, users DO need to understand the effects of AND EXPLAIN what they are doing and not present it as a magic black box.

    [Response: I'd say there are some minor errors here -- the first which springs to mind is that non-centering effects more than the diagonal elements of the covariance matrix -- but they're simple misunderstandings which don't alter the fact that you've put your finger squarely on some of the important issues involved. I'm not saying I agree (or disagree), but you have focused attention on some places where it actually belongs.

    I really do want to contribute to understanding of those issues, but it's pretty clear by now that they're complex enough to deserve more than just a response to a comment. That's why (as I stated earlier) I'm planning yet another entry in the series, on non-centered (no longer really PCA) PCA in general. But there's more to life than PCA and hockey sticks, so I'm not changing my plan to do a sea level post first. Also, a member of my wife's immediate family has passed away (he lived a long, good life and "died with his boots on" so it's not as tragic as it may seem) which takes more of what may be the most valuable quantity of all -- time -- so I'll have to ask readers to stretch their patience a little further.

    I'll also say that commentary at one point became rather contentious, then got downright nasty, but has now begun to focus more clearly. We might actually be making some progress.]

  • RomanM // March 10, 2008 at 5:09 pm

    Sorry about the format of the previous post. I pasted it from Notepad and it came out as poetry! Can it be fixed?

    [Response: *mostly* fixed.]

  • Hansen's Hamster // March 10, 2008 at 5:11 pm

    “Um, yes it is. A logarithmic relationship is still monotonic. “The more […] the warmer […]” describes any monotonic relationship. The more you press on the accelerator of a car, the faster it goes. Not linear, but monotonic. The higher a mountain you climb, the thinner the air is. Not linear, but monotonic. The louder the denialists shout, the flimsier their arguments. Not linear, but monotonic.” Hmmm… is this thesis peer reviewed?

  • caerbannog // March 10, 2008 at 5:23 pm

    Just a question for the hockey-stick auditors. When are you going to wrap up your auditing of Mann’s nearly decade-old reconstruction and turn your attention to the more recent reconstructions that roughly agree with Mann’s original work?

    [Response: How many roads must a Mann walk down, before they call him a Mann ...

    (apologies to Bob Dylan)]

  • RomanM // March 10, 2008 at 5:35 pm

    Thank you for fixing the initial post.

    My comment that “the “covariances” will
    also change but that will tend mainly to increase the size of first eigenvalue. ” is exactly an agreement with you that the off diagonal elments are affected as a product of the two differences between centre and mean for the two variables involved.

  • Gavin's Pussycat // March 10, 2008 at 6:03 pm

    One more comment. Somewhere above someone claimed that one should choose the depth of the PCs that one will consider before doing the analysis. This is, well, auditory. What one should choose is the criteria for including a PC.

    Wasn’t it simply “if the result changes significantly if you include more, you should include more”?

  • dhogaza // March 10, 2008 at 6:05 pm

    I’ll also say that commentary at one point became rather contentious, then got downright nasty, but has now begun to focus more clearly. We might actually be making some progress.

    Not really. They’re just over at CA engaging in scientific arguments like … name-calling. Yes. That seems to be the major rebuttal to your post and your follow-ups. Calling you names apparently proves you wrong.

    [Response: I'm not surprised if *they* aren't making progress. But I hope that *we* are. And "we" doesn't include "them."

    As for calling me names ... the more time and effort they waste doing that, the less they have to work other mischief.]

  • Hank Roberts // March 10, 2008 at 7:08 pm

    They don’t need to work at trolling if you go there looking. That’s a monkey-jar trap. Eschew!

  • MrPete // March 10, 2008 at 7:51 pm

    dhogaza, I’m surprised I need to say this. Look at the stated purpose of IPCC. The reports are only scientifically “robust” to the extent that the reported science is in line with the (political/policy) purpose of IPCC. Y’gotta dig deeper than IPCC to get a good scientific picture. IPCC will not publish anything out of line with their defined purpose.

  • luminous beauty // March 10, 2008 at 8:17 pm

    “If the data are such that the origin is an important point of reference, then this type of analysis can be relevant. However, if the centre of the observations is a long way from the origin, then the first ‘PC’ will dominate the analysis, and will simply reflect the position of the centroid.”

    Is it important that Mann’s reconstruction is calculated beginning with the most recent data and working back in time?

    One might think.

  • John Mashey // March 10, 2008 at 8:28 pm

    HB: this is a great series.

    When you do part 5, might you consider talking some about uncertainty, error bars, etc? It’s clear that lots of people don’t seem to understand what they mean in such reconstructions or in general.

    Personally, although I understand the lure of a compelling graphic when trying to summarize material for a general audience:

    MBH** had lots of caveats and clear (& substantial) error bars:
    http://holocene.meteo.psu.edu/shared/articles/mbh98.pdf, http://www.ncdc.noaa.gov/paleo/pubs/mann_99.html
    Eyeballing these, I’ll admit I don’t understand why the pre-1400AD bars aren’t bigger, but I don’t care much.

    The IPCC TAR, Fig 2.21 had several reconstructions, and the AR4’s TS.20 has even more, and AR4s section 6.6 even more.
    http://www.ipcc.ch/pdf/assessment-report/ar4/wg1/ar4-wg1-chapter6.pdf

    My take was that the original MBH’s were heroic early attempts to extract signal from noisy, minimal data, and a bunch of years of studies have been close enough [Fig 6.10 of AR4, p467 in printed version.] that’s exactly what I’d expect experimental studies to look like. I’d be *astonished* if they all came out with the same line and tiny error bars.

    re: Wegman
    Wegman is clearly a world -class statistician, but cannot be accused of overly-favoring Mann & co, so when I say he got “it” right, I meant the part I quoted. (”move on”) He clearly got other things wrong, as statisticians sometimes do when they get outside their own turf. Statisticians are always niggling at others for not using statistics right, but there are not enough statisticians around for there to be one looking at every single paper in the world upfront, especially in universities, but even in the old Bell Labs, where we had *awesome* statisticians like John Tukey and Joseph Kruskal. if you wrote a paper for external publication, it had to be reviewed by 2 divisions outside your own. Guess where papers with serious stat in them went? Did people learn to check out anything tricky with them first? (yes) And people did lots of approximate”back-of-the-envelope” stat, which was often good enough.

    Tukey wrote:

    “Far better an approximate answer to the right question, which is often vague, than an exact answer to the wrong question, which can always be made precise.”

  • cce // March 10, 2008 at 8:42 pm

    Re: Support from statisticians

    Everyone should listen to the NRC Panel press conference. Because until something major comes around that changes things, they will be the final word on these issues, not Mann, McIntyre, Wegman, or Tamino.
    http://video.nationalacademies.org/ramgen/news/isbn/0309102561.rm

    This is from statistician Peter Bloomfield:
    “Subsequent work has carried out very similar analyses with principal components replaced by much simpler methods like simple averaging and has arrived at essentially very similar reconstructions. So the committee reviewed that and other statistical issues that had been identified in that first analysis and while finding that the issues are real but that they had a minimal effect, not a material effect, on the reconstruction.”

    “[Mann et al’s methods] were all quite reasonable choices. I think in some cases a lot of subsequent, hard work by others in following up on that have showed that some of those choices could have been made better, but they were quite plausible at the time. I would not have been embarrassed by that work at the time, had I been involved in it and I certainly saw nothing that spoke to me of any manipulation or anything other than an honest attempt at constructing a data analysis procedure.”

  • dhogaza // March 10, 2008 at 9:54 pm

    MrPete …

    The Intergovernmental Panel on Climate Change (IPCC) was created in 1988 by the World Meteorological Organization (WMO) and the United Nations Environment Programme (UNEP) to “assess the scientific, technical, and socio-economic information relevant for the understanding of the risk of human-induced climate change.”

    Nothing there to support your claim of predetermined results of the assessment process.

    This:

    The reports are only scientifically “robust” to the extent that the reported science is in line with the (political/policy) purpose of IPCC. Y’gotta dig deeper than IPCC to get a good scientific picture. IPCC will not publish anything out of line with their defined purpose.

    is simply false if your claim is that their “defined purpose” is to promote the AGW hypothesis.

  • MrPete // March 10, 2008 at 10:16 pm

    cce, why should scientists be paying attention to a press conference?

  • TCO // March 10, 2008 at 10:44 pm

    George:

    Part of getting your Ph.D. is getting the balls and experience to move into alternate fields. I have done it in psych and econ and alternate science fields. Not every person can do that well…but many scientists can.

    I’m not so sure about McI. I think he has some fundamental problems in terms of fairness and disaggregation and organized presentation. So you have to tak him for what he’s worth. I think he does surface interesting questions.

    I know a lot of people here on both the right and left are more interseted in winning the war. And that’s fine. It could be an important war. But I’m more interested in intellectual curiosity. And in an occasional trollish blogfight. And drinking and eating.

  • TCO // March 11, 2008 at 12:01 am

    Tammie:
    A. Sorry about your wife’s male rel. You are my favorite lib. Pretty cool and fair for an alarmist. Actually rab is ok too. Well you know…smart for a marine. ;)

    B. Actually very cool of you to engage RomanM. good on ya man.

    C. buncha other comments, but I don’t feel like rebutting the Groundhog like arguments of others.

  • per // March 11, 2008 at 12:05 am

    joliffe’s powerpoint shows several ways that PC analysis can go wrong, and offers no specific comment on MBH or MM. It does finish with a clear admonishment that users of non-centred PCA must “explain what they are doing”. One wonders how Tamino would apply this advice to MBH, who didn’t even write in their original methods that they were doing non-centred PCA.

    It is fascinating to see Tamino berating Wegman (http://www.galaxy.gmu.edu/stats/faculty/wegman.html). Strangely, when Gerry North, chair of the NRC review panel, was asked to comment on the wegman report, he said that the findings of the wegman and NRC panels on the statistical issues were very similar. Why is it that eminent scientists agree with Wegman ?

    Tamino himself points out that the non-centred pca pushes the spurious PC to be more prominent, and says this is the point. MBH’98 specifically emphasises that their PC1 is the dominant component of variance. It certainly takes the shine off when you admit that the “dominant component of variance” is an artefact of your analysis method.

  • elspi // March 11, 2008 at 12:31 am

    per brings the stupid^2

    “Tamino himself points out that the non-centred pca pushes the spurious PC to be more prominent, and says this is the point.”

    Actually he said that the non-centered pca pushes the temperature signal PC to be more prominent, which, for the non-short-bus crowd,
    is indeed the WHOLE POINT.

    In his example, the non-centered pca is better than the centered pca, just as it was in MBH98.

  • cce // March 11, 2008 at 1:00 am

    MrPete,

    Scientists are free to do what they will, but it would be wise for the public to listen to the NRC Press Conference and keep in mind their findings the next time someone tries to tell them “the Hockey Stick was broken by the NAS” or “the Hockey Stick is bad science,” or “no one supports Mann” or similar falsehoods.

  • TCO // March 11, 2008 at 1:09 am

    I read the McI posting. A translation:

    Para 1: no math/science content. Quote whining, expressed in a pompous manner.

    Para2: no math/science content: Blathering about the context of literature. APomctually here, I agree with his point against Tammy and perhaps Tammy starting to as well.

    Para3: Pompously expressed but probably has a point about the Mann lack of description of methods. Not really math/science.

    Para4: Basically a math/science point here, but still one that cuts both ways and at times shows McI to be sophistic (for instance he will complain about aspects of methodology as independant errors, yet refurse to show that they are compensated for or that they do not actually cause the effect alleged (acting singly).

    Para5: Some science content here. Some disorganized mixing of issues. The explanatory point on temp and proxy PCs…is explanatory. There is a later bit of kitchen sink excusism (that is not proven) wrt precip series. The point about time dependance is fundamentally interesting, but not well developed.

    Para 6: A really, really long meandering nonsequitery para. I think you could boil it down from 8 lines to 1: “Mann’s code shows that he used rule N with temp data.”

  • TCO // March 11, 2008 at 1:19 am

    Para 7: Another overly long para that could be boiled down to “Preisendorfer” is a good ref. Not sure that that is earth-shaking, but at least the para is slightly more readable, slightly less pretentious.

    Para8: An interesting point is buried in here. I sorta notice the same thing with Jollife: that climate PCA is usually concerned with some sorts of wave like looking patterns across regions. Not sure how this differs with MBH, if there is some stochastic to continuous issue…or some difference in z’s of interest (in one case caring about the regional look and using methods to look at that, and in another honed in on a global average and perhaps not needing same methods).

    Para9-11: If the “purpose of the post is not to discuss short centering”, why does he do so? Such meandering is p….ompous. However the points made and the quotes from Presie are quite relevent. A small nit is that “not centering” is not well differentiated from short centering.

  • TCO // March 11, 2008 at 1:31 am

    Para 12: Interesting, reasonable point per se…but inserted in isolation. It means something to a general argument (strained one) that McI likes to have about what is “conventional”. In any case, as it’s put in isolation, with no development even to the usual arguments, it serves no purpose. In addition, I really wish he would move beyond this tendancy to try to prove himself right versus really fairly examining correlation versus covariance. I learned about this much better and more fairly by reading medical stuff on the web. Looking only at the McI defenses, would not well explain to one when to use corr and when cov. (He gets so wrapped in rhetoric, that he misses science explanation and intuition.)

  • MrPete // March 11, 2008 at 1:38 am

    elspi, what’s interesting here is how “better” is determined. Only analogy that comes to mind right now, and this is obviously stupid and simplistic yet perhaps good for us peanut gallery folk:

    “round(avg(max(1,5,3,4))) is better than max(round(avg(1,5,3,4))), because it better fits the reference value (which is 5).”

    Both methods are valid ways to use the functions. Unfortunately, the example is silly because there’s no physical theory supporting either formula. And that’s why fitting sample data to reference data is not enough. One must have a physical basis for the analysis method.

  • Spence_UK // March 11, 2008 at 1:47 am

    Lazar, many thanks for the reply, raising further good points. I may be a little rusty on some of this stuff, although your questions are helping to remind me of some of it!

    With regard to the AR1 significance, yes the outcome was significant (albeit only just, but significance is a pass/fail anyway) using the original test applied by Mann (for which, I believe, example Scree plots exist). However, it is true that the analysis conducted by Ritson does tend to suggest that the AR1 model is not a good one. But the model applied by McIntyre was the same one used by Mann in the original study – so if you argue that McIntyre’s method is inappropriate, then you are arguing that Mann’s method is inappropriate. Sounds like a thorny issue from both sides! Ironically, I believe the signficance occurs because the standard statistical test doesn’t really work properly for short segment decentred PCA, because the mathematical properties are not well understood…

    That said, there are still other compelling reasons not use decentred PCA (in addition to the statistical significance). I discussed the reasons I felt the selectivity would be compelling to the regression step, but I did not define the mathematical relationship that leads to this, which I think is worthwhile.

    Beforehand, a brief non-controversial example. If I were to try to estimate temperature from a large set (say 1000 time series) of noisy data with long histories, I could just average, or I could select the 100 that best correlate with instrumental data in recent times. As with any other form of selectivity, this could affect the significance requirements of my results, which would become more demanding. If the goal of my study was to prove that my long histories correlate well with the instrumental record, such a selectivity step would be a disaster because it would insert my conclusions into my assumptions.

    So, what of the selectivity imposed by short segment decentred PCA? For the AD1400 step, the “variance” will be inflated in relation to the difference between the 1400-1901 mean and the 1902-1980 mean. Now, firstly, I would like to see an a priori justification of this selectivity; secondly, I would doubt any conclusion related to the relative temperatures of these periods based on an algorithm that contained such selectivity.

    To me, the three reasons (inflated significance requirements, unjustified selectivity which impinges on conclusions, poor mathematical understanding of consequences) are good enough cause to drop the short-segment PCA. The persistence issue you raise is a valid point, but I would be deeply uncomfortable arguing a modern-medieval temperature relationship based on the remaining assumptions.

    OK, so if we say – let’s assume uncentred PCA is the better way to go until a better argument is put forward for its case. You raise some interesting points on the uncentred PCA case, but I’ve waffled on enough for now – maybe I’ll address these another time. Meanwhile, I would draw your attention to the most recent post at CA which discusses PC selection, which acknowledges one of the loose threads you highlight.

  • TCO // March 11, 2008 at 1:51 am

    para 13-14: seems like decent stuff, although not developed to a complete point.

    Para 16-18: good point on use of PCA for exploration of spaces vice as a method of processing. I would like to see a fuller description on the concept of PCing and throwing away and then model buliding on that versus just doing regressions.

    Para 19-20: This part is a bit slick here. He is correct that the under N PCs are not gauranteed to be significant (even the first is not). However, I’m not sure how that relates to an argument of truncations (where a more conservative approach is to include more…actually to the extent of not even doing PCA and just doing MC.) Also, at this point, the continued references to source materials (appeal to authority) rather than demonstration of points by math itself, starts to drag.

  • TCO // March 11, 2008 at 2:15 am

    21-22: nice explanatory science content. Although all agree that not much turns on it. the vieled kvetching against correlation matrix is mildly off, but again…little turns on it.

    23-26: Actually a pretty interesting detail-type point about what the meaning of “rule N” is. apparently Mann did not exactly use what was out of Priesie or if he did, did not show the validation study to show his thumbrule rationale.

    27: bit of non-disaggregation here. there is a methodology issue here…and a data issue. And McI just can’t seem to get his head wrapped around the 2 by 2 matrix comprised of right/wrong for both method and data separately. It’s just a mental block with him. Probably comes from never having done designed experiments, full factgorials, etc.

    28-29: some blathering about “moved goalposts” where McI fails to appreciate that it’s not the time basis of when he complained about things, but that he tends to confound things and tries to have one potential flaw take the numeric impact of several.

    29 further: poor concluding paragraph where he kitchen sink like brings in other issues of general Mann badness (like verification stats) that he does not have the room to develop and that are not needed for examination of former issues.

  • Lazar // March 11, 2008 at 3:10 am

    Okay, I’ve been fiddling about with R and have a few results.
    I think I now understand Steve McIntyre’s (and Ross McKitrick’s) argument, something as follows…
    … decentered PCA applied to red noise creates a ‘hockey stick’ PC1, so there is a danger in doing decentered PCA of emphasizing sommat which is just noise.
    … the 1400AD step is highly dependent on the PC1 from the back-to-1400AD North American ITRDB network, which they claim is ‘mainly’ bristlecone pine.
    … if you supply red noise in place of the NA ITRDB network, the decentered PCA may ’shape’ a PC1 hockey stick in a way that resembles
    instrumental temperature enough to fool the algorithm.
    … he concludes MBH98 were wrong when they benchmarked significance by regressing red noise against temperature without first passing through decentered PCA. MBH98 RE benchmark significance at 0.0 (99%).
    … in the GRL paper (MM2005a), M&M try to do just that by passing red noise through decentered PCA, but, as far as I can make out, they don’t emulate the ‘full’ MBH98 algorithm, as I understand it.
    … I don’t understand why they didn’t.

    In order to obtain more accurate significance benchmarks, we regressed each of the 10,000 simulated PC1s against the MBH98
    northern hemisphere temperature series (the ’sparse’ subset used in MBH98 for verification
    )
    in the 1901-1980 calibration period — a procedure which more closely emulates actual MBH98 methods.

    Benchmark signficance at 0.59.

    Huybers commented,
    M&M did not carry out an (undisclosed) variance scaling step, resulting in inflated RE scores. With variance scaling, Huybers reported benchmark significance same as MBH98, 0.0.

    M&M respond, Huybers did not implement the variance scaling step correctly, and after implementation according to MBH98 code, M&M benchmarked signficance at 0.54.

    Steve’s R code for creating ‘red noise’ PCs is here, and the Wahl & Amman R code for emulating MBH98 is here.
    I reproduced their (WA) verification score for the 1400AD step, 0.48, then substituted two red noise ‘hockey stick’ PCs, lifted straight from 2004GL021750-hockeysticks.txt, for the PC1 and PC2 used to represent the NA ITRDB network. Ran the simulation. RE score 0.18 this time, lower but still significant according to MBH98.
    Tried again with another two red noise ‘hockey sticks’, 0.34 this time, quite high.
    Are the red noise proxies contributing?
    Remove both red noise proxies, down to 20 proxy indicators now and no NA ITRDB, and the score is not much different, 0.27.
    For fun, substitute a red noise ‘hockey stick’ for all 22 proxies / PC summaries, and the score is… -1.48.
    I need to do more runs, graph the results, and double check.
    Am I doing something wrong here (anything obvious)?
    Anyone else wanna have a go?
    PS, graphed the 22 proxies, apart from the 2 NA ITRDB PCs and one other, 8 out of the remaining 19 proxy indicators have a ‘hockey stick’ shape.

  • fred // March 11, 2008 at 3:23 am

    Luminous, you said “It is not really a question of statistics. It is a question of whether tree-ring growth has a causal, non-probabilistic relation to climate variation. It does.”

    This may be a misunderstanding of the issue (though I’m not totally confident of having understood it). There is a purely statistical issue. Even after you have accepted that tree rings are a temperature proxy, you’re only a small part of the way to the HS. You have no HS yet. You just have a series of what you believe to be temperature observations. But if you just plot them all on a chart, they will not be very informative, because they have so much random variation. They are very noisy. Its the submarine problem. You are listening for propeller noises, but the ocean is a noisy place, and if there is a signal, its obscured by the other noises.

    You have at this point a large number of series of data, too many, and have to simplify them to make sense of them. If memory serves, MBH had about 1,000 series and a much larger number of data points. The question is, what do they show and how do you display it.

    One thing you might try is averaging them – simply plotting the means by date. This has been done, and it also looks trendless, (which does not mean it really is).

    You could also do standard PCA. This does show a trend, but (apparently, I’ve not done it) where the first PC is also trendless.

    The contention of M&M, which was endorsed by Wegman, is that the method used by MBH takes some of the 1,000 series, and gives a stronger weight to them. The claim is also that it gives a stronger weight to those series with strong rises in the 20C. The amounts of the weighting have been calculated. The contention is that BCPs from Sheep Mountain have nearly 400 times the weight of other series.

    This gives rise to the remark, which sounds like a jibe but is not, that they could have simply used the BCPs by themselves and thrown out the rest of the data.

    I don’t think Tamino has addressed this issue in his posting, or that any of the other defenders of MBH have done so. As a matter of statistics, this seems not be a correct procedure. This is what M&M tried to show by applying the method to trendless red noise. Their contention was that if you do it, you get a HS shape 99% of the time. If this is true, then the HS as shown around the world is an artifact of the method.

    Continuing with the logic, the issue in all this is not simply whether there is an HS shape someplace in the data, but how important it is. How much of the variation in the data does it represent? The contention is that it is there OK, but represents under 10%, if you do PCA properly. That is, it is the fourth principal component not the first and not of any great significance. If however you do PCA in the way that MBH did it, not using the mean of the series, but the mean of a subset of it, it shows up as the first PC and represents most of the observations.

    It is not enough to say ‘you still get the HS’. You will always get it as long as any of your series has it. The question is whether it represents most of the data, or only a tiny proportion of it. Is it PC1 or PC4? This is the real issue.

    You see that the issue is independent of what you think of the relation of tree rings to temperatures. Even after you agree they measure temperature, the question remains, what your temperature series mean and what shape are they. This is the reason why it is a statistical question.

  • dhogaza // March 11, 2008 at 3:49 am

    “round(avg(max(1,5,3,4))) is better than max(round(avg(1,5,3,4))), because it better fits the reference value (which is 5).”

    Both methods are valid ways to use the functions. Unfortunately, the example is silly because there’s no physical theory supporting either formula. And that’s why fitting sample data to reference data is not enough. One must have a physical basis for the analysis method.

    There may be no physical theory, but surely, given your expertise (as expressed elsewhere), you understand that delaying an integer-to-floating point conversion, especially when later combined with an explicit rounding from float to int, is not a bad thing?

  • George // March 11, 2008 at 4:01 am

    TCO

    Part of getting your Ph.D. is getting the balls and experience to move into alternate fields.

    Perhaps, but I’d say just as (if not more) important is leaning to recognize ones own limitations and ignorance (especially outside one’s own narrow field).

    Sometimes (more often than not?) when one attempts to characterize the current state of a scientific area that is not one’s own, one merely makes a fool of oneself.

    Even Nobel caliber scientists have fallen into this trap.

  • fred // March 11, 2008 at 4:04 am

    A brief comment on this question of CO2 again. It really is misleading to say in this context and without qualification that the more CO2 you have, the higher the temperature. The real situation is that raising CO2 from 280ppm to 380 has a largish effect. Raising it from 380 to 480 has a much smaller one. From 480 to 580 a still smaller. And so on. Yes, its literally a continuing rise for ever. But it needs qualification.

    “The more CO2, the higher the temperature, with decreasing effects up to about x%, when the effect becomes insignificant”.

    This would be a correct, and less alarmist, way to put it.

    It is also true to say that this effect, for doubling of CO2 from 280ppm, is 1.2 degrees. The evidence required for feedback amplifying this 1.2 degree increase is quite different from the evidence for the warming from CO2. Not that it does not exist. But it is quite different. If we discover feedbacks don’t exist or are negative, or even are higher than the IPCC supposes, no modification of laws of physics will be needed. Whereas to refute or increase the warming effect of CO2, the laws of physics would have to be changed.

    To say this is not an sort of ‘ism’, its just stating facts. You can only get to proper assessments of AGW if you disentangle the different elements. Doing which is, to disentangle yet another thing, completely different from being a “denialist”. It is rather like the difference between finding out what the doctrine of the Real Presence is, as opposed to the heresy of denying it.

  • dhogaza // March 11, 2008 at 5:11 am

    So over at CA, where McIntyre “demolishes” Tamion, we have the other Mc posting:

    It’s very telling that Tamino fails to link to a single one of our papers

    Well, yes, it’s very telling. Why should he or anyone else?

  • Bob North // March 11, 2008 at 5:12 am

    Having waded through all of this, I guess one fundamental question remains: Given the various climatic/weather/local site factors that do affect tree ring growthm(or many of the other proxies), are we trying to tease too much out of the available data, regardless of statistical technique? I have no doubt that there is some temperature signal in tree rings (probably more of a growing season signal than an actual annual signal) but as rightly pointed out by MrPete (I think, it was so long ago), there are a lot of other factors that affect plant growth, even in areas where, in theory, growing season temperture is a dominant limiting factor.

    Also, I am probably wrong, but I seem to remember from reviewing various papers some time ago, that the correlations between temperature and tree ring width and max wood density are ok, but not great. Based on all this, I seriously doubt that we can nail down the average global temperature 800 years ago to within a few tenths of a degree C.

    Now, as noted above, none of this really affects the basic premise that CO2 is a greenhouse gas and that man has the potential to significantly affect global climate, whether through GHG emissions or land use changes, etc. What it does affect is the conclusion that the current temperatures are unprecedented and the extent to which limited future warming (1-3 degrees C)will be “catastrophic”.

    Finally, HB if you could address more clearly in your next post on PCA why the centered PCA didn’t pick up the hockey stick in your hypothetical example, but the decentered “PCA” did pick it up, it would be much appreciated. I am not sure that you answered this question when you responded to my previous post early on in this thread.

    Regards,

    [Response: But the centered PCA *did* pick up the hockey stick -- it was in PC2 rather than PC1. Likewise, with the actual data it's picked up by centered PCA, mainly in PC4 rather than PC1.]
    Bob North

  • dhogaza // March 11, 2008 at 5:20 am

    McKittrick makes an interesting claim:

    I guess Tamino also hasn’t inspected the CENSORED folder or grasped the meaning of what was in it. Without the BCP’s the decentering doesn’t matter since the remaining series all have stable means, and no hockey stick emerges in any PC.

    So now they’re conflating two supposed evils, without justification AFAICT

    1. BCP proxies are bad.

    2. Including them in the analysis shows that decentered PCA is bad.

    3. The hockey stick doesn’t appear w/o BCPs, no matter what analysis you do (false, others have done this)

  • per // March 11, 2008 at 9:00 am

    “Actually he said that the non-centered pca pushes the temperature signal PC to be more prominent, which, for the non-short-bus crowd,
    is indeed the WHOLE POINT.”

    well, he has made a synthetic example where he gets different results with the two methods. But the example doesn’t prove the general case; it is just an isolated example.
    “Wegman’s right about this much: a non-standard centering convention will indeed emphasize the variation of the mean, which turns out to be the main difference between the 20th century and the rest of the time span. What he fails to realize is that that’s the point. ”
    This is fixing the analysis method, so that you get the result that you desire. This is a faulty approach.
    joliffe goes out of his way to say that you must explain why you use a non-centred PCA, because it can introduce artefacts. Saying that it gives the “result” you want is not a scientific explanation. Saying that it emphasises variation from the mean (the 20th century increase) means that you know you are fixing the results. Again, not science.

  • MrPete // March 11, 2008 at 11:15 am

    dhogaza, don’t miss the joke…

    “surely… you understand that delaying an integer-to-floating point conversion… is not a bad thing?”

    I sure do. I suppose my example may have been more successful than expected! The rounding is obfuscation in this case. Surely you understand how avg(max(…)) is both similar to and quite different from max(avg(…)) … and both in a “silly” way :-)

    Solve the riddle: what do these expressions reduce to (in general, ignoring the specific data)?

  • MrPete // March 11, 2008 at 11:33 am

    George, “Sometimes (more often than not?) when one attempts to characterize the current state of a scientific area that is not one’s own, one merely makes a fool of oneself.”

    Well said! So the trick is to understand what “one’s own” area(s) are.

    Personally, I have confidence about a very restricted set of knowledge that’s actually “mine.”

    For the rest, I listen broadly and try to see things from a variety of perspectives, recognizing that many areas of expertise will bring valuable understanding to a particular set of issues.

    That’s different from expecting the experts in a single area to be the sole authority for issues that fall more or less into their bailiwick.

    Another way of putting it: I’ve found that, even for brilliant experts, today there are very few applied issues (brain freeze – not sure that’s the right term at 5am :-) ) that 100% fall into “their” area of science or knowledge. What do you think?

    To be extreme, show me a piece of nature where a single scientific specialty “owns” all knowledge relevant to that bit. Isn’t the world “naturally” multidisciplinary?

    (Interesting, I started searching for definitions of “climate science” — it’s not universally agreed what that actually is.)

  • Meltwater // March 11, 2008 at 12:54 pm

    Patrick Hadley wrote:

    Incidentally, while I gave four reasons to justify the continued debate about the HS, if Meltwater was right when he wrote ” For the ‘hockey stick’ to be wrong, a great deal of other science has to be wrong too. ” then it would seem that the stakes are even higher.

    Hansen’s Bulldog, aka tamino, added:

    [Response: I disagree strongly with the “a great deal of other science has to be wrong too” part. But I don’t speak for my readers, just as they don’t speak for me.]

    And fred said:

    meltwater, it will help to … talk about different things differently.

    The hockeystick, MBH98 and subsequent papers, could be wrong. Decentered PCA could also be wrong. And it still could be warming due to CO2.

    You don’t have to accept MBH to be convinced by the arguments for CO2 driven warming.

    All these comments left me thinking that in my own earlier comment I carelessly mis-phrased what I was trying to say. I did not mean to suggest that the soundness of the overall case for anthropogenic global warming (AGW) somehow immunizes the Mann et al. (MBH98) ‘hockey stick’ against error or criticism. Nor did I mean to imply that any possible refutation of MBH98 would also discredit the overall case. Frankly, I did not realize then that my comment could be taken in such ways, although it seems obvious now.

    When I asked—

    Given the accelerating rate of human unburied carbon emissions, how could a temperature ‘hockey stick’ not result?

    —and then asserted—

    For the ‘hockey stick’ to be wrong, a great deal of other science has to be wrong too.

    —I intended to argue broadly that some sort of ‘hockey stick’ temperature graph has to be right, not necessarily that the specific MBH98 graph has to be right.

    I wrongly imagined that by saying “a temperature ‘hockey stick’” in my question I had made this clear. Regrettably, I undid any such clarity when I said “the ‘hockey stick’” in my assertion, which apparently referred to MBH98.

    I’m sorry that my verbal clumsiness wasted so much time and bandwidth, and I ask you all to look away from that, focusing instead on my intended point:

    Since human unburied carbon emissions have driven CO2 concentrations upward in a manner that would make a ‘hockey stick’ shape on a graph of atmospheric composition over the last few hundred thousand years,

    —and since we know from radiation physics that atmospheric CO2 molecules absorb IR, get kinetically excited and heat the surrounding gas,

    —the smart money would bet that a resulting ‘hockey stick’ shape would be found on a graph of global mean temperatures over a comparable time-span.

    Indeed, for such a shape not to result would mean something surprising, like a physical mechanism that prevents climate sensitivity to CO2, and/or a great deal of other science being wrong.

    Now that I have phrased my point more carefully, I ask you all, including Hansen’s Bulldog: Do you still disagree? If so, why?

    [Response: I agree that for such a shape not to emerge would be surprising. But there's one caveat. So far we seem to have experienced about 0.75 deg.C modern warming (depends on exactly where you start, but that's in the ballpark). Climate sensitivity (to forcing, not to doubled CO2) seems to be around 0.75 deg.C/(W/m^2) (not precise but in the ballpark). So we might have experience comparable warming (or cooling) in the not-too-distant past IF there was a forcing change of about 1 W/m^2 sustained long enough for its impact to be felt, or a greater forcing change for a more brief time. This is implausible but not impossible.]

  • Surly // March 11, 2008 at 2:06 pm

    According to one poster over there, MM’s breaking the hockey stick saved the world from needless destruction. I feel so relieved.

  • fred // March 11, 2008 at 2:24 pm

    “with the actual data it’s picked up by centered PCA, mainly in PC4 rather than PC1″

    Yes, but that is the point at issue, surely? PC1 explains most of the variation. PC4 explains very little. Its not a small thing to demote the HS from PC1 to PC4. Its the difference between it being the main trend, and something quite close to noise, no?

  • Hank Roberts // March 11, 2008 at 2:35 pm

    dz, it’s a monkey trap. There are dozens of blogs making claims about beliefs. Copying that stuff here is second hand, why bother? It’s ’see what they said’ if attributed. If not attributed, it’s just copypaste tactical stuff (A stinging remark! I’ll go pretend it’s mine by pasting it into a climate blog).

    You realize they fish for goats, don’t you? Don’t let them catch yours!

    Review the copious notes, there’s extensive documentation on how it’s long been done on Usenet. Recognize this, don’t be caught up in copypaste.

    alt.syntax.tactical FAQ – Dealing with Trolls Crossposting and Flames:
    Playtime (1 off trolling) * Tactical (long term trolling by a single character) * Strategic (long term trolling using multiple characters and different …
    http://ddi.digital.net/~gandalf/trollfaq.html

  • George // March 11, 2008 at 2:40 pm

    I’m still a little unclear exactly why PCA would be used in this case.

    How (or more to the point is) the answer obtained with PCA somehow superior (more reliable, more robust, etc) than the answer obtained with simpler, more straightforward less manipulative, techniques?

    It strikes me that sometimes itis better to do less data processing than more.

    I guess the main question for me is this:

    Why change the data representation with PCA at all if there is the possibility that it might introduce ambiguity in the final result? (depending on which components are selected, in this case)

    If one has to include a certain (possibly unspecified) number of the components to be certain that one has not missed something important, what has one gained?

  • Tom C // March 11, 2008 at 2:44 pm

    So over at CA, where McIntyre “demolishes” Tamion, we have the other Mc posting:

    It’s very telling that Tamino fails to link to a single one of our papers

    Well, yes, it’s very telling. Why should he or anyone else?

    Why? Because then they could read the papers themselves and would realize how Tamino has distorted the content.

  • Meltwater // March 11, 2008 at 2:45 pm

    Quoting me, fred responded:

    “People intent on finding fault with the ‘hockey stick’ deserve to have their arguments and their intellectual honesty challenged.”

    No, they need to have their arguments challenged. The rest is pointless personal insults which get no place.

    A more civil and reasonable discourse of the kind fred invites would be wonderful but, in the world we have, I am sadly sure that challenging arguments alone is not enough.

    If it were enough, why do denialist polemics like that of Singer and Avery keep repeating arguments that were scientifically refuted long ago? To give only two of many examples:

    Singer and Avery suggest that human activity is too ‘puny’ to affect the climate even though Högbom and Arrhenius showed more than a century ago that accessible coal and oil suffice if combusted to multiply the atmospheric concentration of CO2 several times over, and that merely doubling CO2 can change the climate.

    Singer and Avery suggest that the IR spectral bands absorbed by CO2 could be saturated, and more CO2 can be added to the atmosphere with relative impunity after saturation. This was a respectable view back in 1900 when Knut Ångström reported an experiment that seemingly confirmed it, but E.O. Hulburt challenged it as early as 1931, Lewis D. Kaplan had proved aspects of it wrong by 1952, and Gilbert N. Plass finished it off in 1956. There has not been any scientific basis for the spectroscopic saturation argument for more than half a century.

    Laypeople can be forgiven for not knowing how thoroughly the arguments for and against AGW have been weighed in the scientific literature over several generations. Yet some prominent ‘experts’ exploit such popular ignorance by repeating long-refuted arguments, with no other motive than to prevent timely political action to curtail unburied carbon emissions. Naomi Oreskes discusses them and their agenda on this video.

    Their behavior is not so easily forgiven. Not only are they intellectually dishonest; tragically, their dishonesty has prevailed. It is probably now too late for timely emissions curtailment. Disastrous warming sufficient to destroy whole cities and bioregions has likely become inevitable. Less timely action, which at best can only minimize the harm to come, is the most we might be able to manage now.

    We are all emitters, and we all share the blame for what’s coming. But the people who deliberately misled and still mislead the public deserve more blame than the rest of us. If we let their intellectual dishonesty go unchallenged, they will keep ‘swift-boating’ real science and real scientists, and thereby continue to thwart even the less timely emissions curtailment of which some prospect still remains.

  • fred // March 11, 2008 at 3:49 pm

    George

    “How (or more to the point is) the answer obtained with PCA somehow superior (more reliable, more robust, etc) than the answer obtained with simpler, more straightforward less manipulative, techniques?”

    I haven’t yet done PCA in anger, so treat this with caution. Maybe an analogy will help, imagine a text consisting of words in half a dozen languages. We can’t read it. If we can extract the English in one piece, we can read some of it. Then the Dutch…etc. Or we are listening for screw noises in a noisy ocean. Can’t distinguish them. Signal processing can do a better job than our eyes and ears and brains unaided.

    All you’re really doing with PCA is to extract the signal, or signals, from the noise. The key part is how important the extracted signal is as a percent of the whole. PC1 is usually going to explain a lot of what there is to be explained. PC4 and lower, very little.

    If you just look at the raw data you will see noise. This doesn’t mean there is no signal however.

    Have a look at Ross McKitrick
    “What is the Hockey Stick Debate About?”, fig 6. Whether or not you agree with the paper, the diagram shows the process. Remember, just because you can’t see it with the naked eye does not mean it is not there.

  • chriscolose // March 11, 2008 at 4:10 pm

    Meltwater, I definitely agree. There is a line between “skepticism” and “lying” and Singer and co. have crossed that a long time ago.

    What escapes me is why AGW arguments are held to such a high standard (i.e., bad proxy usage, and we never hear the end of it for a a decade now), but the denial propaganda is allowed to regurgetate the same garbage such as “CO2 lags temperature, so now AGW” or “1940-70 cooling, so no AGW” and you simply hear more of it, even after it is invalidated. Because the science is there, we have much more interesting things to talk about than conspiracies and dwelling on decade-old arguments which didn’t work out.

    What is strange to me is that this whole hockey stick argument can somehow be taken as a criticism of the enhanced greenhouse effect. The hockeystick doesn’t really show the warming, but rather a slow decline in temperatures over a long time, until an enhanced greenhouse effect kicks in (and this is based more in the instrumental record, and not so much PCA). The argument that an as warm as, or warmer than today MWP invalidates AGW is high level nonsense, as the medieval warmth is not part of the attribution effort. It also says nothing about what will happen in the future.

  • George // March 11, 2008 at 4:48 pm

    fred claims:

    “It really is misleading to say in this context and without qualification that the more CO2 you have, the higher the temperature.”

    Misleading? How so?

    The statement as it stands is quite correct on its own.

    Fred rationalizes:

    … it needs qualification.

    “The more CO2, the higher the temperature, with decreasing effects up to about x%, when the effect becomes insignificant”.

    This would be a correct, and less alarmist, way to put it.

    No, specifying an effective “cutoff” would not be a correct way to put it.

    It would, however, be a more subjective way to put it, that much is certain.

    Words like “insignificant”, “alarmist”, etc are subjective. Insignificant compared to what? By what measure? Alarmist by whose definition?

    To say that “more CO2 will lead to more warming– all else being equal” is quite accurate. Adding subjective qualifications to the statement makes it less — not more — accurate.

  • David Holland // March 11, 2008 at 5:08 pm

    When Michael Mann mentioned Ian Jolliffe on RC in connection with his non centred PCA, I could not see that the link he gave helped him. When he didn’t answer question as to whether Jolliffe endorsed his method I asked Jolliffe myself and this was his reply dated 25th Feb 2005:

    “I’m afraid that I can’t offer you much enlightenment. I did not hear Michael Mann on the Today programme. Nor do I know what ’seminal work’ you refer to, or how or why he references me. From your email it may be a talk I gave in Cape Town last year, which was a brief review of alternative centerings – I can’t see that it said enough to used as a recommendation.
    My one (anonymous) interaction with Mann, his co-workers and his critics was last year when I acted as a referee for an exchange of views submitted to Nature. After a couple of iterations I came to conclusion that I simply could not understand what was being done sufficiently well to judge whether Mann’s methodology was sound, but I certainly would not endorse it. At least one other referee came to same conclusion. Although the exchange was not published in Nature I believe it may have appeared on a web site. I don’t know whether the methodology noted in your email is the same as that which referees found too opaque and/or complicated to understand.”

  • David Holland // March 11, 2008 at 5:37 pm

    Chriscolose says,

    “What is strange to me is that this whole hockey stick argument can somehow be taken as a criticism of the enhanced greenhouse effect.”

    In the Attribution Chapter 12 of the TAR you can read why. It says on page 702:

    We expect, however, that the reconstructions will continue to improve and that palaeo-data will become increasingly important for assessing natural variability of the climate system. One of the most important applications of this palaeoclimate data is as a check on the estimates of internal variability from coupled climate models, to ensure that the latter are not underestimating the level of internal variability on 50 to 100 year time-scales.

  • Barton Paul Levenson // March 11, 2008 at 5:48 pm

    anonymous writes:

    [[If the analysis of MBH were correct, I suspect long ago some professional academic statisticians would have backed them up. ]]

    They did. Fourteen other studies replicated MBH’s results. Read the NAS report.

  • George // March 11, 2008 at 5:54 pm

    Fred says:

    “Remember, just because you can’t see it with the naked eye does not mean it is not there.”

    Fred. I know what PCA is, but I am trying to understand how it is superior in this case to simple methods that also extract the signal from the noise. Averaging combined with piecewise linear regression is the one that comes immediately to mind.

    My question was really directed to Bulldog or someone else who is not only conversant with PCA but with lots of other data analysis techniques (including linear regression, which Bulldog uses quite a bit, at any rate)

    In this case, I actually can see it [the hockey stick] with the naked eye!

    Sure, the eyes/brain sometimes play tricks, but I have done enough spectroscopic analysis to appreciate that the power of the human eye/ brain combination should not be underestimated when it comes to finding patterns in noisy data. In fact, I’d say that in some cases, the human eye/brain combination is unmatched by even the best data analysis software running on the most powerful computers.

    …which, combined with the fact that simple techniques often work quite well in extracting a signal from noise, makes me wonder “why all the fuss over PCA”?

  • Barton Paul Levenson // March 11, 2008 at 5:54 pm

    fred writes:

    [[“CO2 is a greenhouse gas…. That means that the more of it you have in an atmosphere, the warmer the ground is going to get, all else being equal.”

    This is not true, as a matter of science, for two reasons. First that as CO2 rises, incremental rises have diminishing warming effects. The curve is logarithmic. It is not true that the more you have of it the warmer it gets in any unqualified sense.]]

    Which part of “warmer” did you think included “is not on a logarithmic basis?”

  • Layman // March 11, 2008 at 6:02 pm

    Per Wegman:

    “If the variance is artificially increased by decentering, then the principal component methods will “data mine” for those shapes. In other words, the hockey stick shape must be in the data to start with or the CFR methodology would not pick it up… Most proxies do not contain the hockeystick signal. The MBH98 methodology puts undue emphasis on those proxies that do exhibit the hockey-stick shape and this is the fundamental flaw. Indeed, it is not clear that the hockey-stick shape is even a temperature signal because all the confounding variables have not been removed.”

    The issue is that Mann’s method inappropriately amplifies signals from one set of proxies in a way that drowns out all other signals. De-Centering is a non-standard technique. It is incumbent on Mann to justify its use. For now, I assume it mines for hockey sticks unless shown otherwise.

  • per // March 11, 2008 at 6:06 pm

    “They did. Fourteen other studies replicated MBH’s results. Read the NAS report.”

    you may not understand that “replication” has a specific meaning; it means exact repetition. There may have been fourteen other reconstructions, but there are not 14 replications. In fact, even wahl and ammann were unable to exactly replicate MBH’s results. The shocking thing is that they were able to exactly replicate M&M.

    For the record, the fourteen similar studies have many of the same problems as MBH. That is why theNAS panel concluded right in the summary, that the error in reconstructions going back before thermometers is unquantifiable. In science terms, it is difficult to think of a more damning criticism.

  • Barton Paul Levenson // March 11, 2008 at 6:14 pm

    fred writes:

    [[It really is misleading to say in this context and without qualification that the more CO2 you have, the higher the temperature.]]

    No matter how many times you say this, it still won’t be true.

    [[ The real situation is that raising CO2 from 280ppm to 380 has a largish effect. Raising it from 380 to 480 has a much smaller one. From 480 to 580 a still smaller. And so on. Yes, its literally a continuing rise for ever. But it needs qualification.

    “The more CO2, the higher the temperature, with decreasing effects up to about x%, when the effect becomes insignificant”.

    This would be a correct, and less alarmist, way to put it.]]

    It would be an absurdly Pollyannish way to put it, since it gives the mistaken impression that we don’t have to worry too much about CO2 rising. We do. And your “no further effect” level is not reached even on Venus, which has 88 bars of CO2, a surface temperature averaging 735.3 K, and on which, according to Bullock’s simulations, you could probably reach 900 K under varying atmospheric conditions. The upper limit is irrelevant to any real planet, at least in this solar system.

    ]]

  • dhogaza // March 11, 2008 at 6:18 pm

    My one (anonymous) interaction with Mann, his co-workers and his critics was last year when I acted as a referee for an exchange of views submitted to Nature. After a couple of iterations I came to conclusion that I simply could not understand what was being done sufficiently well to judge whether Mann’s methodology was sound, but I certainly would not endorse it.

    This is a far cry from “Mann’s methodology was wrong” or “the hockey stick is broken”.

  • Chris O'Neill // March 11, 2008 at 6:31 pm

    fred:

    “The more CO2, the higher the temperature, with decreasing effects up to about x%, when the effect becomes insignificant”.

    This would be a correct, and less alarmist, way to put it.

    No it would not be correct and it would be a denialist way to put it. There is no “x%, when the effect becomes insignificant” that we would want to go anywhere near. Maybe the effect is insignificant on Venus but I hope you’re not suggesting we go anywhere near that situation. Although it is true that by the time we get to 560 ppm the effect of each new molecule of CO2 will be half what it was at 280 ppm, I don’t think this is going to keep us out of trouble. The warming from one doubling of CO2 is a very serious issue indeed.

    It is also true to say that this effect, for doubling of CO2 from 280ppm, is 1.2 degrees. The evidence required for feedback amplifying this 1.2 degree increase is quite different from the evidence for the warming from CO2. Not that it does not exist. But it is quite different. If we discover feedbacks don’t exist or are negative, or even are higher than the IPCC supposes, no modification of laws of physics will be needed.

    Which is an academic question anyway because the warming produced by CO2 alone, according to the laws of physics, is only 26% of the empirically observed warming and only a bit more than that including non-feedback gases such as N2O and CH4, i.e. without feedbacks there would only be a bit more than 26% of the empirically observed greenhouse effect.

  • fred // March 11, 2008 at 6:34 pm

    If anyone wants to check out BPL’s statement, the report is here:

    http://www.nap.edu/catalog.php?record_id=11676

    It would be helpful to have a reference for where the fourteen other studies replicating MBH are mentioned. I don’t recall them from a first reading, and didn’t find them on a superficial second glance. Which does not prove they are not there. But it would help if someone could point to exactly where they are.

  • fred // March 11, 2008 at 6:45 pm

    George, the point is not really a combative one, and it is surely reasonable. When we say in ordinary English that the more you get of A the more you get of B, most people would assume that it happens in roughly equal proportions.

    What they would not expect is that for every extra pound or gallon of A you get less and less more of B. The more gas you have, the further you can drive on that tank. Not that adding gas to your tank is subject to a law of diminishing returns.

    When this isn’t so, we usually are very careful to caution people. As in, for instance, we don’t just say the faster you are going, the further it takes to brake. We are very careful to say that its not linear. The same with speed and pedestrian deaths. We don’t just say, the faster you are going, the greater the risk of death. We are careful to indicate that risk of death rises faster than speed. This is why we have the shorthand ‘law of diminishing returns’. Its because of the general assumption of linearity, which people expect to have exceptions signalled to them.

    I am not a great advocate of raising CO2 levels, it seems at best imprudent. But I am a great advocate of careful characterization, and one ought to be clear in speaking about the relationship that it is far from linear. Its not JUST a question of the more you add the hotter it gets. We should follow the usual conventions in making this clear and explicit.

    In the same way, its not JUST a question of the less you have, the cooler. Think about this the other way. If everyone is right, and if we should be reducing CO2 at some point, it will not either be very useful to tell people the more you take out the cooler it will get. You’re going to have to explain that at first, what you take out will have little effect, but that as you keep it up, later takeouts will have more and more. There will be pain without gain at the start. Just as there will be gain without very much pain at the end, going the other way.

  • Chris O'Neill // March 11, 2008 at 6:49 pm

    fred:

    The key part is how important the extracted signal is as a percent of the whole. PC1 is usually going to explain a lot of what there is to be explained. PC4 and lower, very little.

    No, that’s not the key part. The key part is which PC has the signal that we want (e.g. temperature). If there are stronger PCs that do not correlate with our signal then they are just noise. Being stronger doesn’t give them any special significance at all. Just means they are something we are not interested in.

  • Will Richardson // March 11, 2008 at 7:13 pm

    Re: Levenson @ 5:48

    No, the NAS report referred to other studies, most of which use similar analyses as MBH and the same data sets, including Mann’s improperly derived PC1, but did not examine those studies to determine whether those studies suffered from the same errors and weaknesses contained in MBH. Those other studies cannot be deemed independent when they use the same data, in the same way as in MBH. The NAS report did state that NAS agreed with McIntyre’s and Wegman’s statistical criticism of MBH, and agreed that the bristlecone proxies should not be used. All of the fourteen studies NAS referenced used some form of the bristlecone proxies.

  • per // March 11, 2008 at 7:29 pm

    “The key part is which PC has the signal that we want (e.g. temperature). If there are stronger PCs that do not correlate with our signal then they are just noise. Being stronger doesn’t give them any special significance at all. Just means they are something we are not interested in.”

    this is wrong. The whole point in having all these proxies, is that they are meant to behave as thermometers, each representing the temperature in a different part of the world. That is why it was so important for MBH to say that PC1 was the dominant pattern of variance, because it suggests that temperature was the dominant cause of variation.

    If you are simply using the PCA to select for a few records which show a correlation with temperature, then you might as well throw the other records out; but then, you would have no basis for saying that your method was a global (or northern hemisphere) reconstruction of temperature.

    And for the record; it isn’t a god-given truth that any of the proxies studied are actual accurate temperature proxies. If you actually test this in a PCA/ verification analysis, and you find that only 8% of the signal (PC4) gives a borderline significant (if that) association with temperture record, then you have a very shaky basis for even believing that the PC is a temperature proxy.

  • luminous beauty // March 11, 2008 at 7:37 pm

    “One of the most important applications of this palaeoclimate data is as a check on the estimates of internal variability from coupled climate models, to ensure that the latter are not underestimating the level of internal variability on 50 to 100 year time-scales.”

    One might think this means that a higher level of past internal variability means that some larger portion of temperature increase from the instrumental record might be due more to internal variation than EGE.

    One would be wrong.

    It would mean a greater natural variation upon which the EGE must then be added for the purpose of model projections. It just means more wobble in the curve. It doesn’t diminish the greenhouse effect. In fact, it suggests the opposite. A system that is more sensitive to unforced internal oscillation is likely to be more sensitive to external forcing as well.

  • dhogaza // March 11, 2008 at 7:50 pm

    Per:

    That is why theNAS panel concluded right in the summary, that the error in reconstructions going back before thermometers is unquantifiable. In science terms, it is difficult to think of a more damning criticism.

    Not quantifiable, huh?

    Less confidence can be placed in large-scale surface temperature reconstructions for the period from A.D. 900 to 1600. Presently available proxy evidence indicates that temperatures at many, but not all, individual locations were higher during the past 25 years than during any period of comparable length since A.D. 900. The uncertainties associated with reconstructing hemispheric mean or global mean temperatures from these data increase substantially backward in time through this period and are not yet fully quantified.

    Gee, not exactly what the summary actually says, is it, Per?

    And what, exactly, is damning about a summary that concludes:

    The basic conclusion of Mann et al. (1998, 1999) was that the late 20th century warmth in the Northern Hemisphere was unprecedented during at least the last 1,000 years. This conclusion has subsequently been supported by an array of evidence that includes both additional large-scale surface temperature reconstructions and pronounced changes in a variety of local proxy indicators, such as melting on ice caps and the retreat of glaciers around the world, which in many cases appear to be unprecedented during at least the last 2,000 years.

    The reason why we say the NAS report declared that Mann was right in his basic conclusion is because, AS YOU CAN READ YOURSELF, that’s exactly what the summary says.

    What’s damning, Per, is your playing “fast and loose” with the report. Don’t worry, others on your side have assured us that playing “fast and loose” isn’t dishonest, apparently it’s just fine to do so…

  • mikep // March 11, 2008 at 7:59 pm

    But we just get back to the point that in MBH nearly all the hockey stick series are bristlecone pines. The reconstruction therefore depends not on the whole proxy set but on a tiny subset, and therefore in no sense a northern hemisphere reconstruction except insofar a bristlecone pines have a reliable relationship – unlike everything else – not with local temperatures but with the Northern hemisphere temperature. There are very good grounds for doubting that bristlecone pines do act as such standard candles. The best evidence for this is that their growth does not correlate with late 20th century temperature increases. Ever since M&M isolated which series were picked out (nb not created) for differential weighting by de-centred PCA this has been the issue. (And incidentally M&M were well aware that conventional PCA put the bristlecones and hence the hockey stick in PC4 – it’s discussed in their 2005 articles). The danger is of spurious regressions. That’s the point of repeating the MBH exercise with stock prices instead of bristlecones: stock prices do just as well as bristlecones in predicting temperature. This suggests we need to dig a bit deeper than simple verification stats before we conclude we have have found a magic species whose growth. almost alone among trees, acts as an accurate thermometer (and not for local, but for hemispheric temperatures).

  • dhogaza // March 11, 2008 at 8:36 pm

    I didn’t post enough of that report, but I imagine most here are familiar with it, and with the fact that they state that they think Mann’s conclusions are “plausible”. Which outside the report, members have stated was meant to mean “more likely than not”.

  • David Holland // March 11, 2008 at 8:50 pm

    Barton Paul Levenson said:

    “They did. Fourteen other studies replicated MBH’s results. Read the NAS report.”

    I read it and they do not. You should read what those involved in the NRC (NAS) Report said (under oath) to the Whitfield Subcommittee in the hearing transcript.

    1 Ralph Cicerone President of NAS who set up the NRC Panel said Wegman was credible on statistical issues.

    2. Gerald North, Chairman of the NRC panel and panel member Peter Bloomfield both said they agreed with Wegman and the NRC report says pretty much the same as Wegman.

    3. Gerald North said in written answers “for the record”:

    “I may have mistakenly mentioned the “two to one odds” figure in the oral press release of the report, and it may also have appeared in some press accounts, but it does not appear in the report, and I avoided using it in my sworn testimony.

    In our view it is not possible to quantify all of the inherent uncertainties associated with reconstructing surface temperatures from proxy data, which in turn precludes assigning numerical probabilities to statements regarding the unique nature of recent warmth.”

    Now what is true is that the NRC panel looked at virtually the same studies, as did the IPCC Chapter 6 including Wahl and Ammann and Rutherford et al as well as M&M and made not one criticism of M&M but all the ones that you’ve been told of here of Mann et al. – PCA, Bristlecones and verification failure.

    So my question is, how do the IPCC guys figure a probability of up to 90% based on the same proxies which they admit have problems and are “not entirely independent”?

  • Hank Roberts // March 11, 2008 at 8:53 pm

    > stock prices instead of bristlecones:
    > stock prices do just as well as
    > bristlecones in predicting temperature

    Citation please? I’m very curious to look at the tulip stock bubble. Was that the MWP? This may be important.

    They looked at the full history of stock trading, many centuries’ worth -right?

    Any subsequent citations in later journal articles, please?

  • BoulderSolar // March 11, 2008 at 8:59 pm

    Tamino,

    You said:
    “Did they really discard the relevant PCs just to copy the bare number of PCs used by MBH, without realizing that the different centering convention could move the relevant information up or down the PC list? You betcha”

    In MM2005 they stated:

    “Under the MBH98 data transformation, the distinctive contribution of the bristlecone pines is in the PC1, which has a spuriously high explained variance coefficient of 38% (without the transformation – 18%). Without the data transformation, the distinctive contribution of the bristlecones only appears in the PC4, which accounts for less than 8% of the total explained variance.”

    So it looks like you owe them a retraction.

  • David Holland // March 11, 2008 at 9:03 pm

    And as for “pausible” meaning more likely than not, look it up. My dictionary says of statements it means “specious, seeming reasonable or probable” and of persons “persuasive but deceptive” .

    And who are the et al in Rutherford’s paper? IPCC do not say but it’s Mann Bradley Hughes Briffa and Jones. Briffa was a Chapter 6 lead author and so was Ammann’s boss at UCAR.

  • Chris Colose // March 11, 2008 at 9:04 pm

    Fred, using decent numbers, say

    5.35 * ln(380/280) = 1.6 W/m2
    5.35 ln (560/380)= 2.07 W/m2

    This means we still have a bit more from the rest of the way to 560 than we had so far, but you also need to consider what aerosols have done, as well as “in the pipeline” warming from thermal intertia, and non-linear effect of feedbacks. We have obviously already included the logarithmic relationship, and under busness-as-usual scenarios out to 2100 it may be possible to even triple pre-industrial CO2. We’re already talking over 3 degrees, which is rather significant. At no point will further CO2 have no effect, so I really do not know what your point is.

  • cce // March 11, 2008 at 9:08 pm

    It’s worth repeating, once again, that the Hockey Stick emerges if you just averaged the proxies together, or if you remove the BCPs entirely. The BCPs are included because they strengthen the verification of the early part of the reconstruction. And the reason that the NRC report doesn’t quantify the likelihood of modern temps vs historical temps (thus relying on statements of “high confidence,” “less confidence” and “plausible”) is because it’s an expert opinion or judgment call.

  • Lazar // March 11, 2008 at 9:14 pm

    If I were to use Steve’s code to generate red noise series for each (individual) proxy in the 1400AD step, form ‘hockey stick’ PC summaries where appropriate, and substitute these for Wahl & Amman 06 proxy series and see what comes out; what the shapes are, what the scores are, and then do the run 100 times, benchmark RE at 99%, and compare with MBH98… would that be a fair and relevant evaluation? Will I be doing anything obviously stupid? Anyone, yes, no?
    If the results show MBH98 is significant, does it refute the possibility that MBH98 resulted from a decentered PCA pulling ‘hockey sticks’ from noise?
    Dependency on bristlecones, and the potential for decentered PCA to give low mean values of the 1400-1901AD period are separate issues that have, I think, been succesfully refuted by Wahl & Amman 06.

  • cce // March 11, 2008 at 9:28 pm

    It was not Gerald North who referred to 2:1 odds (~”Less Confidence” in the NRC report, ~”Likely” in the IPCC language), it was Peter Bloomfield, a statistician on the panel. John Wallace, an atmospheric physicist, also said that it was “likely.” They didn’t want to get into assigning numbers because they are arbitrary. The IPCC assigns numbers because policymakers demand them. Otherwsie, people sit around debating ambiguous language like we are doing now.

  • per // March 11, 2008 at 9:31 pm

    dear dhogaza
    thanks for your interesting post. It is a year or so since last I looked at the NRC report, so I think my recall is pretty good.
    “The uncertainties associated with reconstructing hemispheric mean or global mean temperatures from these data increase substantially backward in time through this period and are not yet fully quantified.”
    let’s go through this step by step, because you need a helping hand. If I ask you currently if we can quantify the uncertainty in reconstructions, then – according to the NRC report- the answer is “not yet fully”. You may have difficulty parsing this; so let me make it simple for you. NO.

    If we can’t “yet”, “fully” quantify something, it’s unquantifiable.

    I am also amused by your suggestion that the NRC supported Mann’s conclusions by saying that they are “plausible”. Science is about rigorously testing hypotheses, colloquially referred to as proof. Plausible is a word you use to describe the latest James Bond film or episode of Baywatch.

    Since you don’t seem so good at numbers, “more likely than not” includes ratios like 50.001: 49.999.

    All of which is a long, long way from the entirely spurious 95% confidence limits in MBH. Just ask yourself the simple question; if the NRC believed in MBH’s 95% confidence limits, why didn’t they say so ? why are they saying not yet, fully quantified ?

  • per // March 11, 2008 at 9:37 pm

    David Holland nails it with Gerry North’s comment. It is very clear, it is sworn testimony, and North was the chairman of the NRC panel.

    Dhogaza, i invite you to read what North said, and consider it carefully.

  • Paul G // March 11, 2008 at 9:47 pm

    = dhogaza said: =

    ==”they state that they think Mann’s conclusions are “plausible”. Which outside the report, members have stated was meant to mean “more likely than not”.”==

    “Probable” means “more likely than not”. Plausible is much weaker, inside the report, or outside of it.

  • dhogaza // March 11, 2008 at 9:57 pm

    And as for “pausible” meaning more likely than not, look it up. My dictionary says of statements it means “specious, seeming reasonable or probable” and of persons “persuasive but deceptive” .

    Members *explained* what they meant by “plausible”. I’m not depending on a dictionary definition, I’m depending on what THEY’VE said they mean.

    I’m certain you know this. I’m certain you don’t care.

  • dhogaza // March 11, 2008 at 10:02 pm

    and of persons “persuasive but deceptive”

    Nice. A claim that the report accused Mann of deceptiveness. Just one more slander.

  • Hank Roberts // March 11, 2008 at 10:02 pm

    >plausible … dictionary

    Citatin, please? What’s your dictionary?

    It appears “plausible” is used one way in academic research and the opposite way in rhetoric and debate. Curious.

    A few selections from the online American Heritage® Dictionary of the English Language: Fourth Edition. 2000.

    5) recension.
    …1. A critical revision of a text incorporating the most plausible elements found in varying sources. 2. A text so revised. Latin recnsio, recnsion-, a reviewing,…

    6) implausible.
    …Difficult to believe; not plausible. im·plausi·bili·ty -NOUN im·plausi·bly …

    8) plausive.
    …1. Showing or expressing praise or approbation; applauding. 2. Obsolete Plausible. From Latin plaudere, plaus-, to applaud….

    9) convincing.
    …1. Serving to convince: a convincing argument; a convincing manner. 2. Believable; plausible: a convincing story. See synonyms at valid. con·vincing·ly

    10) believable.
    …Capable of eliciting belief or trust. See synonyms at plausible. be·lieva·bili·ty -NOUN be·lieva·bly -ADVERB…

    ——————-
    Flammable! Inflammable!

  • Hank Roberts // March 11, 2008 at 10:05 pm

    The earlier results:

    Search Results for “plausible”
    http://www.bartleby.com/61/

    1) sophism.
    …1. A plausible but fallacious argument. 2. Deceptive or fallacious argumentation. Middle English sophime, sophisme, from Old French sophime, from Latin sophisma,…

    2) plausible.
    …1. Seemingly or apparently valid, likely, or acceptable; credible: a plausible excuse. 2. Giving a deceptive impression of truth or reliability. 3. Disingenuously…

    3) credible.
    …1. Capable of being believed; plausible. See synonyms at plausible. 2. Worthy of confidence; reliable. Middle English, from Latin crdibilis, from crdere, to believe….

    4) sophistry.
    …Inflected forms: pl. soph·is·tries1. Plausible but fallacious argumentation. 2. A plausible but misleading or fallacious argument….
    ——————————–

    Would anyone care to illustrate sophistry?

  • mikep // March 11, 2008 at 10:07 pm

    The work on stock prices – actually high-tech stocks can be found at

    http://www.climateaudit.org/?p=370

  • mikep // March 11, 2008 at 10:36 pm

    CCE you say that
    ” the Hockey Stick emerges if you just averaged the proxies together, or if you remove the BCPs entirely.”
    M&M have done the clacualtions – see their 2005 E&E article and the answer they get is
    “If de-centered PC calculation is carried out (as in MBH98), then MM-type results
    still occur regardless of the presence or absence of the PC4 if the bristlecone pine
    sites are excluded, while MBH-type results occur if bristlecone pine sites (and PC4)
    are included. Mann’s FTP site [Mann, 2002–2004] actually contains a sensitivity
    study on the effect of excluding 20 bristlecone pine sites5 in which this adverse
    finding was discovered”

    No one to my knowledge has ever shown this to be wrong.

  • David Holland // March 11, 2008 at 10:54 pm

    Hank

    Pocket Oxford, Sixth Edition, 1982.

    But what really matters is what Gerry North said the NRC meant by it. The IPCC say that second half of last century was 66 to 90% likely to be exceptional. The NRC have the honesty to say they that believe it is but that the data does not allows a numerical probability to be calculated.

    If you take into account the vested interest of the lead authors and review editors of AR4 Ch 6 it was like asking turkeys to vote for Christmas to expect them to admit the conclusions that they were a part of in 2001 were wrong.

  • cce // March 11, 2008 at 11:01 pm

    Here are the relevent sections from Jerry North’s followup questions of his testimony:
    http://frwebgate.access.gpo.gov/cgi-bin/getdoc.cgi?dbname=109_house_hearings&docid=f:31362.wais
    http://frwebgate.access.gpo.gov/cgi-bin/getdoc.cgi?dbname=109_house_hearings&docid=f:31362.pdf

    This is the definition of plausible used by the NRC Panel:
    “Our working definition of ‘plausible’ was that the assertion is reasonable, or in other words there is not a convincing argument to refute the assertion. We used this term to describe our assessment of the statement that ‘the last few decades of the 20th century were warmer than any comparable period over the last millennium’ because none of the available evidence to date contradicts this assertion. In our view it is not currently possible to perform a quantitative evaluation of recent warmth relative to the past 1,000 years that includes all of the inherent uncertainties associated with reconstructing surface temperatures from proxy data. This precludes stronger statements of confidence, but it does not mean that the assertion is false. In fact, all of the large-scale surface temperature reconstructions that we examined support the assertion that global-mean temperatures during the last few decades of the 20th century were unprecedented over at least the past 1,000 years, and a larger fraction of geographically diverse proxy records experienced exceptional warmth during
    the late 20th century than during any other extended period from 900 A.D. onward.”

    North on Wahl and Amman and von Storch:
    “My personal impression of these two papers is that the quote cited above (From the report: ‘In practice, this method, though not recommended, does not appear to unduly influence reconstructions of
    hemispheric mean temperature’) still applies, that is, none of the statistical criticisms that have been raised by various authors unduly influence the shape of the final reconstruction. This is attested to by the fact that reconstructions performed without using principal components yield similar results.”

    North on Strip Bark:
    “In summary, it appears that there is a carbon dioxide fertilization effect in some trees, but not in all the places where the samples used in the Mann et al studies were taken. Also note that this section of the report discusses the calibration of tree-ring records since atmospheric carbon dioxide levels started to increase around 150 years ago. Hence, in context, what the clause “strip-bark samples should be avoided for temperature reconstructions” was intended to convey is that strip-bark samples from the mid-19th century to the present are very difficult to calibrate against instrumental records of temperature, and the easiest solution is therefore not to use them. However, strip-bark data are considered suspect only after the modern increase in atmospheric carbon dioxide concentrations. This is why other studies that rely on strip-bark pine records only use them to infer past temperatures prior to 1850 (e.g., Biondi et al. 1999).”

    (Mann attempted to correct for cO2 fertilization)

    North on the Hockey Stick error bars and whay the do and do not mean:
    “As noted in our report, these error bars provide an indication of how well the reconstructed temperatures match observations during the ‘calibration period,’ but they do not represent all of uncertainties inherent in reconstructing surface temperature from proxy data. The actual uncertainties in the reconstruction would be somewhat larger, and difficult to quantify.”

    North on the Wegman Report:
    “Dr. Wegman’s criticisms of the statistical methodology in the papers by Mann et al were consistent with our findings. Our committee did not consider any social network analyses and we did not have access to Dr. Wegman’s report during our deliberations so we did not have an opportunity to discuss his conclusions. Personally, I was not impressed by the social network analysis in the Wegman report, nor did I agree with most of the report’s conclusions on this subject. As I stated in my testimony, one might erroneously conclude, based on a social network analysis analogous to the one performed on Dr. Mann, that a very active and charismatic scientist is somehow guilty of conspiring or being inside a closed community or ‘mutual admiration society’. I would expect that a social network analysis of Enrico Fermi or any of the other scientists involved with the development of modern physics would yield a similar pattern of connections, yet there is no reason to believe that theoretical physics has suffered from being a tight-knit community. Moreover, as far as I can tell the only data that went into Dr. Wegman’s analysis was a list of individuals that Dr. Mann has co-authored papers with. It is difficult to see how this data has any bearing on the peer-review process, the need to include statisticians on every team that engages in climate research (which in my view is a particularly unrealistic and unnecessary recommendation), or any of the other findings and recommendations in Dr. Wegman’s report. I was also somewhat taken aback by the tone of theWegman Report, which seems overly accusatory towards Dr. Mann and his colleagues, rather than being a neutral, impartial assessment of the techniques used in his research. In my opinion, while the techniques used in the original Mann et al papers may have been slightly flawed, the work was the first of its kind and deserves considerable credit for moving the field of paleoclimate research forward. It is also important to note that the main conclusions of the Mann et al studies have been supported by subsequent research. Finally, while our committee would agree with Dr. Wegman that access to research data could and should be improved, as discussed on page 23 of the prepublication version of our report, we also acknowledge the complicated nature of such mandates, especially in areas such as computer code where intellectual property rights need to be considered.”

    Note that they agreed that the statistical methods of the Hockey Stick could have been better, but what you will not find is a statement from North or anyone else on the panel saying that any of these flaws were significant or that the Hockey Stick was “bad science.”

  • Lazar // March 11, 2008 at 11:05 pm

    MrPete,

    I’ve learned enough from watching these arguments to know that it is not as simple as reading the “surface” of a paper like this and deciding whether the claims made are correct. The devil’s in the nit-picky details. For example, apparently WA isn’t using the same data — correlation 0.7 with MBH data.

    if I’m understanding correctly what Steve M said in the CA thread linked above

    You might have been pretty tired at the time! The quote refers to a paper by Bo Li (Casper Amman is a coauthor), which uses data purportedly from MBH99, that is proxy series which stretch back to 1000AD. (See comments #12, #15, #21, #22 and #29 again).

    The data in Wahl & Amman 2006 is from MBH98, the same data downloaded from the same ftp site used by Steve McIntyre.
    Steve has commented here on WA06;

    The order of the proxies in Ammann is a little different from MBH and Ammann provides no index for the proxies. In the AD1400 roster, the Stahle/SWM PC1 usually in the 16th spot occurs in the 22nd spot. I don’t plan to examine other steps. A first small difference – rounding before normalization of proxies. As far as I can tell, Mann’s practice is to normalize without rounding. Ammann sometimes rounds to the first decimal place before rounding. The differences can be as much as 0.15 Ã?Æ’ (e.g. below for the Tasmania temperature reconstruction. I don’t suppose that it matters much, but, if he’s trying to replicate, it would be easier to do little things the same. Otherwise, our proxy data set in our MBH98 replication is virtually identical to Ammann.

    I’m progressing nicely with the process of parsing Ammann. The correlation between our AD1400 emulations of the MBH98 reconstructed PC1 is 0.9993201! See illustration below. The emulations are virtually identical up to scaling/centering. W-A do not use archived MBH temperature PCs, but re-calculate them; this appears to account for scaling differences identified so far (and they should get washed out).

    The correlation between our coefficents and theirs is 0.9992717. There’s a difference in scaling – I’m not sure why right now, but I can’t see why it would matter (since it merely affects downstream coefficients).

    All in all, this is looking pretty good so far. One could scarcely have contemplated that WA should replicate our results so accurately. The irony is delicious.

    … so, back to the question!

    More than anything else, in the proxy series used to create the infamous hockey stick graph, it is dependent on the presence of stripbark BCP data from a tiny region of the world. The rest of the proxies are so weak as to have little impact on the results.

    … how does this explain the green line in figure 2 of WA06 (which HB reproduces in the post body), showing a hockey stick result when bristlecone pine series are excluded :-)

  • per // March 11, 2008 at 11:16 pm

    “Dependency on bristlecones, and the potential for decentered PCA to give low mean values of the 1400-1901AD period are separate issues that have, I think, been succesfully refuted by Wahl & Amman 06.”
    hmm. Presumably you are referring to Figure 4 of W&A, where the exclusion of bristlecones (the purple or pink lines) raise the temperature in the 1400s by 0.5 Celsius ? I am just struggling with how that is a refutation ?