Open Mind

World Wide Web of Science

April 20, 2008 · 27 Comments

The latest issue of Scientific American has an interesting article on the use of the internet for scientific research. The web is revolutionizing the way a lot of scientists do things by making it easier to communicate, access resources, acquire data online, and collaborate in ever larger and more productive ways. It’s also changing the fundamental way in which scientists communicate with the public, in some cases making the very thought processes behind scientific research accessible to a wide audience.


One of the advantages of being a mathematician specializing in data analysis is that scientists in most fields need my skills; one of the disadvantages is that I lack the skills to acquire for myself the very data which is the fuel for my work. I depend on other scientists (both professionals and amateurs!) to do the field work and the theorizing, but they depend on me for sophisticated analysis and experienced interpretation when the data have been acquired. It’s a powerful symbiosis, enabling each participant to make the most effective contribution.

Just by making communication (especially of data) so much easier, the internet has made such collaboration faster and more effective. I’ve collaborated with neuroscientists in Bangalore, India, to analyze sodium-ion channel conductivity data, as a result of which I’m a co-author of a paper in the European Journal of Neuroscience. But I don’t really know what sodium-ion channels are, and neuroscience is a topic about which my knowledge wouldn’t fill the space available on a postage stamp. Only the internet makes transmitting data, results, and discussion fast enough and inexpensive enough that such long-distance teamwork is practical. One disadvantage is that I’ve never seen the faces of my collaborators on that project.

The web helps me in other ways too. When I need a scientific paper my best chance of finding it is online. Now that I work in industry rather than academia my access to scientific libraries is limited to a very narrow field. But outside that field I have an excellent chance of finding a copy of any paper I want on the web. Even before works are published, they are often available (as submitted manuscripts or preprints) in various online archives; the Arxiv system is a revolutionary blessing, not just for me personally but for the entire scientific community. I have the highest regard for those fields, and those researchers, for whom open access to published research is a priority.

It’s not just collaboration and publications that can no longer do without the internet. When I first got interested in climate science I was pretty ignorant of the subject. I have a strong background in physics but mainly about the more esoteric theoretical stuff, so I had a lot to learn; the internet has made it so much easier. Also, contrary to what some readers may think I tend to carry skepticism to an extreme (my wife teases me about it). I remain at least a little skeptical of scientific claims unless I’ve seen the data for myself. And I know just how easy it is for even skilled researchers to err in data analysis so I always want to do it for myself; only when I personally have “run the numbers” do I have complete confidence in the soundness of scientific claims. That’s not to say that I consider the findings of other researchers unreliable, quite the contrary; I just have a hard time feeling comfortable accepting conclusions until I’ve seen it with my own eyes.

The internet has made astounding amounts of data available at the touch of a few buttons. Want to know the temperature history of Boston? Go to GISS and you can get data back to 1880, go to the GHCN version 1 and you can get it well before that. Does the Vostok ice core data excite you? It’s easy to find. Do you want to see for yourself how sea level has changed, or estimates of total solar irradiance, or historical sunspot counts, or any number of other sets of data? They’re there, they’re available, they’re usually pretty easy to find. I’m a data analysis junkie, and the climate data on the internet could keep me busy for a number of lifetimes. It’s only because of easy access to vast amounts of climate data, that I’ve become so strongly convinced that global warming is real, man-made, and one of the greatest threats to modern civilization.

If you’re reading this, then you’re aware of the existence of blogs and other community-based internet forums. This enables easy and rapid communication among all sorts of people, including a public view inside the minds of working scientists. I’ve benefitted immensely from the intimate and up-to-date reports on new discoveries in climate science to be found in the blog posts on RealClimate. I’ve also learned a lot from the commentary there, often from interested lay readers. And I’ve learned quite a bit, and been directed to many valuable resources of which I wasn’t aware, by readers right here on this blog.

There’s a downside to the internet explosion as well. As powerful a medium as it is for the rapid and widespread communication of scientific information, it’s equally potent for the communication of misinformation. Nearly everybody I’ve met who thinks global warming is a fraud, was convinced by information obtained online. I can sometimes tell, from the (mis)information supplied and the arguments chosen, exactly where they got their faulty opinions. I don’t care to argue in this thread about the validity of global warming; if you yourself are convinced it’s a fraud, then you can consider advocates of my own belief to be equally guilty of using the web to spread misinformation.

In part, web access to data and tools makes misinformation not just more easily communicated, but that much more possible. It’s just as easy for someone else to download the Central England Temperature data as it is for me, and there are enough analysis tools available online to enable the web-savvy to analyze the dickens out of it. This has spawned an army of armchair quarterbacks, working feverishly to draw conclusions from climate data. It’s especially risky business from my perspective, because data analysis can be a very intricate endeavor and over my career I’ve learned how strewn is the battlefield with land mines, even for those who are trained in it. It’s extremely difficult for amateurs to undertake without missteps, and in fact it just might be the single most common sources of erroneous results by professionals in peer-reviewed publications. These days anyone can run a bunch of numbers through a black box to get other numbers. Very few indeed are good at knowing what they really mean.

The blogosphere also generates a lot of heat in the form of arguments over the truth or falsehood of climate science. It can get downright nasty at times. Passions are inflamed, and the anonymity of blog commentary (even those who comment under their real names don’t exchange glances with their fellow community members) lowers the threshold of civility in conversation. This, in my opinion, necessitates moderating comments on a blog with any significant readership.

But that conflicts with the unrestricted freedom of expression which I passionately regard as the foundation of a free society. I have no problem deleting long strings of grotesque obscenity. I do have a problem deleting (and avoiding!) insulting commentary. I’m not sure where the line properly falls between what may have “redeeming social value” in spite of offending some readers, and I don’t believe I should be the sole arbiter of what might have some redeeming value.

Still, I envy the blogs that are more strongly moderated. Even those who agree with the regulars at ClimateAudit can’t deny that the commentary there is less than perfectly civil. RealClimate is so much easier to take because there’s such a high standard imposed by the moderators; it’s not perfectly civil but it’s much more so than ClimateAudit, or this blog for that matter.

But protecting readers’ sensitivities isn’t very high on my priority list. The real reason I so strongly envy the more civil discussions is that they’re so much more productive. When anger is aroused, the signal-to-noise ratio of the blog commentary drops. I believe that there’s nothing wrong with calling a scoundrel a scoundrel. But I also believe that increasing the information content of what’s said is one of the merits in restraint.

Perhaps the best solution is not to let the scoundrel through the door in the first place. This is reminicent of censorship, but it’s not really! A blog is the “intellectual property” of the blogger, and although I defend the right of scoundrels to express themselves publicly I’m certainly under no moral obligation to let ‘em do it in my house. But it’s always a tradeoff; even a scoundrel may have something worth saying, and even idiocy can lead to enlightenment if only through its clear refutation. I have yet to figure out where the lines fall or what the best practice is. I probably never will, I’ll have to be satisfied if I keep getting closer without ever actually getting “there.”

But of this much I’m confident: the internet has made science, communication, and my work so much richer in possibilities and opportunities, that taken all in all it’s one of our most prized creations. As a younger man I concluded (and I still believe) that the two most potent creations of the human race were the invention of writing, and the spread of the moveable-type printing press. I would now add a third “wonder of the world” to that list: the computerization of the masses. Play ball!

Categories: Global Warming

27 responses so far ↓

  • Hank Roberts // April 21, 2008 at 1:30 am

    Wish list: parallel threads on new and interesting subjects, one for general public use, the other one for scientists by invitation with the shared agreement to further the conversation not derail it.

    That’s doable now with current technology.

    Regrettably the Usenet’s answer to this same problem (the threaded newsreader and killfile) still isn’t doable using current WWW technology

  • TCO // April 21, 2008 at 2:34 am

    1. Please give me your perspective on Hotelling’s seminal work on what it means to be a statistician, where he speaks for instance of the importance of even theoretical types having some connection to real problems, data and practioners. (Read it if nesc.)

    2. The debate on moderation versus freedom is a tricky one. especially because the community tends to make the blog or forum.

    3. I agree with you on the trickiness of data analysis and the many ways someone can go wrong. That’s why I think armchair types should be ignored (including SM) until they have put their results into real papers and submitted them to peer review. There is a lot of laziness when people don’t think they are trying to get through a hurdle. Even on issues not in controversy. At least sending stuff to journals like Climate of the Past Discussions (where even submitted and rejected papers are in a sense published and archived) or making real working papers (and circulating those) would be a good solution. The cyrrent skeptic blogs are a real waste for really spreading analysis. They’re so confounding with education/slight-fights/politics.

    4. I’m fascinated with your background in math and data analysis. I think sometimes you hold back a bit with both Mann and SM and would really appreciate it instead if you engaged more. I think there are places where they are each wrong, but where the things they argue about are fascinating. Would think that this would excite curiosity in you as well. And that you would go after things. I welcome all to do that who can push things forward. Have a lot of respect in particular for Huybers, Von Storch, Zorita and Burger.

    5. What is your general impression of economist type time series studiers? As scientists/mathematicians?

    6. I was fascinated by the Jolliffe comment. I was not so surprised or impressed that he said he did not really uinderstand the Mann algorithm or what issues it might be prone to. I would expect that of any good scientist. Especially considering how complex that thing is. To me this is rather subtley telling you that Mann may have had a pretty akward setup. And would think you would want to set it straight. If only for the sake of promoting proper practices (even if the impact on AGW is minor). For the sake of good math/stats.

  • John Mashey // April 21, 2008 at 3:37 am

    http://johnquiggin.com/ has had some recent good discussions on moderation, worth reading, and that blog is a pleasure as a result.

  • Paul Middents // April 21, 2008 at 4:46 am

    Please keep doing what you are doing. The trolls are probably beyond hope but some have inspired you to some really good analysis. I would hope that Svaalgard starts pushing in some more productive directions than the nits he has so far chosen to pick.

  • Nick Barnes // April 21, 2008 at 8:28 am

    Hank Roberts:

    Regrettably the Usenet’s answer to this same problem (the threaded newsreader and killfile) still isn’t doable using current WWW technology

    It’s a SMOP. If only I had the time.

  • Barton Paul Levenson // April 21, 2008 at 12:26 pm

    The internet would be a much more useful source of scientific data if most of the new papers in peer-reviewed science journals weren’t locked behind a paywall. I can’t pay $32.00 or whatever for each article I want to read. And they are asking that much for articles that go back 30 and 40 years. I think this trend is a very, very bad one for science. It’s restricting information to the professionals, which just makes them look like a closed circle immune to outside criticism.

  • climatewonk // April 21, 2008 at 2:27 pm

    Hank, I agree. I want to see posts where only invited scientists discuss and analyse topics. The internet is great for providing us ordinary folks with a place to read informed opinion and find evidence, but I’m tired of cranks and deniers taking over threads and muddying things. There should be threads for us ordinary folks to discuss what this all means to us, but I need good information and good data and analysis and when the cranks and denialists move into an otherise-decent science thread, that gets lost.

  • Gavin's Pussycat // April 21, 2008 at 2:35 pm

    I like RC’s standard of moderation.

    It’s not about reader sensitivities, but about their bothering to wade through three feet of muck. In other words, S/N, precisely.

    We’ve been there before :-)

  • Rainman // April 21, 2008 at 7:07 pm

    Being a ‘lukewarmer’, I am very interested in the nuts and bolts behind climate science, wherever I can find the informatiuon.

    I frequently read your site and learn something in every post. Thanks for the work.

    When people go ‘flame on’ it takes away from the informational discussion, but as you said, sometimes there is a tidbit of interesting info that comes out of it… as hard as it is to find that tidbit.

    In case you haven’t seen the term, ‘lukewarmer’ was coined (perhaps not originally) at Watts’ site. I know the climate is warming, I’m just not convinced it’s all about CO2. And so here I am reading the information provided by those ‘for’ CO2 and other sites ‘against’…

    The ‘final’ truth is out there. Just hope we all agree what it is before it is too late. (Myself included)

  • David B. Benson // April 21, 2008 at 11:28 pm

    Despite the difficulties with ‘doing science on the web’, it takes us back to the early days of science, when scientific ressults written in London were published directly and read by scientists in Paris only a week later.

    Much better than waiting months for peer review, as I currently am. *sigh*

    Rainman // April 21, 2008 at 7:07 pm — I encourage to take the time to read historian Weart’s “The Discovery of Global Warming”:

    http://www.aip.org/history/climate/index.html

  • frankbi // April 22, 2008 at 4:49 am

    Rainman, another resource I’d recommend is the list of arguments on Skeptical Science.

  • Christopher // April 22, 2008 at 5:45 pm

    “3. I agree with you on the trickiness of data analysis and the many ways someone can go wrong. That’s why I think armchair types should be ignored (including SM) until they have put their results into real papers and submitted them to peer review. There is a lot of laziness when people don’t think they are trying to get through a hurdle.”

    Really disagree with this. As someone who has published I can relate that waiting for your article to cycle through the queue and actually get disseminated can be painfully slow. It’s a mismatch to the suddenness of the times we live in and how quickly information gets cycled based on the internet, aot. This has nothing to do with laziness but it is a turnoff. I’m trying to publish because, at the beginning of my career, I need numbers. If that were not the case then there are other avenues to get the word out. And using these other avenues is not lazy at all. Blogs allow for a discussion-like format. I was tickled to read the back and forth between Climate Audit and Tamino wrt PCAs. That’s good stuff. The notion of cycling such a discussion thru peer reviewed channels? Yuck! It takes way too long and peer-review is, imho, not a failsafe. Science is an iterative process and slowing it down too much, which peer-review has the potential to do, is not good thing. There are some journals who have got wind of this and offer rapid communications, online publishing and the like. I’m hopeful editors will actually get paid for their work –in my field they do not. Also, writing useful blog entries is nontrivial. Laziness is forming an opinion without having tried to gather information yourself. Or shooting for the hip because you heard some factoid on rightwingnutcase.com or leftwingwacko.com…

    I think this blog is quite valuable and that the level of civility is par for the course. The only blogs I know of where civility is the standard (like Freakonomics) have a paid staffer looking over things. I would imagine doing a blog as a drain from more productive endeavors. You seem to find that compromise quite well from my perspective.

  • dhogaza // April 22, 2008 at 7:39 pm

    Blogs allow for a discussion-like format.

    With no quality control, frequently driven by non-scientists with obvious political agendas, and an anti-science bias that rises to the level of claiming scientific misconduct and fraud on the part of professionals at every opportunity.

    Yeah, this is going to overturn the traditional process and lead to improved quality and a faster pace of scientific progress, alright.

    Pardon my cynicism, but if five minutes at CA don’t make you puke, you’re not paying sufficient attention.

  • Hank Roberts // April 22, 2008 at 8:16 pm

    > puke
    And if they do, you’ve GOT to learn how to ignore stuff. Life is too short to pay attention to everything.

  • TCO // April 22, 2008 at 11:12 pm

    Christopher: In SM’s blog postings axes are not well labelled. Ideas are not well layed out (there are segues). Even things like clear footnotes are not used. You may think this is pedantic. But it’s not. The IDEAS themselves are tricky. So we need to do a good job of keeping obvious things clear. Then we can engage on the non-obvious. there is nothing stopping someone like SM from writing papers and circulating them in pre-prints. And his ideas would be more formed if he did.

    It’s like this one time I took a seminar class that was truly a discussion around a round table. Every other week, we had to write a paper. The discussions on those weeks were much better than the weeks where we had not written a paper. It’s common sense. The result of tough analysis and prep. Clear writing DRIVES clear thinking. And clear thinking is NOT TRIVIAL. We need to encourage it.

    McK complains of review holdups. But he does not show what he submitted. And I have seen his first drafts for presentations at meetings (and they were god awful. no respect for the auidience.) Heck, if McK submits to CotPD, even his initial submittals are “published. He doesn’t submit because he’s lazy and cowardly. And I would say it to his face. Do you doubt me?

  • TCO // April 22, 2008 at 11:41 pm

    I mean McI (fraudian slip).

  • Christopher // April 23, 2008 at 3:01 am

    >you’ve GOT to learn how to ignore stuff

    That’s the key. I’ve learned things at numerous blogs, Open Mind and CA included. You just need to filter information based on quality, which is a central skill for navigating the internets anyway…

  • EliRabett // April 25, 2008 at 3:36 am

    One of the things that drives me nuts about SM and AW is that my training is to think first about the physics (or chemistry or biology) using basic principles, and then to fill in the math/statistics as detail. They go at it backwards which means they never get to the question of “does this make sense”

  • TCO // April 25, 2008 at 5:27 pm

    One thing that annoys me from SM is that he only shows tests that find fault:

    There will be some paper that he wants to examine. SM will go and subject various parts of the work to scrutiny. But in his blog, he only reports the tests where problems are found. In cases where (and there must be some!) where his testing makes a paper’s conclusions MORE ROBUST, he doesn’t post it.

    ———————————-

    Yeah, I agree that SM drives me nuts with the backwards focus.

    And you have to watch both SM and AW, because they are looking so hard for flaws, that at times they don’t consider alternative explanations. AW is the worst. He was talking about station record shapes and had found one that was a nice site instrumentation (no nearby problems for the detectors), but still showed an uptrending temp. He said, well this looks like a classic UHI effect than. It’s as if he wasn’t even COMPREHENDING the POSSIBILITY of the sensor recording a CHANGE IN THE CLIMATE.

  • steven mosher // April 25, 2008 at 7:49 pm

    1. TCO, are we going to have this debate again. It’s a seminar format. A dialogue not a dissertation. You want papers, buy a blunt wrap
    and chill.

    2. I agree with ELI, fancy that, doing statistics in advance of physical understanding is often futile and misleading.
    ( see BCPs)

    3. TCO Focusing on flaws. I’m used to a different world. In my world test involved challenging folks to find a flaw. red team, blue team. So St.Macs approach seems right to me.
    Peer review is pansy review. papers in science should be subject to inquisition. hehe.

  • TCO // April 25, 2008 at 8:43 pm

    SM: The whole thing with SM’s “journal” reminds me of a friend of mine riding around San Diego. He had this whole theory for crossings: “I’m a car”, “I’m a pedestrian”, he would cry out as he followed whichever gave him the right of way.

    It’s fine if SM wants to have a bullphone/seminar/lab journal (which he controls and edits without notes, etc.). But scientists should not have to bother with anything he writes until he spends the time to write it up properly (so that it is easily read, so that it’s cogent, so that it can be pinned down, etc.) And I think it’s a hoot when the whole cheering choir section thinks that SM has done some grand things and that the field needs to pay attention to him.

    You should watch him, SM. He’s been dishonest in the past. Nigel Persuad sock puppet. Tucson airport photo. etc. Just watch out, big guy.

  • TCO // April 25, 2008 at 8:45 pm

    SM: I’m fine with him taking apart things and seeing what makes them tick. With him finding flaws. With him exploring how algorithms interact with data. I got no problems with taking on any scientist and looking for flaws. Honest.

    What I have a problem with is half-assed work, with PR trumpeting.

  • TCO // April 25, 2008 at 9:01 pm

    I mean just look at CA’s having Watts as a moderator/thread starter. Steve is endorsing a moron there. Just because the guy is on his “side”, but we’ve seen with this guys solar silliness, that he’s not a scientist. Heck, when the Lohler paper came out, SM pulled his punches like crazy. There were all kinds of things messed up about it…and instead of calling him out….Steve just kept his mouth shut. That’s a “sided” behavior. He ripped Moberg to shreds! But with Lohler, instead of “ripping to shreds”, he made oblique, tit for taty argument statements like “if Lohler has mistakes, so does Moburg”. Well that’s bunk. He already definitively said Moburg was crap. So there’s no if/then required. Just call out Lohler as crap.

    The guy is shifty, mosh-pit. Watch out.

  • TCO // April 25, 2008 at 9:11 pm

    SM: I don’t fault him for looking for flaws. I fault him for only reporting the flaws. For not reporting the analyses that hold up. He is giving a biased view and playing PR games talking to the hoi polloi like jae or Watts or the rest of the nitwits.

  • TCO // April 26, 2008 at 5:18 pm

    Here is an interesting article, discussing use of statistical terms (”significance”) for their debate-moving connotation rather than with mathematical precision or limits. While Tufte’s target is popularist economists (who do a lot of damage when they over-puff insights in complicated fields like sociology or history), the issues are generalizable to much of intellectual discourse. And to writings from both sides in the AGW wars. I find the Mann c. 98-9 and McI c.now both often guilty of this. I think it’s something for us all to watch out for. Especially when blogs are used as substitutes for composed/reviewed/edited technical reports/articles.

    BTW, Tufte is famous for having eviscerated NASA for their use of PowerPoint which played a large role in causing the return to earth and crash of the foam-damaged shuttle.

    http://www.edwardtufte.com/bboard/q-and-a-fetch-msg?msg_id=0001Zl

  • TCO // April 26, 2008 at 5:20 pm

    (please let thru, Tammy). SM, I am drinking a bloody in your honor…

  • Tenney Naumer // May 23, 2008 at 4:50 am

    Dear Tamino,

    Your blog is one of the best written, especially when it comes to your explanations of the data. Thanks so much for taking the time. Your wonderful illustrations make all of us more productive.

Leave a Comment