Please, now you are spinning - it is not plural and if he used the middle initial it would be "fi-mi-lastname". He arbitrary used the first and middle initial and that is not the methods documented in the paper.
Actually
if you look at the PRALL list you provided it is quite clear THEY USD "PD JONES".
Click on any of the links within the Google Scholar search column and
you will see it explicitly there. It reproduces the search. Look at the top and you'll see: "PD Jones"
And I fail to see how it is arbitrary.
As for you "fi-mi-lastname", I will point out that since this is not a search constructed in which "field designators" need be explicitly stated (as you find in some database queries) there is no
exact limitation that "fi" can
only be first initial.
And clearly,
as your own reference shows, this is indeed how they approached it.
I am well aware Anderegg et al. did not do their work. They do not include the lists of papers that support their totals, they instead used numerical totals from Google Scholar search queries that are not reproducible and deviate from the methods explicitly outlined in their paper.
You make it sound like they merely plugged in "P Jones" and read the "hit count" in the upper right hand corner of the screen and called it a day!
Here, again, is what they explicitly state:
Anderegg said:
Between December 2008 and July 2009, we collected the number of
climate-relevant publications for all 1,372 researchers from Google Scholar
(search terms: [FONT=AdvOT118e7927+20][FONT=AdvOT118e7927+20][/FONT][/FONT]
author:[FONT=AdvOT118e7927+fb][FONT=AdvOT118e7927+fb]fi[/FONT][/FONT]
-lastname climate[FONT=AdvOT118e7927+20][FONT=AdvOT118e7927+20][/FONT][/FONT]
)
When I click on Prall's "Climate"+author
D-Jones search (HERE) I get a list that when I click through it at least for the first many pages I see almost exclusively Phil Jones climatology articles. So it seems that the technique is relatively robust.
Now of course a rational view of what they did would include an assessment of the fact (as you yourself have provided in your Prall link) that they used 3 search versions on Phil Jones. One was "all" "author : PD-Jones", one was the same search only in Physics, Astronomy and Planetary science, and one was the same with the term "climate" in the search.
As you are no doubt familiar with Venn diagrams I shouldn't need to explain that this can result in a simple process of determining using these three a relatively robust number of documents that fulfill the requirement of "climate relevant" document that are all attributable to Phil D. Jones of UEA.
The fact that they didn't explain this step by step to you is not indicative of the very real possibility that that is exactly what they did.
I honestly think you are trying to find a reason to knock this out by relying on the hopes that the researchers are simply stupid. Which is nowhere in evidence.
As I said, they clearly explain it to you when they state:
Anderegg said:
Between December 2008 and July 2009, we collected the number of
climate-relevant publications for all 1,372 researchers from Google Scholar
(search terms: [FONT=AdvOT118e7927+20][FONT=AdvOT118e7927+20]
Anderegg said:
author:[FONT=AdvOT118e7927+fb][FONT=AdvOT118e7927+fb]fi[/FONT][/FONT]-lastname climate[FONT=AdvOT118e7927+20][FONT=AdvOT118e7927+20][/FONT][/FONT])
[/FONT][/FONT]
It makes no mention of review. They mention "climate-relevant" because the search query included the word, "climate".
-sigh-
I don't know how to help you if you wish to find gross error. I will suggest you actually look on the lists provided by Prall
that you introduced.
But I will also caution you that
this is a statistical analsysis and the authors themselves clearly and repeatedly stated the potential shortcomings even going so far as to explicitly state some of the potential problems you mention.
In a statistical study the point being to find as "robust" an analysis as you can. This does not mean (nor is it
ever suggested in science) that a data sampling is "perfect". It simply cannot be.
However since
you have Prall's list you are free to click on every one of these and run the numbers yourself.
You can easily with just a few weeks worth of efforts
count every single hit from the search that Prall links to and show how far off from the value that the number is.
(Make sure to limit the publication dates to
only those before July 2009.)
Let's say you do the count and you find that it actually contains a few "false" hits. Let's say you find 3 hits that are not PD Jones of UEA but Paul Dingleberry Jones of Larry's Clown College and the article is about the climate of sadness clowns live in. Out of total of 724 you have found an error of 0.4% !!!!!
Now take into account you will have to repeat the exercise on all of the other authors and let's assume you find that a couple authors are
undercounted by 0.4%.
This is kind of how data collection works. In huge samples which are part of larger populations you have to accept that there are "errors".
It's kind of why the whole article is
FOUNDED ON STATISTICS. (What do you think the Mann-Whitney U Test is all about? It is a non-parametric hypothesis test to assess whether one sample contains larger values than another, note the p-values on these various tests!)
Thus falsifying their data and the bogus statistics they are based on.
So you found one erroneous "hit" (did it show up in the Prall list? You have to make sure that the same hit came up in the original authors' work for it to be a problem for them).
So now you have found...wait for it....0.14% error!
Are you expecting absolute 0.0...0% error? Perfect data?
Interesting. Because even the authors never claimed that. But I can understand. Like when debating creationists you realize they want "perfect" data but since that never happens (hence the need for statistics...like...well, Mann-Whitney U tests) I guess all of science is doomed.
I am well aware they do not provide direct links to it in the paper. As usual with these "studies" no one attempted to verify the data.
LOL. I can see you have never published in the sciences. In the days before the huge reach of the intarwebs I published stuff whose data was only in my lab notebooks. If you wanted to see the data you'd have to write me and ask me for it and I would provide it to you assuming I had the time. If an official investigation was ongoing I would have the data but you really should stick to the kind of publications you do for your critiques.
You like worthless statistics based on data that is not reproducible and filled with erroneous results?
If I relied on only one data point that would be a fair cop. But since I've already explained by background and the mass of information I base my acceptance of the hypothesis on I will traverse your "suggestion".