If you guys held the USGA to the same standards of proof as you hold the USGA's detractors, then maybe we wouldnt be in the mess we are in now.
Normally, I would like to see a correlation of at least .5 between two variables on an interval scale before I would consider the correlation meaningful (I'm purposely avoiding using the word signficant). If the idea is that distance is truly the lynch pin for performance on Tour, in my opinion both of the correlations you show are low. However, there is no way to test for any significant differences between your two r values or two r squared values.
JAL,
I just can't see how this is correct. No one I know has ever claimed that distance is the only factor which determines success. Rather, the claim is twofold:
1. Distance has become a much more important factor than it used to be.
2. Distance is has become too important, compared to the other factors.
The first claim is potentially verifiable from the statistics, and it looks to me that the statistics do indeed verify this. Distance is more closely correlated to money than it used to be.
The second claim is much more subjective and requires that we decide how much of a correlation is too much. I think truth of the first the first claim provides pretty good evidence of this second, but I may have a lower tolerance for the changes in the game than you.
In either case, to dismiss the correlation because it is not equal or greater than 0.5 is just too draconian. I just dont think there is anything magical about r >=0.5 from a statistics standpoint. If it was a magical threshhold, multiple variables would render any meaningful statistical analysis of multiple cause events meaningless. I just dont think this is the case.
While I think it's a bit of a random approach, my idea for a t-test for significant differences would be to take the ten longest drivers from 1980 and 2005 (ten being the random number - it could be anything we wanted), finding and listing the percent of the purse each player won in each tournament he played, and running an independent samples t-test between the two data sets. This would simultaneously circumvent the issues associated with differing numbers of tournaments played and inflation.
I think you overestimate the degree of variance in the tournaments issued I am only using stats from those who had enough rounds to qualify in the driving stats. Also, I'd be willing to bet the the top money guys actually started less tournaments. If so, this works for my point, not against it.
As for inflation, I dont think it matters much, if at all. Run the correlation with money rank instead of the dollar figures, and you will see that there was a much stronger correlation in 2005.
I had Excel run a ttest on both sets of data using all the drives and got extraordinary low values. Something like 2.34E-30.
_________________________________
David, besides JAL's comment, seems to me another point weakens the correlation. You are looking for trends for the entire tour. Well, leave out just three players -- Tiger, Mick and Vijay -- and I think your trend line might change dramatically. Without doing any calculations, I'm guessing your trend line would have around zero slope. If so, doesn't that mean that for around 98.5% of all players, the correlation is near zero?
Let me get this straight . . . we are trying to test for a correlation between distance and success, and you want to throw out the best three players on tour because they are too long and too successful? For what are we trying to correlate again?
If you want to throw some guys out, why not throw out Furyk, DiMarco, and Funk. Would that make any sense at all? What do you suppose would happen to the chart then?
If you think the top 3 just made too much money, then knock them down to Furyk's level and you get substantially the same results as posted. But elimating guys because you think they are too long and too successful just isnt going to wash.