Statistics is to engineers, like lamp-posts are to drunks - they are used more for support than for illumination.This got me thinking - where is statistics used for illumination? Before we proceed it would be worthwhile checking the definition of statistics given here.
Census, I feel, is a practical branch of statistics mainly because its sample is the entire population. Analyzing the census data could illuminate what already exists, but when one tries to use this data for prediction, it is no longer practical. Similar is the case with polls. How scientific are scientifically conducted polls?
Michael Schemer, differentiating between anecdotal thinking (that leads to superstition) and science in his column Skeptic in The Scientific American May 05 issue says:
What we havehere is a signal-to-noise problem. Humans evolved brains that are pattern-recognition machines, adept at detecting signals that enhance or threaten survival amid a very noisy world. This capability is association learning - associating the causal connections between A and B - as when our ancestors associated the seasons with the migration of game animals.
Unfortunately, the system has flaws. Superstitions are false associations - A appears connected to B, but it is not (the baseball player who doesn't shave and hits a home run). Las Vegas was built on associated learning.
..... - if you scan enough noise, you will eventually find a signal, whether it is there or not.
We evolved as a social primate species whose language ability facilitated the exchange of such association anecdotes. The problem is that although true pattern recognition helps us survive, false pattern recognition does not get us killed, so the overall phenomenon has endured the winnowing process of natural selection. .... Anecdotal thinking comes naturally; science requires training.
The key from the piece above is - if you scan enough noise, you will eventually find a signal, whether it is there or not. This is similar to the fact that any set of data can be fitted to a polynomial equation - I vaguely remember reading that the figure of an elephant can be fitted with a polynomial equation. But what value does this equation have in predicting the next data point?
Statistics is a form of pattern recognition, or rather a method for pattern recognition. If used without the right set of data, it becomes nothing more than anecdotal thinking. Thus statistics are a rigorous method in science is questionable.
Statistics has played, and certainly will play, a huge part in science and engineering - in my area of expertise, thermodynamics, Statistical Thermodynamics is important to answer question Why? rather than How? answered by Classical Thermodynamics. I understand that regression analysis, parameter estimation etc are widely used in science and engineering. But the most important footnote when using any of these methods is their range of applicability - which again depends on the data used.
What I am trying to convey through this muddled post is that unless used within a clear predefined set of constraints or bounds, the use of statistics is equivalent to anecdotal thinking and is divorced from science.