Fundamental Principles of Statistics

design
measurement
principles
2017
2023
This brief note catalogs what I feel are some of the most important principles to guide statistical practice.
Author
Affiliation

Vanderbilt University
School of Medicine
Department of Biostatistics

Published

January 18, 2017

Modified

October 16, 2023

There are many principles involved in the theory and practice of statistics, but here are the ones that guide my practice the most.

  1. Use methods grounded in theory or extensive simulation
  2. Understand uncertainty, and realize that the most honest approach to inference is a Bayesian model that takes into account what you don’t know (e.g., Are variances equal? Is the distribution normal? Should an interaction term be in the model?)
  3. Design experiments to maximize information
  4. Understand the measurements you are analyzing and don’t hesitate to question how the underlying information was captured
  5. Be more interested in questions than in null hypotheses, and be more interested in estimation than in answering narrow questions
  6. Verify that the sample size will support the intended analyses, or pre-specify a simpler analysis for which the sample size is adequate; live within the confines of the information content of the data
  7. Rather than making dichotomous decisions about inclusion of variables or complexities (nonlinearities, interactions, unequal variances, etc.) in the model, which results in uncertainty intervals that are too narrow, put parameters in the model for aspects you don’t know. Bayesian models that put priors on complexities provide continuous solutions that automatically relax assumptions as the sample size grows. The resulting uncertainty intervals will not be falsely confident since model uncertainty is taken into account.
  8. Use all information in data during analysis
  9. Use discovery and estimation procedures not likely to claim that noise is signal
  10. Strive for optimal quantification of evidence about effects
  11. Give decision makers the inputs (other than the utility function) that optimize decisions
  12. Present information in ways that are intuitive, maximize information content, and are correctly perceived
  13. Give the client what she needs, not what she wants
  14. Teach the client to want what she needs

… the statistician must be instinctively and primarily a logician and a scientist in the broader sense, and only secondarily a user of the specialized statistical techniques.

In considering the refinements and modifications of the scientific method which particularly apply to the work of the statistician, the first point to be emphasized is that the statistician is always dealing with probabilities and degrees of uncertainty. He is, in effect, a Sherlock Holmes of figures, who must work mainly, or wholly, from circumstantial evidence.

Malcolm C Rorty: Statistics and the Scientific Method
JASA 26:1-10, 1931

The statistical method is more than an array of techniques. The statistical method is a mode of thought; it is sharpened thinking; it is power.

William Edwards Deming, International Statistical Institute Meeting, Sept 1953

Science is not about falsifying hypotheses. Science is about determining what component of a set of observations are reproducible, and then trying to determine the mechanism or rule to explain those observations… again, reasoning from data, not by starting with an axiom.

David J Glass


See this post for related thoughts.