List of publications in statistics
Encyclopedia
ProbabilityProbabilityProbability is ordinarily used to describe an attitude of mind towards some proposition of whose truth we arenot certain. The proposition of interest is usually of the form "Will a specific event occur?" The attitude of mind is of the form "How certain are we that the event will occur?" The...
Théorie analytique des probabilités
- Author: Pierre-Simon LaplacePierre-Simon LaplacePierre-Simon, marquis de Laplace was a French mathematician and astronomer whose work was pivotal to the development of mathematical astronomy and statistics. He summarized and extended the work of his predecessors in his five volume Mécanique Céleste...
- Publication data: 1820 (3rd ed.)
- Online version: Internet Archive; CNRS, with more accurate character recognition; Gallica-Math, complete PDF and PDFs by section
- Description: Introduced the Laplace transform, exponential families, and conjugate priorConjugate priorIn Bayesian probability theory, if the posterior distributions p are in the same family as the prior probability distribution p, the prior and posterior are then called conjugate distributions, and the prior is called a conjugate prior for the likelihood...
s in Bayesian statisticsBayesian statisticsBayesian statistics is that subset of the entire field of statistics in which the evidence about the true state of the world is expressed in terms of degrees of belief or, more specifically, Bayesian probabilities...
. Pioneering asymptotic statistics, proved an early version of the Bernstein–von Mises theoremBernstein–von Mises theoremIn Bayesian inference, the Bernstein–von Mises theorem provides the basis for the important result that the posterior distribution for unknown quantities in any problem is effectively independent of the prior distribution once the amount of information supplied by a sample of data is large...
on the irrelevance of the (regular) prior distribution on the limiting posterior distribution, highlighting the asymptotic role of the Fisher informationFisher informationIn mathematical statistics and information theory, the Fisher information is the variance of the score. In Bayesian statistics, the asymptotic distribution of the posterior mode depends on the Fisher information and not on the prior...
. Studies the influence of medianMedianIn probability theory and statistics, a median is described as the numerical value separating the higher half of a sample, a population, or a probability distribution, from the lower half. The median of a finite list of numbers can be found by arranging all the observations from lowest value to...
and skewnessSkewnessIn probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable. The skewness value can be positive or negative, or even undefined...
in regression analysisRegression analysisIn statistics, regression analysis includes many techniques for modeling and analyzing several variables, when the focus is on the relationship between a dependent variable and one or more independent variables...
. Inspired the field of robust regressionRobust regressionIn robust statistics, robust regression is a form of regression analysis designed to circumvent some limitations of traditional parametric and non-parametric methods. Regression analysis seeks to find the effect of one or more independent variables upon a dependent variable...
, proposed the Laplace distribution and was the first to cprovide alternatives to Carl Friedrich GaussCarl Friedrich GaussJohann Carl Friedrich Gauss was a German mathematician and scientist who contributed significantly to many fields, including number theory, statistics, analysis, differential geometry, geodesy, geophysics, electrostatics, astronomy and optics.Sometimes referred to as the Princeps mathematicorum...
's work on statisticsStatisticsStatistics is the study of the collection, organization, analysis, and interpretation of data. It deals with all aspects of this, including the planning of data collection in terms of the design of surveys and experiments....
. - Importance: Topic creator, Breakthrough, Influence
Mathematical statistics
Mathematical Methods of Statistics- Author: Harald CramérHarald CramérHarald Cramér was a Swedish mathematician, actuary, and statistician, specializing in mathematical statistics and probabilistic number theory. He was once described by John Kingman as "one of the giants of statistical theory".-Early life:Harald Cramér was born in Stockholm, Sweden on September...
- Publication data: Princeton Mathematical Series, vol. 9. Princeton University Press, Princeton, N. J., 1946. xvi+575 pp. (A first version was published by Almqvist & Wiksell in UppsalaUppsala- Economy :Today Uppsala is well established in medical research and recognized for its leading position in biotechnology.*Abbott Medical Optics *GE Healthcare*Pfizer *Phadia, an offshoot of Pharmacia*Fresenius*Q-Med...
, Sweden, but had little circulation because of World War II.) - Description: Carefully written and extensive account of measure-theoretic probabilityProbability theoryProbability theory is the branch of mathematics concerned with analysis of random phenomena. The central objects of probability theory are random variables, stochastic processes, and events: mathematical abstractions of non-deterministic events or measured quantities that may either be single...
for statisticians, along with careful mathematical treatment of classical statistics. - Importance: Made measure-theoretic probability the standard language for advanced statistics in the English-speaking world, following its earlier adoption in France and the USSR.
Statistical Decision Functions
- Author: Abraham WaldAbraham Wald- See also :* Sequential probability ratio test * Wald distribution* Wald–Wolfowitz runs test...
- Publication data: 1950. John Wiley & Sons.
- Description: Exposition of statistical decision theory as a foundations of statistics. Included earlier results of Wald on sequential analysisSequential analysisIn statistics, sequential analysis or sequential hypothesis testing is statistical analysis where the sample size is not fixed in advance. Instead data are evaluated as they are collected, and further sampling is stopped in accordance with a pre-defined stopping rule as soon as significant results...
and the sequential probability ratio testSequential probability ratio testThe sequential probability ratio test is a specific sequential hypothesis test, developed by Abraham Wald. Neyman and Pearson's 1933 result inspired Wald to reformulate it as a sequential analysis problem...
and on Wald's complete class theorem characterizing admissible decision rules as limits of Bayesian proceduresBayesian statisticsBayesian statistics is that subset of the entire field of statistics in which the evidence about the true state of the world is expressed in terms of degrees of belief or, more specifically, Bayesian probabilities...
. - Importance: Raised the mathematical status of statistical theory and attracted mathematical statisticians like John von NeumannJohn von NeumannJohn von Neumann was a Hungarian-American mathematician and polymath who made major contributions to a vast number of fields, including set theory, functional analysis, quantum mechanics, ergodic theory, geometry, fluid dynamics, economics and game theory, computer science, numerical analysis,...
, Aryeh DvoretzkyAryeh DvoretzkyAryeh Dvoretzky was a Russian-born Israeli mathematician, the winner of the 1973 Israel Prize in Mathematics. He is best known for his work in functional analysis, statistics and probability.-Biography:...
, Jacob WolfowitzJacob WolfowitzJacob Wolfowitz was a Polish-born American statistician and Shannon Award-winning information theorist. He was the father of former Deputy Secretary of Defense and World Bank Group President Paul Wolfowitz....
, Jack C. KieferJack Kiefer (mathematician)Jack Carl Kiefer was an American statistician.- Biography :Jack Kiefer was born on January 25, 1924, in Cincinnati, Ohio, to Carl Jack Kiefer and Marguerite K. Rosenau...
, and David BlackwellDavid Blackwell-Honors and awards:*President, Institute of Mathematical Statistics, 1956*National Academy of Sciences, 1965*American Academy of Arts and Sciences, 1968*Honorary Fellow, Royal Statistical Society, 1976*Vice President, American Statistical Association, 1978...
, providing greater ties with economic theoryMathematical economicsMathematical economics is the application of mathematical methods to represent economic theories and analyze problems posed in economics. It allows formulation and derivation of key relationships in a theory with clarity, generality, rigor, and simplicity...
and operations researchOperations researchOperations research is an interdisciplinary mathematical science that focuses on the effective use of technology by organizations...
. Spurred further work on decision theoryDecision theoryDecision theory in economics, psychology, philosophy, mathematics, and statistics is concerned with identifying the values, uncertainties and other issues relevant in a given decision, its rationality, and the resulting optimal decision...
.
Testing Statistical Hypotheses
- Author: Erich Leo LehmannErich Leo LehmannErich Leo Lehmann was an American statistician, who contributed to statistical and nonparametric hypothesis testing...
- Publication data: 1959. John Wiley & Sons.
- Description: Exposition of statistical hypothesis testingStatistical hypothesis testingA statistical hypothesis test is a method of making decisions using data, whether from a controlled experiment or an observational study . In statistics, a result is called statistically significant if it is unlikely to have occurred by chance alone, according to a pre-determined threshold...
using the statistical decision theory of Abraham WaldAbraham Wald- See also :* Sequential probability ratio test * Wald distribution* Wald–Wolfowitz runs test...
, with some use of measure-theoretic probability. - Importance: Made Wald's ideas accessible. Collected and organized many results of statistical theory that were scattered throughout journal articles, civilizing statistics.
Bayesian statisticsBayesian statisticsBayesian statistics is that subset of the entire field of statistics in which the evidence about the true state of the world is expressed in terms of degrees of belief or, more specifically, Bayesian probabilities...
An Essay towards solving a Problem in the Doctrine of ChancesAn Essay towards solving a Problem in the Doctrine of Chances
An Essay towards solving a Problem in the Doctrine of Chances is a work on the mathematical theory of probability by the Reverend Thomas Bayes, published in 1763, two years after its author's death. It included a statement of a special case of what is now called Bayes' theorem. In 18th-century...
- Author: Thomas BayesThomas BayesThomas Bayes was an English mathematician and Presbyterian minister, known for having formulated a specific case of the theorem that bears his name: Bayes' theorem...
- Publication data: 1763-12-23
- Online version:
- Description: In this paper Bayes addresses the problem of using a sequence of identical "trials" to determine the per-trial probabilityProbabilityProbability is ordinarily used to describe an attitude of mind towards some proposition of whose truth we arenot certain. The proposition of interest is usually of the form "Will a specific event occur?" The attitude of mind is of the form "How certain are we that the event will occur?" The...
of "success" — the so-called inverse probabilityInverse probabilityIn probability theory, inverse probability is an obsolete term for the probability distribution of an unobserved variable.Today, the problem of determining an unobserved variable is called inferential statistics, the method of inverse probability is called Bayesian probability, the "distribution"...
problem. It later inspired the theorem that bears his name (Bayes' theoremBayes' theoremIn probability theory and applications, Bayes' theorem relates the conditional probabilities P and P. It is commonly used in science and engineering. The theorem is named for Thomas Bayes ....
). See also Pierre Simon de Laplace. - Importance: Topic creator, Breakthrough, Influence
On Small Differences in Sensation
- Author: Charles Sanders Peirce and Joseph JastrowJoseph JastrowJoseph Jastrow was an American psychologist, noted for inventions in experimental psychology, design of experiments, and psycho-physics. Jastrow was one of the first scientists to study the evolution of language, publishing an article on the topic in 1886...
- Publication data:
- Online version: http://psychclassics.yorku.ca/Peirce/small-diffs.htm
- Description: Peirce and Jastrow use logistic regressionLogistic regressionIn statistics, logistic regression is used for prediction of the probability of occurrence of an event by fitting data to a logit function logistic curve. It is a generalized linear model used for binomial regression...
to estimate subjective probabilities of subjects's judgments of the heavier of two measurements, following a randomized controlledRandomized experimentIn science, randomized experiments are the experiments that allow the greatest reliability and validity of statistical estimates of treatment effects...
repeated measures designRepeated measures designThe repeated measures design uses the same subjects with every condition of the research, including the control. For instance, repeated measures are collected in a longitudinal study in which change over time is assessed. Other studies compare the same measure under two or more different conditions...
. - Importance: Pioneered elicitation of subjective probabilities.
Truth and Probability
- Author: Frank P. RamseyFrank P. RamseyFrank Plumpton Ramsey was a British mathematician who, in addition to mathematics, made significant and precocious contributions in philosophy and economics before his death at the age of 26...
- Publication data: * Ramsey, Frank PlumptonFrank P. RamseyFrank Plumpton Ramsey was a British mathematician who, in addition to mathematics, made significant and precocious contributions in philosophy and economics before his death at the age of 26...
; “Truth and Probability” ( PDF), Chapter VII in The Foundations of Mathematics and other Logical Essays (1931). - Online version: http://cepa.newschool.edu/het//texts/ramsey/ramsess.pdf
- Description: Ramsey proposes elucidatingOperationalizationIn humanities, operationalization is the process of defining a fuzzy concept so as to make the concept clearly distinguishable or measurable and to understand it in terms of empirical observations...
a person's subjective probability for a proposition using a sequence of bets. Ramsey described his work as an elaboration of some pragmatic ideasPragmaticismPragmaticism is a term used by Charles Sanders Peirce for his pragmatic philosophy starting in 1905, in order to distance himself and it from pragmatism, the original name, which had been used in a manner he did not approve of in the "literary journals"...
of C. S. Peirce, which were expressed in "How to Make Our Ideas Clear". - Importance: Popularized the "Ramsey test" for eliciting subjective probabilities.
Probability, Induction and Statistics
- Author: Bruno de FinettiBruno de FinettiBruno de Finetti was an Italian probabilist, statistician and actuary, noted for the "operational subjective" conception of probability...
- Publication data: New York: John Wiley & Sons, Inc., 1972.
- Description A collection of de Finetti's essays on subjective probability.
- Importance: Proved de Finetti's theoremDe Finetti's theoremIn probability theory, de Finetti's theorem explains why exchangeable observations are conditionally independent given some latent variable to which an epistemic probability distribution would then be assigned...
on the representationChoquet theoryIn mathematics, Choquet theory is an area of functional analysis and convex analysis created by Gustave Choquet. It is concerned with measures with support on the extreme points of a convex set C...
of an infinite sequence of exchangeable random variableRandom variableIn probability and statistics, a random variable or stochastic variable is, roughly speaking, a variable whose value results from a measurement on some type of random process. Formally, it is a function from a probability space, typically to the real numbers, which is measurable functionmeasurable...
s by a mixture of independentStatistical independenceIn probability theory, to say that two events are independent intuitively means that the occurrence of one event makes it neither more nor less probable that the other occurs...
random variables.
Theory of Probability
- Author: Bruno de FinettiBruno de FinettiBruno de Finetti was an Italian probabilist, statistician and actuary, noted for the "operational subjective" conception of probability...
- Publication data: Two volumes, A.F.M. Smith and A. Machi (trs.), New York: John Wiley & Sons, Inc., 1974, 1975.
- Description The first detailed statement of the operational subjective position, dating from the author's research in the 1920s and 30s.
- Importance: Emphasizes exchangeable random variables which are often mixtures of independent random variables. Argues for finitely additive probability measures that need not be countably additive. Emphasizes expectations rather than probability measures.
Introduction to statistical decision theory
- Author: John W. PrattJohn W. PrattJohn W. Pratt is Emeritus William Ziegler professor business administration at Harvard University. He has made contributions to research in risk aversion theory, notably with Kenneth Arrow on measures of risk aversion....
, Howard RaiffaHoward RaiffaHoward Raiffa is the Frank P. Ramsey Professor of Managerial Economics, a joint chair held by the Business School and the Kennedy School of Government at Harvard University...
, and Robert SchlaiferRobert SchlaiferRobert O. Schlaifer was a pioneer of Bayesian decision theory. At the time of his death he was William Ziegler Professor of Business Administration Emeritus of the Harvard Business School.... - Publication data: preliminary edition, 1965. Cambridge, Mass.: MIT Press, 1995.
- Description Extensive exposition of statistical decision theory, statistics, and decision analysis from a Bayesian standpoint. Many examples and problems come from business and economics.
- Importance: Greatly extended the scope of applied Bayesian statistics by using conjugate priorConjugate priorIn Bayesian probability theory, if the posterior distributions p are in the same family as the prior probability distribution p, the prior and posterior are then called conjugate distributions, and the prior is called a conjugate prior for the likelihood...
s for exponential families. Extensive treatment of sequential decision making, for example mining decisions. For many years, it was required for all doctoral students at Harvard Business SchoolHarvard Business SchoolHarvard Business School is the graduate business school of Harvard University in Boston, Massachusetts, United States and is widely recognized as one of the top business schools in the world. The school offers the world's largest full-time MBA program, doctoral programs, and many executive...
.
Multivariate analysisMultivariate analysisMultivariate analysis is based on the statistical principle of multivariate statistics, which involves observation and analysis of more than one statistical variable at a time...
An Introduction to Multivariate Analysis
- Authors: Theodore W. AndersonTheodore Wilbur AndersonTheodore Wilbur Anderson is an American mathematician and statistician who has specialized in the analysis of multivariate data.Born in Minneapolis, Minnesota, he was awarded a Guggenheim Fellowship in 1946....
- Publication data: 1958, John Wiley
- Description:
- Importance: This textbook educated a generation of theoretists and applied statisticians, emphasizing hypothesis testing via likelihood ratio tests and the properties of power functionStatistical powerThe power of a statistical test is the probability that the test will reject the null hypothesis when the null hypothesis is actually false . The power is in general a function of the possible distributions, often determined by a parameter, under the alternative hypothesis...
s: AdmissiblityAdmissible decision ruleIn statistical decision theory, an admissible decision rule is a rule for making a decision such that there isn't any other rule that is always "better" than it, in a specific sense defined below....
, unbiasednessBias of an estimatorIn statistics, bias of an estimator is the difference between this estimator's expected value and the true value of the parameter being estimated. An estimator or decision rule with zero bias is called unbiased. Otherwise the estimator is said to be biased.In ordinary English, the term bias is...
and monotonicity.
Applied statistics
Statistical Methods for Research WorkersStatistical Methods for Research Workers
Statistical Methods for Research Workers is a classic 1925 book on statistics by the statistician R.A. Fisher. It is considered by some to be one of the 20th century's most influential books on statistical methods. According to ,...
- Author: R.A. FisherRonald FisherSir Ronald Aylmer Fisher FRS was an English statistician, evolutionary biologist, eugenicist and geneticist. Among other things, Fisher is well known for his contributions to statistics by creating Fisher's exact test and Fisher's equation...
- Publication data: Edinburgh: Oliver & Boyd, 1925 (1st edition); London: Macmillan, 1970 (15th edition)
- Online version: http://psychclassics.yorku.ca/Fisher/Methods/
- Description: The original manual for researchers, especially biologists, on how to statistically evaluate numerical data.
- Importance: Hugely influential text by the father of modern statistics that remained in print for more than 50 years. Responsible for the widespread use of tests of statistical significanceStatistical significanceIn statistics, a result is called statistically significant if it is unlikely to have occurred by chance. The phrase test of significance was coined by Ronald Fisher....
.
Statistical Methods
- Author: George W. SnedecorGeorge W. SnedecorGeorge Waddel Snedecor was an American mathematician and statistician. He contributed to the foundations of analysis of variance, data analysis, experimental design, and statistical methodology. Snedecor's F distribution and the George W...
- Publication data: 1937, Collegiate Press
- Description: One of the first comprehensive texts on statistical methods. Reissued as Statistical Methods Applied to Experiments in Agriculture and Biology in 1940 and then again as Statistical Methods with Cochran, WG in 1967. A classic text.
- Importance: Influence
Principles and Procedures of Statistics with Special Reference to the Biological Sciences.
- Authors: Steel, R.G.D, and Torrie, J. H.
- Publication data: McGraw Hill (1960) 481 pages
- Description: Excellent introductory text for analysis of variance (one-way, multi-way, factorial, split-plot, and unbalanced designs). Also analysis of co-variance, multiple and partial regression and correlation, non-linear regression, and non-parametric analyses. This book was written before computer programmes were available, so it gives the detail needed to make the calculations manually.Cited in more than 1,381 publications between 1961 and 1975.
- Importance: Influence
Biometry: The Principles and Practices of Statistics in Biological Research
- Authors: Robert R. SokalRobert R. SokalRobert Reuven Sokal is an Austrian-American biostatistician and anthropologist. Distinguished Professor Emeritus at the State University of Stony Brook, New York, Sokal is a member of the National Academy of Sciences and the American Academy of Arts and Sciences...
; F. J. Rohlf - Publication data: 1st ed. W. H. Freemann (1969),; 2nd ed. W. H. Freemann (1981); 3rd ed. Freeman & Co. (1994)
- Description: Key textbook on Biometry: the application of statistical methods for descriptive, experimental, and analytical study of biological phenomena.
- Importance Cited in more than 7,000 publications.
Statistical learning theoryStatistical learning theoryStatistical learning theory is an ambiguous term.#It may refer to computational learning theory, which is a sub-field of theoretical computer science that studies how algorithms can learn from data....
On the uniform convergence of relative frequencies of events to their probabilities
- Authors: V. VapnikVladimir VapnikVladimir Naumovich Vapnik is one of the main developers of Vapnik–Chervonenkis theory. He was born in the Soviet Union. He received his master's degree in mathematics at the Uzbek State University, Samarkand, Uzbek SSR in 1958 and Ph.D in statistics at the Institute of Control Sciences, Moscow in...
, A. ChervonenkisAlexey ChervonenkisAlexey Jakovlevich Chervonenkis is a Soviet and Russian mathematician, and, with Vladimir Vapnik, was one of the main developers of the Vapnik–Chervonenkis theory, also known as the "fundamental theory of learning" an important part of computational learning theory. As of September 2007, Dr... - Publication data: Theory of Probability and its Applications, 16(2):264--280, 1971
- Description: Computational learning theoryComputational learning theoryIn theoretical computer science, computational learning theory is a mathematical field related to the analysis of machine learning algorithms.-Overview:Theoretical results in machine learning mainly deal with a type of...
, VC theory, statistical uniform convergence and the VC dimensionVC dimensionIn statistical learning theory, or sometimes computational learning theory, the VC dimension is a measure of the capacity of a statistical classification algorithm, defined as the cardinality of the largest set of points that the algorithm can shatter...
. - Importance: Breakthrough, Influence
Variance component estimation
On the mathematical foundations of theoretical statistics- Author: Fisher, RARonald FisherSir Ronald Aylmer Fisher FRS was an English statistician, evolutionary biologist, eugenicist and geneticist. Among other things, Fisher is well known for his contributions to statistics by creating Fisher's exact test and Fisher's equation...
- Publication data: 1922, Philosophical Transactions of the Royal Society, London, Series A, volume 222, pages 309-368
- Description: First comprehensive treatise of estimation by maximum likelihood.
- Importance: Topic creator, Breakthrough, Influence
Estimation of variance and covariance components
- Author: Henderson, CRCharles Roy HendersonCharles Roy Henderson was a statistician and a pioneer in animal breeding — the application of quantitative methods for the genetic evaluation of domestic livestock. He developed mixed model equations to obtain best linear unbiased predictions of breeding values and, in general, any random effect...
- Publication data: 1953, BiometricsBiometricsBiometrics As Jain & Ross point out, "the term biometric authentication is perhaps more appropriate than biometrics since the latter has been historically used in the field of statistics to refer to the analysis of biological data [36]" . consists of methods...
, volume 9, pages 226-252 - Description: First description of three methods of estimation of variance components in mixed linear models for unbalanced data. "One of the most frequently cited papers in the scientific literature."
- Importance: Topic creator, Breakthrough, Influence
Maximum-likelihood estimation for the mixed analysis of variance model
- Author: H. O. HartleyHerman Otto HartleyH. O. Hartley , born Herman Otto Hirschfeld but commonly called HOH, was a German-American statistician. He developed Hartley's test for equality of variances . In 1967 he and J.N.K. Rao published a maximum likelihood method for finding variance components in mixed models...
and J. N. K. Rao - Publication data: 1967, BiometrikaBiometrika- External links :* . The Internet Archive. 2011....
, volume 54, pages 93-108 - Description: First description of maximum likelihood methods for variance component estimation in mixed models
- Importance: Topic creator, Breakthrough, Influence
Recovery of inter-block information when block sizes are unequal
- Author: Patterson, HD; Thompson, R
- Publication data: 1971, BiometrikaBiometrika- External links :* . The Internet Archive. 2011....
, volume 58, pages 545-554 - Description: First description of restricted maximum likelihood (REML)
- Importance: Topic creator, Breakthrough, Influence
Estimation of Variance and Covariance Components in Linear Models
- Author: Rao, CR
- Publication data: 1972, Journal of the American Statistical AssociationJournal of the American Statistical AssociationThe Journal of the American Statistical Association is the most prestigious journal published by the American Statistical Association, the main professional body for statisticians in the United States...
, volume 67, pages. 112-115 - Description: First description of Minimum Variance Quadratic Unbiased Estimation (MIVQUE) and Minimum Norm Quadratic Unbiased Estimation (MINQUE) for unbalanced data
- Importance: Topic creator, Breakthrough, Influence
Survival analysisSurvival analysisSurvival analysis is a branch of statistics which deals with death in biological organisms and failure in mechanical systems. This topic is called reliability theory or reliability analysis in engineering, and duration analysis or duration modeling in economics or sociology...
Nonparametric estimation from incomplete observations
- Author: Kaplan, EL and Meier, PPaul Meier (statistician)Paul Meier was a statistician who promoted the use of randomized trials in medicine. He is also known for introducing, with Edward L. Kaplan, the Kaplan–Meier estimator, a tool for measuring how many patients survive a medical treatment.-External links:...
- Publication data: 1958, Journal of the American Statistical AssociationJournal of the American Statistical AssociationThe Journal of the American Statistical Association is the most prestigious journal published by the American Statistical Association, the main professional body for statisticians in the United States...
, volume 53, pages 457-481 - Description: First description of the now ubiquitous Kaplan-Meier estimatorKaplan-Meier estimatorThe Kaplan–Meier estimator, also known as the product limit estimator, is an estimator for estimating the survival function from life-time data. In medical research, it is often used to measure the fraction of patients living for a certain amount of time after treatment. In economics, it can be...
of survival functions from data with censored observations - Importance: Breakthrough, Influence
A generalized Wilcoxon test for comparing arbitrarily singly-censored samples
- Author: Gehan, EA
- Publication data: 1965, BiometrikaBiometrika- External links :* . The Internet Archive. 2011....
, volume 52, pages 203-223 - Description: First presentation of the extension of the Wilcoxon rank-sum test to censored data
- Importance: Influence
Evaluation of survival data and two new rank order statistics arising in its consideration
- Author: Mantel, NNathan MantelNathan Mantel was a biostatistician best known for his work with William Haenszel which led to the Mantel–Haenszel test and its associated estimate, the Mantel–Haenszel odds ratio...
- Publication data: 1966, Cancer Chemotherapy Reports, volume 50, pages 163-170. PMID 5910392
- Description: Development of the logrank testLogrank testIn statistics, the logrank test is a hypothesis test to compare the survival distributions of two samples. It is a nonparametric test and appropriate to use when the data are right skewed and censored...
for censored survival data. - Importance: Topic creator, Breakthrough, Influence
Regression Models and Life Tables
- Author: Cox, DRDavid Cox (statistician)Sir David Roxbee Cox FRS is a prominent British statistician.-Early years:Cox studied mathematics at St. John's College, Cambridge and obtained his PhD from the University of Leeds in 1949, advised by Henry Daniels and Bernard Welch.-Career:He was employed from 1944 to 1946 at the Royal Aircraft...
- Publication data: 1972, Journal of the Royal Statistical SocietyJournal of the Royal Statistical SocietyThe Journal of the Royal Statistical Society is a series of three peer-reviewed statistics journals published by Blackwell Publishing for the London-based Royal Statistical Society.- History :...
, Series B, volume 34, pages 187-220 - Description: Seminal paper introducing semi-parametric proportional hazards modelsProportional hazards modelsProportional hazards models are a class of survival models in statistics. Survival models relate the time that passes before some event occurs to one or more covariates that may be associated with that quantity. In a proportional hazards model, the unique effect of a unit increase in a covariate...
(Cox models) for survival data - Importance: Topic creator, Breakthrough, Influence
The Statistical Analysis of Failure Time Data
- Author: Kalbfleisch, JD and Prentice, RL
- Publication data: 1980, John Wiley & SonsJohn Wiley & SonsJohn Wiley & Sons, Inc., also referred to as Wiley, is a global publishing company that specializes in academic publishing and markets its products to professionals and consumers, students and instructors in higher education, and researchers and practitioners in scientific, technical, medical, and...
, New York - Description: First comprehensive text covering the methods of estimation and inference for time to event analyses
- Importance: Influence
Meta analysis
Report on Certain Enteric Fever Inoculation Statistics- Author: Pearson, KKarl PearsonKarl Pearson FRS was an influential English mathematician who has been credited for establishing the disciplineof mathematical statistics....
- Publication data: 1904, British Medical JournalBritish Medical JournalBMJ is a partially open-access peer-reviewed medical journal. Originally called the British Medical Journal, the title was officially shortened to BMJ in 1988. The journal is published by the BMJ Group, a wholly owned subsidiary of the British Medical Association...
, volume 2, pages 1243-1246 - Description: Generally considered to be the first synthesis of results from separate studies, although no formal statistical methods for combining results are presented.
- Importance: Breakthrough, Influence
The Probability Integral Transformation for Testing Goodness of Fit and Combining Independent Tests of Significance
- Author: Pearson, ESEgon PearsonEgon Sharpe Pearson, CBE FRS was the only son of Karl Pearson, and like his father, a leading British statistician....
- Publication data: 1938 BiometrikaBiometrika- External links :* . The Internet Archive. 2011....
, volume 30, pages 134-148 - Description: One of the first published methods for formally combining results from different experiments
- Importance: Breakthrough, Influence
Combining Independent Tests of Significance
- Author: Fisher, RARonald FisherSir Ronald Aylmer Fisher FRS was an English statistician, evolutionary biologist, eugenicist and geneticist. Among other things, Fisher is well known for his contributions to statistics by creating Fisher's exact test and Fisher's equation...
- Publication data: 1948, The American StatisticianThe American StatisticianThe American Statistician, established in 1947, is a magazine published quarterly by the American Statistical Association.- External links :*...
, volume 2, page 30 - Description: One of the first published methods for formally combining results from different experiments
- Importance: Breakthrough, Influence
The combination of estimates from different experiments
- Author: Cochran, WGWilliam Gemmell CochranWilliam Gemmell Cochran was a prominent statistician; he was born in Scotland but spent most of his life in the United States....
- Publication data: 1954, BiometricsBiometricsBiometrics As Jain & Ross point out, "the term biometric authentication is perhaps more appropriate than biometrics since the latter has been historically used in the field of statistics to refer to the analysis of biological data [36]" . consists of methods...
, volume 10, page 101-129 - Description: A comprehensive treatment of the various methods for formally combining results from different experiments
- Importance: Breakthrough, Influence
Experimental design
On Small Differences in Sensation- Author: Charles Sanders Peirce and Joseph JastrowJoseph JastrowJoseph Jastrow was an American psychologist, noted for inventions in experimental psychology, design of experiments, and psycho-physics. Jastrow was one of the first scientists to study the evolution of language, publishing an article on the topic in 1886...
- Publication data:
- Online version: http://psychclassics.yorku.ca/Peirce/small-diffs.htm
- Description: Peirce and Jastrow use logistic regressionLogistic regressionIn statistics, logistic regression is used for prediction of the probability of occurrence of an event by fitting data to a logit function logistic curve. It is a generalized linear model used for binomial regression...
to estimate subjective probabilities of subjects's judgments of the heavier of two measurements, following a randomized controlledRandomized experimentIn science, randomized experiments are the experiments that allow the greatest reliability and validity of statistical estimates of treatment effects...
repeated measures designRepeated measures designThe repeated measures design uses the same subjects with every condition of the research, including the control. For instance, repeated measures are collected in a longitudinal study in which change over time is assessed. Other studies compare the same measure under two or more different conditions...
. - Importance: The first randomized experiment, which also used blindingBlindingBlinding can refer to:*The act of making someone blind**Metaphorical and extended uses of same: see blindness#Metaphorical uses*Blinding , a technique by which an agent can provide a service to a client in an encoded form without knowing either the real input or the real output*Blinding , a novel...
; it seems also to have been the first experiment for estimating subjective probabilities.
The Design of Experiments
The Design of Experiments
The Design of Experiments is a 1935 book by the British statistician R.A. Fisher, which effectively founded the field of design of experiments. The book has been highly influential.-References:...
- Author: Fisher, RARonald FisherSir Ronald Aylmer Fisher FRS was an English statistician, evolutionary biologist, eugenicist and geneticist. Among other things, Fisher is well known for his contributions to statistics by creating Fisher's exact test and Fisher's equation...
- Publication data: 1935, Oliver and Boyd, Edinburgh
- Description: The first textbook on experimental design
- Importance: Influence
The Design and Analysis of Experiments
- Author: Oscar KempthorneOscar KempthorneOscar Kempthorne was a statistician and geneticist known for his research on randomization-analysis and the design of experiments, which had wide influence on research in agriculture, genetics, and other areas of science...
- Publication data: 1950, John Wiley & SonsJohn Wiley & SonsJohn Wiley & Sons, Inc., also referred to as Wiley, is a global publishing company that specializes in academic publishing and markets its products to professionals and consumers, students and instructors in higher education, and researchers and practitioners in scientific, technical, medical, and...
, New York (Reprinted with corrections in 1979 by Robert E. Krieger) - Description: Early exposition of the general linear model using matrix algebra (following lecture notes of George W. Brown). Bases inference on the randomization distribution objectively defined by the experimental protocol, rather than a so-called "statistical model" expressing the subjective beliefs of a statistician: The normal model is regarded as a convenient approximation to the randomization-distribution, whose quality is assessed by theorems about moments and simulation experiments.
- Importance: The first and most extensive discussion of randomation-based inference in the field of design of experiments until the recent 2-volume work by Hinkelmann and Kempthorne; randomization-based inference is called "design-based" inference in survey sampling of finite populations. Introduced the treatment-unit additivity hypothesis, which was discussed in chapter 2 of David R. Cox's book on experiments (1958) and which has influenced Donald Rubin and Paul Rosenbaum's analysis of observational data.
On the Experimental Attainment of Optimum Conditions (with discussion)
- Author: George E. P. BoxGeorge E. P. Box- External links :* from a at NIST* * * * * *** For Box's PhD students see*...
and K. B. Wilson. - Publication data: (1951) Journal of the Royal Statistical SocietyJournal of the Royal Statistical SocietyThe Journal of the Royal Statistical Society is a series of three peer-reviewed statistics journals published by Blackwell Publishing for the London-based Royal Statistical Society.- History :...
Series B 13(1):1–45. - Description: Introduced Box-Wilson central composite designCentral composite designIn statistics, a central composite design is an experimental design, useful in response surface methodology, for building a second order model for the response variable without needing to use a complete three-level factorial experiment....
for fitting a quadratic polynomial in several variables to experimental data, when an initial affine model had failed to yield a direction of ascent. The design and analysis is motivated by a problem in chemical engineering. - Importance: Introduced response surface methodologyResponse surface methodologyIn statistics, response surface methodology explores the relationships between several explanatory variables and one or more response variables. The method was introduced by G. E. P. Box and K. B. Wilson in 1951. The main idea of RSM is to use a sequence of designed experiments to obtain an...
for approximating local optima of systems with noisy observations of responses.
See also
- List of scientific journals in statistics
- List of important publications in mathematics