Statistics

Study of the collection, analysis, interpretation, and presentation of data

Statistics (from German: Statistik, orig. "description of a state, a country")^[1]^[2] is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data.^[3]^[4]^[5] In applying statistics to a scientific, industrial, or social problem, it is conventional to begin with a statistical population or a statistical model to be studied. Populations can be diverse groups of people or objects such as "all people living in a country" or "every atom composing a crystal". Statistics deals with every aspect of data, including the planning of data collection in terms of the design of surveys and experiments.^[6]

When census data cannot be collected, statisticians collect data by developing specific experiment designs and survey samples. Representative sampling assures that inferences and conclusions can reasonably extend from the sample to the population as a whole. An experimental study involves taking measurements of the system under study, manipulating the system, and then taking additional measurements using the same procedure to determine if the manipulation has modified the values of the measurements. In contrast, an observational study does not involve experimental manipulation.

Two main statistical methods are used in data analysis: descriptive statistics, which summarize data from a sample using indexes such as the mean or standard deviation, and inferential statistics, which draw conclusions from data that are subject to random variation (e.g., observational errors, sampling variation).^[7] Descriptive statistics are most often concerned with two sets of properties of a distribution (sample or population): central tendency (or location) seeks to characterize the distribution's central or typical value, while dispersion (or variability) characterizes the extent to which members of the distribution depart from its center and each other. Inferences on mathematical statistics are made under the framework of probability theory, which deals with the analysis of random phenomena.

A standard statistical procedure involves the collection of data leading to a test of the relationship between two statistical data sets, or a data set and synthetic data drawn from an idealized model. A hypothesis is proposed for the statistical relationship between the two data sets, and this is compared as an alternative to an idealized null hypothesis of no relationship between two data sets. Rejecting or disproving the null hypothesis is done using statistical tests that quantify the sense in which the null can be proven false, given the data that are used in the test. Working from a null hypothesis, two basic forms of error are recognized: Type I errors (null hypothesis is rejected when it is in fact true, giving a "false positive") and Type II errors (null hypothesis fails to be rejected when an it is in fact false, giving a "false negative").^[8] Multiple problems have come to be associated with this framework, ranging from obtaining a sufficient sample size to specifying an adequate null hypothesis.^[7]

Statistical measurement processes are also prone to error in regards to the data that they generate. Many of these errors are classified as random (noise) or systematic (bias), but other types of errors (e.g., blunder, such as when an analyst reports incorrect units) can also occur. The presence of missing data or censoring may result in biased estimates and specific techniques have been developed to address these problems.

Share this article:

This article uses material from the Wikipedia article Statistics, and is written by contributors. Text is available under a CC BY-SA 4.0 International License; additional terms may apply. Images, videos and audio are available under their respective licenses.

[1] [1]
"statistics". Oxford English Dictionary (Online ed.). Oxford University Press. (Subscription or participating institution membership required.)

[2] [2]
"Statistik" in Digitales Wörterbuch der deutschen Sprache

[ox-3] [3]
"Statistics". Oxford Reference. Oxford University Press. January 2008. ISBN 978-0-19-954145-4. Archived from the original on 2020-09-03. Retrieved 2019-08-14.

[4] [4]
Romijn, Jan-Willem (2014). "Philosophy of statistics". Stanford Encyclopedia of Philosophy. Archived from the original on 2021-10-19. Retrieved 2016-11-03.

[5] [5]
"Cambridge Dictionary". Archived from the original on 2020-11-22. Retrieved 2019-08-14.

[Dodge-6] [6]
Dodge, Y. (2006) The Oxford Dictionary of Statistical Terms, Oxford University Press. ISBN 0-19-920613-9

[LundResearchLtd-7] [7]
Lund Research Ltd. "Descriptive and Inferential Statistics". statistics.laerd.com. Archived from the original on 2020-10-26. Retrieved 2014-03-23.

[8] [8]
"What Is the Difference Between Type I and Type II Hypothesis Testing Errors?". About.com Education. Archived from the original on 2017-02-27. Retrieved 2015-11-27.

[9] [9]
Moses, Lincoln E. (1986) Think and Explain with Statistics, Addison-Wesley, ISBN 978-0-201-15619-5. pp. 1–3

[10] [10]
Hays, William Lee, (1973) Statistics for the Social Sciences, Holt, Rinehart and Winston, p. xii, ISBN 978-0-03-077945-9

[11] [11]
Moore, David (1992). "Teaching Statistics as a Respectable Subject". In F. Gordon; S. Gordon (eds.). Statistics for the Twenty-First Century. Washington, DC: The Mathematical Association of America. pp. 14–25. ISBN 978-0-88385-078-7.

[12] [12]
Chance, Beth L.; Rossman, Allan J. (2005). "Preface" (PDF). Investigating Statistical Concepts, Applications, and Methods. Duxbury Press. ISBN 978-0-495-05064-3. Archived (PDF) from the original on 2020-11-22. Retrieved 2009-12-06.

[13] [13]
Lakshmikantham, D.; Kannan, V. (2002). Handbook of stochastic analysis and applications. New York: M. Dekker. ISBN 0824706609.

[14] [14]
Schervish, Mark J. (1995). Theory of statistics (Corr. 2nd print. ed.). New York: Springer. ISBN 0387945466.

[LB-15] [15]
Broemeling, Lyle D. (1 November 2011). "An Account of Early Statistical Inference in Arab Cryptology". The American Statistician. 65 (4): 255–257. doi:10.1198/tas.2011.10191. S2CID 123537702.

[16] [16]
Ostasiewicz, Walenty (2014). "The emergence of statistical science". Śląski Przegląd Statystyczny. 12 (18): 76–77. doi:10.15611/sps.2014.12.04.

[17] [17]
Bruneau, Quentin (2022). States and the Masters of Capital: Sovereign Lending, Old and New. Columbia University Press. ISBN 978-0231555647.

[18] [18]
Willcox, Walter (1938) "The Founder of Statistics". Review of the International Statistical Institute 5(4): 321–328. JSTOR 1400906

[19] [19]
J. Franklin, The Science of Conjecture: Evidence and Probability before Pascal, Johns Hopkins Univ Pr 2002

[20] [20]
Schneider, I. (2005). Jakob Bernoulli, Ars Conjectandi (1713). In I. Grattan-Guinness (Ed.), Landmark writings in Western Mathematics, 1640-1940 (pp. 88-103).

[21] [21]
Sylla, E. D.; Bernoulli, Jacob (2006). The Art of Conjecturing, Together with Letter to a Friend on Sets in Court Tennis (trans.). JHU Press. ISBN 978-0-8018-8235-7.

[22] [22]
Lim, M. (2021). "Gauss, Least Squares, and the Missing Planet". Actuaries Digital. Retrieved 2022-11-01.

[23] [23]
Helen Mary Walker (1975). Studies in the history of statistical method. Arno Press. ISBN 978-0405066283. Archived from the original on 2020-07-27. Retrieved 2015-06-27.

[Galton1877-24] [24]
Galton, F (1877). "Typical laws of heredity". Nature. 15 (388): 492–553. Bibcode:1877Natur..15..492.. doi:10.1038/015492a0.

[25] [25]
Stigler, S.M. (1989). "Francis Galton's Account of the Invention of Correlation". Statistical Science. 4 (2): 73–79. doi:10.1214/ss/1177012580.

[Pearson,_On_the_criterion-26] [26]
Pearson, K. (1900). "On the Criterion that a given System of Deviations from the Probable in the Case of a Correlated System of Variables is such that it can be reasonably supposed to have arisen from Random Sampling". Philosophical Magazine. Series 5. 50 (302): 157–175. doi:10.1080/14786440009463897. Archived from the original on 2020-08-18. Retrieved 2019-06-27.

[27] [27]
"Karl Pearson (1857–1936)". Department of Statistical Science – University College London. Archived from the original on 2008-09-25.

[28] [28]
Box, JF (February 1980). "R.A. Fisher and the Design of Experiments, 1922–1926". The American Statistician. 34 (1): 1–7. doi:10.2307/2682986. JSTOR 2682986.

[29] [29]
Yates, F (June 1964). "Sir Ronald Fisher and the Design of Experiments". Biometrics. 20 (2): 307–321. doi:10.2307/2528399. JSTOR 2528399.

[30] [30]
Stanley, Julian C. (1966). "The Influence of Fisher's "The Design of Experiments" on Educational Research Thirty Years Later". American Educational Research Journal. 3 (3): 223–229. doi:10.3102/00028312003003223. JSTOR 1161806. S2CID 145725524.

[31] [31]
Agresti, Alan; David B. Hichcock (2005). "Bayesian Inference for Categorical Data Analysis" (PDF). Statistical Methods & Applications. 14 (3): 298. doi:10.1007/s10260-005-0121-y. S2CID 18896230. Archived (PDF) from the original on 2013-12-19. Retrieved 2013-12-19.

[oed-32] [32]
OED quote: 1935 R.A. Fisher, The Design of Experiments ii. 19, "We may speak of this hypothesis as the 'null hypothesis', and the null hypothesis is never proved or established, but is possibly disproved, in the course of experimentation."

[33] [33]
Fisher|1971|loc=Chapter II. The Principles of Experimentation, Illustrated by a Psycho-physical Experiment, Section 8. The Null Hypothesis

[Edwards98-34] [34]
Edwards, A.W.F. (1998). "Natural Selection and the Sex Ratio: Fisher's Sources". American Naturalist. 151 (6): 564–569. doi:10.1086/286141. PMID 18811377. S2CID 40540426.

[fisher15-35] [35]
Fisher, R.A. (1915) The evolution of sexual preference. Eugenics Review (7) 184:192

[fisher30-36] [36]
Fisher, R.A. (1930) The Genetical Theory of Natural Selection. ISBN 0-19-850440-3

[pers00-37] [37]
Edwards, A.W.F. (2000) Perspectives: Anecdotal, Historical and Critical Commentaries on Genetics. The Genetics Society of America (154) 1419:1426

[ander94-38] [38]
Andersson, Malte (1994). Sexual Selection. Princeton University Press. ISBN 0-691-00057-3. Archived from the original on 2019-12-25. Retrieved 2019-09-19.

[ander06-39] [39]
Andersson, M. and Simmons, L.W. (2006) Sexual selection and mate choice. Trends, Ecology and Evolution (21) 296:302

[gayon10-40] [40]
Gayon, J. (2010) Sexual selection: Another Darwinian process. Comptes Rendus Biologies (333) 134:144

[41] [41]
Neyman, J (1934). "On the two different aspects of the representative method: The method of stratified sampling and the method of purposive selection". Journal of the Royal Statistical Society. 97 (4): 557–625. doi:10.2307/2342192. JSTOR 2342192.

[42] [42]
"Science in a Complex World – Big Data: Opportunity or Threat?". Santa Fe Institute. 2 December 2013. Archived from the original on 2016-05-30. Retrieved 2014-10-13.

[43] [43]
Freedman, D.A. (2005) Statistical Models: Theory and Practice, Cambridge University Press. ISBN 978-0-521-67105-7

[pmid17608932-44] [44]
McCarney R, Warner J, Iliffe S, van Haselen R, Griffin M, Fisher P (2007). "The Hawthorne Effect: a randomised, controlled trial". BMC Med Res Methodol. 7 (1): 30. doi:10.1186/1471-2288-7-30. PMC 1936999. PMID 17608932.

[45] [45]
Rothman, Kenneth J; Greenland, Sander; Lash, Timothy, eds. (2008). "7". Modern Epidemiology (3rd ed.). Lippincott Williams & Wilkins. p. 100. ISBN 978-0781755641.

[46] [46]
Mosteller, F.; Tukey, J.W (1977). Data analysis and regression. Boston: Addison-Wesley.

[47] [47]
Nelder, J.A. (1990). The knowledge needed to computerise the analysis and interpretation of statistical information. In Expert systems and artificial intelligence: the need for information about data. Library Association Report, London, March, 23–27.

[48] [48]
Chrisman, Nicholas R (1998). "Rethinking Levels of Measurement for Cartography". Cartography and Geographic Information Science. 25 (4): 231–242. Bibcode:1998CGISy..25..231C. doi:10.1559/152304098782383043.

[49] [49]
van den Berg, G. (1991). Choosing an analysis method. Leiden: DSWO Press

[50] [50]
Hand, D.J. (2004). Measurement theory and practice: The world through quantification. London: Arnold.

[51] [51]
Mann, Prem S. (1995). Introductory Statistics (2nd ed.). Wiley. ISBN 0-471-31009-3.

[52] [52]
"Descriptive Statistics | Research Connections". www.researchconnections.org. Retrieved 2023-01-10.

[Oxford-53] [53]
Upton, G., Cook, I. (2008) Oxford Dictionary of Statistics, OUP. ISBN 978-0-19-954145-4.

[54] [54]
"Basic Inferential Statistics - Purdue OWL® - Purdue University". owl.purdue.edu. Retrieved 2023-01-10.

[Piazza-55] [55]
Piazza Elio, Probabilità e Statistica, Esculapio 2007

[56] [56]
Everitt, Brian (1998). The Cambridge Dictionary of Statistics. Cambridge, UK New York: Cambridge University Press. ISBN 0521593468.

[57] [57]
"Cohen (1994) The Earth Is Round (p < .05)". YourStatsGuru.com. Archived from the original on 2015-09-05. Retrieved 2015-07-20.

[58] [58]
Rubin, Donald B.; Little, Roderick J.A., Statistical analysis with missing data, New York: Wiley 2002

[Ioannidis2005-59] [59]
Ioannidis, J.P.A. (2005). "Why Most Published Research Findings Are False". PLOS Medicine. 2 (8): e124. doi:10.1371/journal.pmed.0020124. PMC 1182327. PMID 16060722.

[Huff-60] [60]
Huff, Darrell (1954) How to Lie with Statistics, WW Norton & Company, Inc. New York. ISBN 0-393-31072-8

[61] [61]
Warne, R. Lazo; Ramos, T.; Ritter, N. (2012). "Statistical Methods Used in Gifted Education Journals, 2006–2010". Gifted Child Quarterly. 56 (3): 134–149. doi:10.1177/0016986212444122. S2CID 144168910.

[Statistics_in_Archaeology-62] [62]
Drennan, Robert D. (2008). "Statistics in archaeology". In Pearsall, Deborah M. (ed.). Encyclopedia of Archaeology. Elsevier Inc. pp. 2093–2100. ISBN 978-0-12-373962-9.

[Misuse_of_Statistics-63] [63]
Cohen, Jerome B. (December 1938). "Misuse of Statistics". Journal of the American Statistical Association. 33 (204). JSTOR: 657–674. doi:10.1080/01621459.1938.10502344.

[Modern_Elementary_Statistics-64] [64]
Freund, J.E. (1988). "Modern Elementary Statistics". Credo Reference.

[65] [65]
Huff, Darrell; Irving Geis (1954). How to Lie with Statistics. New York: Norton. The dependability of a sample can be destroyed by [bias]... allow yourself some degree of skepticism.

[66] [66]
Nelder, John A. (1999). "From Statistics to Statistical Science". Journal of the Royal Statistical Society. Series D (The Statistician). 48 (2): 257–269. doi:10.1111/1467-9884.00187. ISSN 0039-0526. JSTOR 2681191. Archived from the original on 2022-01-15. Retrieved 2022-01-15.

[67] [67]
Nikoletseas, M.M. (2014) "Statistics: Concepts and Examples." ISBN 978-1500815684

[68] [68]
Anderson, D.R.; Sweeney, D.J.; Williams, T.A. (1994) Introduction to Statistics: Concepts and Applications, pp. 5–9. West Group. ISBN 978-0-314-03309-3

[69] [69]
"Journal of Business & Economic Statistics". Journal of Business & Economic Statistics. Taylor & Francis. Archived from the original on 27 July 2020. Retrieved 16 March 2020.

[:0-70] [70]
Natalia Loaiza Velásquez, María Isabel González Lutz & Julián Monge-Nájera (2011). "Which statistics should tropical biologists learn?" (PDF). Revista Biología Tropical. 59: 983–992. Archived (PDF) from the original on 2020-10-19. Retrieved 2020-04-26.

[71] [71]
Pekoz, Erol (2009). The Manager's Guide to Statistics. Erol Pekoz. ISBN 978-0979570438.

[72] [72]
"Aims and scope". Journal of Business & Economic Statistics. Taylor & Francis. Archived from the original on 23 June 2021. Retrieved 16 March 2020.

[73] [73]
"Journal of Business & Economic Statistics". Journal of Business & Economic Statistics. Taylor & Francis. Archived from the original on 27 July 2020. Retrieved 16 March 2020.

[74] [74]
Numerous texts are available, reflecting the scope and reach of the discipline in the business world:
Sharpe, N. (2014). Business Statistics, Pearson. ISBN 978-0134705217

Wegner, T. (2010). Applied Business Statistics: Methods and Excel-Based Applications, Juta Academic. ISBN 0702172863
Two open textbooks are:
Holmes, L., Illowsky, B., Dean, S. (2017). Introductory Business Statistics Archived 2021-06-16 at the Wayback Machine

Nica, M. (2013). Principles of Business Statistics Archived 2021-05-18 at the Wayback Machine

[75] Sharpe, N. (2014). Business Statistics, Pearson. ISBN 978-0134705217

[76] Wegner, T. (2010). Applied Business Statistics: Methods and Excel-Based Applications, Juta Academic. ISBN 0702172863

[77] Holmes, L., Illowsky, B., Dean, S. (2017). Introductory Business Statistics Archived 2021-06-16 at the Wayback Machine

[78] Nica, M. (2013). Principles of Business Statistics Archived 2021-05-18 at the Wayback Machine

[75] [75]
Cline, Graysen (2019). Nonparametric Statistical Methods Using R. EDTECH. ISBN 978-1-83947-325-8. OCLC 1132348139. Archived from the original on 2022-05-15. Retrieved 2021-09-16.

[76] [76]
Palacios, Bernardo; Rosario, Alfonso; Wilhelmus, Monica M.; Zetina, Sandra; Zenit, Roberto (2019-10-30). "Pollock avoided hydrodynamic instabilities to paint with his dripping technique". PLOS ONE. 14 (10): e0223706. Bibcode:2019PLoSO..1423706P. doi:10.1371/journal.pone.0223706. ISSN 1932-6203. PMC 6821064. PMID 31665191.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]

[47]

[48]

[49]

[50]

[51]

[52]

[53]

[54]

[55]

[56]

[57]

[58]

[59]

[60]

[61]

[62]

[63]

[64]

[65]

[66]

[67]

[68]

[69]

[70]

[71]

[72]

[73]

[74]

[75]

[76]

Statistics

Statistics

Introduction

Mathematical statistics

History

Statistical data

Data collection

Sampling

Experimental and observational studies

Experiments

Observational study

Types of data

Methods

Descriptive statistics

Inferential statistics

Terminology and theory of inferential statistics

Statistics, estimators and pivotal quantities

Null hypothesis and alternative hypothesis

Error

Interval estimation

Significance

Examples

Exploratory data analysis

Misuse

Misinterpretation: correlation

Applications

Applied statistics, theoretical statistics and mathematical statistics

Machine learning and data mining

Statistics in academia

Statistical computing

Business statistics

Statistics applied to mathematics or the arts

Specialized disciplines

See also

References

Further reading

External links

Share this article: