This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author Journeyman08
Recipients Journeyman08, docs@python
Date 2015-02-25.18:40:45
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1424889646.1.0.160219400081.issue23522@psf.upfronthosting.co.za>
In-reply-to
Content
In the statistics module documentation, there is a note that states that 

"The mean is strongly affected by outliers and is not a robust estimator for central location: the mean is not necessarily a typical example of the data points. For more robust, although less efficient, measures of central location, see median() and mode()"

https://docs.python.org/3/library/statistics.html

While I appreciate the intention, this is quite misleading.  The implication is that the mean, median and mode are different ways to estimate one "central location", however, in reality they are very different things (albeit which refer to a similar notion).

The sample mean is an unbiased estimator of the true mean but it need not be unbiased as an estimator of the true median or modes and vice versa for the median and mode.

To make this clearer I would rephrase to 

"The mean is strongly affected by outliers and is not necessarily representative of the central tendency of the data. For cases with large outliers or very low sample size, see median() and mode()"

Apologies if this is seen as frivolous, but statistics can be hard enough to remain very clear about even when the words are used precisely.
History
Date User Action Args
2015-02-25 18:40:46Journeyman08setrecipients: + Journeyman08, docs@python
2015-02-25 18:40:46Journeyman08setmessageid: <1424889646.1.0.160219400081.issue23522@psf.upfronthosting.co.za>
2015-02-25 18:40:46Journeyman08linkissue23522 messages
2015-02-25 18:40:45Journeyman08create