Message196971
Since Python 2.1 [1], when random.shuffle was added, the documentation has said:
"""Note that for even rather small len(x), the total number of permutations of x is larger than the period of most random number generators; this implies that most permutations of a long sequence can never be generated."""
This comment is incorrect and misleading. In fact, I claim that shuffle can produce "all" permutations for any representable sequence.
To shuffle a sequence of length N requires log(N!) ~ N * log(N/e) bits of randomness [2]. The random module provides a generator with "a period of 2**19937-1", meaning you can get 2**19937 bits of randomness out of it before it starts repeating.
All of which is to say that any representable sequence, say N < 2**50, will need no more than 2**60 bits of randomness to shuffle. That is well within the period of the random number generator.
Attached is a patch that deletes the comment.
An illustration of this misconception is at [3].
1: http://docs.python.org/release/2.1/lib/module-random.html
2: https://en.wikipedia.org/wiki/Factorial#Rate_of_growth_and_approximations_for_large_n
3: http://stackoverflow.com/questions/3062741/maximal-length-of-list-to-shuffle-with-python-random-shuffle |
|
Date |
User |
Action |
Args |
2013-09-05 00:18:34 | dbenbenn | set | recipients:
+ dbenbenn, docs@python |
2013-09-05 00:18:34 | dbenbenn | set | messageid: <1378340314.25.0.0317280223826.issue18928@psf.upfronthosting.co.za> |
2013-09-05 00:18:33 | dbenbenn | link | issue18928 messages |
2013-09-05 00:18:32 | dbenbenn | create | |
|