Message 409000 - Python tracker

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author	tim.peters
Recipients	PedanticHacker, Stefan Pochmann, mark.dickinson, mcognetta, pablogsal, rhettinger, serhiy.storchaka, tim.peters
Date	2021-12-21.19:32:57
SpamBayes Score	-1.0
Marked as misclassified	Yes
Message-id	<1640115177.84.0.643458509671.issue37295@roundup.psfhosted.org>
In-reply-to

Content
Clever, Mark! Very nice. The justification for the shift count isn't self-evident, and appears to me to be an instance of the generalization of Kummer's theorem to multinomial coefficients. I think it would be clearer at first sight to rely instead on that 2i/(2j * 2k) = 2(i-j-k), which is shallow. So here's a minor rewrite doing that instead; it would add 68 bytes to the precomputed static data. import math # Largest n such that comb(n, k) fits in 64 bits for all k. Nmax = 67 # Express n! % 264 as Fodd[n] << Fntz[n] where Fodd[n] is odd. Fodd = [] # unsigned 8-byte ints Fntz = [] # unsigned 1-byte ints for i in range(Nmax + 1): f = math.factorial(i) lsb = f & -f # isolate least-significant 1 bit Fntz.append(lsb.bit_length() - 1) Fodd.append((f >> Fntz[-1]) % 264) Finv = [pow(fodd, -1, 2*64) for fodd in Fodd] # All of the above is meant to be precomputed; just static tables in C. # Fast comb for small values. def comb_small(n, k): if not 0 <= k <= n <= Nmax: raise ValueError("k or n out of range") return ((Fodd[n] Finv[k] * Finv[n-k] % 2**64) << (Fntz[n] - Fntz[k] - Fntz[n-k])) # Exhaustive test for n in range(Nmax+1): for k in range(0, n+1): assert comb_small(n, k) == math.comb(n, k) Since 99.86% of comb() calls in real life are applied to a deck of cards ;-) , it's valuable that Nmax be >= 52.

Clever, Mark! Very nice.

The justification for the shift count isn't self-evident, and appears to me to be an instance of the generalization of Kummer's theorem to multinomial coefficients. I think it would be clearer at first sight to rely instead on that 2**i/(2**j * 2**k) = 2**(i-j-k), which is shallow.

So here's a minor rewrite doing that instead; it would add 68 bytes to the precomputed static data.

    import math
    # Largest n such that comb(n, k) fits in 64 bits for all k.
    Nmax = 67

    # Express n! % 2**64 as Fodd[n] << Fntz[n] where Fodd[n] is odd.
    Fodd = [] # unsigned 8-byte ints
    Fntz = [] # unsigned 1-byte ints
    for i in range(Nmax + 1):
        f = math.factorial(i)
        lsb = f & -f # isolate least-significant 1 bit
        Fntz.append(lsb.bit_length() - 1)
        Fodd.append((f >> Fntz[-1]) % 2**64)

    Finv = [pow(fodd, -1, 2**64) for fodd in Fodd]

    # All of the above is meant to be precomputed; just static tables in C.

    # Fast comb for small values.
    def comb_small(n, k):
        if not 0 <= k <= n <= Nmax:
            raise ValueError("k or n out of range")
        return ((Fodd[n] * Finv[k] * Finv[n-k] % 2**64)
                << (Fntz[n] - Fntz[k] - Fntz[n-k]))

    # Exhaustive test
    for n in range(Nmax+1):
        for k in range(0, n+1):
            assert comb_small(n, k) == math.comb(n, k)

Since 99.86% of comb() calls in real life are applied to a deck of cards ;-) , it's valuable that Nmax be >= 52.

History
Date	User	Action	Args
2021-12-21 19:32:57	tim.peters	set	recipients: + tim.peters, rhettinger, mark.dickinson, serhiy.storchaka, PedanticHacker, mcognetta, Stefan Pochmann, pablogsal
2021-12-21 19:32:57	tim.peters	set	messageid: <1640115177.84.0.643458509671.issue37295@roundup.psfhosted.org>
2021-12-21 19:32:57	tim.peters	link	issue37295 messages
2021-12-21 19:32:57	tim.peters	create