Issue 21922: PyLong: use GMP

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

This issue has been migrated to GitHub: https://github.com/python/cpython/issues/66121

classification

Title:	PyLong: use GMP
Type:	performance	Stage:
Components:	Interpreter Core	Versions:	Python 3.5

process

Status:	closed	Resolution:	rejected
Dependencies:		Superseder:
Assigned To:		Nosy List:	Sergey.Kirpichev, casevh, h.venev, josh.r, mark.dickinson, pitrou, rhettinger, skrah, tim.peters, vstinner
Priority:	normal	Keywords:	patch

Created on 2014-07-05 12:22 by h.venev, last changed 2022-04-11 14:58 by admin. This issue is now closed.

Files
File name	Uploaded	Description	Edit
pygmp.patch	h.venev, 2014-07-05 12:51		review

Messages (22)
msg222347 - (view)	Author: Hristo Venev (h.venev) *	Date: 2014-07-05 12:22
I have implemented the PyLong interface using the GMP mpn functions. API/ABI compatibility is retained (except for longintrepr). It can be enabled by passing --enable-big-digits=gmp to ./configure. No large performance regressions have been observed for small numbers (a few operations are about 10% slower). For large numbers some operations are a lot faster. There is also int.__gcd__ which may be used by fractions.gcd. The GIL is sometimes released. Minimum number of digis for releasing GIL: - multiplication - 64 - division - 64, - modular exponentiation - 16, - base conversion - 64 (256 for binary bases) - GCD - 16 The tests for long, float, decimal, fractions, string, unicode, bytes, pickle, marshal and enum pass. The tests for int fail because the error messages are a bit different (when creating int from bytes or bytearray the value is not shown). I may have run other tests and they have not failed. I have not tested on anything but x86-64. The following testcases yield 42x performace improvement: - 16384-bit RSA on 8 threads on quad-core with HT # GIL released - Multiplying 5600000-bit ints - Dividing 6000000-bit ints - Converting 300000-character str to int(base=10) - Converting 1250000-bit int to str
msg222348 - (view)	Author: Stefan Krah (skrah) *	Date: 2014-07-05 12:30
Did you mean to upload a patch?
msg222350 - (view)	Author: STINNER Victor (vstinner) *	Date: 2014-07-05 12:43
Hi, I worked on a similar patch 6 years ago, while Python 3.0 was developped: https://mail.python.org/pipermail/python-dev/2008-November/083315.html http://bugs.python.org/issue1814 The summary is that using GMP makes Python slower because most numbers are small: fit in [-2^31; 2^31-1], and GMP allocation is expensive. There is also a license issue: GMP license is GPL which is not compatible with the Python license. If you want to work on large numbers, you can gmpy: https://code.google.com/p/gmpy/ """ The following testcases yield 42x performace improvement: - 16384-bit RSA on 8 threads on quad-core with HT # GIL released - Multiplying 5600000-bit ints - Dividing 6000000-bit ints - Converting 300000-character str to int(base=10) - Converting 1250000-bit int to str """ That's not a common use case. Run the Python benchmark suite with your patch to see if your patch has a similar overhead than my old patch. http://hg.python.org/benchmarks
msg222351 - (view)	Author: Hristo Venev (h.venev) *	Date: 2014-07-05 12:51
PyLongObject is a PyVarObject. It contains many mp_limb_t's. There is little overhead. For some operations if the result is in [-20;256] no memory will be allocated. There are special codepaths for 1-limb operations. And I just finished GDB support. Please test if it works for you.
msg222361 - (view)	Author: Stefan Krah (skrah) *	Date: 2014-07-05 13:32
Hmm, the license (LGPL) should only matter for the Windows binaries and we can just compile without --enable-big-digits=gmp. Even if the Windows binaries were built with gmp support, it would be sufficient for any redistributor to point to the external library sources for gmp that the build has actually used -- your own code does not automatically become LGPL licensed and you are under no obligation to reveal it. I haven't looked at the patch in detail, but I don't have any fundamental objection against adding a configure option, provided that the additional code is well isolated (which seems to be the case). Of course we'd have to set up buildbots for the option etc...
msg222368 - (view)	Author: Hristo Venev (h.venev) *	Date: 2014-07-05 15:42
After some minor optimizations my implementation is about 1.8% slower on pystone and about 4% slower on bm_nqueens. It's 4 times faster on bm_pidigits.
msg222369 - (view)	Author: STINNER Victor (vstinner) *	Date: 2014-07-05 15:49
Please try the Python benchmark suite.
msg222370 - (view)	Author: Mark Dickinson (mark.dickinson) *	Date: 2014-07-05 15:54
I do have an objection to adding the configure option: from that point on, it means that maintaining the GMP-based long implementation is now the responsibility of the core developers, and I think that's an unnecessary maintenance burden, for an option that few users will care about. I think having two long integer implementations in the core is worse than having one.
msg222371 - (view)	Author: STINNER Victor (vstinner) *	Date: 2014-07-05 16:00
> I think having two long integer implementations in the core is worse than having one. I agree. If the GMP implementation is accepted, the old implementation must be dropped and replaced by the GMP implementation.
msg222372 - (view)	Author: Mark Dickinson (mark.dickinson) *	Date: 2014-07-05 16:21
> If the GMP implementation is accepted, the old implementation must be > dropped and replaced by the GMP implementation. Agreed. I'm open to that, but it's critical that common use-cases (i.e., those not using 1000-digit integers!) aren't slowed down.
msg222389 - (view)	Author: Antoine Pitrou (pitrou) *	Date: 2014-07-06 02:19
Note that we could probably release the GIL in the current implementation, too - we just haven't bothered adding such an optimization.
msg222507 - (view)	Author: Hristo Venev (h.venev) *	Date: 2014-07-07 19:59
After optimization, tests on small ints (< 230) Currently only addition, subtraction, negation and ~ are a bit slower (< 5%). Most other operations are the same. Bitwise operators, //, %, and pow are faster. Converting to and from strings is a bit faster. pickle, marshal and json are faster. bm_nqueens is a bit slower. pystone is a bit faster. There are no performance regressions in other benchmarks. When I fix +,- and ~ I will reupload the patch.
msg222508 - (view)	Author: STINNER Victor (vstinner) *	Date: 2014-07-07 20:05
IMO you must discuss the GMP license on the python-dev mailing list since we wrote that if the GMP patch is accepted, it will not be optional and so affect all platforms.
msg222547 - (view)	Author: Mark Dickinson (mark.dickinson) *	Date: 2014-07-08 06:53
> IMO you must discuss the GMP license on the python-dev mailing list I think the maintenance implications of having another external dependency would also need discussion on python-dev.
msg222548 - (view)	Author: Mark Dickinson (mark.dickinson) *	Date: 2014-07-08 07:44
Hmm. Looking back at my previous comments, I should explain my negativity a bit better. 1. Like Victor's issue 1814 work, this is great stuff, and very impressive. 2. I'm really not convinced that it belongs in the Python core. To expand on 2: we already have a simple, highly portable, battle-tested implementation of big integers that's reasonably efficient for normal uses and requires little day-to-day maintenance. We'd be replacing that with something that's a lot more complicated, less thoroughly tested, and not significantly more efficient in normal use-cases. Apart from the pain of the transition (and any refactor of this size is bound to involve some pain), I'm particularly worried about future headaches involved in maintaining the external GMP dependency: keeping up with bugfixes, managing the binary builds, etc. I anticipate that that would add quite of a lot of work for the core team in general and those building releases in particular. (And that's another reason that we should have a python-dev discussion, so that those involved in the release process get a chance to weigh in.) To offset that, there needs to be a clear benefit to making this change. A couple of specific questions for Hristo Venev: - What's the current status of GMP on Windows? IIUC, one of the motivations for the MPIR unfriendly fork of GMP was better Windows support. Does your patch already build cleanly on Windows? - Whom do you see this change benefiting? Those doing serious large-number stuff on Python (SAGE users, for example) are presumably already using gmpy2 or something similar. - Do you want to initiate the python-dev discussion, or leave that to someone else? [I'm deliberately steering clear of the licensing issues; it needs discussion, but IANAL and I have nothing useful to contribute here.]
msg222550 - (view)	Author: Mark Dickinson (mark.dickinson) *	Date: 2014-07-08 08:01
Previous python-dev discussions: https://mail.python.org/pipermail/python-dev/2008-November/083315.html https://mail.python.org/pipermail/python-3000/2007-September/010718.html [Regarding the ancient mpz module, which used to be part of Python] https://mail.python.org/pipermail/python-dev/2001-December/018967.html
msg222826 - (view)	Author: Mark Dickinson (mark.dickinson) *	Date: 2014-07-12 08:23
... and if there's one person who's very well placed to comment on the ease or difficulty of keeping up with GMP/MPIR (especially on Windows), it's Case Van Horsen, who I notice has recently added himself to the nosy. @casevh: any comments?
msg222851 - (view)	Author: Case Van Horsen (casevh)	Date: 2014-07-12 16:11
Disclaimer: as Mark alluded to, I maintain gmpy2. Some comments on MPIR/GMP: For all practical purposes, building GMP on Windows requires some version of the mingw compiler toolchain. None of the performance gains of custom assembly code is available if GMP is build with the VS compiler. When compiled with mingw, GMP supports CPU detection to automatically use code optimized for the specific instruction set. This level of optimization may not be needed for Python, though. The MPIR fork of GMP can be built with VS. Assembly code is supported via the YASM assembler plugin. Only a single instruction set is supported by the lib/dll. gmpy2 currently uses MPIR. I've had no issues with its stability. The mpz type has a maximum precision. IIRC, the maximum length is 2^31 bits on a 32-bit platform and 2^37 on a 64-bit platform. The mpn interface may or may not have the same restrictions. This might impact code that runs correctly, but slowly, with Python's normal PyLong implementation. GMP does not handle out-of-memory situations gracefully. When GMP encounters a memory allocation failure (exceeding the limits above or when running our of scratch space), it will just abort. It is easy in gmpy2 to trigger an abort that will crash the Python interpreter. My main concern is tightly linking the Python interpreter to a specific version of GMP (i.e. whatever version is used for the Windows builds or the version provided by the distribution). As long as gmpy2 can continue to use another version of GMP, it shouldn't matter to me. GMP and MPIR are both licensed under LGPL v3+ (not v2+). I'll reserve any further licensing discussions for python-dev. I'll try to test the patch this weekend and that should answer some of my questions.
msg222988 - (view)	Author: Raymond Hettinger (rhettinger) *	Date: 2014-07-14 01:00
I agree with all of Mark's objections. Unless there is a compelling win, Python is better-off without the maintenance, portability, and licensing issues. I have vague memories of this having been discussed a long time ago and it is unlikely that there have been any changes to the reasons for not doing it.
msg222999 - (view)	Author: Case Van Horsen (casevh)	Date: 2014-07-14 06:36
I've successfully tested the patch. The patch works fine but there are a couple of issues: 1) The patch relies on several new low-level functions that were introduced in the latest release of GMP 6.0.0. Most Linux distributions are still providing older versions. 2) The new functions are not available in any version of MPIR so I can't try a Windows build. I've done some basic tests but I'll wait until Hristo updates the patch with his improvements before I run detailed tests.
msg223044 - (view)	Author: Mark Dickinson (mark.dickinson) *	Date: 2014-07-14 17:55
> When GMP encounters a memory allocation failure (exceeding the limits above or when running our of scratch space), it will just abort. Ouch; that's not friendly. Seems like this is part of the reason that Armin Rigo abandoned the attempt to use GMP in PyPy. https://bitbucket.org/pypy/pypy/issue/892/
msg228044 - (view)	Author: STINNER Victor (vstinner) *	Date: 2014-10-01 00:42
Can we close this issue?

History
Date	User	Action	Args
2022-04-11 14:58:05	admin	set	github: 66121
2021-03-07 09:44:41	Sergey.Kirpichev	set	nosy: + Sergey.Kirpichev
2014-10-01 00:43:42	benjamin.peterson	set	status: open -> closed resolution: rejected
2014-10-01 00:42:50	vstinner	set	messages: + msg228044
2014-07-14 17:55:34	mark.dickinson	set	messages: + msg223044
2014-07-14 06:36:02	casevh	set	messages: + msg222999
2014-07-14 01:00:30	rhettinger	set	messages: + msg222988
2014-07-12 16:11:36	casevh	set	messages: + msg222851
2014-07-12 08:23:26	mark.dickinson	set	messages: + msg222826
2014-07-12 06:52:18	casevh	set	nosy: + casevh
2014-07-08 08:01:12	mark.dickinson	set	messages: + msg222550
2014-07-08 07:44:53	mark.dickinson	set	messages: + msg222548
2014-07-08 06:53:08	mark.dickinson	set	messages: + msg222547
2014-07-07 20:05:03	vstinner	set	messages: + msg222508
2014-07-07 19:59:23	h.venev	set	messages: + msg222507
2014-07-06 02:19:25	pitrou	set	nosy: + pitrou messages: + msg222389
2014-07-05 16:21:50	mark.dickinson	set	messages: + msg222372
2014-07-05 16:00:19	vstinner	set	messages: + msg222371
2014-07-05 15:54:30	mark.dickinson	set	nosy: + tim.peters
2014-07-05 15:54:21	mark.dickinson	set	messages: + msg222370
2014-07-05 15:49:33	vstinner	set	messages: + msg222369
2014-07-05 15:42:10	h.venev	set	messages: + msg222368
2014-07-05 14:13:56	josh.r	set	nosy: + josh.r
2014-07-05 13:32:50	skrah	set	messages: + msg222361
2014-07-05 12:51:54	h.venev	set	files: + pygmp.patch keywords: + patch messages: + msg222351
2014-07-05 12:43:01	vstinner	set	nosy: + vstinner messages: + msg222350
2014-07-05 12:30:54	ezio.melotti	set	nosy: + rhettinger, mark.dickinson
2014-07-05 12:30:21	skrah	set	nosy: + skrah messages: + msg222348
2014-07-05 12:22:24	h.venev	create