Issue 5091: Segfault in PyObject_Malloc(), address out of bounds

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

This issue has been migrated to GitHub: https://github.com/python/cpython/issues/49341

classification

Title:	Segfault in PyObject_Malloc(), address out of bounds
Type:	crash	Stage:	resolved
Components:	Interpreter Core	Versions:	Python 2.7

process

Status:	closed	Resolution:	not a bug
Dependencies:		Superseder:
Assigned To:		Nosy List:	christian.heimes, neologix, rfk, terry.reedy
Priority:	normal	Keywords:

Created on 2009-01-28 14:21 by christian.heimes, last changed 2022-04-11 14:56 by admin. This issue is now closed.

Files
File name	Uploaded	Description	Edit
core.27778.bz2	rfk, 2011-03-05 12:05

Messages (9)
msg80711 - (view)	Author: Christian Heimes (christian.heimes) *	Date: 2009-01-28 14:21
One of our application recently started to segfault in PyObject_Malloc(). The cause of the problem could be tracked down to an overflowing internal cache. However I was astonished that Python was segfaulting instead of raising a memory exception. I did some post mortem debugging with gdb and found an address out of bounds problem. I think the issue is caused by the limited heap of a 32bit Python process. (gdb) bt ... #7 <signal handler called> #8 PyObject_Malloc (nbytes=40) at Objects/obmalloc.c:747 #9 0xb7edfba5 in _PyObject_GC_Malloc (basicsize=28) at Modules/gcmodule.c:1322 #10 0xb7e79867 in PyType_GenericAlloc (type=0xb7606d40, nitems=0) at Objects/typeobject.c:454 ... (gdb) up 8 #8 PyObject_Malloc (nbytes=40) at Objects/obmalloc.c:747 747 if ((pool->freeblock = (block )bp) != NULL) { (gdb) print pool $1 = (poolp) 0x17ecc000 (gdb) print pool->freeblock $2 = (block ) 0xecc778b7 <Address 0xecc778b7 out of bounds> (gdb) print bp $3 = (block *) 0xecc778b7 <Address 0xecc778b7 out of bounds> Python: 2.5.2 (32bit) OS: SuSE Linux 2.6.16.60-0.33-bigsmp
msg103246 - (view)	Author: Charles-François Natali (neologix) *	Date: 2010-04-15 19:08
It was a long time ago, but: - I think the interpreter will never be able to catch all memory allocation errors, since because of overcommitting (which Linux does), you can very well get a segmentation fault even if the memory allocation routine (be it malloc or mmap) is successful (i.e. doesn't return NULL): so basically, we don't have any way to avoid that, since we will segfault only when we first touch the allocated page - since your application was "overflowing internal cache", this might very well be what happened (or it could have been a memory corruption issue :-) Peers ?
msg112761 - (view)	Author: Terry J. Reedy (terry.reedy) *	Date: 2010-08-04 05:03
Is this reproducible? Has is occurred with 2.7 or 3.x? Or should be close this?
msg130110 - (view)	Author: Ryan Kelly (rfk)	Date: 2011-03-05 09:23
Not sure if it's caused by the same thing, but I just got a segfault on the same line in my own program. Running python 2.7.1. I will try to dig out some more useful info but it's been a long time since I chased a segfault...
msg130113 - (view)	Author: Charles-François Natali (neologix) *	Date: 2011-03-05 11:02
Do you have a coredump ? It'd be curious to see this faulting address. I didn't notice the first time, but in the OP case the address is definitely wrong: 0xecc778b7 is above PAGE_OFFSET (0xc0000000 on x86), so unless he's using a kernel with a 4G/4G split (and it's not the case since it seems to be a PAE kernel), it's definitely an invalid/corrupt address...
msg130115 - (view)	Author: Ryan Kelly (rfk)	Date: 2011-03-05 11:15
Please remind me how to obtain an appropriate coredump (as I said, it's been a long time...) Doing "print bp" shows an out-of-bounds address as for the original submitter.
msg130116 - (view)	Author: Ryan Kelly (rfk)	Date: 2011-03-05 12:05
attaching core dump from a freshly-compiled python 2.7.1 at with "-O0 -g" in CFLAGS. The code that is segfaulting is using pycrypto and sqlite3, so it may be that a bug in one of these is trampling on something. No idea how to investigate any further.
msg130117 - (view)	Author: Charles-François Natali (neologix) *	Date: 2011-03-05 12:45
> The code that is segfaulting is using pycrypto and sqlite3, so it may be that a bug in one of these is trampling on something. No idea how to investigate any further. You could try valgrind: $ valgrind --tool=memcheck -o /tmp/output.log <prog> arguments This slows down the execution, but can reveal certain types of memory corruption.
msg130195 - (view)	Author: Ryan Kelly (rfk)	Date: 2011-03-06 21:56
Thanks for the help, I have tracked this down to a bug in PyCrypto. It was increfing an object once but decrefing it twice. Sorry for the noise.

History
Date	User	Action	Args
2022-04-11 14:56:44	admin	set	github: 49341
2011-03-06 22:37:45	terry.reedy	set	status: open -> closed
2011-03-06 22:13:49	SilentGhost	set	resolution: not a bug stage: test needed -> resolved
2011-03-06 21:56:03	rfk	set	messages: + msg130195
2011-03-05 12:45:38	neologix	set	messages: + msg130117
2011-03-05 12:05:11	rfk	set	files: + core.27778.bz2 messages: + msg130116
2011-03-05 11:15:21	rfk	set	messages: + msg130115
2011-03-05 11:02:31	neologix	set	messages: + msg130113
2011-03-05 09:23:40	rfk	set	status: pending -> open nosy: + rfk messages: + msg130110
2010-08-04 05:03:16	terry.reedy	set	status: open -> pending versions: + Python 2.7, - Python 2.5 nosy: + terry.reedy messages: + msg112761
2010-04-15 19:08:22	neologix	set	nosy: + neologix messages: + msg103246
2009-01-28 14:21:06	christian.heimes	create