This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

classification
Title: cPickle can misread data type
Type: behavior Stage: resolved
Components: Library (Lib) Versions: Python 2.6
process
Status: closed Resolution: wont fix
Dependencies: Superseder:
Assigned To: alexandre.vassalotti Nosy List: ac.james, alexandre.vassalotti, georg.brandl, mark.dickinson
Priority: normal Keywords: patch

Created on 2009-06-15 23:19 by ac.james, last changed 2022-04-11 14:56 by admin. This issue is now closed.

Files
File name Uploaded Description Edit
cPicktest.py ac.james, 2009-06-22 21:01 test case
issue6290.patch mark.dickinson, 2010-07-12 15:35
Messages (9)
msg89418 - (view) Author: Alex James (ac.james) Date: 2009-06-15 23:19
When using cPickle to pickle / unpickle an object instance whose
__dict__ contains a dictionary of NumPy Arrays (on a windows32 system),
some of the array elements have the wrong type raising a ValueError:
could not convert string to float.  

On UNIX platform this error does not occur, and the data is read out in
the correct type every time.  
Forcing the caller to use pickle.py module instead removed the issue.

Statements about the imprecision of cPickle (such as issue1536, 655802),
or its deprecaition (now I can't find where that was mentioned), would
assist.  By contrast the current state of the documentation implies that
cPickle is better overall, and thus should be used preferentially.
msg89421 - (view) Author: Alexandre Vassalotti (alexandre.vassalotti) * (Python committer) Date: 2009-06-16 01:24
Could you provide a test case? The behaviour you are describing sounds
like a bug in cPickle.
msg89612 - (view) Author: Alex James (ac.james) Date: 2009-06-22 21:01
I have now pinpointed the error to a list of infinities (see attached).
When using pickle.py to read the cPickle'd data we get a different, and
more, informative error:
ValueError: invalid literal for float(): 1.#INF
msg89619 - (view) Author: Alexandre Vassalotti (alexandre.vassalotti) * (Python committer) Date: 2009-06-23 01:24
Thanks for the test case. I will take a look.
msg89620 - (view) Author: Alexandre Vassalotti (alexandre.vassalotti) * (Python committer) Date: 2009-06-23 01:52
Could you give me the output of this?

  import cPickle
  print repr(cPickle.dumps([float('+inf'), float('-inf'), float('nan')]))
  print [float('+inf'), float('-inf'), float('nan')]

By the way, are you sure this bug occurs on Python 2.6? Python 2.6 uses
a platform-independent float to string converter (i.e.,
PyOS_double_to_string) which shouldn't output stuff like "1.#INF"

Also, can you verify that the bug does not occur with pickle protocol 1
and over?
msg89638 - (view) Author: Alex James (ac.james) Date: 2009-06-23 18:40
Your test prints:
'(1p1\nF1.#INF\naF-1.#INF\naF-1.IND\na.'
[inf, -inf, nan]

My installation is Python 2.6.2 as currently distributed.

Specifying protocol 1 or 2 does circumvent the error.  
Thank you.
msg110094 - (view) Author: Mark Dickinson (mark.dickinson) * (Python committer) Date: 2010-07-12 15:20
I think this can be closed.  It should no longer be a problem in Python 2.7 or Python 3.x, and there's a workaround (use protocol 1 or 2) for Python 2.6.

In theory, it *could* still be fixed for Python 2.6.6, but changing the pickle output in a bugfix release seems like it might be dangerous.
msg110097 - (view) Author: Mark Dickinson (mark.dickinson) * (Python committer) Date: 2010-07-12 15:35
However, here's a patch.  I haven't tested it on Windows.
msg112952 - (view) Author: Alexandre Vassalotti (alexandre.vassalotti) * (Python committer) Date: 2010-08-05 07:23
It is too late for 2.6.6 now that it is released.
History
Date User Action Args
2022-04-11 14:56:50adminsetgithub: 50539
2010-08-05 07:23:49alexandre.vassalottisetstatus: open -> closed
resolution: wont fix
messages: + msg112952

stage: resolved
2010-07-16 18:03:20belopolskylinkissue9267 dependencies
2010-07-12 15:35:28mark.dickinsonsetfiles: + issue6290.patch
keywords: + patch
messages: + msg110097
2010-07-12 15:20:05mark.dickinsonsetnosy: + mark.dickinson
messages: + msg110094
2009-06-23 18:41:45ac.jamessetnosy: georg.brandl, alexandre.vassalotti, ac.james
components: + Library (Lib), - Documentation, Extension Modules, Windows
2009-06-23 18:40:57ac.jamessetnosy: georg.brandl, alexandre.vassalotti, ac.james
messages: + msg89638
components: + Documentation, Extension Modules, Windows, - Library (Lib)
2009-06-23 01:52:29alexandre.vassalottisetmessages: + msg89620
2009-06-23 01:24:26alexandre.vassalottisetassignee: georg.brandl -> alexandre.vassalotti
messages: + msg89619
components: + Library (Lib), - Documentation, Extension Modules, Windows
nosy: georg.brandl, alexandre.vassalotti, ac.james
2009-06-22 21:01:05ac.jamessetfiles: + cPicktest.py

messages: + msg89612
2009-06-16 01:24:57alexandre.vassalottisetnosy: + alexandre.vassalotti
messages: + msg89421
2009-06-15 23:19:52ac.jamescreate