This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author Dragoljub
Recipients Dragoljub
Date 2018-11-08.22:56:21
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1541717781.13.0.788709270274.issue35195@psf.upfronthosting.co.za>
In-reply-to
Content
xref: https://github.com/pandas-dev/pandas/issues/23516

Example:
import io
import pandas as pd
import numpy as np
df = pd.DataFrame(np.random.randn(1000000, 10), columns=('COL{}'.format(i) for i in range(10)))
csv = io.StringIO(df.to_csv(index=False))
df2 = pd.read_csv(csv) #3.5X slower on Python 3.7.1

pd.read_csv() reads data at 30MB/sec on Python 3.7.1 while at 100MB/sec on Python 3.6.7.

This issue seems to be only present on Windows 10 Builds both x86 & x64. 

Possibly some IO changes in Python 3.7 could have contributed to this slowdown on Windows but not on Linux?
History
Date User Action Args
2018-11-08 22:56:21Dragoljubsetrecipients: + Dragoljub
2018-11-08 22:56:21Dragoljubsetmessageid: <1541717781.13.0.788709270274.issue35195@psf.upfronthosting.co.za>
2018-11-08 22:56:21Dragoljublinkissue35195 messages
2018-11-08 22:56:21Dragoljubcreate