Hi, I have an issue which might be related to this:
I'm running the following basic code:
dfMain.to_csv('./January_filtered_International_WE.csv')
which used to run normally until yesterday. Yesterday morning I upgraded to pandas 0.25.0 while running code and now I cannot write a 500k rows (single-level indexed) dataframe to a csv. I can mention that I left Jupyter Notebook running in order to do some processing, so this morning when I opened it I had the dataFrame already, processed.
Versions (using Windows 10)
Jupyter Notebook : 5.7.8 (Edit: 6.0.0 - upgraded with Administrator permissions - errors still persist)
Python : 3.6.7
Pandas : 0.25.0
I would like to save my DataFrame in a fast and efficient manner as I will load it several times in the future. I do not want to close the notebook as this will delete the dataFrame.
I tried:
Pandas 0.24.2 (previous version used) but still getting the __init__() got an unexpected keyword argument 'tupleize_cols'pd.to_pickle but got a memoryErrorpd.to_hdf but got a memoryErrorERROR: ipython 5.8.0 has requirement prompt-toolkit<2.0.0,>=1.0.4, but
you'll have prompt-toolkit 2.0.9 which is incompatible
_ERROR: jupyter-console 6.0.0 has requirement prompt-toolkit<2.1.0,>=2.0.0, but you'll have prompt-toolkit 1.0.16 which is incompatible._
Edit:
With Admin privileges it managed to upgrade Jupyter.
As an alternative I went into PyCharm and took a random DataFrame.to_csv and it worked. This makes me think the issue is with Jupyter Notebook.
P.S.: Any help on how to save the DataFrame (~12 GB) is also appreciated!
As an alternative I went into PyCharm and took a random
DataFrame.to_csvand it worked. This makes me think the issue is with Jupyter Notebook.
Unfortunately this sounds like more of an issue with your configuration than pandas, and it's almost impossible to advise anything there. If you get a traceback from Jupyter you can maybe try there but again will be hard to say
I do not want to close the notebook as this will delete the dataFrame.
I'm not sure you can upgrade / downgrade versions and have it take effect while a notebook is running
Thanks - indeed it seems it goes deeper than just pandas
I just had the same issue but for me it was an incompatibility between numpy and pandas.
to_csv worked fine0.25 for new functionality and it broke1.15.0 to 1.17.0 and it worked againI am not sure if you guys classify that as a bug
got the same error. downgraded pandas to 0.24.2 and worked again!
numpy is 1.16.3
Most helpful comment
I just had the same issue but for me it was an incompatibility between numpy and pandas.
to_csvworked fine0.25for new functionality and it broke1.15.0to1.17.0and it worked againI am not sure if you guys classify that as a bug