Pandas: ENH: Support mangle_dupe_cols=False in pd.read_csv()

Created on 23 May 2016  路  7Comments  路  Source: pandas-dev/pandas

12935 added full support for duplicate column names (in header or in names) by mangling them. While this has been considered _acceptable_ by users, ideally, we would like to not have to mangle them.

Enhancement IO CSV

Most helpful comment

Depending on how difficult this is, I would personally still have it as our goal to have mangle_dupe_cols=False implemented some time.

All 7 comments

@jreback : Given what you said in #17060, is this something we should still pursue ?

Depending on how difficult this is, I would personally still have it as our goal to have mangle_dupe_cols=False implemented some time.

What is the ETA on this issue?

when / if a community pull request happens

@caniko2 : This is quite a tricky one given that duplicate column names have unusual behavior in pandas. You are more than welcome to submit a PR to implement it if you like.

Could anyone help me to check whether current pandas 0.24.2 support "mangle_dupe_cols=False"?

I find docs at http://pandas.pydata.org/pandas-docs/stable/user_guide/io.html, showing : Passing in False will cause data to be overwritten if there are duplicate names in the columns.

Thanks so much!

Still no support, as behavior of data handling has proven to be quite non-trivial when there are duplicate column names. You are welcome to give it a shot though!

Was this page helpful?
0 / 5 - 0 ratings

Related issues

matthiasroder picture matthiasroder  路  3Comments

tade0726 picture tade0726  路  3Comments

Ashutosh-Srivastav picture Ashutosh-Srivastav  路  3Comments

mfmain picture mfmain  路  3Comments

nathanielatom picture nathanielatom  路  3Comments