API: read_csv,from_csv/to_csv keyword consistency #9568

rmorgans · 2015-03-01T12:33:52Z

method naming consistency issue #577

to_csv uses line_terminator and read_csv uses lineterminator
compression kw ENH: DataFrame.to_csv support for "compression='gzip'" #7615 (ENH: added compression kw to to_csv GH7615 #11219)
sep/delimiter read_csv/to_csv sep/delimiter inconsistency #7662
from_csv differs from read_csv DataFrame.from_csv undocumented behavior of index_col #9556 (DOC: clarify purpose of DataFrame.from_csv (GH4191) #10163)
decimal Add support of 'decimal' option to Series.to_csv and Dataframe.to_csv #8448 (671c4b3)

The text was updated successfully, but these errors were encountered:

jreback · 2015-03-01T15:42:37Z

@rmorgans yep. I created a master issue for things that I recall, but I am sure there are some more. Would you like to do a full comparison of the read_csv/to_csv options to see if there any other conventions that are inconsisten (and include from_csv) here as well.

Ideally we would like to make most consistent name work and deprecate the others. Would you like to do a pull-request?

Dobatymo · 2017-11-03T09:55:25Z

all other issues referenced here are closed.
only

to_csv uses line_terminator and read_csv uses lineterminator

still applies.

jreback · 2017-11-04T18:31:37Z

@Dobatymo updated. would you like to do a PR to deprecate line_terminator in .to_csv?

Dobatymo · 2017-11-06T01:49:39Z

Not sure how to deprecate something. Should I add lineterminator and add to the docs that line_terminator is deprecated? Or directly delete it?

TomAugspurger · 2017-11-06T14:58:30Z

Not sure how to deprecate something

If you look in pandas/utils/_decorators.py you'll see a deprecate_kwarg. You can use that to and it'll emit a warning if the incorrect version is passed.

We'll also want to

update all the docs and tests to use lineterminator
Add a new tests that uses line_terminator and checks that the deprecation warning was emitted
Add a release note to whatsnew/v0.21.1.txt (@jreback we're OK with adding deprecation warnings in point releases, correct?)

jorisvandenbossche · 2017-11-06T22:35:45Z

Add a release note to whatsnew/v0.21.1.txt (@jreback we're OK with adding deprecation warnings in point releases, correct?)

Let's keep that for 0.22.0. Introducing deprecation warnings is somehow changing behaviour in the sense that it gives annoying warnings one needs to change code for to get rid of them. I would keep 0.21.1 for essential bug fixes.

But apart from that, actual question: is there a reason we would prefer lineterminator over line_terminator ? (I find the latter more readable)

jreback · 2017-11-07T19:56:59Z

most 2 word options are not _ separates in read_csv and we chose previously not to change them
so lineterminator is consistent (agree it’s not as nice)

no depreciations in point release so this for 0.22

jorisvandenbossche · 2017-11-07T20:33:29Z

most 2 word options are not _ separates in read_csv and we chose previously not to change them
so lineterminator is consistent (agree it’s not as nice)

That is not completely true (I mean we have both), eg: index_col, mangle_dupe_cols, true_values, na_values, keep_default_na, skip_blank_lines, parse_dates, infer_datetime_format, ... etc all use the _ naming scheme.
It seems that it are mainly the keyword inherited from the stdlib csv module that are not with underscores. And, lineterminator is one of those. So in read_csv we followed the csv module, in to_csv not.

Personally I find that not enough reason to choose for lineterminator in to_csv.
Another option is also to document it as line_terminator but still accept lineterminator in read_csv for compatibility with csv dialects.

jreback · 2017-11-07T20:58:07Z

yes I think we have an issue about accepting both. personally I would just do that, its really no big deal to do that.

GGordonGordon · 2018-03-05T03:34:06Z

@jreback / @jorisvandenbossche - I was thinking of picking this up and working on it but wanted to clarify is the objective for this issue to still deprecate line_terminator in to_csv and replace it with lineterminator to mirror read_csv or is the objective to allow both parameters in both locations?

mangecoeur · 2019-04-30T09:38:14Z

Bumping/adding to this: it would be good to have symmetry in the header keyword, when writing you set header=False to skip the header but when reading you set header=None (header=False is an error).

df.to_csv(header=False)

#Currently allowed
df = pd.read_csv(header=None)

#Currently an error
df = pd.read_csv(header=False)

pv8473h12 · 2019-10-12T10:53:52Z

If no-one else is working on this, I'd like to give it a go.

jbrockmendel · 2022-01-10T05:35:06Z

#35399 didn't achieve any consensus. In isolation I like line_terminator better, but for consistency with stdlib csv think we should standardize on lineterminator.

phofl · 2022-01-30T12:25:31Z

closed by #45302

jreback changed the title ~~csv line terminator inconsistency between read and write~~ API: read_csv/to_csv keyword consistency Mar 1, 2015

jreback added API Design IO CSV read_csv, to_csv labels Mar 1, 2015

jreback added this to the 0.17.0 milestone Mar 1, 2015

jreback added Good as first PR Deprecate Functionality to remove in pandas labels Mar 1, 2015

jreback changed the title ~~API: read_csv/to_csv keyword consistency~~ API: read_csv,from_csv/to_csv keyword consistency Mar 1, 2015

jorisvandenbossche mentioned this issue May 18, 2015

DOC: clarify purpose of DataFrame.from_csv (GH4191) #10163

Merged

TomAugspurger added the good first issue label Oct 11, 2017

jreback added good first issue and removed good first issue Difficulty Novice labels Dec 15, 2017

jbrockmendel added API - Consistency Internal Consistency of API/Behavior and removed API Design labels Dec 18, 2019

arw2019 mentioned this issue Jul 24, 2020

API: read_csv, to_csv line_terminator keyword inconsistency #35399

Closed

5 tasks

mroeschke added Needs Discussion Requires discussion from core team before further action and removed good first issue labels Apr 12, 2021

phofl closed this as completed Jan 30, 2022

phofl modified the milestones: Contributions Welcome, 1.5 Jan 30, 2022

phofl mentioned this issue Jan 30, 2022

DEPR: line_terminator->lineterminator GH#9569 #45302

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API: read_csv,from_csv/to_csv keyword consistency #9568

API: read_csv,from_csv/to_csv keyword consistency #9568

rmorgans commented Mar 1, 2015 •

edited by jreback

Loading

jreback commented Mar 1, 2015

Dobatymo commented Nov 3, 2017 •

edited

Loading

jreback commented Nov 4, 2017

Dobatymo commented Nov 6, 2017

TomAugspurger commented Nov 6, 2017

jorisvandenbossche commented Nov 6, 2017

jreback commented Nov 7, 2017

jorisvandenbossche commented Nov 7, 2017

jreback commented Nov 7, 2017

GGordonGordon commented Mar 5, 2018

mangecoeur commented Apr 30, 2019

pv8473h12 commented Oct 12, 2019

jbrockmendel commented Jan 10, 2022

phofl commented Jan 30, 2022

API: read_csv,from_csv/to_csv keyword consistency #9568

API: read_csv,from_csv/to_csv keyword consistency #9568

Comments

rmorgans commented Mar 1, 2015 • edited by jreback Loading

jreback commented Mar 1, 2015

Dobatymo commented Nov 3, 2017 • edited Loading

jreback commented Nov 4, 2017

Dobatymo commented Nov 6, 2017

TomAugspurger commented Nov 6, 2017

jorisvandenbossche commented Nov 6, 2017

jreback commented Nov 7, 2017

jorisvandenbossche commented Nov 7, 2017

jreback commented Nov 7, 2017

GGordonGordon commented Mar 5, 2018

mangecoeur commented Apr 30, 2019

pv8473h12 commented Oct 12, 2019

jbrockmendel commented Jan 10, 2022

phofl commented Jan 30, 2022

rmorgans commented Mar 1, 2015 •

edited by jreback

Loading

Dobatymo commented Nov 3, 2017 •

edited

Loading