-
-
Notifications
You must be signed in to change notification settings - Fork 18.5k
API: read_csv,from_csv/to_csv keyword consistency #9568
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@rmorgans yep. I created a master issue for things that I recall, but I am sure there are some more. Would you like to do a full comparison of the Ideally we would like to make most consistent name work and deprecate the others. Would you like to do a pull-request? |
all other issues referenced here are closed.
still applies. |
@Dobatymo updated. would you like to do a PR to deprecate |
Not sure how to deprecate something. Should I add |
If you look in We'll also want to
|
Let's keep that for 0.22.0. Introducing deprecation warnings is somehow changing behaviour in the sense that it gives annoying warnings one needs to change code for to get rid of them. I would keep 0.21.1 for essential bug fixes. But apart from that, actual question: is there a reason we would prefer |
most 2 word options are not _ separates in read_csv and we chose previously not to change them no depreciations in point release so this for 0.22 |
That is not completely true (I mean we have both), eg: index_col, mangle_dupe_cols, true_values, na_values, keep_default_na, skip_blank_lines, parse_dates, infer_datetime_format, ... etc all use the _ naming scheme. Personally I find that not enough reason to choose for |
yes I think we have an issue about accepting both. personally I would just do that, its really no big deal to do that. |
@jreback / @jorisvandenbossche - I was thinking of picking this up and working on it but wanted to clarify is the objective for this issue to still deprecate |
Bumping/adding to this: it would be good to have symmetry in the df.to_csv(header=False)
#Currently allowed
df = pd.read_csv(header=None)
#Currently an error
df = pd.read_csv(header=False) |
If no-one else is working on this, I'd like to give it a go. |
#35399 didn't achieve any consensus. In isolation I like line_terminator better, but for consistency with stdlib csv think we should standardize on lineterminator. |
closed by #45302 |
method naming consistency issue #577
to_csv
uses line_terminator andread_csv
uses lineterminatorcompression
kw ENH: DataFrame.to_csv support for "compression='gzip'" #7615 (ENH: added compression kw to to_csv GH7615 #11219)sep/delimiter
read_csv/to_csv sep/delimiter inconsistency #7662from_csv
differs fromread_csv
DataFrame.from_csv undocumented behavior of index_col #9556 (DOC: clarify purpose of DataFrame.from_csv (GH4191) #10163)decimal
Add support of 'decimal' option to Series.to_csv and Dataframe.to_csv #8448 (671c4b3)The text was updated successfully, but these errors were encountered: