-
Notifications
You must be signed in to change notification settings - Fork 28
Update to_parquet/pyarrow tests #125
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
I'd like to work on this |
@datapythonista Since there are two things that need to be done (update the tests and docs), should these be done in two separate branches or just one? |
@datapythonista also, I've setup the latest version of arrow/pyarrow in my local, however I'm having trouble telling my conda environment to use the latest version of pyarrow. I've been searching for docs/tutorials that might help me with this but to no avail. Do you have any pointers regarding this? Thank you! |
@galuhsahid Try giving the pyarrow documentation a read. It has information regarding the development settings via conda. |
@TanyaaCJain Did you mean this doc? I've followed through & built pyarrow in its own environment successfully. I should've been clearer - I meant I had trouble telling my pandas-dev conda environment to refer to the pyarrow I've just built, so that my tests in pandas would pass. |
@galuhsahid, if I'm understanding correctly what you mean, I think using the
And your python interpreter should be able to import your pyarrow from master. You can also have: Let me know if this is not what you needed. |
It's fine to do it in a single branch/PR (since they are related)
You will need to install pandas and pyarrow in the same environment (technically it might be possible to point to an install in a different environment, but it is not something I would recommend). |
@galuhsahid What about trying code from this doc? Somehow, the html version does not have the same code as that in the "Developing with conda - Environment Setup and Build" topic in the pdf version. The code in here does what @jorisvandenbossche is talking about. |
@datapythonista @jorisvandenbossche @TanyaaCJain I ended up following @jorisvandenbossche's approach which works for me. Thanks a lot for the help everyone! I'll make sure to add this to our learning points as well. |
@galuhsahid would you like to try to fix https://issues.apache.org/jira/browse/ARROW-6302 as well? |
@jorisvandenbossche Sure, I'd like to fix that as well. I'll ask you if I get stuck on something (if you don't mind) |
Great ;) Don't hesitate to ask any questions! (I am no C++ expert, but I start to be a bit familiar with the Arrow codebase) |
This issue will have a bit of trickiness on setting up locally the latest (
master
branch) version of arrow/pyarrow, but should be easy other than that:pandas-dev/pandas#27955
Please add a comment to the original issue (also here) to claim it if you plan to work on it.
The text was updated successfully, but these errors were encountered: