Feature Request: .rand() to call random rows to compliment .head() and .tail() #9569

nickeubank · 2015-03-01T20:29:19Z

.head() and .tail() are great tools for quick data interrogations, but when data is sorted they are often far from representative. It would be great if there was a simple command to pull an arbitrary number of random rows and display them for a more representative way to spotcheck data.

It would behave something like:

def rand_rows(df, num_rows = 5):
    from numpy import random as rm
    subset = rm.choice(df.index.values, size = num_rows)    
    return df.loc[subset]

a_data_frame = pd.DataFrame({'col1':range(10,20), 'col2':range(20,30)})
rand_rows(a_data_frame)
rand_rows(a_data_frame, 6)

The text was updated successfully, but these errors were encountered:

TomAugspurger · 2015-03-01T20:40:31Z

We already have an issue for that: #2419
It's just a matter of someone implementing it. Give it a go if you want try! I don't think anyone has started.

TomAugspurger closed this as completed Mar 1, 2015

jbrockmendel mentioned this issue Jan 10, 2022

DEPR: line_terminator->lineterminator GH#9569 #45302

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: .rand() to call random rows to compliment .head() and .tail() #9569

Feature Request: .rand() to call random rows to compliment .head() and .tail() #9569

nickeubank commented Mar 1, 2015

TomAugspurger commented Mar 1, 2015

Feature Request: .rand() to call random rows to compliment .head() and .tail() #9569

Feature Request: .rand() to call random rows to compliment .head() and .tail() #9569

Comments

nickeubank commented Mar 1, 2015

TomAugspurger commented Mar 1, 2015