Expand LuxGroupby Tests and add bug fixes #287

westernguy2 · 2021-02-25T08:29:30Z

Overview

This pull request expands the tests for LuxGroupby to test if a LuxDataFrame is pre_aggregated after a groupby. It also patches the edge case of LuxDataFrame having a "name" column among other bug fixes.

Changes

This pull request is mainly for adding tests to LuxGroupby that didn't check if a LuxDataFrame is pre_aggregated after a groupby in certain cases.

In addition, this PR adds the "name" column back to the metadata, which patches the bug described in the above Overview. A test has been added to ensure this doesn't change.

Finally, this PR also patches small bugs relating to groupby and LuxGroupby. For instance, it adds apply as one of the methods extended from Pandas Groupby. It also fixes an edge case where the name of a Series was being changed. This is patched by directly editing unnamed columns (named by default as 0) to a blank string.

Example Output

The expected behavior should model the expected behavior of Pandas for the different patches described above.

thyneb19

The changes look good! Just added a comment on hos the test_name_column test could be adjusted. Thanks Kunal!

thyneb19 · 2021-02-25T21:10:35Z

tests/test_columns.py

+
+def test_name_column(global_var):
+    df = pd.read_csv("lux/data/car.csv")
+    new_df = df.rename(columns={"Name": "name"})


for this test, can we do a quick check that the values of the "name" column have not all been converted to None values?

Thanks! I added a few more assert statements to check for this case!

dorisjlee · 2021-03-02T09:18:28Z

These changes looks great, thanks @westernguy2 ! (and thanks to @thyneb19 for helping with the review)

westernguy2 added 19 commits January 11, 2021 22:29

add series equality and value counts test

95bb455

black formatting

f18c8a3

fix old value counts test instead

e851c07

add pandas tests

73affb6

remove str from column group

2e9fc3f

fix merge conflicts

71c7769

Merge remote-tracking branch 'upstream/master' into pandas-tests

ec78690

save work on groupby bugs so far

a46d4e9

fix merge conflicts

6d44ece

fix merge conflict again

32d24a4

add new tests and add groupby bug fixes

431f11a

remove tests for staging

7f9f058

update series tests

910049f

add back getitem

1de238d

fix merge conflicts for staging

fe4c022

remove print statements

fc6906f

run black and fix merge conflicts

4845159

revert Makefile

9116360

add test for name column case

4a8a338

thyneb19 reviewed Feb 25, 2021

View reviewed changes

add test to ensure column is not all None

8c0f821

thyneb19 approved these changes Feb 28, 2021

View reviewed changes

thyneb19 requested a review from dorisjlee March 1, 2021 16:50

dorisjlee approved these changes Mar 2, 2021

View reviewed changes

dorisjlee closed this Mar 2, 2021

dorisjlee mentioned this pull request Mar 2, 2021

Better data type detection for pre_aggregated, indexed dataframes #61

Closed

dorisjlee reopened this Mar 2, 2021

dorisjlee merged commit c596592 into lux-org:master Mar 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expand LuxGroupby Tests and add bug fixes #287

Expand LuxGroupby Tests and add bug fixes #287

westernguy2 commented Feb 25, 2021

thyneb19 left a comment

thyneb19 Feb 25, 2021

westernguy2 Feb 28, 2021

dorisjlee commented Mar 2, 2021

Expand LuxGroupby Tests and add bug fixes #287

Expand LuxGroupby Tests and add bug fixes #287

Conversation

westernguy2 commented Feb 25, 2021

Overview

Changes

Example Output

thyneb19 left a comment

Choose a reason for hiding this comment

thyneb19 Feb 25, 2021

Choose a reason for hiding this comment

westernguy2 Feb 28, 2021

Choose a reason for hiding this comment

dorisjlee commented Mar 2, 2021