Staging to main: Fix bug in MAP and added new notebook programmatic execution #2035

miguelgfierro · 2023-11-07T09:46:14Z

Description

Related Issues

References

Checklist:

I have followed the contribution guidelines and code style for this project.
I have added tests covering my contributions.
I have updated the documentation accordingly.
This PR is being made to staging branch and not to main branch.

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

* Not triggering unit tests on Draft PR Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Change a PR-triggering file to test Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> --------- Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com>

@wutaomsft

* Announcement LF Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Update email Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Update README.md Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * security Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * license and contribution notice Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * update author link Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Add new code of conduct from LF Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Replacing references GRU4Rec to GRU Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Replacing references GRU4Rec to GRU Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Replacing references GRU4Rec in config files Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Update references Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Delete conda.md Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * refactor map_at_k and map to be the same as Spark's Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * list of test failing to fix Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Update readme LF feedback @wutaomsft Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Update NEWS.md Co-authored-by: Andreas Argyriou <anargyri@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Update README.md Co-authored-by: Andreas Argyriou <anargyri@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Fix test errors, Refactor column check utils to be simpler Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Rename ranking tests to be _at_k suffixed Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Change test names in the test group Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * add comment to mocked fn in a test Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * 📝 Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * remove unused input Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * 📝 Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * no need to output the logs twice Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * packages Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * skipping flaky test Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Issue with TF Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Comment out the PR gate affected tests with the upgrade to TF>2.10.1 Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Comment out the nightly builds affected tests with the upgrade to TF>2.10.1 Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * 🐛 Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Comment out the nightly builds affected tests with the upgrade to TF>2.10.1 Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * revert the breaking tests with TF 2.10.1 Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * temporary pin to TF=2.8.4 Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Update security tests Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Update expected values to not use fixture Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * list of test failing to fix Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> * Fix missing fixture error Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> --------- Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com> Co-authored-by: miguelgfierro <miguelgfierro@users.noreply.github.com> Co-authored-by: Andreas Argyriou <anargyri@users.noreply.github.com> Co-authored-by: Miguel Fierro <3491412+miguelgfierro@users.noreply.github.com>

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

miguelgfierro · 2023-11-07T09:47:28Z

@loomlike I think there are some of the tests in the nightly builds that haven't been updated:

@pytest.mark.notebooks
    @pytest.mark.parametrize(
        "size, expected_values",
        [
            (
                "1m",
                ***
                    "map": 0.060579,
                    "ndcg": 0.299245,
                    "precision": 0.[2701](https://github.com/recommenders-team/recommenders/actions/runs/6749833186/job/18351719718?pr=2035#step:3:2708)16,
                    "recall": 0.104350,
                ***,
            ),
            (
                "10m",
                ***
                    "map": 0.098745,
                    "ndcg": 0.319625,
                    "precision": 0.275756,
                    "recall": 0.154014,
                ***,
            ),
        ],
    )
    def test_sar_single_node_functional(
        notebooks, output_notebook, kernel_name, size, expected_values
    ):
        notebook_path = notebooks["sar_single_node"]
        pm.execute_notebook(
            notebook_path,
            output_notebook,
            kernel_name=kernel_name,
            parameters=dict(TOP_K=10, MOVIELENS_DATA_SIZE=size),
        )
        results = sb.read_notebook(output_notebook).scraps.dataframe.set_index("name")[
            "data"
        ]
    
        for key, value in expected_values.items():
>           assert results[key] == pytest.approx(value, rel=TOL, abs=ABS_TOL)
E           assert 0.1850985029206066 == 0.060579 ± 5.0e-02
E             comparison failed
E             Obtained: 0.1850985029206066
E             Expected: 0.060579 ± 5.0e-02

anargyri · 2023-11-07T13:30:59Z

tests/unit/recommenders/evaluation/test_python_evaluation.py

@@ -194,7 +181,7 @@ def test_python_exp_var(rating_true, rating_pred):
        rating_pred=rating_true,
        col_prediction=DEFAULT_RATING_COL,
    ) == pytest.approx(1.0, TOL)
-    assert exp_var(rating_true, rating_pred) == pytest.approx(-6.4466, TOL)
+    assert exp_var(rating_true, rating_pred) == pytest.approx(-6.4466, 0.01)


Does this mean we now get something between -6.38 or -6.51? It looks a bit strange, since nothing changed with explained variance, doesn't it?

I think this happened when I copied expected outputs values from the fixture over to each tests. This shouldn't be changed. I'll update codes back to what we had. Thank you for the good catch!

anargyri · 2023-11-07T14:00:47Z

tests/unit/recommenders/evaluation/test_spark_evaluation.py

-    evaluator2 = SparkRatingEvaluation(df_true, df_pred)
-    assert evaluator2.exp_var() == target_metrics["exp_var"]
+    evaluator = SparkRatingEvaluation(df_true, df_pred)
+    assert evaluator.exp_var() == pytest.approx(-6.4466, 0.01)


Same question here. So both spark and python evaluations now give slightly different results. Maybe something changed in the input dataframes?

this should use the same TOL for the consistency.

Signed-off-by: Simon Zhao <simonyansenzhao@gmail.com>

…uage Add missing kernelspec language

loomlike · 2023-11-13T17:21:27Z

@miguelgfierro To fix this issue, I need to change map_at_k() in the notebooks to use map() which will cause conflicts with your changes in PR #2031 where you're removing glue parts:

# Record results with papermill for tests
import scrapbook as sb
sb.glue("map", rank_eval.map_at_k())  --> rank_eval.map()
...

So I think it will be good to wait for your PR gets merged into staging first and then fix the issues if that's okay.

miguelgfierro · 2023-11-14T06:41:39Z

So I think it will be good to wait for your PR gets merged into staging first and then fix the issues if that's okay.

Got it, makes sense

Signed-off-by: Simon Zhao <simonyansenzhao@gmail.com>

…xecution_notebook Programmatic execution of notebooks

loomlike · 2023-12-18T22:01:36Z

@SimonYansenZhao Hi Simon, I found an issue at execute_notebook function I wanted you to confirm:

        if (
            "tags" in cell.metadata
            and "parameters" in cell.metadata["tags"]
            and cell.cell_type == "code"
        ):
            cell_source = cell.source
            modified_cell_source = (
                cell_source  # Initialize a variable to hold the modified source
            )
            for param, new_value in parameters.items():
                if (
                    isinstance(new_value, str)
                    and not (new_value.startswith('"') and new_value.endswith('"'))
                    and not (new_value.startswith("'") and new_value.endswith("'"))
                ):
                    # Check if the new value is a string and surround it with quotes if necessary
                    new_value = f'"{new_value}"'
                # # Check if the new value is a string and surround it with quotes if necessary
                # if isinstance(new_value, str):
                #     new_value = f'"{new_value}"'
                # Define a regular expression pattern to match parameter assignments and ignore comments
                pattern = re.compile(
                    rf"(\b{param})\s*=\s*([^#\n]+)(?:#.*$)?",
                    re.MULTILINE
                    # rf"\b{param}\s*=\s*([^\n]+)\b"
                )
                modified_cell_source = pattern.sub(rf"\1 = {new_value}", cell_source)  <-------- Here.

In the code above, modified_cell_source = pattern.sub(rf"\1 = {new_value}", cell_source) this part always replaces a new value with the original cell_source. This means that the next parameter value is updated into the original cell source string again, not the one we applied the previous value to, which is modified_cell_source. And thus modified_cell_source will only contain the last parameter value that was updated.

To test and fix this, I changed the code to apply the pattern to modified_cell_source cumulatively, and added a test case (to do so, I pull that part into a separate function, as I suggested earlier).

If you see those changes make sense or have any comments, please let me know!
The changes are in jumin/fix_nightly branch.

* Revert tests tolerance * Fix notebook parameter parsing * Add notebook utils tests to test groups * Fix notebooks * Fix notebook unit tests * Update evaluation metrics name map. Handle None for exp_var * Fix smoke tests * cleanup * Fix functional test errors * make notebook parameter update function to be private * Fix benchmark notebook bug * fix remaining bugs --------- Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com>

Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com>

…t_cell Fix benchmarks last cell to store value, not [value]

loomlike

Tests are all greeeeeeen!

miguelgfierro · 2023-12-23T07:43:56Z

Vamos!!!!

miguelgfierro and others added 20 commits October 30, 2023 22:59

Remove scrapbook and papermill deps

51a2831

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

notebook utils programmatic execution

964bca0

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

Test notebook programmatic

ab63e1c

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

Added test notebook for utils

1ac3023

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

data notebooks

6a03c42

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

Replace papermill and scrapbook for new internal function

e1c2b63

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

Replace papermill and scrapbook for new internal function

70c068e

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

Update new programmatic execution code

54c2278

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

Update new programmatic execution code

055e5f0

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

Update notebooks with new utility

e60008a

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

🐛

1174258

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

Issue with xDeepFM WIP

5a4bd35

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

🐛

ee344e3

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

🐛

922011c

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

Document the tests in programmatic notebook

f7b5fdf

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

📝

c2d9d13

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

WIP

397555c

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

WIP

58cbcef

Signed-off-by: miguelgfierro <miguelgfierro@users.noreply.github.com>

miguelgfierro requested review from gramhagen, anargyri, loomlike, wutaomsft and SimonYansenZhao as code owners November 7, 2023 09:46

anargyri reviewed Nov 7, 2023

View reviewed changes

loomlike self-assigned this Nov 8, 2023

Import missing store_metadata

ddf1b10

Signed-off-by: Simon Zhao <simonyansenzhao@gmail.com>

Add missing kernelspec language

a79bf1c

Signed-off-by: Simon Zhao <simonyansenzhao@gmail.com>

SimonYansenZhao mentioned this pull request Nov 13, 2023

Add missing kernelspec language #2037

Merged

4 tasks

Merge pull request #2037 from recommenders-team/simon/kernelspec_lang…

fb3e0cf

…uage Add missing kernelspec language

SimonYansenZhao and others added 4 commits November 14, 2023 15:12

Correct pattern matching and substitution

bd9573e

Signed-off-by: Simon Zhao <simonyansenzhao@gmail.com>

Merge multiline parameters into one line

8c6aaed

Signed-off-by: Simon Zhao <simonyansenzhao@gmail.com>

Increase timeout

4992cb6

Signed-off-by: Simon Zhao <simonyansenzhao@gmail.com>

Merge pull request #2031 from recommenders-team/miguel/programmatic_e…

b57cec2

…xecution_notebook Programmatic execution of notebooks

This was referenced Dec 18, 2023

[BUG] GPU Nightly builds failing #2041

Closed

Fix nightly test errors #2045

Merged

loomlike added 2 commits December 21, 2023 10:28

Fix benchmarks last cell to store value, not [value]

495d09d

Signed-off-by: Jun Ki Min <42475935+loomlike@users.noreply.github.com>

miguelgfierro changed the title ~~Staging to main: Fix bug in MAP~~ Staging to main: Fix bug in MAP and added new notebook programmatic execution Dec 22, 2023

Merge pull request #2046 from recommenders-team/jun/fix_benchmark_las…

5bf18a0

…t_cell Fix benchmarks last cell to store value, not [value]

loomlike approved these changes Dec 22, 2023

View reviewed changes

SimonYansenZhao approved these changes Dec 23, 2023

View reviewed changes

miguelgfierro merged commit 0d9d7c7 into main Dec 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Staging to main: Fix bug in MAP and added new notebook programmatic execution #2035

Staging to main: Fix bug in MAP and added new notebook programmatic execution #2035

miguelgfierro commented Nov 7, 2023

miguelgfierro commented Nov 7, 2023

anargyri Nov 7, 2023 •

edited

Loading

loomlike Nov 8, 2023 •

edited

Loading

anargyri Nov 7, 2023

loomlike Nov 8, 2023

loomlike commented Nov 13, 2023

miguelgfierro commented Nov 14, 2023

loomlike commented Dec 18, 2023

loomlike left a comment

miguelgfierro commented Dec 23, 2023

Staging to main: Fix bug in MAP and added new notebook programmatic execution #2035

Staging to main: Fix bug in MAP and added new notebook programmatic execution #2035

Conversation

miguelgfierro commented Nov 7, 2023

Description

Related Issues

References

Checklist:

miguelgfierro commented Nov 7, 2023

anargyri Nov 7, 2023 • edited Loading

Choose a reason for hiding this comment

loomlike Nov 8, 2023 • edited Loading

Choose a reason for hiding this comment

anargyri Nov 7, 2023

Choose a reason for hiding this comment

loomlike Nov 8, 2023

Choose a reason for hiding this comment

loomlike commented Nov 13, 2023

miguelgfierro commented Nov 14, 2023

loomlike commented Dec 18, 2023

loomlike left a comment

Choose a reason for hiding this comment

miguelgfierro commented Dec 23, 2023

anargyri Nov 7, 2023 •

edited

Loading

loomlike Nov 8, 2023 •

edited

Loading