Refactor fusing #1386

reuvenperetz · 2025-03-14T10:30:43Z

Pull Request Description:

This PR introduces handling graph fusion by encapsulating fusion metadata within a new wrapper class for the graph. This class ensures that after every access to the graph, a validation check is performed to verify that the fusion information remains consistent and that no modifications have introduced inconsistencies. Additionally, the fusion-related logic has been refactored into a new class called GraphFuser, which takes the graph along with its fusion metadata and creates a new graph where fused operations are represented as single nodes.

Checklist before requesting a review:

I set the appropriate labels on the pull request.
I have added/updated the release note draft (if necessary).
I have updated the documentation to reflect my changes (if necessary).
All function and files are well documented.
All function and classes have type hints.
There is a licenses in all file.
The function and variable names are informative.
I have checked for code duplications.
I have added new unittest (if necessary).

…tion

This reverts commit 43fbf1d.

model_compression_toolkit/core/common/fusion/fusing_info.py

ofirgo · 2025-03-17T12:26:55Z

model_compression_toolkit/core/common/fusion/graph_fuser.py

                              framework_attr={},
                              input_shape=nodes[0].input_shape,
                              output_shape=nodes[-1].output_shape,
-                              weights={},
+                              weights={}, # TODO: update with weights of all nodes


Is the todo here planned for this PR?
is it necessary actually? because you can always retrieve the original weights from the original graph

No, in the PR that handles the MP.

model_compression_toolkit/core/common/fusion/graph_fuser.py

ofirgo · 2025-03-17T13:25:59Z

model_compression_toolkit/core/common/mixed_precision/mixed_precision_search_facade.py

@@ -82,6 +83,11 @@ def search_bit_width(graph_to_search_cfg: Graph,

    # Set graph for MP search
    graph = copy.deepcopy(graph_to_search_cfg)  # Copy graph before searching
+    # TODO: The handle of mixed precision with the fused graph will be in a separate PR. Currently, the bit-width


also needs to be integrated with the BOPs virtual graph

If I understand correctly, the integration with the BOPs, is simply to build the virtual graph from the fused graph.

model_compression_toolkit/core/common/substitutions/batchnorm_reconstruction.py

model_compression_toolkit/qat/pytorch/quantization_facade.py

ofirgo

consider adding an e2e tests to verify that none of our substitutions fails the fusing metadata.

tests/pytorch_tests/function_tests/layer_fusing_test.py

tests/keras_tests/function_tests/test_activation_weights_composition_substitution.py

tests_pytest/_fw_tests_common_base/base_graph_with_fusion_metadata_test.py

tests_pytest/common_tests/unit_tests/core/test_fusion_info.py

tests_pytest/_fw_tests_common_base/base_graph_with_fusion_metadata_test.py

ofirgo · 2025-03-18T07:58:43Z

tests_pytest/_fw_tests_common_base/base_graph_with_fusion_metadata_test.py

+        This fixture defines allowed operations and fusing patterns for testing.
+        """
+        return schema.TargetPlatformCapabilities(
+            default_qco=default_quant_cfg_options,


maybe check also more complicated scenarios like operators with mixed precision?

I wanted it to be minimal to check only the fusing, maybe in the e2e?

I think that it will be complicated to test this specific attributes on a model in an e2e manner.
in these tests you verify the graph, which is more correct for verifying such edge-cases IMO.
you can leave this extension to the next PR of the e2e tests, and also extend the integration tests then.

tests_pytest/_fw_tests_common_base/base_graph_with_fusion_metadata_test.py

…graph with metadata

…moval

…n process

…ng info

model_compression_toolkit/core/common/mixed_precision/mixed_precision_search_facade.py

tests/keras_tests/function_tests/test_activation_weights_composition_substitution.py

tests_pytest/_test_util/tpc_util.py

tests_pytest/_fw_tests_common_base/fusing/base_fusing_info_generator_test.py

ofirgo · 2025-03-27T10:51:53Z

tests_pytest/_fw_tests_common_base/base_graph_with_fusion_metadata_test.py

+        This fixture defines allowed operations and fusing patterns for testing.
+        """
+        return schema.TargetPlatformCapabilities(
+            default_qco=default_quant_cfg_options,


I think that it will be complicated to test this specific attributes on a model in an e2e manner.
in these tests you verify the graph, which is more correct for verifying such edge-cases IMO.
you can leave this extension to the next PR of the e2e tests, and also extend the integration tests then.

tests_pytest/common_tests/unit_tests/core/test_fusion_info.py

…tion

reuvenp added 14 commits March 12, 2025 13:46

Fusing refactor

1cc4739

remove commented out test file

a2e4632

revert changes in keras mixin

1485cdc

remove unneeded fusion tests packages hirarchy

e08676f

revert old tests that were commented out

bb8cf01

fix wrong type hint in test_activation_weights_composition_substition

0bb6f5a

remove commented out code from runner

d7135d6

use internal graph for the final qat model instead of the fused graph

8f3b1d9

add comment for second moment correction about the fusing info correc…

4cc1533

…tion

remove old fusing data from graph

3f6fe40

update old pytorch tests

2ea3f2a

add check for graph type in torch model builder

595f4bd

add comments to fusing info

a9ae9f9

add comments to graph fuser and graph with metadata

e1cad46

github-actions bot added auto:core auto:qat auto:tests labels Mar 14, 2025

reuvenp added 2 commits March 14, 2025 15:28

Set version for onnxruntime-extensions

43fbf1d

adapt keras unit tests

5c16f03

reuvenperetz requested a review from ofirgo March 16, 2025 07:45

reuvenp added 2 commits March 17, 2025 11:22

Revert "Set version for onnxruntime-extensions"

7c52939

This reverts commit 43fbf1d.

Merge remote-tracking branch 'origin/main' into refactor-fusing

7619677

ofirgo requested changes Mar 17, 2025

View reviewed changes

reuvenp added 2 commits March 17, 2025 18:16

fix comments in fusing info

8406b2c

remove the deepcopy in get_all_fused_operations

fb0a30b

ofirgo requested changes Mar 18, 2025

View reviewed changes

reuvenp added 4 commits March 18, 2025 13:50

verify fusing info is consistent when using graph fuser

ec583f1

move function to disable activation quantization from fusing info to …

44f2256

…graph with metadata

use prefic of op id as constant

5544545

pass only fusing patterns instead of entire fqc

6e7d170

reuvenp added 13 commits March 24, 2025 12:20

rename fusing mateadata wrapper

459394e

fix old name of FusingMetadataWrapper in comments

671c58a

fix comments in fuse method of GraphFuser

34c2820

rename fuse method in graph fuser

0054571

save fusing info instead of multiple fetches

bd75774

set the graph without the fusing metadata in the pytorch back2fw

b1be93a

add missing types in BaseGraphWithFusingMetadataTest

a589333

adjust the test of activation-weight composition due to the fusion re…

30e3645

…moval

migrate old unittests

e7ac1de

tests with multiple successors/predecessors

f5c0a07

add test that checks the case of new fusion due to new node

7551d34

add test that checks a valid graph change does not fail the validatio…

98c14f5

…n process

merge changes with main

8ee3191

github-actions bot removed the auto:qat label Mar 26, 2025

reuvenp added 9 commits March 26, 2025 17:02

move tests to a designated package

d78ac46

fix batchnorm_reconstruction due to replacing list with tuple in fusi…

4c396ae

…ng info

use internal graph for mp for now

4580a98

fix missing import of removed test

3ab79d9

fix wrong path in test_cfg_candidates_filter

63a7dff

run pytest before unittests

c170f2a

add missing license

773c115

extract minimal_cfg_options from minimal_tpc

d31c53c

fix new argument in test_cfg_candidates_filter

106e694

ofirgo requested changes Mar 27, 2025

View reviewed changes

reuvenp added 6 commits April 1, 2025 14:29

replace wrapper with embedding the fusing info in the graph

a16e47a

rewrite the funsions in test_fusing_info more explicity

f485a36

return fusion to test in test_activation_weights_composition_substitu…

8a0334b

…tion

disable validation in tests of virtual graph

68b560c

fix tests

d097e77

merge from main

2011098

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor fusing #1386

Refactor fusing #1386

reuvenperetz commented Mar 14, 2025

ofirgo Mar 17, 2025

reuvenperetz Mar 17, 2025

ofirgo Mar 17, 2025

reuvenperetz Mar 17, 2025

ofirgo left a comment

ofirgo Mar 18, 2025

reuvenperetz Mar 18, 2025

ofirgo Mar 27, 2025

ofirgo Mar 27, 2025

Refactor fusing #1386

Are you sure you want to change the base?

Refactor fusing #1386

Conversation

reuvenperetz commented Mar 14, 2025

Pull Request Description:

Checklist before requesting a review:

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ofirgo left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment