-
Notifications
You must be signed in to change notification settings - Fork 5.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Core] add QKV fusion to AuraFlow and PixArt Sigma #8952
Conversation
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
very nice! thank you
*args, | ||
**kwargs, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we do not need them, no?
*args, | |
**kwargs, |
@yiyixuxu because we have
I had to do: LMK if you have other ideas to counter it. Without it, we will have: |
@yiyixuxu a friendly ping. |
@sayakpaul you're right thank you! |
* add fusion support to pixart * add to auraflow. * add tests * apply review feedback. * add back args and kwargs * style
What does this PR do?
As a follow-up of #8829, I decided to give QKV fusion a try for AuraFlow and PixArt-Sigma when doing int8 quantization. Some nice gains on an H100:
PixArt
AuraFlow