Add ControlNet and IP-Adapter support to StableDiffusionXLKDiffusionPipeline #8825

chi-mf · 2024-07-10T06:36:21Z

What does this PR do?

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
adding IP adapter support to all ControlNet and T2I pipelines and
SD XL K-diffusion with Controlnet
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

I met the same requirement as stated in issue 6530 above, to match the generation result with A1111. So I implemented it myself. But I'm new to diffusers code, I don't know if this approach is acceptable to community, would @yiyixuxu and @asomoza please have a review?

To our own scenario, we don't need the functionality to pass raw IP-Adapter images to pipeline, just CLIP-Vision embeddings, and the image-to-embedding conversion can easily be done outside of the pipeline. I can add it if it is required to match other pipelines.
I did not introduce a new StableDiffusionXLKDiffusionControlNetPipeline, instead, I added a set_controlnet() function to let the same pipeline instance able to switch between ControlNet enabled/disabled state, which serves our use cases better. I can also split the codes to another pipeline class if necessary.
I renamed the original model_fn()'s second parameter from 't' to 'sigma', because to my understanding it is really the sigma, not t.

yiyixuxu · 2024-07-10T23:31:40Z

I think controlnet should be a different pipeline, no?
also, want to understand a little bit why you would use k-diffusion pipeline? we have added all the popular schedulers in diffusers already

chi-mf · 2024-07-11T06:26:19Z

I think controlnet should be a different pipeline, no?

I'm not sure in this part. To our use case, let's say the end users have chosen a particular model (in terms of image style), they will try different ways to generate the image they want, in this process, they will switch between use ControlNet or not frequently (also IP-Adapters). If we have separate pipeline for ControlNet, I guess I should either hold 2 pipelines in memory or construct new ones using init (reuse the unet already in GPU) each time they switch?

also, want to understand a little bit why you would use k-diffusion pipeline? we have added all the popular schedulers in diffusers already

Because we still get different results from A1111, and usually A1111's result is what we want, using k-diffusion pipeline can solve most of them. For this part, I see it like this: most end users are still based on A1111, so to bring them to diffusers, I first need to provide the existing result as is, after that, they can explore the other options in diffusers like they have done in A1111. I have no doubt we will finally get better result, but it need the stairs, the k-diffusion pipeline is just the stairs to me.

asomoza · 2024-07-18T23:54:08Z

Hi, thanks for your work, I really appreciate it. About this:

I guess I should either hold 2 pipelines in memory or construct new ones using init

You can use from_pipe for transferring the modules from one pipeline to another one.

About using the same pipeline for controlnet, you can maybe look into it with another perspective, maybe someone after you wants to add PAG to it or T2I Adapters but don't need or want controlnet. If the pipeline already has the code for controlnet, it will harder for that person to integrate them. The same for this one, if the same pipeline had already some extensive complex "switcheable" code, your integration would be a lot harder and not all the people want to submit a PR, some of them use it inside internal private code, so its good to have a good clean base where to start from.

Also it goes more accordingly to the rest of the code base, we have a separate controlnet pipeline for all the major archs.

In the same spirit as before, since you already added the IP Adapter functionality and we need to think about all the users and not on a specific use case, it would be great if you could add the code to pass the raw IP Adapter Image

github-actions · 2024-09-14T15:07:00Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

yiyixuxu · 2024-11-16T21:51:01Z

Closing this since the K-diffusion pipeline is meant for experiments only (not meant to have a full feature set). We support all the k-schedulers in our regular pipeline too, which you can use along with controlnet and IP-adapter

Support ControlNet and IP-Adapter in StableDiffusionXLKDiffusionPipeline

61441a8

github-actions bot added the stale Issues that haven't received updates label Sep 14, 2024

yiyixuxu closed this Nov 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ControlNet and IP-Adapter support to StableDiffusionXLKDiffusionPipeline #8825

Add ControlNet and IP-Adapter support to StableDiffusionXLKDiffusionPipeline #8825

chi-mf commented Jul 10, 2024

yiyixuxu commented Jul 10, 2024

chi-mf commented Jul 11, 2024

asomoza commented Jul 18, 2024

github-actions bot commented Sep 14, 2024

yiyixuxu commented Nov 16, 2024

Add ControlNet and IP-Adapter support to StableDiffusionXLKDiffusionPipeline #8825

Add ControlNet and IP-Adapter support to StableDiffusionXLKDiffusionPipeline #8825

Conversation

chi-mf commented Jul 10, 2024

What does this PR do?

Before submitting

Who can review?

yiyixuxu commented Jul 10, 2024

chi-mf commented Jul 11, 2024

asomoza commented Jul 18, 2024

github-actions bot commented Sep 14, 2024

yiyixuxu commented Nov 16, 2024