Implement basic/feature parity img2img postprocess upscaling #7

drhead · 2023-06-07T23:38:04Z

Main thing in this is implementation of the same upscaling present in the old API. Images received from the img2img API will be sent to the extra-batch-images endpoint for upscaling. While this is an extra API call that ideally should not have to happen, it is plenty fast enough. The results from upscaling are then injected as layers and are masked off properly.

EDIT: Just for reference, I have not tested how this performs in anything but my main workflow of working on obscenely high res images. Consequently I completely forgot that it might not be desirable to always upscale even when the selection is lower res than the generation. Should still work because the upscaler will happily downscale images when asked to. I have a couple of other people testing it to hammer out any edge cases.

Additionally, I fixed two bugs:

override_settings_restore_afterwards is now True. This was causing persistent and unwanted setting changes, like making the webui not return grids in the results. Everything seems to work just fine with it as True, and it won't attempt to load back the previous model or anything if that's why it was off.

The process for converting the mask has been simplified to three QImage format conversions. The mask is converted to Alpha8, dropping the RGB channels, and it is reinterpreted as grayscale so that when we convert it back to RGBA8888 it goes to the RGB channels. Simple, clean, seems to be a bit faster, and doesn't require any specific mask colors, so that limitation can be crossed off the list.

ControlNet img2img pipeline will now route images through the upscaler API endpoint before inserting them. Also changes mask code to use QImage format conversions to isolate the alpha channel.

frontends/krita/krita_diff/script.py

…nal-controlnet-implementation

frontends/krita/krita_diff/script.py

JasonS09 · 2023-06-20T14:14:44Z

@drhead hello. Any update on this?

drhead · 2023-06-20T19:19:40Z

@drhead hello. Any update on this?

Got a bit sidetracked on... several other things. The multithreading bugs are mostly resolved, and are at the very least much more rare than they were now that I made the current QTimer-based implementation. I have had occasional issues with it not creating a transparency mask, usually on my machine it will be the first attempt at generating an image if anything then it doesn't happen again.

It is more functional than it used to be, so I am going to set this as ready for review. The old API can be retired as soon as support is added for upscaling through the official API, the postprocess upscale code can be reused for this purpose and it may be a good opportunity to expand it to allow use of blended upscalers. Also, the options for transparency masks and layer groups need to be actually reimplemented or removed.

JasonS09 · 2023-06-20T19:48:19Z

@drhead hello. Any update on this?

Got a bit sidetracked on... several other things. The multithreading bugs are mostly resolved, and are at the very least much more rare than they were now that I made the current QTimer-based implementation. I have had occasional issues with it not creating a transparency mask, usually on my machine it will be the first attempt at generating an image if anything then it doesn't happen again.

It is more functional than it used to be, so I am going to set this as ready for review. The old API can be retired as soon as support is added for upscaling through the official API, the postprocess upscale code can be reused for this purpose and it may be a good opportunity to expand it to allow use of blended upscalers. Also, the options for transparency masks and layer groups need to be actually reimplemented or removed.

Actually I've found an issue, probably in the way data for the mask is being set. In most cases I see the created mask is "smaller" than the actual inpaint mask, the inpainted content is being clipped. If I hide the transparency mask this becomes apparent.

drhead · 2023-06-21T04:26:28Z

Actually I've found an issue, probably in the way data for the mask is being set. In most cases I see the created mask is "smaller" than the actual inpaint mask, the inpainted content is being clipped. If I hide the transparency mask this becomes apparent.

I do crop it to match the original selection, mainly to make sure nothing ends up in the image that was not intended to be inpainted, can you show an example of what is being cut off and if/how it is misaligned?

JasonS09 · 2023-06-21T05:16:54Z

I do crop it to match the original selection, mainly to make sure nothing ends up in the image that was not intended to be inpainted, can you show an example of what is being cut off and if/how it is misaligned?

Sure, this is what I found (I have been using this version while doing art and noticed this before as well "in the wild"):

I'm using the following image:

I paint the mask:

Send to inpaint, this is the result (notice how the left edge in the flower is still showing base image):

I manually create a transparency mask from inpaint mask (expected result):

This is what I see if I show both inpaint mask and plugin transparency mask, you can still see a big chunk of the inpaint mask:

Here's the file used so you can try to reproduce:
PluginTest.zip

drhead · 2023-06-21T17:20:16Z

Just managed to fix it, but I can't seem to get rid of this bug with garbage data on the edge of the mask:

I might look into it later but it would help to have someone else investigate it because this implementation detail of QImage makes absolutely no sense at all.

JasonS09 · 2023-06-23T17:44:09Z

Just managed to fix it, but I can't seem to get rid of this bug with garbage data on the edge of the mask:

I might look into it later but it would help to have someone else investigate it because this implementation detail of QImage makes absolutely no sense at all.

I'm able to reproduce sometimes. My guess is this has to do with the fact bytesPerLine() returns a different value from width(). A selection of 503px of width returns 504 bytes per line in my case. Check the comments of this question. Maybe that garbage data is the result of the overflowing bytes. Setting the w variable to the correct width value for setPixelData() will distort the image, so I guess it expects (in my case) a multiple of four. It's curious because I can't make an exact selection with a multiple of four for width in Krita, it will jump from 503px to 505px when I debug, even though the UI says it's a 504px width selection. Maybe it's a bug in the way setPixelData() is implemented for selections, that is ignoring the mismatch between the actual data content and the required bytes for an image?

JasonS09 · 2023-07-03T19:09:17Z

@drhead Any update? Should we merge it this way?

drhead · 2023-07-04T16:21:59Z

@drhead Any update? Should we merge it this way?

I'm currently occupied with a finetuning project as of right now and will be for the next month. These changes can be merged as-is for now.

…olnet-implementation

drhead added 3 commits June 7, 2023 19:24

Implement img2img upscale and update mask converter

e7e4b25

ControlNet img2img pipeline will now route images through the upscaler API endpoint before inserting them. Also changes mask code to use QImage format conversions to isolate the alpha channel.

Implement upscale postprocess API call function

cdc2029

fix inpainting without selection

eafff05

JasonS09 requested changes Jun 8, 2023

View reviewed changes

frontends/krita/krita_diff/script.py Outdated Show resolved Hide resolved

Merge branch 'JasonS09:personal-controlnet-implementation' into perso…

bd7c878

…nal-controlnet-implementation

drhead marked this pull request as draft June 8, 2023 17:10

drhead added 4 commits June 8, 2023 14:13

Bypass upscaling call when it is not needed

a99774b

changed mask generation to only affect group (may break old api)

14e776b

fix txt2img and simplify glayer creation

2859a75

desperation

bc6fa51

JasonS09 reviewed Jun 12, 2023

View reviewed changes

frontends/krita/krita_diff/script.py Outdated Show resolved Hide resolved

drhead added 3 commits June 12, 2023 20:39

QTimer-based approach to avoiding mask race condition

5391954

Fixed mask shredding bug.

91f5ad8

Fix handling of no selection in inpaint

72a4af5

drhead marked this pull request as ready for review June 20, 2023 19:19

fixed mask positioning bug

0d0e476

Merge branch 'personal-controlnet-implementation' into personal-contr…

bfa6e90

…olnet-implementation

JasonS09 merged commit c3fae53 into JasonS09:personal-controlnet-implementation Jul 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement basic/feature parity img2img postprocess upscaling #7

Implement basic/feature parity img2img postprocess upscaling #7

drhead commented Jun 7, 2023 •

edited

Loading

JasonS09 commented Jun 20, 2023

drhead commented Jun 20, 2023 •

edited

Loading

JasonS09 commented Jun 20, 2023

drhead commented Jun 21, 2023

JasonS09 commented Jun 21, 2023

drhead commented Jun 21, 2023

JasonS09 commented Jun 23, 2023

JasonS09 commented Jul 3, 2023

drhead commented Jul 4, 2023

Implement basic/feature parity img2img postprocess upscaling #7

Implement basic/feature parity img2img postprocess upscaling #7

Conversation

drhead commented Jun 7, 2023 • edited Loading

JasonS09 commented Jun 20, 2023

drhead commented Jun 20, 2023 • edited Loading

JasonS09 commented Jun 20, 2023

drhead commented Jun 21, 2023

JasonS09 commented Jun 21, 2023

drhead commented Jun 21, 2023

JasonS09 commented Jun 23, 2023

JasonS09 commented Jul 3, 2023

drhead commented Jul 4, 2023

drhead commented Jun 7, 2023 •

edited

Loading

drhead commented Jun 20, 2023 •

edited

Loading