Question of fine tuning #64

ioky31 · 2025-02-28T05:37:32Z

Thank you for your excellent work. I am planning to fine-tune show-o for a specific downstream task, I used medical image text dataset for stage3 fine-tuning, but I found that the generated result is not satisfactory, and the mode collapse, how can I solve this problem?

ioky31 · 2025-02-28T06:11:25Z

I wonder if it's caused by the dataset being too homogeneous in terms of data

Sierkinhane · 2025-02-28T07:43:10Z

Can you show some examples and what's the dataset size you use to finetune?

ioky31 · 2025-02-28T08:13:38Z

Approximately 90k cases of data

Sierkinhane · 2025-02-28T08:19:53Z

The cfg is disabled when online inference during training. You'd try offline inference and enable the cfg.

Besides, the image on the first row is predicted in one step with a high mask ratio (0.87), which significantly degrades the generation quality.

ioky31 changed the title ~~Question of fine turning~~ Question of fine tuning Feb 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question of fine tuning #64

Question of fine tuning #64

ioky31 commented Feb 28, 2025

ioky31 commented Feb 28, 2025

Sierkinhane commented Feb 28, 2025

ioky31 commented Feb 28, 2025

Sierkinhane commented Feb 28, 2025

Question of fine tuning #64

Question of fine tuning #64

Comments

ioky31 commented Feb 28, 2025

ioky31 commented Feb 28, 2025

Sierkinhane commented Feb 28, 2025

ioky31 commented Feb 28, 2025

Sierkinhane commented Feb 28, 2025