Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question of fine tuning #64

Open
ioky31 opened this issue Feb 28, 2025 · 4 comments
Open

Question of fine tuning #64

ioky31 opened this issue Feb 28, 2025 · 4 comments

Comments

@ioky31
Copy link

ioky31 commented Feb 28, 2025

Thank you for your excellent work. I am planning to fine-tune show-o for a specific downstream task, I used medical image text dataset for stage3 fine-tuning, but I found that the generated result is not satisfactory, and the mode collapse, how can I solve this problem?

@ioky31 ioky31 changed the title Question of fine turning Question of fine tuning Feb 28, 2025
@ioky31
Copy link
Author

ioky31 commented Feb 28, 2025

I wonder if it's caused by the dataset being too homogeneous in terms of data

@Sierkinhane
Copy link
Collaborator

Can you show some examples and what's the dataset size you use to finetune?

@ioky31
Copy link
Author

ioky31 commented Feb 28, 2025

Image
Approximately 90k cases of data

@Sierkinhane
Copy link
Collaborator

The cfg is disabled when online inference during training. You'd try offline inference and enable the cfg.

Besides, the image on the first row is predicted in one step with a high mask ratio (0.87), which significantly degrades the generation quality.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants