-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Difference in reliability w/ SubtitleEdit+Tesseract? #14
Comments
I have this issue as well where some dialogue is missing from the ripped SRT file. I'm not sure if it's related but it seems to happen with segments that are in a different position from the center. @ratoaq2 Here is a sample containing the original SUP file, the ripped pgsrip SRT file, and a Subtitle Edit version (which does not have any dialogue missing) that may help diagnose the issue. |
I tested out an older version of pgsrip (v0.1.4) and the issue does not happen there and all dialogue is extracted properly as expected. So it must be a recent change that causes this. |
Good observation. On another media:
|
Thanks for reporting it and providing information to reproduce it. I'm doing a release with the fix |
Hi, thanks for making this great tool! I am running into similar issues where SubtitleEd I attached a sample sup file where I encountered the issue Help would be appreciated |
Hi!
Really excited to see this tool - fits amazingly as a tdarr plugin too!
I noticed while parsing my first English PGS (1h15min) that it detected 530 strings, whereas SubtitleEdit + Tesseract 5.3.0 detected 783 strings (and had great accuracy on them). I felt a bit surprised considering that both use Tesseract 5 and the performance should theoretically be really good regardless of the dataset given it's just working with bare English, black on white, straight. I noted that when subtitles are missed, the previous subtitles would stick around for a long time.
Do you have any ideas from your experience?
The text was updated successfully, but these errors were encountered: