-
Notifications
You must be signed in to change notification settings - Fork 73
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Batch normalization --training parameter #11
Comments
@fmehmetun Thanks for reporting this. After a little digging, this seems to be due to different weight offsets (16 vs 20) for different major/minor versions. So, yolov2-tiny, yolov3-tiny and yolov3 seem to require an offset of 20 instead of 16. If not set properly, this can corrupt the converted TF weights (ckpt), which likely caused the Fortunately someone fixed this for darkflow in this PR. From a quick test, it seems to resolve your issue. I'll run some more tests and push the fix shortly. |
@fmehmetun - give it a try and let me know if you see any other issues. |
Thanks for the fix. I tried now and its working with no problem. After opening issue I tried darkflow though, it's worked with no problem too. It's good to know I have another option for conversion. Thanks. |
Hi, I wanted to use YOLOv3-tiny model. Downloaded cfg and weights from official website.
With this code below i successfully built .pb and .meta files.
python main.py --cfg ../yolov3-tiny/yolov3-tiny.cfg --weights ../yolov3-tiny/yolov3-tiny.weights --output ../yolov3-tiny/ --prefix "YOLO/"
With this script below I could load graph and weights.
Tried to get output from last convolutional13 layer, I got array with full of nan values:
Outputs:
However when i tried same conversion with
python main.py --training --cfg ../yolov3-tiny/yolov3-tiny.cfg --weights ../yolov3-tiny/yolov3-tiny.weights --output ../yolov3-tiny/ --prefix "YOLO/
Same script outputs:
I believe this is because batch-normalization, --training parameter. And I want to use this model for transfer learning.
Also when I tried to get output from earlier layers like convolutional2 (without --training parameter), values were like:
Is this a problem about code or am I missing something about like image input?
The text was updated successfully, but these errors were encountered: