Improvement/img detection extended #130

aljazkonec1 · 2024-11-13T08:03:25Z

This PR adapts ImgDetectionsExtended to use dai.RotatedRect object for storing information about the (rotated) bounding box. This simplifies porting to depthai in the future and reduces code duplication. Examples were also updated to take advantage of this new definition.

Some smaller updates were also made due to the use of dai v3 alpha6.

codecov-commenter · 2024-11-13T08:04:45Z

Codecov Report

Attention: Patch coverage is 76.00000% with 6 lines in your changes missing coverage. Please review.

Project coverage is 32.84%. Comparing base (37a2d92) to head (19e7f25).
Report is 5 commits behind head on main.

✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
depthai_nodes/ml/messages/img_detections.py	77.77%	4 Missing ⚠️
depthai_nodes/ml/messages/creators/detection.py	83.33%	1 Missing ⚠️
...pthai_nodes/ml/parsers/mediapipe_palm_detection.py	0.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #130      +/-   ##
==========================================
- Coverage   33.40%   32.84%   -0.56%     
==========================================
  Files          68       68              
  Lines        3739     3711      -28     
==========================================
- Hits         1249     1219      -30     
- Misses       2490     2492       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

klemen1999

Generally LGTM, missing getter and setter for label_name. And let's also update README for messages with the new definition of ImgDetectionExtended

…ionExtended

aljazkonec1 · 2024-11-13T10:28:55Z

Generally LGTM, missing getter and setter for label_name. And let's also update README for messages with the new definition of ImgDetectionExtended
I'll update this now and add some other functionality before opening up for review

…/luxonis/depthai-nodes into improvement/ImgDetectionExtended

jkbmrz

LGTM

depthai_nodes/ml/messages/creators/detection.py

examples/visualization/visualizers/detection.py

kkeroo · 2024-11-13T15:47:48Z

I think user expirience now is worse with ImgDetectionExtended because we add another "layer" called RotatedRect. There is no simple way to access the x1,y1,... or angles directly from the message. Previously, you could x_center = detection.x_center, now you have to x_center = detection.rotated_rect.center.x. User will not find it straight away. Same for functions: previously xmin, ymin, xmax, ymax = detection.get_xyxy_bbox(), now xmin = detection.rect.getOuterRect().

What I would like if we keep RotatedRect is that we add properties to ImgDetectionExtended like x_center and return detection.rotated_rect.center.x and also functions.

…ionExtended

aljazkonec1 · 2024-11-14T08:14:48Z

I see your point @kkeroo but there are two things to consider:

DAI already has RotatedRect and uses it in a lot of functions (like ImageManipNodeV2). Therefore, for the user to access these functionalities, he would have to convert to RotatedRect object. Why not do it for them then?
When updating examples (and in some other code), I found the usage of RotatedRect to be easier than having to manually write x_center, y_center. I could just do rect = detection.rotated_rect and then focus on it as a whole rectangle instead of focusing on every single coordinate/shape on its own.

I do however still agree on having utils functions for converting to/from "depthai objects" (like RotatedRect, Point2f, Size2f, etc etc) to numpy arrays. But this can be done in another PR where we clearly define these functions and make the syntax as close to depthai syntax as possible.

kkeroo · 2024-11-14T10:34:25Z

DAI already has RotatedRect and uses it in a lot of functions (like ImageManipNodeV2). Therefore, for the user to access these functionalities, he would have to convert to RotatedRect object. Why not do it for them then?

Valid point.

When updating examples (and in some other code), I found the usage of RotatedRect to be easier than having to manually write x_center, y_center. I could just do rect = detection.rotated_rect and then focus on it as a whole rectangle instead of focusing on every single coordinate/shape on its own.

Depends on the visualization. I see you used cv2.polylines

rect = detection.rotated_rect
points = rect.getPoints()
bbox = np.array([[point.x, point.y] for point in points])
bbox = bbox.astype(int)
cv2.polylines(frame, [bbox], isClosed=True, color=(255, 0, 0), thickness=2)

If we just add method to the ImgDetectionExtended we can have it shorter and more intuitive. And inside the method we explicitly use rect.getPoints:

def get_bbox(self):
    rect = detection.rotated_rect
    points = rect.getPoints()
    bbox = np.array([[point.x, point.y] for point in points])
    bbox = bbox.astype(int)
    return bbox

user wil then just call

bbox = detection.get_bbox()
cv2.polylines(...)

and the same for the rect.getOuterRect(). This just adds the option for getting right bbox coordinates in one line and we keep RotatedRect for the cases you provided.

aljazkonec1 · 2024-11-14T11:07:15Z

Okay I agree with this style now.

aljazkonec1 added 3 commits November 12, 2024 19:40

update to detections class

065b7ce

docstring update

2dc31c1

another docstrng update

4b8c1e5

github-actions bot assigned aljazkonec1 Nov 13, 2024

github-actions bot added messages Changes affecting ml.messages parsers Changes affecting ml.parsers labels Nov 13, 2024

klemen1999 reviewed Nov 13, 2024

View reviewed changes

aljazkonec1 and others added 3 commits November 13, 2024 11:13

Merge remote-tracking branch 'origin/main' into improvement/ImgDetect…

8abf852

…ionExtended

depthai version update to alpha6 in CI

3dde8e5

Merge branch 'main' into improvement/ImgDetectionExtended

09509fc

aljazkonec1 added 2 commits November 13, 2024 12:12

docstring update, segmentation visualizer fix

065bd6b

Merge branch 'improvement/ImgDetectionExtended' of https://github.com…

93e3d7e

…/luxonis/depthai-nodes into improvement/ImgDetectionExtended

aljazkonec1 marked this pull request as ready for review November 13, 2024 13:18

aljazkonec1 requested review from tersekmatija, jkbmrz and kkeroo as code owners November 13, 2024 13:18

klemen1999 approved these changes Nov 13, 2024

View reviewed changes

jkbmrz approved these changes Nov 13, 2024

View reviewed changes

depthai_nodes/ml/messages/creators/detection.py Outdated Show resolved Hide resolved

examples/visualization/visualizers/detection.py Outdated Show resolved Hide resolved

aljazkonec1 added 3 commits November 14, 2024 08:29

Merge remote-tracking branch 'origin/main' into improvement/ImgDetect…

6b36637

…ionExtended

implemented suggested changes

2cc7f25

smaller update to draw.

19e7f25

aljazkonec1 merged commit d738aa5 into main Nov 14, 2024
10 checks passed

aljazkonec1 deleted the improvement/ImgDetectionExtended branch November 14, 2024 11:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvement/img detection extended #130

Improvement/img detection extended #130

aljazkonec1 commented Nov 13, 2024

codecov-commenter commented Nov 13, 2024 •

edited

Loading

klemen1999 left a comment

aljazkonec1 commented Nov 13, 2024

jkbmrz left a comment

kkeroo commented Nov 13, 2024 •

edited

Loading

aljazkonec1 commented Nov 14, 2024

kkeroo commented Nov 14, 2024

aljazkonec1 commented Nov 14, 2024

Improvement/img detection extended #130

Improvement/img detection extended #130

Conversation

aljazkonec1 commented Nov 13, 2024

codecov-commenter commented Nov 13, 2024 • edited Loading

Codecov Report

klemen1999 left a comment

Choose a reason for hiding this comment

aljazkonec1 commented Nov 13, 2024

jkbmrz left a comment

Choose a reason for hiding this comment

kkeroo commented Nov 13, 2024 • edited Loading

aljazkonec1 commented Nov 14, 2024

kkeroo commented Nov 14, 2024

aljazkonec1 commented Nov 14, 2024

codecov-commenter commented Nov 13, 2024 •

edited

Loading

kkeroo commented Nov 13, 2024 •

edited

Loading