🎁 octavia-cli: add telemetry #11896

alafanechere · 2022-04-11T22:50:03Z

What

We want to add telemetry to octavia-cli in order to track the adoption and the tool and detect common errors.

How

Use Segment's python package: I created two Segment source and write keys (octavia-cli prod and dev)
Track usage of commands by defining a custom OctaviaCommand subclass of click's Command. I patched the parent's make_context and invoke methods to collect and send telemetry in case of success or error.
Set the user agent of octavia-cli in case this could help measure the traffic sent to the API from the CLI
Update the README to explain what data is collected and how to disable telemetry.

🚨 User Impact 🚨

This should be transparent for users

ChristopheDuong · 2022-04-12T14:19:58Z

octavia-cli/octavia_cli/telemetry.py

+        user_id = ctx.obj.get("WORKSPACE_ID")
+        anonymous_id = None if user_id else str(uuid.uuid1())
+
+        segment_context = {"app": {"name": "octavia-cli", "version": ctx.obj.get("OCTAVIA_VERSION")}}


tracking the AIRBYTE_VERSION of the api that is being interacted with could also help, but we'd need an API to know what version the server is running?

It looks like there's no endpoint in the API that gives the airbyte version information 🤦

ChristopheDuong · 2022-04-12T14:36:13Z

octavia-cli/README.md

+* Success or failure of the command run and the error type.
+* The workspace id. It is unique to each Airbyte instance, but we can't match it to a username or email address.
+
+You can disable telemetry by setting the `OCTAVIA_ENABLE_TELEMETRY` environment to `false` or using the `--disable-telemetry` flag.


Maybe the "default" telemetry behavior can also follow the boolean for tracking or not on the airbyte instance too?

The boolean for tracking or not is the TRACKING_STRATEGY env var. Unfortunately, the API only returns the anonymousDataCollection field which does not mean the user has turned off analytics on their instance, it just mean they want to avoid mapping their workplace id to their email. Our default is the "anonymous data collection" as we don't handle emails.

alafanechere · 2022-04-13T16:58:01Z

airbyte-tests/src/acceptanceTests/java/io/airbyte/test/acceptance/AcceptanceTests.java

  public void testUpdateConnectionWhenWorkflowUnreachable() throws Exception {
-    // This test only covers the specific behavior of updating a connection that does not have an underlying temporal workflow.
-    // This case only occurs with the new scheduler, so the entire test is inside the feature flag conditional.
-    // Also, this test doesn't verify correctness of the schedule update applied, as adding the ability to query a workflow for its current
-    // schedule is out of scope for the issue (https://github.com/airbytehq/airbyte/issues/11215). This test just ensures that the underlying workflow
+    // This test only covers the specific behavior of updating a connection that does not have an
+    // underlying temporal workflow.
+    // This case only occurs with the new scheduler, so the entire test is inside the feature flag
+    // conditional.
+    // Also, this test doesn't verify correctness of the schedule update applied, as adding the ability
+    // to query a workflow for its current
+    // schedule is out of scope for the issue (https://github.com/airbytehq/airbyte/issues/11215). This
+    // test just ensures that the underlying workflow


This was not formatted on master and broke the build.

lmossman

Had several small comments but I don't think any of them are blocking. Otherwise LGTM

octavia-cli/octavia_cli/base_commands.py

lmossman · 2022-04-13T23:09:50Z

octavia-cli/README.md

+This CLI has some telemetry tooling to send Airbyte some data about the usage of this tool.
+We use this data to measure the tool's adoption and detect common errors users encounter to improve it.
+The telemetry sends data about:
+* Which command was run (not the arguments or options used).


(not the arguments or options used)

Just curious, why are we not tracking the arguments/options used in a request? Is this because we expect the arguments to be different for each user, so it will not be helpful in the aggregate user tracking we want to get out of this?

I wanted to add the least amount of user data to this telemetry. As arguments or options are free user inputs I did not want to risk exposing sensitive information in the telemetry. I'm also thinking that to grasp which resources are managed with Octavia, we can rely on the existing telemetry from the core platform. This is why I added a dedicated user agent for octavia. We can find Octavia related traffic if the user agent is available in the API telemetry. We can also join the workspace id from octavia telemetry with the workpace id from the API telemetry to get a bit more insights.
I wanted to focus Octavia telemetry on getting a sense of which commands are used, which commands cause errors, that's all.

octavia-cli/octavia_cli/telemetry.py

octavia-cli/setup.py

implement telemetry

0e7a5ae

alafanechere force-pushed the augustin/octavia-cli/track branch from c629d04 to 0e7a5ae Compare April 12, 2022 13:53

alafanechere marked this pull request as ready for review April 12, 2022 14:00

alafanechere added the area/octavia-cli label Apr 12, 2022

alafanechere self-assigned this Apr 12, 2022

alafanechere requested review from cgardens, lmossman, marcosmarxm and ChristopheDuong April 12, 2022 14:04

ChristopheDuong reviewed Apr 12, 2022

View reviewed changes

alafanechere added 3 commits April 12, 2022 16:52

format

c1971ba

update readme

2feb7e5

format AcceptanceTests.java

4efc055

alafanechere temporarily deployed to more-secrets April 12, 2022 15:14 Inactive

alafanechere commented Apr 13, 2022

View reviewed changes

lmossman approved these changes Apr 14, 2022

View reviewed changes

alafanechere added 6 commits April 14, 2022 09:41

implement Lake's suggestions

979bf2c

Merge branch 'master' into augustin/octavia-cli/track

325badb

update to latest Airbyte version

5131783

update python version in setup.py

8c8d8b3

update changelog

db4aab0

update write key

0d1d34d

alafanechere merged commit 0964c83 into master Apr 14, 2022

alafanechere deleted the augustin/octavia-cli/track branch April 14, 2022 10:11

octavia-squidington-iii mentioned this pull request Apr 14, 2022

Bump Airbyte version from 0.35.67-alpha to 0.35.68-alpha #12015

Merged

suhomud pushed a commit that referenced this pull request May 23, 2022

🎁 octavia-cli: add telemetry (#11896)

ec449c2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🎁 octavia-cli: add telemetry #11896

🎁 octavia-cli: add telemetry #11896

alafanechere commented Apr 11, 2022 •

edited

Loading

ChristopheDuong Apr 12, 2022

alafanechere Apr 12, 2022

ChristopheDuong Apr 12, 2022 •

edited

Loading

alafanechere Apr 12, 2022 •

edited

Loading

alafanechere Apr 13, 2022

lmossman left a comment

lmossman Apr 13, 2022

alafanechere Apr 14, 2022

🎁 octavia-cli: add telemetry #11896

🎁 octavia-cli: add telemetry #11896

Conversation

alafanechere commented Apr 11, 2022 • edited Loading

What

How

Recommended reading order

🚨 User Impact 🚨

ChristopheDuong Apr 12, 2022

Choose a reason for hiding this comment

alafanechere Apr 12, 2022

Choose a reason for hiding this comment

ChristopheDuong Apr 12, 2022 • edited Loading

Choose a reason for hiding this comment

alafanechere Apr 12, 2022 • edited Loading

Choose a reason for hiding this comment

alafanechere Apr 13, 2022

Choose a reason for hiding this comment

lmossman left a comment

Choose a reason for hiding this comment

lmossman Apr 13, 2022

Choose a reason for hiding this comment

alafanechere Apr 14, 2022

Choose a reason for hiding this comment

alafanechere commented Apr 11, 2022 •

edited

Loading

ChristopheDuong Apr 12, 2022 •

edited

Loading

alafanechere Apr 12, 2022 •

edited

Loading