You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The GitHub connector ran into a 404 error attempting to sync a mostly empty repo. The repo had only a single commit in it (e.g. README.md, template code, etc from a scaffolding tool). The commit was performed using an email address that is not associated with any GitHub account.
The connector hit a 404 error attempting to pull the /contributors?per_page=100 request and aborts further actions.
The API docs aren't particularly clear when the 404 would occur, I assumed only on a missing repository.
Annoyingly, I get the same response from this scaffolded repo as I get on a properly missing repo. When I run the same request against a purely empty repo, I successfully get a list of contributors. This may ultimately be a GitHub API bug. 😕
The 404 error should be handled and the connector should continue as if no contributors are present.
Logs
LOG
2022-02-23 00:19:00 source > SourceGithub runtimes:
2022-02-23 00:19:00 source > Syncing stream: collaborators
2022-02-23 00:19:00 source > Undefined error while reading records: Not Found
2022-02-23 00:19:00 source > Encountered an exception while reading stream SourceGithub
Traceback (most recent call last):
File "/usr/local/lib/python3.7/site-packages/airbyte_cdk/sources/abstract_source.py", line 108, in read
internal_config=internal_config,
File "/usr/local/lib/python3.7/site-packages/airbyte_cdk/sources/abstract_source.py", line 141, in _read_stream
for record in record_iterator:
File "/usr/local/lib/python3.7/site-packages/airbyte_cdk/sources/abstract_source.py", line 213, in _read_full_refresh
for record in records:
File "/airbyte/integration_code/source_github/streams.py", line 150, in read_records
raise e
File "/airbyte/integration_code/source_github/streams.py", line 97, in read_records
yield from super().read_records(stream_slice=stream_slice, **kwargs)
File "/usr/local/lib/python3.7/site-packages/airbyte_cdk/sources/streams/http/http.py", line 366, in read_records
response = self._send_request(request, request_kwargs)
File "/usr/local/lib/python3.7/site-packages/airbyte_cdk/sources/streams/http/http.py", line 333, in _send_request
return backoff_handler(user_backoff_handler)(request, request_kwargs)
File "/usr/local/lib/python3.7/site-packages/backoff/_sync.py", line 94, in retry
ret = target(*args, **kwargs)
File "/usr/local/lib/python3.7/site-packages/backoff/_sync.py", line 94, in retry
ret = target(*args, **kwargs)
File "/usr/local/lib/python3.7/site-packages/airbyte_cdk/sources/streams/http/http.py", line 297, in _send
response.raise_for_status()
File "/usr/local/lib/python3.7/site-packages/requests/models.py", line 960, in raise_for_status
raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 404 Client Error: Not Found for url: https://api.github.com/repos/my_org/my_repo/collaborators?per_page=100
2022-02-23 00:19:00 source > Finished syncing SourceGithub
2022-02-23 00:19:00 source > SourceGithub runtimes:
2022-02-23 00:19:00 source > 404 Client Error: Not Found for url: https://api.github.com/repos/my_org/my_repo/collaborators?per_page=100
Traceback (most recent call last):
File "/airbyte/integration_code/main.py", line 13, in <module>
launch(source, sys.argv[1:])
File "/usr/local/lib/python3.7/site-packages/airbyte_cdk/entrypoint.py", line 127, in launch
for message in source_entrypoint.run(parsed_args):
File "/usr/local/lib/python3.7/site-packages/airbyte_cdk/entrypoint.py", line 118, in run
for message in generator:
File "/usr/local/lib/python3.7/site-packages/airbyte_cdk/sources/abstract_source.py", line 112, in read
raise e
File "/usr/local/lib/python3.7/site-packages/airbyte_cdk/sources/abstract_source.py", line 108, in read
internal_config=internal_config,
File "/usr/local/lib/python3.7/site-packages/airbyte_cdk/sources/abstract_source.py", line 141, in _read_stream
for record in record_iterator:
File "/usr/local/lib/python3.7/site-packages/airbyte_cdk/sources/abstract_source.py", line 213, in _read_full_refresh
for record in records:
File "/airbyte/integration_code/source_github/streams.py", line 150, in read_records
raise e
File "/airbyte/integration_code/source_github/streams.py", line 97, in read_records
yield from super().read_records(stream_slice=stream_slice, **kwargs)
File "/usr/local/lib/python3.7/site-packages/airbyte_cdk/sources/streams/http/http.py", line 366, in read_records
response = self._send_request(request, request_kwargs)
File "/usr/local/lib/python3.7/site-packages/airbyte_cdk/sources/streams/http/http.py", line 333, in _send_request
return backoff_handler(user_backoff_handler)(request, request_kwargs)
File "/usr/local/lib/python3.7/site-packages/backoff/_sync.py", line 94, in retry
ret = target(*args, **kwargs)
File "/usr/local/lib/python3.7/site-packages/backoff/_sync.py", line 94, in retry
ret = target(*args, **kwargs)
File "/usr/local/lib/python3.7/site-packages/airbyte_cdk/sources/streams/http/http.py", line 297, in _send
response.raise_for_status()
File "/usr/local/lib/python3.7/site-packages/requests/models.py", line 960, in raise_for_status
raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 404 Client Error: Not Found for url: https://api.github.com/repos/my_org/my_repo/collaborators?per_page=100
2022-02-23 00:19:31 destination > 2022-02-23 00:19:31 INFO i.a.i.b.FailureTrackingAirbyteMessageConsumer(close):65 - Airbyte message consumer: succeeded.
Steps to Reproduce
Create a new GitHub repo with a single commit using a dummy email address not associated with any GitHub account. (maybe?)
Set up the connector with a destination and include the collaborators table.
Run it.
Are you willing to submit a PR?
Yes. Currently disabling the collaborators functionality to do an initial load of all our data. Can come back to this later.
The text was updated successfully, but these errors were encountered:
Environment
Current Behavior
The GitHub connector ran into a 404 error attempting to sync a mostly empty repo. The repo had only a single commit in it (e.g.
README.md
, template code, etc from a scaffolding tool). The commit was performed using an email address that is not associated with any GitHub account.The connector hit a 404 error attempting to pull the
/contributors?per_page=100
request and aborts further actions.The API docs aren't particularly clear when the 404 would occur, I assumed only on a missing repository.
Annoyingly, I get the same response from this scaffolded repo as I get on a properly missing repo. When I run the same request against a purely empty repo, I successfully get a list of contributors. This may ultimately be a GitHub API bug. 😕
Expected Behavior
The 404 error should be handled and the connector should continue as if no contributors are present.
Logs
LOG
Steps to Reproduce
collaborators
table.Are you willing to submit a PR?
Yes. Currently disabling the
collaborators
functionality to do an initial load of all our data. Can come back to this later.The text was updated successfully, but these errors were encountered: