Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] Attempt to reduce 502s in queue-proxy due to connect errors #5506

Closed
wants to merge 4 commits into from

Conversation

tcnghia
Copy link
Contributor

@tcnghia tcnghia commented Sep 12, 2019

Proposed Changes

Release Note

NONE

@knative-prow-robot knative-prow-robot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Sep 12, 2019
@googlebot googlebot added the cla: yes Indicates the PR's author has signed the CLA. label Sep 12, 2019
@knative-prow-robot knative-prow-robot added the size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. label Sep 12, 2019
Copy link
Contributor

@knative-prow-robot knative-prow-robot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@tcnghia: 0 warnings.

In response to this:

Proposed Changes

  • Also retry on 'connection refused' errors, instead of just 'connect timeout'.

Release Note

NONE

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@tcnghia
Copy link
Contributor Author

tcnghia commented Sep 12, 2019

/test pull-knative-serving-istio-1.1-mesh

@tcnghia
Copy link
Contributor Author

tcnghia commented Sep 12, 2019

/test pull-knative-serving-istio-1.2-mesh

@knative-prow-robot knative-prow-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Sep 12, 2019
@tcnghia
Copy link
Contributor Author

tcnghia commented Sep 12, 2019

/test pull-knative-serving-istio-1.1-mesh
/test pull-knative-serving-istio-1.2-mesh

@tcnghia tcnghia changed the title [WIP] Also retry on 'connection refused' errors. Attempt to reduce 502 in queue-proxy due to connect errors Sep 12, 2019
@knative-prow-robot knative-prow-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Sep 12, 2019
@tcnghia tcnghia changed the title Attempt to reduce 502 in queue-proxy due to connect errors Attempt to reduce 502s in queue-proxy due to connect errors Sep 12, 2019
@vagababov
Copy link
Contributor

/lgtm
/hold

@knative-prow-robot knative-prow-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Sep 12, 2019
@knative-prow-robot knative-prow-robot added the lgtm Indicates that a PR is ready to be merged. label Sep 12, 2019
Copy link
Contributor

@markusthoemmes markusthoemmes left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm
/approve

@knative-prow-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: markusthoemmes, tcnghia

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@tcnghia
Copy link
Contributor Author

tcnghia commented Sep 12, 2019

Still hitting 502s 4 times

kubelogs.go:158: I 06:30:43.513 [kpa-class-podautoscaler-controller] [serving-tests/autoscale-sustaining-qgicvgkz-c852f] Reconcile succeeded. Time taken: 473.352µs.
autoscale_test.go:140: Status = 502, want: 200
autoscale_test.go:141: Response: status: 502, body: dial tcp 127.0.0.1:8080: connect: connection refused
    , headers: map[Content-Length:[53] Content-Type:[text/plain; charset=utf-8] Date:[Thu, 12 Sep 2019 06:30:43 GMT] Server:[istio-envoy] X-Content-Type-Options:[nosniff] X-Envoy-Upstream-Service-Time:[4] Zipkin_trace_id:[dbc7d1e6eb91c83f7906320e03104e33]]
spoof.go:122: Spoofing autoscale-sustaining-qgicvgkz.serving-tests.35.188.4.244.xip.io:80 -> 35.188.4.244:80
spoof.go:122: Spoofing autoscale-sustaining-qgicvgkz.serving-tests.35.188.4.244.xip.io:80 -> 35.188.4.244:80
spoof.go:122: Spoofing autoscale-sustaining-qgicvgkz.serving-tests.35.188.4.244.xip.io:80 -> 35.188.4.244:80
autoscale_test.go:140: Status = 502, want: 200
autoscale_test.go:141: Response: status: 502, body: dial tcp 127.0.0.1:8080: connect: connection refused
    , headers: map[Content-Length:[53] Content-Type:[text/plain; charset=utf-8] Date:[Thu, 12 Sep 2019 06:30:43 GMT] Server:[istio-envoy] X-Content-Type-Options:[nosniff] X-Envoy-Upstream-Service-Time:[2] Zipkin_trace_id:[70239972f43721741312dec5f07b70aa]]
autoscale_test.go:140: Status = 502, want: 200
autoscale_test.go:141: Response: status: 502, body: dial tcp 127.0.0.1:8080: connect: connection refused
    , headers: map[Content-Length:[53] Content-Type:[text/plain; charset=utf-8] Date:[Thu, 12 Sep 2019 06:30:43 GMT] Server:[istio-envoy] X-Content-Type-Options:[nosniff] X-Envoy-Upstream-Service-Time:[3] Zipkin_trace_id:[dcf7441c5bbcb44b2f8fb3bb29698646]]
autoscale_test.go:140: Status = 502, want: 200
autoscale_test.go:141: Response: status: 502, body: dial tcp 127.0.0.1:8080: connect: connection refused
    , headers: map[Content-Length:[53] Content-Type:[text/plain; charset=utf-8] Date:[Thu, 12 Sep 2019 06:30:43 GMT] Server:[istio-envoy] X-Content-Type-Options:[nosniff] X-Envoy-Upstream-Service-Time:[2] Zipkin_trace_id:[2923b60162cd7c9aab5aa3a91f2e1482]]
spoof.go:122: Spoofing autoscale-sustaining-qgicvgkz.serving-tests.35.188.4.244.xip.io:80 -> 35.188.4.244:80

@knative-prow-robot
Copy link
Contributor

New changes are detected. LGTM label has been removed.

@knative-prow-robot knative-prow-robot removed lgtm Indicates that a PR is ready to be merged. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Sep 12, 2019
@knative-prow-robot knative-prow-robot added the size/S Denotes a PR that changes 10-29 lines, ignoring generated files. label Sep 12, 2019
@tcnghia tcnghia changed the title Attempt to reduce 502s in queue-proxy due to connect errors [WIP] Attempt to reduce 502s in queue-proxy due to connect errors Sep 12, 2019
@knative-prow-robot knative-prow-robot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Sep 12, 2019
@tcnghia
Copy link
Contributor Author

tcnghia commented Sep 12, 2019

Looks like either net.Error#Temporary() doesn't include connection refused or the retries didn't help.

@tcnghia
Copy link
Contributor Author

tcnghia commented Sep 12, 2019

/test pull-knative-serving-istio-1.1-mesh
/test pull-knative-serving-istio-1.2-mesh

@knative-metrics-robot
Copy link

The following is the coverage report on pkg/.
Say /test pull-knative-serving-go-coverage to re-run this coverage report

File Old Coverage New Coverage Delta
pkg/network/transports.go 96.0% 95.7% -0.3

@knative-prow-robot
Copy link
Contributor

@tcnghia: The following tests failed, say /retest to rerun them all:

Test name Commit Details Rerun command
pull-knative-serving-istio-1.1-mesh 2492232 link /test pull-knative-serving-istio-1.1-mesh
pull-knative-serving-istio-1.2-mesh 2492232 link /test pull-knative-serving-istio-1.2-mesh

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@tcnghia
Copy link
Contributor Author

tcnghia commented Sep 12, 2019

Looks like retrying won't fix

    autoscale_test.go:141: Response: status: 502, body: timed out dialing```

@tcnghia
Copy link
Contributor Author

tcnghia commented Sep 12, 2019

/close

@knative-prow-robot
Copy link
Contributor

@tcnghia: Closed this PR.

In response to this:

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. area/networking cla: yes Indicates the PR's author has signed the CLA. do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. size/S Denotes a PR that changes 10-29 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants