Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sql-agent dont respond #11

Open
kilin-s opened this issue Jan 24, 2018 · 7 comments
Open

sql-agent dont respond #11

kilin-s opened this issue Jan 24, 2018 · 7 comments

Comments

@kilin-s
Copy link

kilin-s commented Jan 24, 2018

Hi.

I use sql-agent on docker with prometheus-sql. Periodicaly sql-agent stuck, only restart is helps.
No errors in log

15.01.2018 18:12:592018/01/15 14:12:59 main.go:107: * Listening on 0.0.0.0:5000...
16.01.2018 11:04:082018/01/16 07:04:08 main.go:107: * Listening on 0.0.0.0:5000...
16.01.2018 11:41:172018/01/16 07:41:17 main.go:107: * Listening on 0.0.0.0:5000...
17.01.2018 11:46:452018/01/17 07:46:45 main.go:107: * Listening on 0.0.0.0:5000...
22.01.2018 11:25:162018/01/22 07:25:16 main.go:107: * Listening on 0.0.0.0:5000...
23.01.2018 11:04:592018/01/23 07:04:59 main.go:107: * Listening on 0.0.0.0:5000...
23.01.2018 11:44:362018/01/23 07:44:36 main.go:107: * Listening on 0.0.0.0:5000...
23.01.2018 18:11:312018/01/23 14:11:31 main.go:107: * Listening on 0.0.0.0:5000...
23.01.2018 22:30:202018/01/23 18:30:20 main.go:107: * Listening on 0.0.0.0:5000...
24.01.2018 14:23:532018/01/24 10:23:53 main.go:107: * Listening on 0.0.0.0:5000...

docker-compose.yml

version: '2'
services:
  sql-agent:
    image: dbhi/sql-agent
    tty: true
    ports:
    - 5000:5000/tcp
    labels:
      io.rancher.container.pull_image: always
@bruth
Copy link
Contributor

bruth commented Jan 24, 2018

Sorry you are having trouble. Can you describe what you mean by getting stuck? What is the behavior you observe before you restart it?

@kilin-s
Copy link
Author

kilin-s commented Jan 25, 2018

i see errors in prometheus-sql log

[metric_name] 2018/01/25 09:37:56 Post http://sql-agent.sql-agent.renv-0043.rancher-test.domain.ru:5000: net/http: request canceled (Client.Timeout exceeded while awaiting headers)
[metric_name] 2018/01/25 09:37:56 Backing off for 5m0s

When i try send request by curl i don't receive any responce.

@bruth
Copy link
Contributor

bruth commented Jan 25, 2018

Have you tried curling against sql-agent directly? Can you confirm is it is not a DNS/port issue?

@kilin-s
Copy link
Author

kilin-s commented Jan 26, 2018

yes i tried curling directly and don't get responce, but port is open and i can connect to it by telnet. I think problem between sql-agent and postgres.

@kilin-s
Copy link
Author

kilin-s commented Jan 26, 2018

requests from prometheus-sql reach the sql-agent but not reach the postgres.

sometimes sql-agent start works without restart container.

And i forgot to say that postgres and prometheus-sql placed in one network, sql-agent in other.

@bruth
Copy link
Contributor

bruth commented Jan 26, 2018

Just to relay the request flow, prometheus-sql sends a request to sql-agent with db connection info and the query. sql-agent then attempts to open a connection to the database. So if sql-agent can't talk to postgres then it should still respond with an error in the response noting the problem, such as a connection issue. Here is the request handler. In case you only tried curling with GET, it only takes POST requests.

@kilin-s
Copy link
Author

kilin-s commented Jan 26, 2018

i see miscommunication here. I'm sorry, English is not my native language

i know that "prometheus-sql sends a request to sql-agent with db connection info and the query. sql-agent then attempts to open a connection to the database." and "only takes POST requests"

And again

1 when applications start, everything is fine, but after random time interval sql-agent dont send requests to the prometheus. In prometheus-sql logs i see

[metric_name] 2018/01/25 09:37:56 Post http://sql-agent.sql-agent.renv-0043.rancher-test.domain.ru:5000: net/http: request canceled (Client.Timeout exceeded while awaiting headers)
[metric_name] 2018/01/25 09:37:56 Backing off for 5m0s

in sql-agent logs i see only that application is started

2 for restore working capacity i restart container with sql-agent, but somtimes sql-agent restores working capacity without restart the container.

3 when i curling i get responce instantly, when i curling in problem moment i gen nothing

* About to connect() to sql-agent.sql-agent.renv-0043.rancher-test.domain.ru port 5000 (#0)
*   Trying 10.6.109.231... connected
* Connected to sql-agent.sql-agent.renv-0043.rancher-test.domain.ru (10.6.109.231) port 5000 (#0)
> POST / HTTP/1.1
> User-Agent: curl/7.19.7 (x86_64-redhat-linux-gnu) libcurl/7.19.7 NSS/3.15.3 zlib/1.2.3 libidn/1.18 libssh2/1.4.2
> Host: sql-agent.sql-agent.renv-0043.rancher-test.domain.ru:5000
> Accept: */*
> Content-Length: 253
> Content-Type: application/x-www-form-urlencoded
>
^C

and no errors

4 in problem time with tcpdump i don't see packets from sql-agent to postgres.
5 postgres and prometheus-sql placed in one network sql-agent in other.

Can you create release for sql-agent, maybe when applications be in one network all will be work fine.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants