Hot fixes to minimize and close idle database connections #547

aufdenkampe · 2022-01-06T16:25:07Z

As described in #543 (comment), the old code that inserted data from device POSTs, based on a set of convoluted Django models, was leaving open an ever increasing number of idle connections to the PostgreSQL database server. This resulted in every increasing CPU utilization that would max out in less than a week unless the database server was rebooted as seen below:

This general issue is well-described by:

This PR replaces most of the data insert Django model code with code based on SQLAchemy. When deployed to production as a hot fix at around 16:48 UTC on Jan. 5., the base CPU load decreased and has stayed low, as shown here:

Quick additional testing showed that:

API traffic response times are about 1/3 of what they were on the django models dependent code.
Database CPU utilization spikes were at 1/4 to 1/2 what it was previously immediately after a reboot, and stayed that way for >24 hours rather than steadily increasing.

This PR fixes:

This implements a threaded approach to the datastream view, which should increase performance by insert data in parallel.

Performance profiling showed that the Django models used in the view end point for 'api/data-stream' were acting as a bottle neck, and also did not allow for asynchronous support. This commit is a patch which replaces those models with direct SQL, which should be more performant and also supports multithreading.

My previous commit (7612930) replace django models with customized queries. These queries leverages the DO operation and some Postgres IF logic. While this executes fine in Postgres, it doesn't appear to function with SQLAlchemy. This commit replace those queries with more simple logic that is supported by SQLAlchemy.

aufdenkampe · 2022-01-06T16:29:10Z

Note that these hot fixes required us to set Gunicorn to only have 1 worker with 8 threads, to avoid the potential problems described in Connection Pooling — SQLAlchemy 1.4 Documentation.

This simple work-around leaves us we plenty of room to further enhance performance by properly configuring SQLAlchemy to use Multiprocessing.

ptomasula added 9 commits December 23, 2021 16:49

Revised DataStream API

25ca268

This implements a threaded approach to the datastream view, which should increase performance by insert data in parallel.

Separate out update method from threaded approach

107ef55

Add Profiler Method

f2c6763

Missing change of engine pool size

b4f6c40

Restore set sensor deployment date logic

64f2636

Add error handling to set_deployment_date method.

bd21170

Move timeseries_result_values error handling

17a0ce7

aufdenkampe added high priority tech-debt labels Jan 6, 2022

aufdenkampe added this to the v0.12.1 Hotfixes & Tweaks to AWS release milestone Jan 6, 2022

aufdenkampe merged commit b460063 into develop Jan 6, 2022

This was referenced Jan 6, 2022

MMW slow response time via browser #543

Closed

Tests over weekend 2021Dec17 a lot of timeouts on 5 and 7secs #542

Closed

Release 0.12.1 Hotfixes #548

Merged

neilh10 mentioned this pull request Feb 13, 2023

Not getting a 201 on POST and loosing data #641

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hot fixes to minimize and close idle database connections #547

Hot fixes to minimize and close idle database connections #547

aufdenkampe commented Jan 6, 2022 •

edited

Loading

aufdenkampe commented Jan 6, 2022

Hot fixes to minimize and close idle database connections #547

Hot fixes to minimize and close idle database connections #547

Conversation

aufdenkampe commented Jan 6, 2022 • edited Loading

aufdenkampe commented Jan 6, 2022

aufdenkampe commented Jan 6, 2022 •

edited

Loading