0ct0pu5/healthchecks

Author	SHA1	Message	Date
Pēteris Caune	28af3720f4	Increase outgoing webhook timeout from 10 to 30 seconds Also simplify the retry logic: each retry attempt is now allowed to use the full 30 seconds. This means, a single webhook delivery can take up to 3*30=90 seconds.	2024-09-11 12:37:40 +03:00
Pēteris Caune	13217af304	Add --pool parameter in `manage.py sendalerts` If sendalerts receives this parameter, it reconfigures settings.DATABASES to enable db connection pooling (using psycopg_pool with default parameters). This lets us use many concurrent worker threads but not run out of database connections. For example, with `--num-workers 100 --pool`, up to 100 worker threads can run concurrently, but only 3 threads can get a database connection from the pool, the rest have to wait. When a worker thread gives up a connection (by calling `close_old_connections`), another thread can continue. A worker thread can give up a db connection before it is fully finished if it anticipates a long network IO operation ahead. The Webhook transport does this before making a curl call. psycopg_pool's default pool size is 4 connections. One connection is used up by the main thread, so 3 connections are available for the worker threads.	2024-09-10 14:58:24 +03:00
Pēteris Caune	8eecece0bb	Add db migration for the updated msteams name	2024-09-10 14:45:48 +03:00
Pēteris Caune	fd0c428e29	Update sqlite settings to avoid "Database is locked" errors Fixes: #1057 "PRAGMA busy_timeout" configures the database to wait when a database is locked instead of giving up immediately. "transaction_mode IMMEDIATE" starts transactions in read/write mode, required to make busy_timeout work. Reference: https://gcollazo.com/optimal-sqlite-settings-for-django/	2024-09-09 10:11:22 +03:00
Pēteris Caune	6bf588d984	Remove unused import	2024-09-04 10:49:09 +03:00
Pēteris Caune	5a19f9658a	Update changelog for v3.6 release	2024-09-04 10:18:56 +03:00
Pēteris Caune	4097cdee61	Bump Django to 5.1.1	2024-09-04 09:27:51 +03:00
Pēteris Caune	a72f3adc45	Update requirements to require only pure-python psycopg ... and install psycopg-c using instuctions in Dockerfile. This way, getting a development environment or CI environment ready is quick and easy, but Docker images still get the C optimizations.	2024-09-03 16:10:53 +03:00
Pēteris Caune	fea767723c	Upgrade to psycopg 3	2024-09-03 11:30:43 +03:00
Pēteris Caune	9d4fc031aa	Fix sendalerts to check the self.shutdown flag more often	2024-09-03 10:30:18 +03:00
Pēteris Caune	3275e0ffaa	Update notify() to return logs instead of printing them	2024-09-03 10:23:15 +03:00
Pēteris Caune	8c56ca6dde	Update sendalerts to mark flip as processed on thread Previously this was done in process_one_flip (so on the main thread). The advantage of doing this way is the flip gets marked as processed only when the thread has started and has acquired a db connection. There is now a smaller pause between a sendalerts process claiming a flip, and actually starting work on it.	2024-09-01 15:28:48 +03:00
Pēteris Caune	fd75049e0c	Fix type warnings	2024-08-31 19:23:10 +03:00
Pēteris Caune	a463daa775	Update Webhook transport to close db connection before network IO Webhook requests can take 20+ seconds. During that time we hold on to a database connection. With this commit, the Webhook transport closes its DB connection before making a curl call. With psycopg2 this does not have much effect. But with psycopg 3 & connection pooling we will be able to use more sendalerts workers than we have database connections. While one worker is busy making a slow curl call, another worker can grab its freed up connection and do some work. Django's test runner is not happy with connections closed mid-test, so I patched out close_old_connections() in affected tests.	2024-08-31 19:18:17 +03:00
Pēteris Caune	9803d77a1d	Set explicit max_workers value for ThreadPoolExecutor This is a tricky one: the default value for max_workers is None. But it doesn't mean "unlimited", in Python 3.8+ it means "min(32, os.cpu_count() + 4)" For example on 8-core CPU the effective value would be 8 + 4 = 12, and passing anything above 12 to `--max-workers` would have no effect.	2024-08-31 19:11:39 +03:00
Pēteris Caune	4cd677536d	Remove sent notification counter The counter was slightly wrong (it counted lost races as sent notifications). Rather than complicating code to make it correct, let's rather just remove it :-)	2024-08-31 19:07:25 +03:00
Pēteris Caune	faa1a2c99f	Add logging for exceptions thrown inside notify()	2024-08-31 19:04:41 +03:00
Pēteris Caune	7641f2a9a1	Switch to using close_old_connections() instead of connection.close()	2024-08-31 19:02:11 +03:00
Pēteris Caune	d76dc53e49	Increase Signal send timeout to 60 seconds	2024-08-31 11:07:17 +03:00
Pēteris Caune	b1b0a57033	Tweak sendalerts log format	2024-08-30 17:00:30 +03:00
Pēteris Caune	8a3a9b2a7e	Fix code comments	2024-08-29 16:30:28 +03:00
Pēteris Caune	029881f3b9	Refactor sendalerts * Remove the --no-loop and --no-threads arguments * Use a threadpool to do multiple sends concurrently * Add a new `--num-workers` argument. It limits how many flips we grab from the database and process concurrently. * Do not prioritize flips with historically low send times any more (not as important now with concurrent sending, and simpler this way) * Workers close db connections when they finish (to keep the number of idle connections low) Note: concurrent.futures.ThreadPoolExecutor internally has an unbounded queue, it will accept any amount of jobs and keep them queued. We don't want that. We only want to grab a flip, and commit to processing it, if we know there's a free worker for it. Therefore we're tracking the number of jobs in flight using a semaphore (`self.seats`).	2024-08-29 16:20:36 +03:00
Pēteris Caune	3968a4f9e0	Update MS Teams Connector EOL date	2024-08-27 16:34:59 +03:00
Pēteris Caune	320a7c7733	Fix the Docker healthcheck script to supply correct Host header Commit `8fed685f12` added a HEALTHCHECK instruction in the Dockerfile. The healthcheck script calls http://localhost:8000/api/v3/status/, which fails if localhost is not in ALLOWED_HOSTS. With this change, the healthcheck script is now a Django management command. It reads Django's ALLOWED_HOSTS setting, grabs the first element, and uses it in the "Host:" HTTP header when making a HTTP request. cc: #1051	2024-08-21 15:52:19 +03:00
Pēteris Caune	027fcc1097	Simplify and eliminate assert	2024-08-20 14:39:11 +03:00
Pēteris Caune	0a4f038987	Simplify and eliminate assert	2024-08-20 14:13:58 +03:00
Pēteris Caune	b27ffe07a6	Update email_form to use more precise type annotation	2024-08-20 13:58:52 +03:00
Pēteris Caune	6d15c45b21	Update CHANGELOG for v3.5.1 release	2024-08-20 13:46:09 +03:00
Pēteris Caune	6f11b9c0dd	Remove unneeded bits	2024-08-20 13:27:28 +03:00
Pēteris Caune	79b9aae660	Update Dockerfile to install recent rustc (needed to build cryptography) * Healthchecks depends on python library "fido2" * fido2 depends on python library "cryptography" * building cryptography requires recent (1.65+) rustc * cryptography has prebuilt binary wheels for most architectures but not for arm/v7 * Dockerfile uses bookworm as base, which ships rustc 1.63 * So we now install rust using rustup This is all terrible.	2024-08-20 13:11:29 +03:00
Pēteris Caune	ca75c7e984	Update CHANGELOG for v3.5 release	2024-08-20 11:20:34 +03:00
Pēteris Caune	001ba8b69b	Fix type warnings	2024-08-20 11:06:55 +03:00
Pēteris Caune	5e051bfc30	Fix AJAX views to better handle user logging out Rather than redirecting to login page, return HTTP 403 Forbidden	2024-08-20 10:57:36 +03:00
Pēteris Caune	15e1a988c8	Upgrade docker-compose.yml to use postgres 16, add upgrade instructions	2024-08-19 11:00:37 +03:00
Pēteris Caune	8fed685f12	Update Dockerfile to report container health in `docker ps` This commit adds a HEALTHCHECK instruction in Dockerfile. The HEALTHCHECK instruction calls /docker/fetchstatus.sh which in turn makes a HTTP request to http://localhost:8000/api/v3/status/ This endpoint makes a test database query and returns non-200 response if the query fails. So, in short, if the Healthchecks container for any reason is unable to query database, `docker ps` will now show the container as "unhealthy". cc: #1045	2024-08-19 10:17:05 +03:00
Pēteris Caune	70b55a777b	Add migration which updates Channel.kind values This is to go with `8054191be3`, and should have been in there :-) cc: #1050	2024-08-17 12:12:47 +03:00
Pēteris Caune	d3ae4e7fac	Add support for $SLUG placeholder in webhook payloads Fixes: #1049	2024-08-16 13:24:12 +03:00
Pēteris Caune	cda744d0c1	Implement search by slug in the checks list cc: #1048	2024-08-15 14:17:28 +03:00
Pēteris Caune	56bac98816	Update the "Set Password" page to reject very weak passwords	2024-08-15 12:04:28 +03:00
Pēteris Caune	5d63057e78	Improve password quality meter for very weak passwords Previously, if the user enters a weak password like "qwerty", the score is 0, the password strength bar is empty (all gray). It is easy to not notice the password strength bar at all. Now, the lowest score for a non-empty password is 1, meaning the user will see one red bar. This will hopefully draw more attention to the password strength bar. Users are still allowed to choose weak passwords.	2024-08-15 11:10:14 +03:00
Pēteris Caune	81515e3ed2	Fix selectize optgroup separator in dark mode	2024-08-13 14:54:08 +03:00
Pēteris Caune	3fbba0c2f0	Update timezone dropdowns to show frequently used timezones at the top	2024-08-13 13:57:52 +03:00
Pēteris Caune	b859a71920	Rename "sign in" to "log in" I like "sign in" better, but users from time to time confuse "sign in" and "sign up" forms. To reduce confusion potential, I'm renaming "sign in" to "log in".	2024-08-12 15:09:58 +03:00
Pēteris Caune	56862a1c49	Update NotificationsAdmin to use __ lookup in list_display	2024-08-07 17:39:17 +03:00
Pēteris Caune	f7876f67d7	Remove unused code	2024-08-07 17:38:43 +03:00
Pēteris Caune	bd5582872a	Upgrade to Django 5.1	2024-08-07 17:24:27 +03:00
Pēteris Caune	a3bc9f3b37	Upgrade to Django 5.0.8	2024-08-07 17:20:17 +03:00
Joel Pérez	28168a5651	Fix django version in self hosted documentation (#1034 ) * Update self_hosted.md * Update self_hosted.html-fragment	2024-07-30 19:24:31 +03:00
Pēteris Caune	aa2bd8cf66	Fix a testcase not correctly using sample values	2024-07-29 10:36:29 +03:00
Pēteris Caune	26ed70eccd	Bump package versions	2024-07-29 10:31:06 +03:00

1 2 3 4 5 ...

3197 commits