redash

mirror of https://github.com/getredash/redash.git synced 2025-12-19 17:37:19 -05:00

Author	SHA1	Message	Date
Andrii Chubatiuk	ec1c4d07de	Removed pseudojson class, converted all options and other json columns to jsonb ones (#6687 ) Co-authored-by: Andrew Chubatiuk <andrew.chubatiuk@motional.com>	2024-01-12 09:02:00 +10:00
Jun	9b2f635692	format code by black and isort (#6167 ) Signed-off-by: Ye Sijun <junnplus@gmail.com>	2023-07-11 19:13:54 +10:00
Jesse	67263e1b0f	Fixes issue #5445 : Scheduled query not working (#5448 ) * use 'query_id' everywhere instead of 'Query ID' * some black while we're at it Co-authored-by: Omer Lachish <omer@rauchy.net>	2021-04-08 13:32:34 -05:00
Omer Lachish	9fdf1f341d	Reset failure counter on adhoc success (#5394 ) * reset failure counter when query completes successfully via adhoc * Use "query_id" in metadata, but still allow "Query ID" for transition/legacy support	2021-03-12 12:02:29 -08:00
Omer Lachish	90024ebc92	Delete locks for cancelled queries (#5006 ) * delete locks for cancelled queries * test that query cancellations do not prevent reenqueues	2020-06-29 13:09:01 +03:00
Omer Lachish	a9cb87d4b3	refresh_queries shouldn't break because of a single query having a bad schedule object (#4163 ) * move filtering of invalid schedules to the query * simplify retrieved_at assignment and wrap in a try/except block to avoid one query blowing up the rest * refactor refresh_queries to use simpler functions with a single responsibility and add try/except blocks to avoid one query blowing up the rest * avoid blowing up when job locks point to expired Job objects. Enqueue them again instead * there's no need to check for the existence of interval - all schedules have intervals * disable faulty schedules * reduce FP style in refresh_queries * report refresh_queries errors to Sentry (if it is configured) * avoid using exists+fetch and use exceptions instead	2020-03-01 11:02:46 +02:00
Omer Lachish	329e85987c	Execute Queries in RQ (#4413 ) * enforce hard limits on non-responsive work horses by workers * move differences from Worker to helper methods to help make the specialization clearer * move HardLimitingWorker to redash/tasks * move schedule.py to /tasks * explain the motivation for HardLimitingWorker * pleasing CodeClimate * pleasing CodeClimate * port query execution to RQ * get rid of argsrepr * avoid star imports * allow queries to be cancelled in RQ * return QueryExecutionErrors as job results * fix TestTaskEnqueue and QueryExecutorTests * remove Celery monitoring * get rid of QueryTask and use RQ jobs directly (with a job serializer) * Revert "remove Celery monitoring" This reverts commit `37a74ea403`. * reduce occurences of the word 'task' * use Worker, Queue and Job instead of spreading names that share behavior details * remove locks for failed jobs as well * did I not commit that colon? oh my * push the redis connection to RQ's stack on every request to avoid verbose connection setting * use a connection context for tests * black it up * run RQ on all queues when running in Cypress	2019-12-30 14:11:01 +02:00
Arik Fraimovich	0aa176e2e5	Don't update query's updated_at when updating schedule_failures counter (#4488 )	2019-12-25 16:25:16 +02:00
Arik Fraimovich	2dff8b9a00	Black support for the Python codebase (#4297 ) * Apply black formatting * Add auto formatting when committing to master * Update CONTRIBUTING.md re. Black & Prettier	2019-12-11 13:54:29 +02:00
Omer Lachish	5a5fdecdde	Replace Celery with RQ (except for execute_query tasks) (#4093 ) * add rq and an rq_worker service * add rq_scheduler and an rq_scheduler service * move beat schedule to periodic_jobs queue * move version checks to RQ * move query result cleanup to RQ * use timedelta and DRY up a bit * move custom tasks to RQ * do actual schema refreshes in rq * rename 'period_jobs' to 'periodic', as it obviously holds jobs * move send_email to rq * DRY up enqueues * ditch and use a partially applied decorator * move subscribe to rq * move check_alerts_for_query to rq * move record_event to rq * make tests play nicely with rq * 👋 beat * rename rq_scheduler to plain scheduler, now that there's no Celery scheduler entrypoint * add some color to rq-worker's output * add logging context to rq jobs (while keeping execute_query context via get_task_logger for now) * move schedule to its own module * cancel previously scheduled periodic jobs. not sure this is a good idea. * rename redash.scheduler to redash.schedule * allow custom dynamic jobs to be added decleratively * add basic monitoring to rq queues * add worker monitoring * pleasing the CodeClimate overlords * adjust cypress docker-compose.yml to include rq changes * DRY up Cypress docker-compose * add rq dependencies to cypress docker-compose service * an odd attempt at watching docker-compose logs when running with Cypress * Revert "an odd attempt at watching docker-compose logs when running with Cypress" This reverts commit `016bd1a93e`. * show docker-compose logs at Cypress shutdown * Revert "DRY up Cypress docker-compose" This reverts commit `43abac7084`. * minimal version for binding is 3.2 * remove unneccesary code reloads on cypress * add a command which errors if any of the workers running inside the current machine haven't been active in the last minute * SCHEMAS_REFRESH_QUEUE is no longer a required setting * split tasks/queries.py to execution.py and maintenance.py * fix tests after query execution split * pleasing the CodeClimate overlords * rename worker to celery_worker and rq_worker to worker * use /rq_status instead of /jobs * show started jobs' time ago according to UTC * replace all spaces in column names * fix query tests after execution split * exit with an int * general lint * add an entrypoint for rq_healthcheck * fix indentation * delete all existing periodic jobs before scheduling them * remove some unrequired requires * move schedule example to redash.schedule * add RQ integration to Sentry's setup * pleasing the CodeClimate overlords * remove replication settings from docker-compose - a proper way to scale using docker-compose would be the --scale CLI option, which will be described in the knowledge based * revert to calling a function in dynamic settings to allow periodic jobs to be scheduled after app has been loaded * don't need to depend on context when templating failure reports * set the timeout_ttl to double the interval to avoid job results from expiring and having periodic jobs not reschedule * whoops, bad merge * describe custom jobs and don't actually schedule them * fix merge	2019-10-15 23:59:22 +03:00
Arik Fraimovich	204447a9f5	Add interface to abstract query result persistence (#4147 ) * Add interface to implement custom persistence for QueryResult data Co-authored-by: Omer Lachish <omer@rauchy.net> * Deserialize query results data in the model * Change order of mixins. * Make DBPersistence.data setter in sycn with getter + tests	2019-10-10 10:39:55 +03:00
Omer Lachish	7fb33e3ebb	Failed Scheduled Queries Report (#3798 ) * initial work on e-mail report for failed queries * send failure report only for scheduled queries and not for adhoc queries * add setting to determine if to send failure reports * add setting to determine interval of aggregated e-mail report * html templating of scheduled query failure report * break line * support timeouts for failure reports * aggregate errors within message and warn if approaching threshold * handle errors in QueryExecutor.run instead of on_failure * move failure report to its own module * indicate that failure count is since last report * copy changes * format with <code> * styling, copy and add a link to the query instead of the query text * separate reports with <hr> * switch to UTC * move <h2> to actual e-mail subject * add explicit message for SoftTimeLimitExceeded * fix test to use soft time limits * default query failure threshold to 100 * use base_url from utils * newlines. newlines everywhere. * remove redundant import * apply new design for failure report * use jinja to format the failure report * don't show comment block if no comment is provided * don't send emails if, for some reason, there are no available errors * subtract 1 from failure count, because the first one is represented by 'Last failed' * don't show '+X more failures' if there's only one * extract subject to variable * format as text, while we're at it * allow scrolling for long exception messages * test that e-mails are scheduled only when beneath limit * test for indicating when approaching report limits + refactor * test that failures are aggregated * test that report counts per query and reason * test that the latest failure occurence is reported * force sending reports for testing purposes * Update redash/templates/emails/failures.html Co-Authored-By: Ran Byron <ranbena@gmail.com> * Update redash/templates/emails/failures.html Co-Authored-By: Ran Byron <ranbena@gmail.com> * Update redash/tasks/failure_report.py * add org setting for email reports * remove logo from failure report email * correctly use the organization setting for sending failure reports * use user id as key for failure reports data structure * Update redash/tasks/failure_report.py Co-Authored-By: Arik Fraimovich <arik@arikfr.com> * build comments while creating context for e-mail templates * figure out the base url when creating the e-mail * no need to expire pending failure report keys as they are deleted anyway when sent * a couple of CodeClimate changes * refactor key creationg to a single location * refactor tests to send e-mail from a single function * use beat to schedule a periodic send_aggregated_errors task instead of using countdown per email * remove pending key as it is no longer required when a periodic task picks up the reports to send * a really important blank line. REALLY important. * Revert "a really important blank line. REALLY important." This reverts commit `c7d8ed8972`. * a really important blank line. REALLY important. It is the best blank line. * don't send failure emails to disabled users	2019-07-28 12:40:54 +03:00
Omer Lachish	5b30d081d7	Dynamic query time limits (#3702 ) * extract time limit decisions to a dynamic settings function * introduce environment variable for scheduled query time limits * pass in org_id to query_time_limit * add an interaction test that verifies that time limits are applied to jobs * really important newlines according to CodeClimate	2019-04-15 12:06:37 +03:00
Omer Lachish	dd477d49ec	Sharing embeds with safe parameters (#3495 ) * change has_access and require_access signatures to work with the objects that require access, instead of their groups * change has_access and require_access signatures to work with the objects that require access, instead of their groups * use the textless endpoint (/api/queries/:id/results) for pristine queriest * Revert "use the textless endpoint (/api/queries/:id/results) for pristine" This reverts commit `cd2cee7738`. * go to textless /api/queries/:id/results by default * change `run_query`'s signature to accept a ParameterizedQuery instead of constructing it inside * raise HTTP 400 when receiving invalid parameter values. Fixes #3394 * support querystring params * extract coercing of numbers to function, along with a friendlier implementation * wire embeds to textless endpoint * allow users with view_only permissions to execute queries on the textless endpoint, as it only allows safe queries to run * enqueue jobs for ApiUsers * add parameters component for embeds * include existing parameters in embed code * fetch correct values for json requests * remove previous embed parameter code * rename `id` to `user_id` * support executing queries using Query api_keys by instantiating an ApiUser that would be able to execute the specific query * bring back ALLOW_PARAMETERS_IN_EMBEDS (with link on deprecation coming up) * show deprecation messages for ALLOW_PARAMETERS_IN_EMBEDS. Also, move other message (email not verified) to use the same mechanism * add link to forum message on setting deprecation * rephrase deprecation message * add link to forum message regarding embed deprecation * change API to /api/queries/:id/dropdowns/:dropdown_id * split to 2 different dropdown endpoints and implement the second * add test cases for /api/queries/:id/dropdowns/:id * use new /dropdowns endpoint in frontend * first e2e test for sharing embeds * Pleasing the CodeClimate overlords * All glory to CodeClimate * change has_access and require_access signatures to work with the objects that require access, instead of their groups * split has_access between normal users and ApiKey users * remove residues from bad rebase * allow access to safe queries via api keys * rename `object` to `obj` * support both objects and group dicts in `has_access` and `require_access` * simplify permission tests once `has_access` accepts groups * change has_access and require_access signatures to work with the objects that require access, instead of their groups * rename `object` to `obj` * support both objects and group dicts in `has_access` and `require_access` * simplify permission tests once `has_access` accepts groups * fix bad rebase * send embed parameters through POST data * no need to log `is_api_key` * move query fetching by api_key to within the Query model * fetch user by adding a get_by_id function on the User model * pass parameters as POST data (fixes test failure introduced by switching from query string parameters to POST data) * test the right thing - queries with safe parameters should be embeddable * introduce cy.clickThrough * add another Cypress test to make sure unsafe queries cannot be embedded * serialize Parameters into query string * set is_api_key as the last parameter to (hopefully) avoid backward-dependency problems * Update redash/models/parameterized_query.py Co-Authored-By: rauchy <omer@rauchy.net> * attempt to fix empty percy snapshots * snap percies after DOM is fully loaded	2019-04-02 11:45:38 +03:00
Jannis Leidel	8456bbf762	Revert "Schema Viewer Drawer (#3291 )" (#3585 ) This reverts commit `cb4d81d6ad`.	2019-03-14 10:51:30 +02:00
Marina Samuel	cb4d81d6ad	Schema Viewer Drawer (#3291 ) * Process extra column metadata for a few sql-based data sources. * Add Table and Column metadata tables. * Periodically update table and column schema tables in a celery task. * Fetching schema returns data from table and column metadata tables. * Add tests for backend changes. * Front-end shows extra table metadata and uses new schema response. * Delete datasource schema data when deleting a data source. * Process and store data source schema when a data source is first created or after a migration. * Tables should have a unique name per datasource. * Addressing review comments. * Update migration file for mixins. * Appease PEP8 * Upgrade migration file for rebase. * Cascade delete. * Adding org_id * Remove redundant column and table prefixes. * Non-existing tables and columns should be filtered out on the server side not client side. * Fetching table samples should be optional and should happen in a separate task per table. * Allow users to force a schema refresh. * Use updated_at to help prune old schema metadata periodically. * Using settings.SCHEMAS_REFRESH_QUEUE	2019-03-13 18:08:00 +01:00
Arik Fraimovich	26f0ce0749	New Celery/Queries Execution Status API (#3057 ) * Remove QueryTaskTracker * Remove scheudling of cleanup_tasks * Add Celery introspection tools * First iteration of updating the admin API. * Show more details * Add option to skip building npm in Dockerfile * Show started_at * update the refresh schedule, as it's too fast * Update Celery monitor to report on all active tasks. * Update task parsing for new format * WIP: improved celery status screen * Fix property name. * Update counters * Update tab name * Update counters names * Move component to its own file and fix lint issues * Add migratin to remove Redis keys * Improve columns layout * Remove skip_npm_build arg as it's not used anymore. * Convert query from SQL to Python * Simplify column definition. * Show alert on error.	2019-03-10 11:19:31 +02:00
Jannis Leidel	44dff83046	Add "Active at" column to user list. (#3026 ) * add last_active_at to users page * Use our JSON encoder as the SQLAlchemy JSON serializer. * Fixed some inconsistencies in the user query class methods. * Minor cosmetic fixes. * Add some make tasks for easier development. * Add user detail sync system based on Redis backend. There is a periodic Celery task that updates a new “details” JSONB column in the “user” table with the data from Redis. Currently this is only used for tracking the date of last activity of a user but can be extended with other user information later. Updates a few dependencies. * Normalize a few Flask extension API names. * Reduce implementation complexity of JSONEncoder. * Use request_started signal to make sure we have a request context. Otherwise loading the user based on the request won’t work. * Fix test that checks if disabled users can login. This correctly uses a URL path that includes the current organization and checks for the error message. The previous test seems to have been a red herring. * Minor cosmetic fixes. * Remove needs_sync in favor of just deleting things. * Misc review fixes. * Ignore line length. * Split redash.models import several modules. * Move walrus UTC DateTimeField into redash.models.types. * Restore distinctly loading dashboards. * Simplify default values for user details. * Define __repr__ methods generically. * Consistently have underscore methods at the top of model methods. * Fix tests. * Split redash.models import several modules. * Update to latest walrus and redis-py. * Update kombu to 4.2.2 for redis-py 3.x compatibility. * Remove redis-cli container after running Make task. * Move buffer condition after datetime/time conditions. * Update walrus to 0.7.1. * Refactor some query APIs. This uses the flask-sqlalchemy helpers consistently and makes more use of mixins. * Post rebase fixes. * Use correct kombu version * Fix migration down revision	2019-01-07 10:30:42 +02:00
Marina Samuel	cdd2259d08	Closes #2396 : Add finer-grained scheduling. (#2426 ) * Closes #187: Add finer-grained scheduling - backend. * Closes #2396 - Add finer-grained scheduling - frontend. * Fix linting issues * Rename ScheduleDialgo to .jsx	2019-01-06 10:59:50 +02:00
Alison	341a68c7d4	Propagate query execution errors from Celery tasks properly (#2713 ) Refs https://github.com/mozilla/redash/issues/458	2018-08-23 20:33:43 +02:00
Arik Fraimovich	c054731794	Change: close metadata database connection early in the execute query Celery task. This to prevent the task holding an idle connection for a long period of time, while waiting for the query to finish.	2018-03-08 11:06:15 +02:00
Arik Fraimovich	d8a0885953	Fix: tests were using old method signature	2017-03-06 21:22:29 +02:00
Arik Fraimovich	83e6b6f50c	Tests use the same session as the tested code, and we can't use the same objects after the tested code calls commit() without disabling expire on commit. It seems like a safe thing in our case.	2017-03-06 13:49:29 +02:00
Allen Short	2407b115e4	Exponential backoff for failing queries	2017-02-22 10:29:08 -06:00
Arik Fraimovich	4459c464ca	use Query#query_text instead of Query#query	2016-12-07 02:13:20 -06:00
Arik Fraimovich	7159f0beb0	Remove potnetially concurrency not safe code form enqueue_query. This might have been causing the behavior described in #1097.	2016-06-09 16:53:29 +03:00
Arik Fraimovich	b67f412f58	Add test for enqueue_query	2016-06-08 20:00:59 +03:00
Arik Fraimovich	b2e2277d0b	Fix: old task trackers were not really removed	2016-05-19 09:58:30 +03:00
Arik Fraimovich	d38ab20c45	Feature: running queries (tasks) monitor - Refactored tasks module into a package. - Add new admins screens (running queries & outdated queries).	2016-04-18 13:46:31 +03:00

29 Commits