Project

General

Profile

Actions

action #110677

closed

coordination #80142: [saga][epic] Scale out: Redundant/load-balancing deployments of openQA, easy containers, containers on kubernetes

coordination #92854: [epic] limit overload of openQA webUI by heavy requests

Investigation page shouldn't involve blocking long-running API routes size:M

Added by livdywan over 2 years ago. Updated over 2 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Feature requests
Target version:
Start date:
2022-02-03
Due date:
% Done:

0%

Estimated time:

Description

Observation

The ajax request can be very slow e.g. https://openqa.opensuse.org/tests/2326528/investigation_ajax will take several minutes to respond.
See #106759#note-36

Acceptance criteria

  • AC1: /tests/*/investigation_ajax requests are no longer able to block the prefork worker process for more than 400 seconds
  • AC2: The user does not see any timeouts, even if loading of the "Investigation" tab takes a while

Suggestions

  • Move the investigation code into a Minion job and add a route for polling its result
    • Very easy to store results as job results, but performance regressions need to be considered
  • As a further optimization the investigation results could be cached explicitely/longer
    • Generate results even before the job finishes:
    • compute git analysys once and save it to a file
    • save to an asset
    • possibly a bit wasteful (as we run a lot of jobs but don't need the investigation for every job)
    • Trigger caching of data upon first request
    • return "no data yet" on the first attempt -> trigger a minion job
    • check for an existing minion job for the same job (e.g. user refreshing the page)
    • next step: save to an asset internally

Related issues 3 (1 open2 closed)

Related to openQA Project (public) - action #97190: Limit size of initial requests everywhere, e.g. /, /tests, etc., over webUI and APINew2022-05-30

Actions
Related to openQA Project (public) - action #128591: [openqa_logwarn] logwarn reports the same entry over and over size:MResolvedtinita2023-05-03

Actions
Blocks openQA Project (public) - action #106759: Worker xyz has no heartbeat (400 seconds), restarting repeatedly reported on o3 size:MResolvedlivdywan2022-02-03

Actions
Actions

Also available in: Atom PDF