Project

General

Profile

action #128417

Updated by nicksinger about 1 year ago

## Observation 

 On 2023-04-28 16:30 the partition usage of w5-xen skyrocketed to >90% (https://stats.openqa-monitor.qa.suse.de/d/GDopenqaw5-xen/dashboard-for-openqaw5-xen?orgId=1&viewPanel=65090&from=1682657429086&to=1682699823248) and quickly after a alert was fired. Someone or something cleaned up a short time after to a reasonable 40% usage. This raises the question if we need to adjust our timings for that alert. 

 ## Suggestions 
 * Check with e.g. @okurz if this was maybe a one-time thing because somebody moved around stuff manually 
 * Manual cleanup of files in /var/lib/libvirt/images, ask in #eng-testing what the stuff is needed for 
 * Plug in more SSDs. Likely we have some spare in FC Basement shelves 
 * Check virsh XMLs to crosscheck openQA jobs before deleting anything for good 
 * Adjust the alert to allow longer periods over the threshold 

Back