Project

General

Profile

action #139145

Updated by okurz about 1 year ago

## Observation 

 Failed grafana alert about "Failed systemd services". 

 ``` 
 root@monitor:~ # systemctl status dehydrated 
 . dehydrated.service - Certificate Update Runner for Dehydrated 
      Loaded: loaded (/usr/lib/systemd/system/dehydrated.service; static) 
      Active: failed (Result: exit-code) since Mon 2023-11-06 00:21:21 CET; 9h ago 
 TriggeredBy: . dehydrated.timer 
    Main PID: 19901 (code=exited, status=1/FAILURE) 

 Nov 06 00:19:10 monitor systemd[1]: Starting Certificate Update Runner for Dehydrated... 
 Nov 06 00:19:10 monitor dehydrated[19901]: # INFO: Using main config file /etc/dehydrated/config 
 Nov 06 00:19:10 monitor dehydrated[19901]: # INFO: Using additional config file /etc/dehydrated/config.d/suse-ca.sh 
 Nov 06 00:19:10 monitor dehydrated[19901]: # INFO: Running /usr/bin/dehydrated as dehydrated/dehydrated 
 Nov 06 00:19:11 monitor sudo[19901]:       root : PWD=/ ; USER=dehydrated ; GROUP=dehydrated ; COMMAND=/usr/bin/dehydrated --cron 
 Nov 06 00:21:21 monitor dehydrated[19997]: EXPECTED value GOT EOF 
 Nov 06 00:21:21 monitor systemd[1]: dehydrated.service: Main process exited, code=exited, status=1/FAILURE 
 Nov 06 00:21:21 monitor systemd[1]: dehydrated.service: Failed with result 'exit-code'. 
 Nov 06 00:21:21 monitor systemd[1]: Failed to start Certificate Update Runner for Dehydrated. 
 ``` 

 as of 2023-11-13 the service is not in state "failed" anymore so apparently a spurious sporadic problem. 

 ## Suggestions 
 * Upstream research for error messages 
 * Consider to Try manual retry: If successful change systemd service to restart automatically on failure 

 ## Rollback actions 
 * Remove silence for "failed systemd services"

Back