action #41867
closed
[devops][tools] Replace get-metrics script by telegraf
Added by szarate about 6 years ago.
Updated over 5 years ago.
Estimated time:
(Total: 0.00 h)
Description
Currently we have the get-metrics script that collects many of the stats on all workers.
Since we settled now with telegraf it is time to replace the get-metrics script and its components (systemd-timer, salt, script itself).
- Target version changed from Ready to Current Sprint
- Project changed from openQA Project (public) to openQA Infrastructure (public)
- Category deleted (
168)
- Subject changed from [devops] Phase out get-metrics script to [devops][functional][u] Phase out get-metrics script
- Target version changed from Current Sprint to Milestone 20
- Checklist item changed from to [x] Custom collectd plugin to get data from openQA worker instances (is it running?, can talk to webui? what was the last job that ran here)
- Checklist item changed from to [ ] Custom collectd plugin to get data from openQA worker instances (is it running?, can talk to webui? what was the last job that ran here)
- Assignee deleted (
szarate)
- Target version changed from Milestone 20 to future
- Checklist item changed from [ ] Update collectd to >= 5.5, we're using the cpu plugin, where data could be aggregated but is only available on 5.5 upwards, [ ] Create templates for the dashbads so that they are auto generated, [ ] Edit dashboards to pick data from collectd, [ ] Custom collectd plugin to get data from openQA worker instances (is it running?, can talk to webui? what was the last job that ran here), [ ] Retire the systemd timmer and the script from salt states to [x] Create templates for the dashbads so that they are auto generated, [ ] Edit dashboards to pick data from collectd, [ ] Custom collectd plugin to get data from openQA worker instances (is it running?, can talk to webui? what was the last job that ran here), [ ] Retire the systemd timmer and the script from salt states, [x] Edit dashboards to pick data from telegraf
- Subject changed from [devops][functional][u] Phase out get-metrics script to [devops][functional][u] Replace get-metrics script by telegraf
- Status changed from New to Workable
- Checklist item changed from [x] Create templates for the dashbads so that they are auto generated, [ ] Edit dashboards to pick data from collectd, [ ] Custom collectd plugin to get data from openQA worker instances (is it running?, can talk to webui? what was the last job that ran here), [ ] Retire the systemd timmer and the script from salt states, [x] Edit dashboards to pick data from telegraf to [x] Create templates for the dashbads so that they are auto generated, [x] Edit dashboards to pick data from telegraf, [ ] Retire the systemd timmer and the script from salt states, [ ] Find a way to monitor more openQA-worker stats (Is it running? Connected to a webui? Last job time/result, etc)
- Description updated (diff)
- Subject changed from [devops][functional][u] Replace get-metrics script by telegraf to [devops][tools] Replace get-metrics script by telegraf
- Status changed from Workable to Resolved
- Assignee set to nicksinger
I think this is already done, some time ago
Also available in: Atom
PDF