Project

General

Profile

Wiki » History » Version 83

okurz, 2020-11-21 18:38
SUSE QE Tools: Add high-level and top-level view of our backlog, i.e. epics and higher and saga and higher

1 27 okurz
{{toc}}
2
3
# Test results overview
4 18 okurz
* Latest report based on openQA test results http://s.qa.suse.de/test-status , SLE12: http://s.qa.suse.de/test-status-sle12 , SLE15: http://s.qa.suse.de/test-status-sle15
5 36 okurz
* only "blocker" or "shipstopper" bugs on "interesting products" for SLE: http://s.qa.suse.de/qa_sle_bugs_sle , SLE15: http://s.qa.suse.de/qa_sle_bugs_sle15_all, SLE12: http://s.qa/qa_sle_bugs_sle12_2
6 1 mgriessmeier
7 64 okurz
# QE tools - Team description
8 1 mgriessmeier
9 27 okurz
## Team responsibilities
10 1 mgriessmeier
11 27 okurz
* Develop and maintain upstream openQA
12
* Administration of openqa.suse.de and workers (But not physical hardware, as these belong to the departments that purchased them and we merely facilitate)
13
* Helps administrating and maintaining openqa.opensuse.org, including coordination of efforts aiming at solving problems affecting o3
14 81 okurz
* Develop and maintain internal maintenance QA tools (SMELT, template generator, MTUI, openQA QAM bot, etc, e.g. from https://confluence.suse.com/display/maintenanceqa/QAM+Toolchain)
15 27 okurz
* Support colleagues, team members and open source community
16 1 mgriessmeier
17 27 okurz
## Out of scope
18
19
* Maintenance of individual tests
20
* Maintenance of physical hardware
21
* Maintenance of special worker addendums needed for tests, e.g. external hypervisor hosts for s390x, powerVM
22
* Ticket triaging of http://progress.opensuse.org/projects/openqatests/
23
* Feature development within the backend for single teams (commonly provided by teams themselves)
24
25
## How we work
26
27 64 okurz
The QE Tools team is following the DevOps approach working using a lightweight Agile approach. We plan and track our works using tickets on https://progress.opensuse.org . We pick tickets based on priority and planning decisions. We use weekly meetings as checkpoints for progress and also track cycle and lead times to crosscheck progress against expectations.
28 27 okurz
29 83 okurz
* [tools team - backlog](https://progress.opensuse.org/issues?query_id=230): The complete backlog of the team
30
* [tools team - backlog, high-level view](https://progress.opensuse.org/issues?query_id=526): A high-level view of the backlog, all epics and higher
31
* [tools team - backlog, high-level view](https://progress.opensuse.org/issues?query_id=524): A top-level view of the backlog, only sagas and higher
32 67 okurz
* [tools team - what members of the team are working on](https://progress.opensuse.org/issues?query_id=400): To check progress and know what the team is currently occupied with
33 1 mgriessmeier
34 67 okurz
Be aware: Custom queries in the right-hand sidebar of individual projects, e.g. https://progress.opensuse.org/projects/openqav3/issues , show queries with the same name but are limited to the scope of the specific projects so can show only a subset of all relevant tickets.
35 1 mgriessmeier
36 32 okurz
### Common tasks for team members
37
38
This is a list of common tasks that we follow, e.g. reviewing daily based on individual steps in the DevOps Process ![DevOps Process](devops-process_25p.png)
39
40
* **Plan**:
41
 * State daily learning and planned tasks in internal chat room
42
 * Review backlog for time-critical, triage new tickets, pick tickets from backlog; see https://progress.opensuse.org/projects/qa/wiki#How-we-work-on-our-backlog
43
* **Code**:
44 1 mgriessmeier
 * See project specific contribution instructions
45 72 okurz
 * Provide peer-review following https://github.com/notifications based on projects within the scope of https://github.com/os-autoinst/ with the exception of test code repositories, especially https://github.com/os-autoinst/openQA, https://github.com/os-autoinst/os-autoinst, https://github.com/os-autoinst/scripts, https://github.com/os-autoinst/os-autoinst-distri-openQA, https://github.com/os-autoinst/openqa-trigger-from-obs, https://github.com/os-autoinst/openqa_review as well as other projects like https://gitlab.suse.de/qa-maintenance/openQABot/
46 32 okurz
* **Build**:
47
 * See project specific contribution instructions
48
* **Test**:
49
 * Monitor failures on https://travis-ci.org/ relying on https://build.opensuse.org/package/show/devel:openQA/os-autoinst_dev for os-autoinst (email notifications)
50
 * Monitor failures on https://app.circleci.com/pipelines/github/os-autoinst/openQA?branch=master relying on https://build.opensuse.org/project/show/devel:openQA:ci for openQA (email notifications)
51
* **Release**:
52
 * By default we use the rolling-release model for all projects unless specified otherwise
53
 * Monitor https://build.opensuse.org/project/show/devel:openQA (all packages and all subprojects) for failures, ensure packages are published on http://download.opensuse.org/repositories/devel:/openQA/
54
 * Monitor http://jenkins.qa.suse.de/view/openQA-in-openQA/ for the openQA-in-openQA Tests and automatic submissions of os-autoinst and openQA to openSUSE:Factory through https://build.opensuse.org/project/show/devel:openQA:tested
55
* **Deploy**:
56
 * o3 is automatically deployed (daily), see https://progress.opensuse.org/projects/openqav3/wiki/Wiki#Automatic-update-of-o3
57
 * osd is automatically deployed (weekly), monitor https://gitlab.suse.de/openqa/osd-deployment/pipelines and watch for notification email to openqa@suse.de
58
* **Operate**:
59
 * Apply infrastructure changes from https://gitlab.suse.de/openqa/salt-states-openqa (osd) or manually over sshd (o3)
60 37 okurz
 * Monitor for backup, see https://gitlab.suse.de/qa-sle/backup-server-salt
61 32 okurz
config changes in salt (osd), backups, job group configuration changes
62 61 okurz
 * Ensure old unused/non-matching needles are cleaned up (osd+o3), see #73387
63 32 okurz
* **Monitor**:
64 48 okurz
 * React on alerts from https://stats.openqa-monitor.qa.suse.de/alerting/list?state=not_ok (emails on osd-admins@suse.de . You need to be logged in to reach the alert list by the provided URL)
65 32 okurz
 * Look for incomplete jobs or scheduled not being worked on o3 and osd (API or webUI)
66 44 okurz
 * React on alerts from https://gitlab.suse.de/openqa/auto-review/, https://gitlab.suse.de/openqa/openqa-review/, https://gitlab.suse.de/openqa/monitor-o3 (subscribe to projects for notifications)
67 49 okurz
 * Be responsive on #opensuse-factory (irc://chat.freenode.net/opensuse-factory) for help, support and collaboration (Unless you have a better solution it is suggested to use [Element.io](https://app.element.io/#/room/%23freenode_%23opensuse-factory:matrix.org) for a sustainable presence; you also need a [registered IRC account](https://freenode.net/kb/answer/registration))
68 1 mgriessmeier
 * Be responsive on [#testing](https://chat.suse.de/channel/testing) for help, support and collaboration
69 50 okurz
 * Be responsive on mailing lists opensuse-factory@opensuse.org and openqa@suse.de (see https://en.opensuse.org/openSUSE:Mailing_lists_subscription)
70 31 okurz
71 27 okurz
### How we work on our backlog
72 1 mgriessmeier
73 27 okurz
* "due dates" are only used as exception or reminders
74
* every team member can pick up tickets themselves
75
* everybody can set priority, PO can help to resolve conflicts
76 67 okurz
* consider the [ready, not assigned/blocked/low](https://progress.opensuse.org/issues?query_id=490) query as preferred
77 60 livdywan
* ask questions in tickets, even potentially "stupid" questions, oftentimes descriptions are unclear and should be improved
78 62 okurz
* There are "low-level infrastructure tasks" only conducted by some team members, the "DevOps" aspect does not include that but focusses on the joint development and operation of our main products
79 74 okurz
* Consider tickets with the subject keyword or tag "learning" as good learning opportunities for people new to a certain area. Experts in the specific area should prefer helping others but not work on the ticket
80 75 okurz
* For tickets which are out of the scope of the team remove from backlog, delegate to corresponding teams or persons but be nice and supportive, e.g. [SUSE-IT](https://sd.suse.com/), [EngInfra](https://infra.nue.suse.com/), [test maintainer](https://progress.opensuse.org/projects/openqatests/), QE-LSG PrjMgr/mgmt
81 78 okurz
* Refactoring and general improvements are conduct while we work on features or regression fixes
82
* For every regression or bigger issue that we encounter try to come up with at least two improvements, e.g. the actual issue is fixed and similar cases are prevented in the feature with better tests and optionally also monitoring is improved
83 27 okurz
84 55 okurz
#### Definition of DONE
85
86
Also see http://www.allaboutagile.com/definition-of-done-10-point-checklist/ and https://www.scrumalliance.org/community/articles/2008/september/what-is-definition-of-done-%28dod%29
87
88
* Code changes are made available via a pull request on a version control repository, e.g. github for openQA
89
* [Guidelines for git commits](http://chris.beams.io/posts/git-commit/) have been followed
90
* Code has been reviewed (e.g. in the github PR)
91
* Depending on criticality/complexity/size/feature: A local verification test has been run, e.g. post link to a local openQA machine or screenshot or logfile
92
* Potentially impacted package builds have been considered, e.g. openSUSE Tumbleweed and Leap, Fedora, etc.
93
* Code has been merged (either by reviewer or "mergify" bot or reviewee after 'LGTM' from others)
94
* Code has been deployed to osd and o3 (monitor automatic deployment, apply necessary config or infrastructure changes)
95
96 56 okurz
#### Definition of READY for new features
97 55 okurz
98
The following points should be considered before a new feature ticket is READY to be implemented:
99
100
* Follow the ticket template from https://progress.opensuse.org/projects/openqav3/wiki/#Feature-requests
101
* A clear motivation or user expressing a wish is available
102
* Acceptance criteria are stated (see ticket template)
103
* add tasks as a hint where to start
104
105 1 mgriessmeier
#### WIP-limits (reference "Kanban development")
106 28 okurz
107 79 livdywan
* global limit of 10 tickets, and 3 tickets per person respectively [In Progress](https://progress.opensuse.org/issues?query_id=505)
108
* limit of 20 tickets per person in [Feedback](https://progress.opensuse.org/issues?query_id=520)
109 27 okurz
110 1 mgriessmeier
#### Target numbers or "guideline", "should be", in priorities
111
112 67 okurz
1. *New, untriaged openQA:* [0 (daily)](https://progress.opensuse.org/projects/openqav3/issues?query_id=475) . Every ticket should have a target version, e.g. "Ready" for QE tools team, "future" if unplanned, others for other teams
113 64 okurz
1. *Untriaged "tools" tagged:* [0 (daily)](https://progress.opensuse.org/issues?query_id=481) . Every ticket should have a target version, e.g. "Ready" for QE tools team, "future" if unplanned, others for other teams
114 67 okurz
1. *Workable (properly defined):* [~40 (20-50)](https://progress.opensuse.org/issues?query_id=478) . Enough tickets to reflect a proper plan but not too many to limit unfinished data (see "waste")
115 82 okurz
1. *Overall backlog length:* [ideally less than 100](https://progress.opensuse.org/issues?query_id=230) . Similar as for "Workable". Enough tickets to reflect a proper roadmap as well as give enough flexibility for all unfinished work but limited to a feasible number that can still be overlooked by the team without loosing overview. One more reason for a maximum of 100 are that pagination in redmine UI allows to show only up to 100 issues on one page at a time, same for redmine API access.
116 71 okurz
1. *Within due-date:* [0 (daily/weekly)](https://progress.opensuse.org/issues?query_id=514) . We should take due-dates serious, finish tickets fast and at the very least update tickets with an explanation why the due-date could not be hold and update to a reasonable time in the future based on usual cycle time expectations
117 27 okurz
118
#### SLOs (service level objectives)
119
120
* for picking up tickets based on priority, first goal is "urgency removal":
121 67 okurz
 * **immediate openQA**: [<1 day](https://progress.opensuse.org/projects/openqav3/issues?utf8=%E2%9C%93&set_filter=1&f%5B%5D=priority_id&op%5Bpriority_id%5D=%3D&v%5Bpriority_id%5D%5B%5D=7&f%5B%5D=status_id&op%5Bstatus_id%5D=o&f%5B%5D=subproject_id&op%5Bsubproject_id%5D=%3D&v%5Bsubproject_id%5D%5B%5D=125&f%5B%5D=updated_on&op%5Bupdated_on%5D=%3Ct-&v%5Bupdated_on%5D%5B%5D=1&f%5B%5D=&c%5B%5D=subject&c%5B%5D=project&c%5B%5D=status&c%5B%5D=assigned_to&c%5B%5D=fixed_version&c%5B%5D=due_date&c%5B%5D=priority&c%5B%5D=updated_on&c%5B%5D=category&group_by=priority)
122
 * **urgent openQA**: [<1 week](https://progress.opensuse.org/projects/openqav3/issues?utf8=%E2%9C%93&set_filter=1&f%5B%5D=priority_id&op%5Bpriority_id%5D=%3D&v%5Bpriority_id%5D%5B%5D=6&f%5B%5D=status_id&op%5Bstatus_id%5D=o&f%5B%5D=subproject_id&op%5Bsubproject_id%5D=%3D&v%5Bsubproject_id%5D%5B%5D=125&f%5B%5D=updated_on&op%5Bupdated_on%5D=%3Ct-&v%5Bupdated_on%5D%5B%5D=7&f%5B%5D=&c%5B%5D=subject&c%5B%5D=project&c%5B%5D=status&c%5B%5D=assigned_to&c%5B%5D=fixed_version&c%5B%5D=due_date&c%5B%5D=priority&c%5B%5D=updated_on&c%5B%5D=category&group_by=status)
123
 * **high openQA**: [<1 month](https://progress.opensuse.org/projects/openqav3/issues?utf8=%E2%9C%93&set_filter=1&f%5B%5D=status_id&op%5Bstatus_id%5D=o&f%5B%5D=priority_id&op%5Bpriority_id%5D=%3D&v%5Bpriority_id%5D%5B%5D=5&f%5B%5D=subproject_id&op%5Bsubproject_id%5D=%3D&v%5Bsubproject_id%5D%5B%5D=125&f%5B%5D=updated_on&op%5Bupdated_on%5D=%3Ct-&v%5Bupdated_on%5D%5B%5D=30&f%5B%5D=&c%5B%5D=subject&c%5B%5D=project&c%5B%5D=status&c%5B%5D=assigned_to&c%5B%5D=fixed_version&c%5B%5D=due_date&c%5B%5D=priority&c%5B%5D=updated_on&c%5B%5D=category&group_by=status)
124
 * **normal openQA**: [<1 year](https://progress.opensuse.org/projects/openqav3/issues?utf8=%E2%9C%93&set_filter=1&f%5B%5D=priority_id&op%5Bpriority_id%5D=%3D&v%5Bpriority_id%5D%5B%5D=4&f%5B%5D=status_id&op%5Bstatus_id%5D=o&f%5B%5D=subproject_id&op%5Bsubproject_id%5D=%3D&v%5Bsubproject_id%5D%5B%5D=125&f%5B%5D=updated_on&op%5Bupdated_on%5D=%3Ct-&v%5Bupdated_on%5D%5B%5D=365&f%5B%5D=&c%5B%5D=subject&c%5B%5D=project&c%5B%5D=status&c%5B%5D=assigned_to&c%5B%5D=fixed_version&c%5B%5D=due_date&c%5B%5D=priority&c%5B%5D=updated_on&c%5B%5D=category&group_by=status)
125
 * **low openQA**: undefined
126 1 mgriessmeier
127
* aim for cycle time of individual tickets (not epics or sagas): 1h-2w
128 31 okurz
129 54 mkittler
#### Backlog prioritization
130 47 okurz
131
When we prioritize tickets we assess:
132
1. What the main use cases of openQA are among all users, be it SUSE QA engineers, other SUSE employees, openSUSE contributors as well as any other outside user of openQA
133
2. We try to understand how many persons and products are affected by feature requests as well as regressions (or "concrete bugs" as the ticket category is called within the openQA Project) and prioritize issues affecting more persons and products and use cases over limited issues
134
3. We prioritize regressions higher than work on (new) feature requests
135
4. If a workaround or alternative exists then this lowers priority. We prioritize tasks that need deep understanding of the architecture and an efficient low-level implementation over convenience additions that other contributors are more likely to be able to implement themselves.
136
137 38 okurz
### Team meetings
138
139 77 livdywan
* **Daily:** Use (internal) chat actively, e.g. formulate your findings or achievements and plans for the day, "think out loud" while working on individual problems. Optionally join the call every day 1030 CET/CEST
140 58 livdywan
  * *Goal*: Quick support on problems, feedback on plans, collaboration and self-reflection (compare to [Daily Scrum](https://www.scrumguides.org/scrum-guide.html#events-daily))
141 63 livdywan
* **Weekly coordination:** Every Friday 1115-1145 CET/CEST in [m.o.o/suse_qa_tools](https://meet.opensuse.org/suse_qa_tools) ([fallback](https://meet.jit.si/suse_qa_tools)). Community members and guests are particularly welcome to join this meeting.
142 58 livdywan
  * *Goal*: Team backlog coordination and design decisions of bigger topics (compare to [Sprint Planning](https://www.scrumguides.org/scrum-guide.html#events-planning)).
143 63 livdywan
* **Fortnightly Retrospective:** Friday 1145-1215 CET/CEST every even week, same room as the weekly meeting. On these days the weekly has hard time limit of 1115-1145. At the start of the week a game on retrospected.com is started which can be filled in all week. Specific actions are recorded as tickets at the end of the week.
144 58 livdywan
  * *Goal*: Inspect and adapt, learn and improve (compare to [Sprint Retrospective](https://www.scrumguides.org/scrum-guide.html#events-retro))
145
* **Virtual coffee talk:** Weekly every Thursday 1100-1120 CET/CEST, same room as the weekly.
146
  * *Goal*: Connect and bond as a team, understand each other (compare to [Informal Communication in an all-remote environment](https://about.gitlab.com/company/culture/all-remote/informal-communication))
147
* **extension on-demand:** Optional meeting on invitation in the suggested time slot Thursday 1000-1200 CET/CEST, in the same room as the weekly, on-demand or replacing the *Virtual coffee talk*.
148
  * *Goal*: Introduce, research and discuss bigger topics, e.g. backlog overview, processes and workflows
149 31 okurz
150 59 livdywan
Note: Meetings concerning the whole team are moderated by the scrum master by default, who should join the call early and verify that the meeting itself and any tools used are working or e.g. advise the use of the fallback option.
151
152 73 okurz
### Team
153
154
The team is comprised of engineers from different teams, some only partially available:
155
* Xiaojing Liu (Jane, [QA APAC 1](https://geekos.prv.suse.net/team/5b08104d7d795700204993df))
156
* Marius Kittler
157
* Nick Singer
158
* Sebastian Riedel
159
* Oliver Kurz (acting Product Owner)
160
* Tina Müller (Part time, now mainly working for OBS, [QAM3](https://geekos.prv.suse.net/team/5b7d24a17cf60423d2523485))
161
* Christian Dywan (Scrum Master, [QEM1](https://geekos.prv.suse.net/team/5b08104b7d795700204993d1))
162
* Ivan Lausuch (QEM3)
163
* Ondřej Súkup 
164
* Jan Baier (Part time)
165
* Vasileios Anastasiadis
166
167 45 okurz
### Alert handling
168
169
#### Best practices
170
171
* "if it hurts, do it more often": https://www.martinfowler.com/bliki/FrequencyReducesDifficulty.html
172
* Reduce [Mean-time-to-Detect (MTTD)](https://searchitoperations.techtarget.com/definition/mean-time-to-detect-MTTD) and [Mean-time-to-Recovery](https://raygun.com/blog/what-is-mttr/)
173
174
#### Process
175
176
* React on any alert
177
* For each failing grafana alert
178 51 okurz
 * Create a ticket for the issue (with a tag "alert"; create ticket unless the alert is trivial to resolve and needs no improvement)
179 45 okurz
 * Link the corresponding grafana panel in the ticket
180
 * Respond to the notification email with a link to the ticket
181 1 mgriessmeier
 * Optional: Inform in chat
182 51 okurz
 * Optional: Add "annotation" in corresponding grafana panel with a link to the corresponding ticket 
183 46 okurz
 * Pause the alert if you think further alerting the team does not help (e.g. you can work on fixing the problem, alert is non-critical but problem can not be fixed within minutes)
184 45 okurz
* If you consider an alert non-actionable then change it accordingly
185
* If you do not know how to handle an alert ask the team for help
186
* After resolving the issue add explanation in ticket, unpause alert and verify it going to "ok" again, resolve ticket
187
188
#### References
189
190
* https://nl.devoteam.com/en/blog-post/monitoring-reduce-mean-time-recovery-mttr/
191
192 31 okurz
### Historical
193
194 64 okurz
Previously the former QA tools team used target versions "Ready" (to be planned into individual milestone periods or sprints), "Current Sprint" and "Done". However the team never really did use proper time-limited sprints so the distinction was rather vague. After having tickets "Resolved" after some time the PO or someone else would also update the target version to "Done" to signal that the result has been reviewed. This was causing a lot of ticket update noise for not much value considering that the [Definition-of-Done](https://progress.opensuse.org/projects/openqav3/wiki/#ticket-workflow) when properly followed already has rather strict requirements on when something can be considered really "Resolved" hence the team eventually decided to not use the "Done" target version anymore. Since about 2019-05 (and since okurz is doing more backlog management) the team uses priorities more as well as the status "Workable" together with an explicit team member list for "What the team is working on" to better visualize what is making team members busy regardless of what was "officially" planned to be part of the team's work. So we closed the target version. On 2020-07-03 okurz subsequently closed "Current Sprint" as also this one was in most cases equivalent to just picking an assignee for a ticket or setting to "In Progress". We can just distinguish between "(no version)" meaning untriaged, "Ready" meaning tools team should consider picking up these issues and "future" meaning that there is no plan for this to be picked up. Everything else is defined by status and priority.
195 62 okurz
In 2020-10-27 we discussed together to find out the history of the team. We clarified that the team started out as a not well defined "Dev+Ops" team. "team responsibilities" have been mainly unchanged since at least beginning of 2019. We agreed that learning from users and production about our "Dev" contributions is good, so this part of "Ops" is responsibility of everyone.
196 27 okurz
197 69 tjyrinki_suse
# QE Core and QE Yast - Team descriptions
198 1 mgriessmeier
199 70 tjyrinki_suse
(this chapter has seen changes in 2020-11 regarding QSF -> QE Core / QE Yast change)
200 68 tjyrinki_suse
201 70 tjyrinki_suse
**QE Core** (formerly QSF, QA SLE Functional) and **QE Yast** are squads focusing on Quality Engineering of the core and yast functionality of the SUSE SLE products. The squad is comprised of members of QE Integration - [SUSE QA SLE Nbg](https://wiki.suse.net/index.php/SUSE-Quality_Assurance/Organization/Members_and_Responsibilities#QA_SLE_NBG_Team), including [SUSE QA SLE Prg](https://wiki.suse.net/index.php/SUSE-Quality_Assurance/Organization/Members_and_Responsibilities#QA_SLE_PRG_Team) - and QE Maintenance people (formerly "QAM"). The [SLE Departement](https://wiki.suse.net/index.php/SUSE-Quality_Assurance/SLE_Department#QSF_.28QA_SLE_Functional.29) page describes our QA responsibilities. We focus on our automatic tests running in [openQA](https://openqa.suse.de) under the job groups "Functional" as well as "Autoyast" for the respective products, for example [SLE 15 / Functional](https://openqa.suse.de/group_overview/110) and [SLE 15 / Autoyast](https://openqa.suse.de/group_overview/129). We back our automatic tests with exploratory manual tests, especially for the product milestone builds. Additionally we care about corresponding openSUSE openQA tests (see as well https://openqa.opensuse.org).
202 7 szarate
203 1 mgriessmeier
* long-term roadmap: http://s.qa.suse.de/qa-long-term
204
* overview of current openQA SLE12SP5 tests with progress ticket references: https://openqa.suse.de/tests/overview?distri=sle&version=12-SP5&groupid=139&groupid=142
205
* fate tickets for SLE12SP5 feature testing: based on http://s.qa.suse.de/qa_sle_functional_feature_tests_sle12sp5 new report based on all tickets with milestone before SLE12SP5 GM, http://s.qa.suse.de/qa_sle_functional_feature_tests_sle15sp1 for SLE15SP1
206
* only "blocker" or "shipstopper" bugs on "interesting products" for SLE15 http://s.qa.suse.de/qa_sle_functional_bug_query_sle15_2, http://s.qa/qa_sle_bugs_sle12_2 for SLE12
207 3 szarate
* Better organization of planned work can be seen at the [SUSE QA](https://progress.opensuse.org/projects/suseqa) project (which is not public).
208 1 mgriessmeier
209 27 okurz
## Test plan
210 1 mgriessmeier
211
When looking for coverage of certain components or use cases keep the [openQA glossary](http://open.qa/docs/#concept) in mind. It is important to understand that "tests in openQA" could be a scenario, for example a "textmode installation run", a combined multi-machine scenario, for example "a remote ssh based installation using X-forwarding", or a test module, for example "vim", which checks if the vim editor is correctly installed, provides correct rendering and basic functionality. You are welcome to contact any member of the team to ask for more clarification about this.
212
213 19 okurz
In detail the following areas are tested as part of "SLE functional":
214
215 1 mgriessmeier
* different hardware setups (UEFI, acpi)
216
* support for localization
217
* openSUSE: virtualization - some "virtualization" tests are active on o3 with reduced set compared to SLE coverage (on behalf of QA SLE virtualization due to team capacity constraints, clarified in QA SLE coordination meeting 2018-03-28)
218
* openSUSE: migration - comparable to "virtualization", a reduced set compared to SLE coverage is active on o3 (on behalf of QA SLE migration due to team capacity constraints, clarified in QA SLE coordination meeting 2018-04)
219 26 riafarov
220
221 69 tjyrinki_suse
### QE Yast
222 18 okurz
223 69 tjyrinki_suse
Squad focuses on testing YaST components, including installer and snapper.
224 1 mgriessmeier
225 18 okurz
Detailed test plan for SLES can be found here: [SLES_Integration_Level_Testplan.md](https://gitlab.suse.de/qsf-y/qa-sle-functional-y/blob/master/SLES_Integration_Level_Testplan.md)
226 1 mgriessmeier
227
* Latest report based on openQA test results SLE12: http://s.qa.suse.de/test-status-sle12-yast , SLE15: http://s.qa.suse.de/test-status-sle15-yast
228 2 mgriessmeier
229
230 69 tjyrinki_suse
### QE Core
231 1 mgriessmeier
232
"Testing is the future, and the future starts with you"
233
234
* basic operations (firefox, zypper, logout/reboot/shutdown)
235
* boot_to_snapshot
236 18 okurz
* functional application tests (kdump, gpg, ipv6, java, git, openssl, openvswitch, VNC)
237
* NIS (server, client)
238 1 mgriessmeier
* toolchain (development module)
239
* systemd
240 6 okurz
* "transactional-updates" as part of the corresponding SLE server role, not CaaSP
241
242
* Latest report based on openQA test results SLE12: http://s.qa.suse.de/test-status-sle12-functional , SLE15: http://s.qa.suse.de/test-status-sle15-functional
243 1 mgriessmeier
244 6 okurz
245 69 tjyrinki_suse
## In new organization also qovered by QE Core and others
246 1 mgriessmeier
247 69 tjyrinki_suse
* quarterly updated media: former QA Maintenance (QAM) is now part of the various QE squads. However, QU media does happen together with Maintenance Coordination that is not part of these squads.
248 1 mgriessmeier
249
250 27 okurz
## What we do
251 1 mgriessmeier
252
We collected opinions, personal experiences and preferences starting with the following four topics: What are fun-tasks ("new tests", "collaborate", "do it right"), what parts are annoying ("old & sporadic issues"), what do we think is expected from qsf-u ("be quick", "keep stuff running", "assess quality") and what we should definitely keep doing to prevent stakeholders becoming disappointed ("build validation", "communication & support").
253 12 okurz
254 27 okurz
### How we work on our backlog
255 12 okurz
256
* no "due date"
257
* we pick up tickets that have not been previously discussed
258 1 mgriessmeier
* more flexible choice
259 14 okurz
* WIP-limits:
260
 * global limit of 10 tickets "In Progress"
261
262
* target numbers or "guideline", "should be", in priorities:
263 12 okurz
 1. New, untriaged: 0
264
 2. Workable: 40
265 69 tjyrinki_suse
 3. New, assigned to [qe-core] or [qe-yast]: ideally less than 200 (should not stop you from triaging)
266 1 mgriessmeier
267
* SLAs for priority tickets - how to ensure to work on tickets which are more urgent?
268
 * "taken": <1d: immediate -> looking daily
269
 * 2-3d: urgent
270 12 okurz
 * first goal is "urgency removal": <1d: immediate, 1w: urgent
271 1 mgriessmeier
272 12 okurz
* our current "cycle time" is 1h - 1y (maximum, with interruptions)
273 1 mgriessmeier
274
* everybody should set priority + milestone in obvious cases, e.g. new reproducible test failures in multiple critical scenarios, in general case the PO decides
275
276 27 okurz
### How we like to choose our battles
277 1 mgriessmeier
278
We self-assessed our tasks on a scale from "administrative" to "creative" and found in the following descending order: daily test review (very "administrative"), ticket triaging, milestone validation, code review, create needles, infrastructure issues, fix and cleanup tests, find bugs while fixing failing tests, find bugs while designing new tests, new automated tests (very "creative"). Then we found we appreciate if our work has a fair share of both sides. Probably a good ratio is 60% creative plus 40% administrative tasks. Both types have their advantages and we should try to keep the healthy balance.
279
280
281 27 okurz
### What "product(s)" do we (really) *care* about?
282 1 mgriessmeier
283
Brainstorming results:
284
285
* openSUSE Krypton -> good example of something that we only remotely care about or not at all even though we see the connection point, e.g. test plasma changes early before they reach TW or Leap as operating systems we rely on or SLE+packagehub which SUSE does not receive direct revenue from but indirect benefit. Should be "community only", that includes members from QSF though
286
* openQA -> (like OBS), helps to provide ROI for SUSE
287
* SLE(S) (in development versions)
288
* Tumbleweed
289
* Leap, because we use it
290
* SLES HA
291
* SLE migration
292
* os-autoinst-distri-opensuse+backend+needles
293
294
From this list strictly no "product" gives us direct revenue however most likely SLE(S) (as well as SLES HA and SLE migration) are good examples of direct connection to revenue (based on SLE subscriptions). Conducting a poll in the team has revealed that 3 persons see "SLE(S)" as our main product and 3 see "os-autoinst-distri-opensuse+backend+needles" as the main product. We mainly agreed that however we can not *own* a product like "SLE" because that product is mainly not under our control.
295
296
Visualizing "cost of testing" vs. "risk of business impact" showed that both metrics have an inverse dependency, e.g. on a range from "upstream source code" over "package self-tests", "openSUSE Factory staging", "Tumbleweed", "SLE" we consider SLE to have the highest business risk attached and therefore defines our priority however testing at upstream source level is considered most effective to prevent higher cost of bugs or issues. Our conclusion is that we must ensure that the high-risk SLE base has its quality assured while supporting a quality assurance process as early as possible in the development process. package self-tests as well as the openQA staging tests are seen as a useful approach in that direction as well as "domain specfic specialist QA engineers" working closely together with according in-house development parties.
297
298 27 okurz
## Documentation
299 1 mgriessmeier
300
This documentation should only be interesting for the team QA SLE functional. If you find that some of the following topics are interesting for other people, please extract those topics to another wiki section.
301
302
### QA SLE functional Dashboards
303
304
In room 3.2.15 from Nuremberg office are two dedicated laptops each with a monitor attached showing a selected overview of openQA test resuls with important builds from SLE and openSUSE.
305 4 szarate
Such laptops are configured with a root account with the default password for production machines. First point of contact: [slindomansilla.suse.com](mailto:slindomansilla@suse.com), (okurz@suse.de)[mailto:okurz@suse.de]
306 1 mgriessmeier
307
* ''dashboard-osd-3215.suse.de'': Showing current view of openqa.suse.de filtered for some job group results, e.g. "Functional"
308
* ''dashboard-o3-3215.suse.de'': Showing current view of openqa.opensuse.org filtered for some job group results which we took responsibility to review and are mostly interested in
309
310 24 dheidler
### dashboard-osd-3215
311 1 mgriessmeier
312
* OS: openSUSE Tumbleweed
313
* Services: ssh, mosh, vnc, x2x
314
* Users:
315
** root
316
** dashboard
317
* VNC: `vncviewer dashboard-osd-3215`
318
* X2X: `ssh -XC dashboard@dashboard-osd-3215 x2x -west -to :0.0`
319
** (attaches the dashboard monitor as an extra display to the left of your screens. Then move the mouse over and the attached X11 server will capture mouse and keyboard)
320
321
#### Content of /home/dashboard/.xinitrc
322
323 3 szarate
```
324 1 mgriessmeier
#
325
# Source common code shared between the
326
# X session and X init scripts
327
#
328
. /etc/X11/xinit/xinitrc.common
329
330
xset -dpms
331
xset s off
332
xset s noblank
333
[...]
334
#
335
# Add your own lines here...
336
#
337
$HOME/bin/osd_dashboard &
338 3 szarate
```
339 1 mgriessmeier
340
#### Content of /home/dashboard/bin/osd_dashboard
341
342 3 szarate
```
343 1 mgriessmeier
#!/bin/bash
344
345
DISPLAY=:0 unclutter &
346
347
DISPLAY=:0 xset -dpms
348
DISPLAY=:0 xset s off
349
DISPLAY=:0 xset s noblank
350
351
url="${url:-"https://openqa.suse.de/?group=SLE+15+%2F+%28Functional%7CAutoyast%29&default_expanded=1&limit_builds=3&time_limit_days=14&show_tags=1&fullscreen=1#"}"
352 20 dheidler
DISPLAY=:0 chromium --kiosk "$url"
353 3 szarate
```
354 1 mgriessmeier
355
#### Cron job:
356
357 3 szarate
```
358 1 mgriessmeier
Min     H       DoM     Mo      DoW     Command
359 23 dheidler
*	*	*	*	*	/home/dashboard/bin/reload_chromium
360 3 szarate
```
361 1 mgriessmeier
362 21 dheidler
#### Content of /home/dashboard/bin/reload_chromium
363 1 mgriessmeier
364 3 szarate
```
365 1 mgriessmeier
#!/bin/bash
366
367
DISPLAY=:0 xset -dpms
368
DISPLAY=:0 xset s off
369
DISPLAY=:0 xset s noblank
370
371 22 dheidler
DISPLAY=:0 xdotool windowactivate $(DISPLAY=:0 xdotool search --class Chromium)
372 21 dheidler
DISPLAY=:0 xdotool key F5
373
DISPLAY=:0 xdotool windowactivate $(DISPLAY=:0 xdotool getactivewindow)
374 3 szarate
```
375 1 mgriessmeier
376
#### Issues:
377
378
* ''When the screen shows a different part of the web page''
379
** a simple mouse scroll through vnc or x2x may suffice.
380
* ''When the builds displayed are freeze without showing a new build, it usually means that midori, the browser displaying the info on the screen, crashed.''
381
** you can try to restart midori this way:
382
*** ps aux | grep midori
383
*** kill $pid
384
*** /home/dashboard/bin/osd_dashboard
385
** If this also doesn't work, restart the machine.
386 25 dheidler
387
388
### dashboard-o3
389
390
* Raspberry Pi 3B+
391
* IP: `10.160.65.207`
392
393
#### Content of /home/tux/.xinitrc
394
```
395
#!/bin/bash
396
397
unclutter &
398
openbox &
399
xset s off
400
xset -dpms
401
sleep 5
402
url="https://openqa.opensuse.org?group=openSUSE Tumbleweed\$|openSUSE Leap [0-9]{2}.?[0-9]*\$|openSUSE Leap.\*JeOS\$|openSUSE Krypton|openQA|GNOME Next&limit_builds=2&time_limit_days=14&&show_tags=1&fullscreen=1#build-results"
403
chromium --kiosk "$url" &
404
405
while sleep 300 ; do
406
        xdotool windowactivate $(xdotool search --class Chromium)
407
        xdotool key F5
408
        xdotool windowactivate $(xdotool getactivewindow)
409
done
410
```
411
412
#### Content of /usr/share/lightdm/lightdm.conf.d/50-suse-defaults.conf
413
```
414
[Seat:*]
415
pam-service = lightdm
416
pam-autologin-service = lightdm-autologin
417
pam-greeter-service = lightdm-greeter
418
xserver-command=/usr/bin/X
419
session-wrapper=/etc/X11/xdm/Xsession
420
greeter-setup-script=/etc/X11/xdm/Xsetup
421
session-setup-script=/etc/X11/xdm/Xstartup
422
session-cleanup-script=/etc/X11/xdm/Xreset
423
autologin-user=tux
424
autologin-timeout=0
425
```