Project

General

Profile

action #111473

openQA Project - coordination #80142: [saga][epic] Scale out: Redundant/load-balancing deployments of openQA, easy containers, containers on kubernetes

action #97862: More openQA worker hardware for OSD size:M

Get replacements for imagetester and openqaworker1 size:M

Added by okurz about 1 month ago. Updated 1 day ago.

Status:
Workable
Priority:
High
Assignee:
-
Target version:
Start date:
2022-05-23
Due date:
% Done:

0%

Estimated time:

Description

Motivation

Move to co-location is delayed, we need to keep O3+OSD infrastructure within NUE SRV1 up-to-date.

Acceptance criteria

  • AC1: Updated the two oldest workers in NUE SRV1 (imagetester+openqaworker1)

Suggestions

  • We need to make physical space for new machines (might be possible to remove uno+rebel or openqaworker2+openqaworker3 to make room)
  • Get the quote from Nick or Oli

  • DONE: Get a quote, e.g. from https://www.deltacomputer.com/ for supermicro machines to replace at least imagetester+openqaworker1, potentially also uno+rebel or openqaworker2+openqaworker3 -> https://www.deltacomputer.com/media/pdf/delta_d10z-m2-zm_0465.pdf

  • Bring the order forward to Lee Martin Nick Singer and Matthias GrieƟmeier to crosslink with the QE budget

  • Coordinate with SUSE-IT EngInfra to prepare for new machines replacing existing ones

  • Make sure machines are ordered to SUSE Nbg Maxtorhof

  • Ask SUSE-IT EngInfra if it's ok if we sponsor a new top-of-the-rack switch with 10G SFP to replace the existing one https://racktables.nue.suse.com/index.php?page=object&object_id=163 in NUE-SRV1-D-6

Further details

Suggested by nsinger:

  • Chassis 1x SuperMicro 825BTQC-R1K23LPB
  • RAM 8x Micron MTA36ASF8G72PZ-3G2: 512 GB
  • Disk 2x Samsung PM1643a 3,8TB SSD
  • Controller 1x Broadcom 9500-8i
  • Network 1x Intel X710-DA2, 2 Ports, 10GbE, SFP+
  • M.2 NVMe, 2x Micron 7400 MAX, 400 GB, SSD
  • CPU 1x AMD EPYC 7763, 64 Cores pro CPU, 2,45 GHz

https://confluence.suse.com/display/qasle/2022-05-23+Meeting+notes

delta_d10z-m2-zm_9430.pdf (549 KB) delta_d10z-m2-zm_9430.pdf okurz, 2022-06-03 12:32

Related issues

Copied to openQA Infrastructure - action #111986: Ensure uno.openqanet.opensuse.org is properly usedFeedback2022-07-15

History

#1 Updated by okurz about 1 month ago

  • Description updated (diff)

#2 Updated by okurz about 1 month ago

  • Tracker changed from action to coordination

#3 Updated by okurz about 1 month ago

  • Tracker changed from coordination to action
  • Subject changed from [epic] Get replacement for existing machines, e.g. imagetester, openqaworker1 (potentially more) to Get replacement for existing machines, e.g. imagetester, openqaworker1 (potentially more)
  • Priority changed from Normal to High

#4 Updated by cdywan about 1 month ago

  • Subject changed from Get replacement for existing machines, e.g. imagetester, openqaworker1 (potentially more) to Get replacements for imagetester and openqaworker1 size:M
  • Description updated (diff)
  • Status changed from New to Workable

#5 Updated by okurz 28 days ago

  • Description updated (diff)

#6 Updated by okurz 28 days ago

  • Copied to action #111986: Ensure uno.openqanet.opensuse.org is properly used added

#7 Updated by okurz 28 days ago

  • Priority changed from High to Urgent

To ensure we can make use of assigned budget we should expedite the ordering process, raising prio.

#8 Updated by okurz 28 days ago

Seems like I missed something in the previous PDF, e.g. missing NVMe etc.
Configured a new one together with nsinger to be sure we come up with the right stuff. https://www.deltacomputer.com/media/pdf/delta_d10z-m2-zm_9430.pdf is the link to the order, not sure if it's really persistent. So also attaching.

We decided that we don't need to ask for any "extended support" so as included in the PDF. Cost calculates to roughly 9600 * 1.19 including tax, or using SUSE internal conversion from EUR to USD 1.21 ~ 13823 USD

#9 Updated by okurz 28 days ago

  • Description updated (diff)

Moved "uno" related task to #111986

#10 Updated by cdywan 24 days ago

  • Status changed from Workable to Feedback

Email with "Anfrage" in the subject sent today

#11 Updated by okurz 22 days ago

  • Description updated (diff)

#13 Updated by okurz 22 days ago

  • Status changed from Feedback to In Progress
  • Assignee changed from nicksinger to okurz

#14 Updated by okurz 21 days ago

email sent to vendor, awaiting response.

#15 Updated by okurz 21 days ago

  • Due date set to 2022-06-24
  • Status changed from In Progress to Feedback

two quotes received, created ticket to receive approved POs. Approved by mgriessmeier as sponsor representative, waiting for PO approval.

#16 Updated by okurz 14 days ago

  • Due date changed from 2022-06-24 to 2022-07-01

Talked with mbach. She is aware of the urgency. Waiting for approval.

#17 Updated by okurz 9 days ago

  • Due date changed from 2022-07-01 to 2022-09-02

Approval was received. I forwarded the order to the vendor. Received a confirmation for both

#18 Updated by okurz 8 days ago

  • Priority changed from Urgent to Low

Waiting for delivery

#19 Updated by okurz 1 day ago

  • Due date deleted (2022-09-02)
  • Status changed from Feedback to Workable
  • Assignee deleted (okurz)
  • Priority changed from Low to High

According to https://www.ids-logistik.de/de/sendungsverfolgung?tracking=90461_2009043476268005 the Nbg machines have arrived at Frankencampus.

Next steps can be coordinated, e.g. create SUSE-IT EngInfra ticket, get the server hardware moved to Nbg Maxtorhof SRV1, have it connected replacing imagetester and openqaworker1, have imagetester and openqaworker1 moved to qa cold storage or put somewhere in SRV2 where there is place.

Also available in: Atom PDF