action #150830
closed
Two new ARM servers 2023-11 for openqa.suse.de bare-metal testing size:M
Added by okurz about 1 year ago.
Updated 12 months ago.
Description
Motivation¶
afaerber as coordinator for ARM SUSE development+testing has two new ARM machines ready to be integrated as bare-metal test hosts. We should take over those machines, mount them in FC Basement and bring them into OSD production as bare-metal test machines and ensure testing related squads follow-up with specific testing, e.g. just run the default scenario(s) on each specific host.
Acceptance criteria¶
- AC1: Two new ARM servers from 2023-11 are used in production in openqa.suse.de as bare-metal test hosts
- AC2: Our inventory management system is up-to-date
Suggestions¶
- Subject changed from Two new ARM servers 2023-11 for openqa.suse.de bare-metal testing to Two new ARM servers 2023-11 for openqa.suse.de bare-metal testing size:M
- Tags changed from infra, arm, fc-basement, next-frankencampus-visit to infra, arm, fc-basement
- Due date set to 2023-12-07
- Status changed from New to Feedback
- Status changed from Feedback to In Progress
Moved both machines to FC Basement with help from mgriessmeier. Machines are in rack but not yet connected.
- Status changed from In Progress to Workable
- Status changed from Workable to In Progress
connected power and ipmi. squidbilly has fedora with root/root. ipmi 10.168.195.218. ipmitool -Ilanplus -H 10.168.195.218 -U admin -P admin
works fine. Connected FCs for both but seems like switch has those not activated yet.
same for squidward, has ipmi 10.168.194.235 but sol does not show anything.
- Due date deleted (
2023-12-07)
- Status changed from In Progress to Blocked
- Status changed from Blocked to In Progress
- Due date set to 2024-01-12
- Status changed from In Progress to Feedback
qa> configure
Entering configuration mode
{master:5}[edit]
qa# set interfaces xe-4/0/1 unit 0 family ethernet-switching interface-mode access
{master:5}[edit]
qa# set interfaces xe-4/0/1 unit 0 family ethernet-switching vlan members VL192
{master:5}[edit]
qa# set interfaces xe-4/0/0 unit 0 family ethernet-switching interface-mode access
{master:5}[edit]
qa# set interfaces xe-4/0/0 unit 0 family ethernet-switching vlan members VL192
{master:5}[edit]
qa# commit
configuration check succeeds
fpc1:
commit complete
fpc2:
commit complete
fpc3:
commit complete
fpc4:
commit complete
commit complete
{master:5}[edit]
martchus@openqa:~> for w in squidward squidbilly ; do sudo openqa-clone-job --skip-download --parental-inheritance --within-instance https://openqa.suse.de/tests/13004244 _GROUP=0 WORKER_CLASS="$w" {BUILD,TEST}+=-$w-poo150830 ; done
- Due date changed from 2024-01-12 to 2024-01-19
- Related to action #152887: Setup of Ampere Altra Q32-17 for bare-metal tests in openQA size:M added
- Status changed from Feedback to In Progress
- Tags changed from infra, arm, fc-basement to infra, arm, fc-basement, next-frankencampus-visit
Nope, failed the same. Seems like no network connection available. I booted both squidward+squidbilly over the web remote control interface, selected to boot into the UEFI menu over the pre-installed GRUB and in there configured the boot order to try to boot over network before trying other storage devices. Took me some time to also find that there is an additional network setting to disable/enable the fibre network interfaces which I did on squidward. But regardless the booted Fedora systems don't show a carrier on the fibre network devices. Guess I need to check again in person.
I checked the connections physically with the help of dheidler. The SFP+ were both upside down and not properly seated, both cables, both ends. Turned around and ensured that the cables are properly seated.
Additionally to the above network switch configuration by mkittler also did set protocols rstp interface xe-3/2/1 edge
with the according switch ports and we ensured that the network cables get a proper network setup using squiddlydiddly, see #152887. Then over remote control interface for squidbilly I could at least initially get a successful iPXE boot on squidbilly but only that success message, not an interactive menu showing up. Will crosscheck with squidward.
- Due date deleted (
2024-01-19)
- Status changed from In Progress to Resolved
squidward looks good now, successfully booting SLE installation media, see https://openqa.suse.de/tests/13214720 . squidbilly somehow always reverts to booting from storage but I am sure with the right settings in the UEFI menu this can be fixed.
Handed over to kernel squad: #153277
Also available in: Atom
PDF