blob: e9cd4636aef312a7495bd6b382e4c177ad41c75f [file] [log] [blame]
Zack Williams9026f532020-11-30 11:34:32 -07001..
2 SPDX-FileCopyrightText: © 2020 Open Networking Foundation <support@opennetworking.org>
3 SPDX-License-Identifier: Apache-2.0
4
5Troubleshooting
6===============
7
Zack Williams5fd7a232020-12-03 12:45:56 -07008Unable to access a system
9-------------------------
10
11If it's a system behind another system (ex: the compute nodes behind a
12management server) and you're trying to interactively login to it, make sure
13that you've enabled SSH Agent Forwarding in your ``~/.ssh/config`` file::
14
15 Host mgmtserver1.prod.site.aetherproject.net
16 ForwardAgent yes
17
18If you still have problems after verifying that this is set up, run ssh with
19the ``-v`` option, which will print out all the connection details and
20whether an agent is used on the second ssh::
21
22 onfadmin@mgmtserver1:~$ ssh onfadmin@node2.mgmt.prod.site.aetherproject.net
23 debug1: client_input_channel_open: ctype auth-agent@openssh.com rchan 2 win 65536 max 16384
24 debug1: channel 1: new [authentication agent connection]
25 debug1: confirm auth-agent@openssh.com
26 Welcome to Ubuntu 18.04.5 LTS (GNU/Linux 5.4.0-56-generic x86_64)
27 ...
28 onfadmin@node2:~$
29
30Problems with OS installation
31-----------------------------
32
33OS installs, but doesn't boot
34"""""""""""""""""""""""""""""
35
36If you've completed the installation but the system won't start the OS, check
37these BIOS settings:
38
39- If the startup disk is nVME, under ``Advanced -> PCIe/PCI/PnP Configuration``
40 the option ``NVMe Firmware Source`` should be set to ``AMI Native Support``,
41 per `this Supermicro FAQ entry
42 <https://supermicro.com/support/faqs/faq.cfm?faq=28248>`_.
43
Zack Williams9026f532020-11-30 11:34:32 -070044Unknown MAC addresses
45---------------------
46
47Sometimes it's hard to find out all the MAC addresses assigned to network
48cards. These can be found in a variety of ways:
49
501. On servers, the BMC webpage will list the built-in network card MAC
51 addresses.
52
532. If you login to a server, ``ip link`` or ``ip addr`` will show the MAC
54 address of each interface, including on add-in cards.
55
563. If you can login to a server but don't know the BMC IP or MAC address for
57 that server, you can find it with ``sudo ipmitool lan print``.
58
594. If you don't have a login to the server, but can get to the management
60 server, ``ip neighbor`` will show the arp table of MAC addresses known to
61 that system. It's output is unsorted - ``ip neigh | sort`` is easier to
Zack Williams5fd7a232020-12-03 12:45:56 -070062 read. This can be useful for determining if there's a cabling problem -
63 a device plugged into the wrong port of the management switch could show up
64 in the DHCP pool range for a different segment.
Zack Williams9026f532020-11-30 11:34:32 -070065
66Cabling issues
67--------------
68
69The system may not come up correctly if cabling isn't connected properly.
70If you don't have hands-on with the cabling, here are some ways to check on the
71cabling remotely:
72
731. On servers you can check which ports are connected with ``ip link show``::
74
75 $ ip link show
76 ...
77 3: eno1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq state UP mode DEFAULT group default qlen 1000
78 link/ether 3c:ec:ef:4d:55:a8 brd ff:ff:ff:ff:ff:ff
79 ...
80 5: eno2: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN mode DEFAULT group default qlen 1000
81 link/ether 3c:ec:ef:4d:55:a9 brd ff:ff:ff:ff:ff:ff
82
83 Ports that are up will show ``state UP``
84
852. You can determine which remote ports are connected with LLDP, assuming that
86 the remote switch supports LLDP and has it enabled. This can be done with
87 ``networkctl lldp``, which shows both the name and the MAC address of the
88 connected switch on a per-link basis::
89
90 $ networkctl lldp
91 LINK CHASSIS ID SYSTEM NAME CAPS PORT ID PORT DESCRIPTION
92 eno1 10:4f:58:e7:d5:60 Aruba-2540-24…PP ..b........ 10 10
93 eno2 10:4f:58:e7:d5:60 Aruba-2540-24…PP ..b........ 1 1