Zack Williams | 9026f53 | 2020-11-30 11:34:32 -0700 | [diff] [blame] | 1 | .. |
| 2 | SPDX-FileCopyrightText: © 2020 Open Networking Foundation <support@opennetworking.org> |
| 3 | SPDX-License-Identifier: Apache-2.0 |
| 4 | |
| 5 | Troubleshooting |
| 6 | =============== |
| 7 | |
Zack Williams | 5fd7a23 | 2020-12-03 12:45:56 -0700 | [diff] [blame] | 8 | Unable to access a system |
| 9 | ------------------------- |
| 10 | |
| 11 | If it's a system behind another system (ex: the compute nodes behind a |
| 12 | management server) and you're trying to interactively login to it, make sure |
| 13 | that you've enabled SSH Agent Forwarding in your ``~/.ssh/config`` file:: |
| 14 | |
| 15 | Host mgmtserver1.prod.site.aetherproject.net |
| 16 | ForwardAgent yes |
| 17 | |
| 18 | If you still have problems after verifying that this is set up, run ssh with |
| 19 | the ``-v`` option, which will print out all the connection details and |
| 20 | whether an agent is used on the second ssh:: |
| 21 | |
| 22 | onfadmin@mgmtserver1:~$ ssh onfadmin@node2.mgmt.prod.site.aetherproject.net |
| 23 | debug1: client_input_channel_open: ctype auth-agent@openssh.com rchan 2 win 65536 max 16384 |
| 24 | debug1: channel 1: new [authentication agent connection] |
| 25 | debug1: confirm auth-agent@openssh.com |
| 26 | Welcome to Ubuntu 18.04.5 LTS (GNU/Linux 5.4.0-56-generic x86_64) |
| 27 | ... |
| 28 | onfadmin@node2:~$ |
| 29 | |
| 30 | Problems with OS installation |
| 31 | ----------------------------- |
| 32 | |
| 33 | OS installs, but doesn't boot |
| 34 | """"""""""""""""""""""""""""" |
| 35 | |
| 36 | If you've completed the installation but the system won't start the OS, check |
| 37 | these BIOS settings: |
| 38 | |
| 39 | - If the startup disk is nVME, under ``Advanced -> PCIe/PCI/PnP Configuration`` |
| 40 | the option ``NVMe Firmware Source`` should be set to ``AMI Native Support``, |
| 41 | per `this Supermicro FAQ entry |
| 42 | <https://supermicro.com/support/faqs/faq.cfm?faq=28248>`_. |
| 43 | |
Zack Williams | 9026f53 | 2020-11-30 11:34:32 -0700 | [diff] [blame] | 44 | Unknown MAC addresses |
| 45 | --------------------- |
| 46 | |
| 47 | Sometimes it's hard to find out all the MAC addresses assigned to network |
| 48 | cards. These can be found in a variety of ways: |
| 49 | |
| 50 | 1. On servers, the BMC webpage will list the built-in network card MAC |
| 51 | addresses. |
| 52 | |
| 53 | 2. If you login to a server, ``ip link`` or ``ip addr`` will show the MAC |
| 54 | address of each interface, including on add-in cards. |
| 55 | |
| 56 | 3. If you can login to a server but don't know the BMC IP or MAC address for |
| 57 | that server, you can find it with ``sudo ipmitool lan print``. |
| 58 | |
| 59 | 4. If you don't have a login to the server, but can get to the management |
| 60 | server, ``ip neighbor`` will show the arp table of MAC addresses known to |
| 61 | that system. It's output is unsorted - ``ip neigh | sort`` is easier to |
Zack Williams | 5fd7a23 | 2020-12-03 12:45:56 -0700 | [diff] [blame] | 62 | read. This can be useful for determining if there's a cabling problem - |
| 63 | a device plugged into the wrong port of the management switch could show up |
| 64 | in the DHCP pool range for a different segment. |
Zack Williams | 9026f53 | 2020-11-30 11:34:32 -0700 | [diff] [blame] | 65 | |
| 66 | Cabling issues |
| 67 | -------------- |
| 68 | |
| 69 | The system may not come up correctly if cabling isn't connected properly. |
| 70 | If you don't have hands-on with the cabling, here are some ways to check on the |
| 71 | cabling remotely: |
| 72 | |
| 73 | 1. On servers you can check which ports are connected with ``ip link show``:: |
| 74 | |
| 75 | $ ip link show |
| 76 | ... |
| 77 | 3: eno1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq state UP mode DEFAULT group default qlen 1000 |
| 78 | link/ether 3c:ec:ef:4d:55:a8 brd ff:ff:ff:ff:ff:ff |
| 79 | ... |
| 80 | 5: eno2: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN mode DEFAULT group default qlen 1000 |
| 81 | link/ether 3c:ec:ef:4d:55:a9 brd ff:ff:ff:ff:ff:ff |
| 82 | |
| 83 | Ports that are up will show ``state UP`` |
| 84 | |
| 85 | 2. You can determine which remote ports are connected with LLDP, assuming that |
| 86 | the remote switch supports LLDP and has it enabled. This can be done with |
| 87 | ``networkctl lldp``, which shows both the name and the MAC address of the |
| 88 | connected switch on a per-link basis:: |
| 89 | |
| 90 | $ networkctl lldp |
| 91 | LINK CHASSIS ID SYSTEM NAME CAPS PORT ID PORT DESCRIPTION |
| 92 | eno1 10:4f:58:e7:d5:60 Aruba-2540-24…PP ..b........ 10 10 |
| 93 | eno2 10:4f:58:e7:d5:60 Aruba-2540-24…PP ..b........ 1 1 |