blob: 2c908a432a22a45c11f336b3e9652d4d03b9597e [file] [log] [blame]
.. SPDX-FileCopyrightText: 2021 Open Networking Foundation <info@opennetworking.org>
.. SPDX-License-Identifier: Apache-2.0
.. _onos_network_config:
Network Configuration
=====================
SD-Fabric uses several different types of network configurations.
We only focus on ``devices`` and ``ports`` configuration in this section, which
are used to provide basic L2/L3 connectivity.
See :ref:`advanced-features` for advanced feature configurations.
Device Configuration
--------------------
Each switch in SD-Fabric requires a device config.
.. code-block:: json
{
"devices" : {
"device:leaf1" : {
"segmentrouting" : {
"ipv4NodeSid" : 101,
"ipv4Loopback" : "192.168.0.201",
"ipv6NodeSid" : 111,
"ipv6Loopback" : "2000::c0a8:0201",
"routerMac" : "00:00:00:00:02:01",
"isEdgeRouter" : true,
"adjacencySids" : []
},
"basic" : {
"name": "Leaf1",
"managementAddress": "grpc://10.128.100.51:9339?device_id=1",
"driver": "stratum-tofino",
"pipeconf": "org.stratumproject.fabric-spgw-int.montara_sde_9_5_0"
}
}
}
}
- ``device:leaf1``: arbitrary globally unique identifier of the device, must always be prefixed with ``device:``.
- ``ipv4NodeSid``: IPv4 node segment ID, which is used as an MPLS label in
forwarding IPv4 traffic. Can be arbitrary and should be globally unique.
- ``ipv4Loopback``: IPv4 loopback address. Can be arbitrary, should be globally
unique and should not be part of the same subnet(s) defined on the data plane
ports (see port config).
- ``ipv6NodeSid``: IPv6 node segment ID, which is used as an MPLS label in
forwarding IPv6 traffic. Can be arbitrary and should be globally unique. Only
required when using IPv6.
- ``ipv6Loopback``: IPv6 loopback address. Can be arbitrary, should be globally
unique and should not be part of the same subnet(s) defined on the data plane
ports (see port config). Only required when using IPv6.
- ``routerMac``: Router MAC address. Can be arbitrary and should be globally
unique. This MAC address will be used to reply the ARP request for the
loopback IP or the Interface IP that will be introduced later. (We recommend
using the MAC address of the device's management interface as the router
MAC.)
- ``isEdgeRouter``: True for leaf switches. False for spine switches.
- ``adjacencySids``: Deprecated. Always use an empty array.
- ``name``: Human friendly name used in the ONOS UI.
- ``managementAddress``: gRPC endpoint of the Stratum device and the P4Runtime
internal device ID associated to the ASIC (usually ``1``),
in the format of ``grpc://[device_addr]?device_id=[P4Runtime device ID]``
The IP address can be replaced by a domain name.
- ``driver``: ``stratum-bmv2`` or ``stratum-tofino``, depending on which switch this is.
- ``pipeconf``: the P4 program to deploy on this switch. A list of available
pipeconfs can be dumped by running ``pipeconfs`` in the ONOS CLI. When running
with Tofino-based devices, we provide pre-installed pipeconfs with ID
``org.stratumproject.<profile>.<device-type>_<bf-sde-version>``:
- The available *profiles* are:
- ``fabric``: for basic L2/L3 capabilities
- ``fabric-spgw``: with 4G/5G mobile user plane support
- ``fabric-int``: with INT support
- ``fabric-spgw-int``: with SPGW and INT support
- The supported *device-types*: Use the command ``weutil`` in the BMC mode to get the product name.
- ``montara``: for dual-pipe Tofino ASIC SKUs (Product name is 100BF-32X)
- ``mavericks``: for quad-pipe Tofino ASIC SKUs (Product name is 100BF-32QS)
- The Intel/Barefoot SDE version used in Stratum, e.g., ``sde_9_5_0``
.. caution::
You should avoid using reserved MPLS labels for ``ipv4NodeSid`` and
``ipv6NodeSid``. Please check here for the reserved values:
http://www.iana.org/assignments/mpls-label-values/mpls-label-values.xhtml
.. note::
Most of the SD-Fabric configurations support dynamic updates. Unfortunately,
we currently **do not support dynamic device configuration updates**. You
will have to restart (reboot) the switch if the corresponding device
configuration changes.
Having said that, when introducing a completely new device in the network,
the device configurations pushed before ONOS connects to the switch for the
first time should be applied correctly.
Port IDs for Tofino-based devices
---------------------------------
Before describing the ONOS netcfg, it is worth nothing how we refer to ports for
Tofino-based devices. Netcfg uses the format ``device:<name>/<port-number>``.
``<port-number>`` is a special value that is usually different than the number
shown in the switch front panel. It is the same number used for P4 table
programming and depends on the specific Tofino ASIC SKU (e.g., dual-pipe vs.
quad-pipe) and switch vendor/platform. In Stratum this is often referred to as
the *SDK port ID*, as this is the number used for all Tofino SDK calls. In Intel
documentation this is referred to as the ``DP_ID``. We plan to remove this
dependency on such a low level detail in future releases, but for now, to find
out the mapping between front-panel ports and ``DP_ID`` you have the following
options:
- Ask your switch vendor
- Use the command `pm.show` on the BF shell of a running Stratum instance
(see :ref:`troubleshooting_guide`)
Bridging and Unicast Routing
----------------------------
In the following we illustrate how to enable basic bridging and routing on a
per-port basis.
.. attention::
- VLAN **4094** is reserved for unconfigured ports (e.g. spine facing ports)
- VLAN **4090** is reserved for pseudowire transport flow rules on the spines
Access Ports
^^^^^^^^^^^^
The necessary but minimum configuration for an access port is simply a VLAN.
.. code-block:: json
{
"ports" : {
"device:leaf1/12" : {
"interfaces" : [{
"name" : "serverA-intf",
"vlan-untagged": 10
}]
},
"device:leaf1/16" : {
"interfaces" : [{
"name" : "serverB-intf",
"vlan-untagged": 10
}]
}
}
}
The example above shows two ports (12 and 16) on switch ``leaf1`` that have
been assigned to VLAN 10 using the ``vlan-untagged`` keyword.
It simply means that packets come in and leave out of these switches untagged,
but internally they are assigned VLAN 10 and they belong to the bridging domain
defined for VLAN 10.
``name`` is used to associate the interface with a globally unique, user
friendly name. It can be omitted.
With the configuration shown above, the packets will always be bridged, but
they cannot be routed out of the VLAN (e.g. to other subnets). To add the
capability to route out of VLAN 10, we need to add a subnet/gateway IP (similar
to `interface-vlans or SVIs in traditional networks
<https://www.youtube.com/watch?v=bUXpmiJpGb0>`_).
.. code-block:: json
{
"ports" : {
"device:leaf1/12" : {
"interfaces" : [{
"name" : "serverA-intf",
"ips" : [ "10.0.1.254/24"],
"vlan-untagged": 10
}]
},
"device:leaf1/16" : {
"interfaces" : [{
"name" : "serverB-intf",
"ips" : [ "10.0.1.254/24"],
"vlan-untagged": 10
}]
}
}
}
In this example, VLAN 10 is associated with subnet ``10.0.1.0/24``, and the
gateway IP for hosts in this subnet is ``10.0.1.254/32``.
When the desire is to route out of a VLAN, this assignment is currently
necessary on all ports configured in the same VLAN.
.. note::
Typically we only expect a single subnet for a VLAN. Similar to traditional
networks, for us, a subnet == VLAN. Different VLANs should be configured in
different subnets.
In certain use-cases, it may be necessary to configure multiple subnets in
the same VLAN. This is possible by adding more subnet/gateway IPs in the
``ips`` array.
.. attention::
The same subnet cannot be configured on multiple leaf switches.
We usually configure one subnet for all the ports on the same leaf switch.
Tagged Ports
^^^^^^^^^^^^
Tagged port configuration is similar.
.. code-block:: json
{
"ports" : {
"device:leaf1/24" : {
"interfaces" : [{
"name" : "serverA-intf",
"ips" : [ "10.0.2.254/24", "10.0.4.254/24" ],
"vlan-tagged" : [ 20, 40 ]
}]
}
}
}
The configuration above for port 24 on switch ``leaf1`` shows two VLANs 20 and
40 configured on that port, with corresponding subnets and gateway IPs.
Note that there is no specific ordering required in the ``ips`` or
``vlan-tagged`` arrays to correlate the VLANs to their corresponding subnets.
In a future release, we will correlate VLAN and subnets configuration in a more
readable way.
Native VLAN on Tagged Ports
^^^^^^^^^^^^^^^^^^^^^^^^^^^
An additional configuration ``vlan-native`` possible on tagged ports includes
the ability to specify a VLAN (and thus a bridging domain) for incoming
untagged packets.
Typically, such configuration in trunk ports in traditional networks is
referred to a native VLAN.
.. code-block:: json
{
"ports" : {
"device:leaf1/24" : {
"interfaces" : [ {
"name" : "serverA-intf",
"ips" : [ "10.0.2.254/24", "10.0.4.254/24", "10.0.1.254/24" ],
"vlan-tagged" : [ 20, 40 ],
"vlan-native" : 10
}]
}
}
}
Note that it is also necessary to configure the subnet/gateway IP corresponding
to the native VLAN if you wish to route out of that VLAN.
Configuring interface for IPv6 [#f1]_
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
It is similar to configure IPv6 routing. Simply replace the addresses in
``ips`` with IPv6 addresses. For example:
.. code-block:: json
{
"ports" : {
"device:leaf1/24" : {
"interfaces" : [ {
"name" : "serverA-intf",
"ips" : [ "10.0.2.254/24", "2000::1ff/120" ],
"vlan-tagged" : [ 20, 40 ]
}]
}
}
}
.. note::
There is a known issue that breaks dynamic VLAN configuration.
Until the issue get resolved, you need to restart the switch agent to reinstall the flows.
IPv6 Router Advertisement [#f1]_
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Router Advertisement overview
"""""""""""""""""""""""""""""
Router advertisement application is for enabling **Router Advertisement** and
**Router Solicitation** functionalities supported by IPv6 routers.
More details are available in `RFC 4861 <https://tools.ietf.org/html/rfc4861>`_.
Application identifies which IPv6 interfaces are currently configured in the
system and it will try to send out **unsolicited Router Advertisement** (RA)
messages from these interfaces.
Each such RA message will have two mandatory options named **Source link-layer
address** and **MTU**.
Additional RA option **prefix** can be enabled using component configuration
**raGlobalPrefixConfStatus**.
Application also processes **Router Solicitations** (RS) sent from hosts. Upon
receiving RS on a particular interface application stops RA transmission in
that interface and immediately sends RA targeted to the solicited host. After
that application continues unsolicited RA transmission on that interface.
Activate and configure RA
"""""""""""""""""""""""""
RA application can be activated from CLI by running
.. code-block:: console
onos> app activate routeradvertisement
Behavior of RA application is controlled by ONOS component configuration
subsystem and following are possible configuration options.
- ``raThreadDelay``: Delay between consecutive RA transmissions
- ``raPoolSize``: Capacity of thread pool to be used for RA transmissions
- ``raFlagMbitStatus``: RA flag “Managed address configuration”
enabled/disabled
- ``raFlagObitStatus``: RA flag “Other configuration” enabled/disabled
- ``raOptionPrefixStatus``: RA Option “prefix” is enabled/disabled. Router
prefixes will be available in RA only if this flag is “true”
- ``raGlobalPrefixConfStatus``: Enable switch level global prefix
configuration.
Once ``raGlobalPrefixConfStatus`` is enabled, RA prefix option is generated
from port configuration of device, see for more details.
To set the options, following the command (example for ``raOptionPrefixStatus``)
.. code-block:: console
onos> cfg set org.onosproject.ra.RouterAdvertisementManager raOptionPrefixStatus true
Prefix details are picked up from network interface configuration.
RA app will filter out link-local IPs while preparing prefixes.
For example, in following configuration, Prefix will include only
**2001:0558:FF10:04C9::2:1ff/120**.
.. code-block:: json
{
"ports": {
"device:leaf2/16": {
"interfaces": [{
"ips": [ "192.168.114.1/24", "2001:0558:FF10:04C9::2:1ff/120", "FE80::4EA8:2AFF:FE24:8E5F/120" ],
"vlan-untagged": "11",
"name": "18-15"
}]
}
}
}
Global prefix configuration
"""""""""""""""""""""""""""
In some cases, users may want to have a set of global prefix **advertised on
all edge interfaces**.
Such prefixes can be configured in **devices** section of network configuration
in the following way.
.. code-block:: json
{
"devices": {
"device:leaf2": {
"routeradvertisement" : {
"prefixes": [ "2001:0558:FF10:04C9::3:1ff/120"]
}
}
}
}
.. note::
When global prefix is configured, RA app will ignore any prefixes
configured on switch interfaces.
Notes about interface config
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
There is no need to configure ports on switches that are meant to connect to
other switches.
The VLAN (untagged or tagged) configuration is only meant for ports that are
connected to hosts (edge ports).
.. image:: ../images/config-vlan.png
Furthermore, note that the same VLAN can be configured on multiple ToRs - e.g.
VLAN 20 in the figure above.
However this does not mean that the ports are in the same bridging domain,
because in the fabric, the communication between ToRs is through a routed
network.
In other words, a host on VLAN 20 (untagged or tagged) connected to one ToR can
communicate with another host on VLAN 20 (untagged or tagged) connected to a
different ToR, but the MAC addresses will change as the traffic goes through a
routed network.
Please do not use this feature to connect switches in unsupported topologies as
shown in the example below.
The fabric is not designed to be one big Ethernet fabric. The bridging domain
is restricted to one ToR.
If the bridging domain is extended across two ToRs directly linked to each
other, there is a chance of loops.
In other words, the ToRs/Leafs are not standalone 802.1Q bridges, and should
not be used as such.
.. image:: ../images/config-vlan-invalid.png
.. rubric:: Footnotes
.. [#f1] IPv6 support on the data plane (P4 program) is still work-in-progress.