advanced/qos.rst - sdfabric-docs - Gitiles

 QoS and Slicing
 ===============

 .. _qos_configuration:

 Configuration
 -------------
 .. note:: QoS and slicing configuration is currently statically configured at switch startup.
    Dynamic configuration will be supported in a next SD-Fabric release.

 QoS and Slicing is configured via the ``vendor_config`` portion of the Stratum Chassis Config (see :ref:`stratum_chassis_config`),
 where the queues and schedulers can be configured.
 We provide a convenient `script <https://github.com/stratum/fabric-tna/blob/main/util/gen-stratum-qos-config.py>`_
 to generate the configuration starting from a higher-level description provided via a YAML file.
 This file allows to configure the parameters for the traffic classes listed in the previous sections.

 Here's a list of parameters that you can configure via the YAML QoS configuration file:

 * ``max_cells``: Maximum number of buffer cells, depends on the ASIC SKU/revision.

 * ``pool_allocations``: Percentage of buffer cells allocated to each traffic class.
   The sum should be 100. Usually, we leave a portion of the buffer ``unassigned``
   for queues that do not have a pool (yet).
   Example of such queues are those for the recirculation port, CPU port, etc.

   .. code-block:: yaml

     pool_allocations:
       control: 1
       realtime: 9
       elastic: 80
       besteffort: 9
       unassigned: 1

 * **Control** Traffic Class: The available bandwidth dedicated to Control traffic is divided in *slots*.
   Each slot has a maximum rate and burst (in packets of the given MTU).
   A slice can use one or more slots by appropriately configuring meters in the fabric ingress pipeline.

   * ``control_slot_count``: Number of slots.
   * ``control_slot_rate_pps``: Packet per second rate of each slot.
   * ``control_slot_burst_pkts``: Number of packets per burst of each slot.
   * ``control_mtu_bytes``: MTU of packets for the PPS and burst values.

   .. code-block:: yaml

     control_slot_count: 50
     control_slot_rate_pps: 100
     control_slot_burst_pkts: 10
     control_mtu_bytes: 1500

 * **Real-Time** Traffic Class Configuration:

   * ``realtime_max_rates_bps``: List of maximum shaping rates for Real-Time queues,
     one per slice requesting such service.

   * ``realtime_max_burst_s``: Maximum amount of time that a Real-Time queue can
     burst at the port speed. This parameter is used to limit delay for Elastic
     queues.

   .. code-block:: yaml

     realtime_max_rates_bps:
       - 45000000 # 45 Mbps
       - 30000000 # 30 Mbps
       - 25000000 # 25 Mbps
     realtime_max_burst_s: 0.005 # 5 ms

 * **Elastic** Traffic Class Configuration:

   * ``elastic_min_rates_bps``: List of minimum guaranteed rates for Elastic queues,
     one per slice requesting such service.

   .. code-block:: yaml

     elastic_min_rates_bps:
       - 100000000 # 100 Mbps
       - 200000000 # 200 Mbps

 * ``port_templates`` section: List of switch port for which we want to configure
   queues.

   Every ``port_templates`` element contains:

     * ``descr``: Description of the port purpose.

     * ``rate_bps``: Port speed in bit per second.

     * ``is_shaping_enabled``: ``true`` if the rate is enforced using shaping,
       ``false`` if the rate is the channel speed.

     * ``shaping_burst_bytes``: Burst size in bytes, meaningful only if port speed
       is shaped (when ``is_shaping_enabled: true``).

     * ``queue_count``: Number of queues assigned to the port.

     * ``port_ids``: List of Stratum port IDs (:ref:`singleton_port` from Stratum Chassis Config),
       using this port template. Used for port that corresponds to switch front-panel ports.

       Mutually exclusive with ``sdk_port_ids`` field.

     * ``sdk_port_ids``: List of SDK port numbers (i.e., Tofino DP_ID) using this port template.
       Used for internal ports (e.g., recirculation ports).

       Mutually exclusive with ``port_ids`` field.

   .. code-block:: yaml

     port_templates:
       - descr: "Base station"
         rate_bps: 1000000000 # 1 Gbps
         is_shaping_enabled: true
         shaping_burst_bytes: 18000 # 2x jumbo frames
         queue_count: 16
         port_ids:
           - 100
       - descr: "Servers"
         port_ids:
           - 200
         rate_bps: 40000000000 # 40 Gbps
         is_shaping_enabled: false
         queue_count: 16
       - descr: "Recirculation"
         sdk_port_ids:
           - 68
         rate_bps: 100000000000 # 100 Gbps
         is_shaping_enabled: false
         queue_count: 16

 An example of a complete QoS and Slicing configuration can be found `here <https://github.com/stratum/fabric-tna/blob/main/util/sample-qos-config.yaml>`_.
	QoS and Slicing
	===============

	.. _qos_configuration:

	Configuration
	-------------
	.. note:: QoS and slicing configuration is currently statically configured at switch startup.
	Dynamic configuration will be supported in a next SD-Fabric release.

	QoS and Slicing is configured via the ``vendor_config`` portion of the Stratum Chassis Config (see :ref:`stratum_chassis_config`),
	where the queues and schedulers can be configured.
	We provide a convenient `script <https://github.com/stratum/fabric-tna/blob/main/util/gen-stratum-qos-config.py>`_
	to generate the configuration starting from a higher-level description provided via a YAML file.
	This file allows to configure the parameters for the traffic classes listed in the previous sections.

	Here's a list of parameters that you can configure via the YAML QoS configuration file:

	* ``max_cells``: Maximum number of buffer cells, depends on the ASIC SKU/revision.

	* ``pool_allocations``: Percentage of buffer cells allocated to each traffic class.
	The sum should be 100. Usually, we leave a portion of the buffer ``unassigned``
	for queues that do not have a pool (yet).
	Example of such queues are those for the recirculation port, CPU port, etc.

	.. code-block:: yaml

	pool_allocations:
	control: 1
	realtime: 9
	elastic: 80
	besteffort: 9
	unassigned: 1

	* Control Traffic Class: The available bandwidth dedicated to Control traffic is divided in slots.
	Each slot has a maximum rate and burst (in packets of the given MTU).
	A slice can use one or more slots by appropriately configuring meters in the fabric ingress pipeline.

	* ``control_slot_count``: Number of slots.
	* ``control_slot_rate_pps``: Packet per second rate of each slot.
	* ``control_slot_burst_pkts``: Number of packets per burst of each slot.
	* ``control_mtu_bytes``: MTU of packets for the PPS and burst values.

	.. code-block:: yaml

	control_slot_count: 50
	control_slot_rate_pps: 100
	control_slot_burst_pkts: 10
	control_mtu_bytes: 1500

	* Real-Time Traffic Class Configuration:

	* ``realtime_max_rates_bps``: List of maximum shaping rates for Real-Time queues,
	one per slice requesting such service.

	* ``realtime_max_burst_s``: Maximum amount of time that a Real-Time queue can
	burst at the port speed. This parameter is used to limit delay for Elastic
	queues.

	.. code-block:: yaml

	realtime_max_rates_bps:
	- 45000000 # 45 Mbps
	- 30000000 # 30 Mbps
	- 25000000 # 25 Mbps
	realtime_max_burst_s: 0.005 # 5 ms

	* Elastic Traffic Class Configuration:

	* ``elastic_min_rates_bps``: List of minimum guaranteed rates for Elastic queues,
	one per slice requesting such service.

	.. code-block:: yaml

	elastic_min_rates_bps:
	- 100000000 # 100 Mbps
	- 200000000 # 200 Mbps

	* ``port_templates`` section: List of switch port for which we want to configure
	queues.

	Every ``port_templates`` element contains:

	* ``descr``: Description of the port purpose.

	* ``rate_bps``: Port speed in bit per second.

	* ``is_shaping_enabled``: ``true`` if the rate is enforced using shaping,
	``false`` if the rate is the channel speed.

	* ``shaping_burst_bytes``: Burst size in bytes, meaningful only if port speed
	is shaped (when ``is_shaping_enabled: true``).

	* ``queue_count``: Number of queues assigned to the port.

	* ``port_ids``: List of Stratum port IDs (:ref:`singleton_port` from Stratum Chassis Config),
	using this port template. Used for port that corresponds to switch front-panel ports.

	Mutually exclusive with ``sdk_port_ids`` field.

	* ``sdk_port_ids``: List of SDK port numbers (i.e., Tofino DP_ID) using this port template.
	Used for internal ports (e.g., recirculation ports).

	Mutually exclusive with ``port_ids`` field.

	.. code-block:: yaml

	port_templates:
	- descr: "Base station"
	rate_bps: 1000000000 # 1 Gbps
	is_shaping_enabled: true
	shaping_burst_bytes: 18000 # 2x jumbo frames
	queue_count: 16
	port_ids:
	- 100
	- descr: "Servers"
	port_ids:
	- 200
	rate_bps: 40000000000 # 40 Gbps
	is_shaping_enabled: false
	queue_count: 16
	- descr: "Recirculation"
	sdk_port_ids:
	- 68
	rate_bps: 100000000000 # 100 Gbps
	is_shaping_enabled: false
	queue_count: 16

	An example of a complete QoS and Slicing configuration can be found `here <https://github.com/stratum/fabric-tna/blob/main/util/sample-qos-config.yaml>`_.