blob: 5e1a233f8edfbef66a7e9a1d6e405fd9892d2519 [file] [log] [blame]
Hung-Wei Chiu77c969e2020-10-23 18:13:07 +00001..
2 SPDX-FileCopyrightText: © 2020 Open Networking Foundation <support@opennetworking.org>
3 SPDX-License-Identifier: Apache-2.0
4
Hung-Wei Chiu77c969e2020-10-23 18:13:07 +00005Overview
6========
Zack Williams9026f532020-11-30 11:34:32 -07007
Zack Williams1ae109e2021-07-27 11:17:04 -07008There are many ways to deploy Aether, depending on the requirements of the edge
9site. The Reliability, Availability, and Serviceability (RAS) of each set of
10equipment will differ depending on the characteristics of each edge.
Zack Williams9026f532020-11-30 11:34:32 -070011
Zack Williams1ae109e2021-07-27 11:17:04 -070012This document provides several hardware deployment options and explains the
13differences between them.
Zack Williams9026f532020-11-30 11:34:32 -070014
Zack Williams1ae109e2021-07-27 11:17:04 -070015Deployment Options
16------------------
17
18Development Environments
19""""""""""""""""""""""""
20
21For users looking for a development or fully software-simulated environment,
22there is ``Aether-in-a-Box (AiaB)`` - instructions for running this can be
Andy Bavierf73c3d22021-08-30 10:29:06 -070023found in the :doc:`Setting Up Aether-in-a-Box </developer/aiab>`. AiaB
Zack Williams1ae109e2021-07-27 11:17:04 -070024is only suitable for testing and developing software, and can't connect to
25physical hardware, but is a good choice for learning about the different
Zack Williams589ac232021-08-19 08:52:18 -070026software components within Aether.
Zack Williams1ae109e2021-07-27 11:17:04 -070027
28Production Environments
29"""""""""""""""""""""""
30
31Deploying Aether on hardware is required for both production deployments and
Zack Williams589ac232021-08-19 08:52:18 -070032hardware testing. Before deploying Aether, a detailed plan including the
33network topology, hardware, and all cabling needs to be created.
Zack Williams1ae109e2021-07-27 11:17:04 -070034
35For redundancy of workloads running in Kubernetes, at least 3 compute nodes
36must be available. A single or pair of compute nodes can be used, but software
37would need to be configured without High Availability (HA) enabled.
Zack Williamse8c3b2c2021-02-01 12:47:28 -070038
Zack Williams589ac232021-08-19 08:52:18 -070039The topologies below are *simplified physical topologies* to show the equipment
40needed and the minimal connectivity between devices. Within these topologies,
41multiple VLANs, routing, and other network-level configuration is required to
42make a functional Aether edge.
43
44There are also possible RAS improvements that can be done at a topology level -
Zack Williams2dfb6242021-08-24 09:19:00 -070045for example, fabric switch connections can be made with two cables, and
Zack Williams589ac232021-08-19 08:52:18 -070046configured to tolerate the failure or replacement of one cable or NIC port,
47which is recommended especially for inter-switch links.
48
49Edge Connectivity
50-----------------
51
52Aether's is a managed service, and Aether Edges require a constant connection
53via VPN to the 4G and 5G core in Aether Central for managing subscriber
54information.
55
Zack Williams2dfb6242021-08-24 09:19:00 -070056The edge site must provide internet access to the Aether edge, specifically the
57Management Server. The traffic required is:
Zack Williams589ac232021-08-19 08:52:18 -070058
Zack Williams2dfb6242021-08-24 09:19:00 -070059* VPN connection (ESP protocol, Ports UDP/500 and UDP/4500) to Aether Central
60
61* SSH (TCP/22). used for installation, troubleshooting, and updating the site.
62
63* General outgoing internet access used for installation of software and other
64 components from ONF and public (Ubuntu) software repositories.
65
66The open ports can be restricted to specific internet addresses which are used
67for Aether.
68
69The Management Server needs to have an IP address assigned to it, which can be either:
70
71* A public static IP address
72
73* Behind NAT with port forwarding with the ports listed above forwarded to the
74 Management Server
75
76In either case, the Management Server's IP address should be assigned using
77a reserved DHCP if possible, which eases the installation process.
Zack Williams589ac232021-08-19 08:52:18 -070078
79BESS-based Network Topology
80---------------------------
81
82The :doc:`Software-only BESS UPF
83</edge_deployment/bess_upf_deployment>`, which can be used for deployments that
84do not have P4 switching hardware.
85
86.. image:: images/edge_mgmt_only.svg
87 :alt: BESS network topology
88
89
Zack Williams2dfb6242021-08-24 09:19:00 -070090`BESS <https://github.com/NetSys/bess>`_ runs on an x86 compute server, and is
91deployed using Kubernetes. In production it requires an SR-IOV capable network
92card, and specific K8s CNIs to be used.
Zack Williams589ac232021-08-19 08:52:18 -070093
Zack Williams2dfb6242021-08-24 09:19:00 -070094The Management Server and Switch must be configured with multiple VLANs and
95subnets with routing required for the BESS UPF.
Zack Williams589ac232021-08-19 08:52:18 -070096
97P4-based Network Topology
98-------------------------
Zack Williams9026f532020-11-30 11:34:32 -070099
Zack Williams9026f532020-11-30 11:34:32 -0700100If only a single P4 switch is used, the :doc:`Simple
Zack Williams1ae109e2021-07-27 11:17:04 -0700101<trellis:supported-topology>` topology can be used, but provides no network
102redundancy:
Zack Williams9026f532020-11-30 11:34:32 -0700103
Zack Williams1ae109e2021-07-27 11:17:04 -0700104.. image:: images/edge_single.svg
Zack Williams589ac232021-08-19 08:52:18 -0700105 :alt: Single Switch Topology
Zack Williams9026f532020-11-30 11:34:32 -0700106
Zack Williams2dfb6242021-08-24 09:19:00 -0700107If another switch is added, the "Paired Leaves" (aka :doc:`Paired Switches
108<trellis:supported-topology>`) topology can be used, which can tolerate the
109loss of a leaf switch and still retain connections for all dual-homed devices.
110Single homed devices on the failed leaf would lose their connections (the
111single-homed server is shown for reference, and not required). If HA is needed
112for single-homed devices, one option would be to deploying multiple of those
113devices in a way that provides that redundancy - for example, multiple eNBs
114where some are connected to each leaf and have overlapping radio coverage:
Zack Williams9026f532020-11-30 11:34:32 -0700115
Zack Williams1ae109e2021-07-27 11:17:04 -0700116.. image:: images/edge_paired_leaves.svg
Zack Williams589ac232021-08-19 08:52:18 -0700117 :alt: Paired Leaves Topology
Zack Williamse8c3b2c2021-02-01 12:47:28 -0700118
Zack Williams1ae109e2021-07-27 11:17:04 -0700119For larger deployments, a 2x2 fabric can be configured (aka :doc:`Single-Stage
120Leaf-Spine <trellis:supported-topology>`), which provide Spine redundancy, but
121does not support dual-homing of devices.
122
123.. image:: images/edge_2x2.svg
Zack Williams589ac232021-08-19 08:52:18 -0700124 :alt: 2x2 Fabric Topology
Zack Williams1ae109e2021-07-27 11:17:04 -0700125
Zack Williams2dfb6242021-08-24 09:19:00 -0700126Other topologies as described in the :doc:`Trellis Documentation
Zack Williams589ac232021-08-19 08:52:18 -0700127<trellis:supported-topology>` can possibly be used, but are not actively being
128tested at this time.
Zack Williams1ae109e2021-07-27 11:17:04 -0700129
Zack Williams589ac232021-08-19 08:52:18 -0700130Additionally, the P4-based topologies can support running both the BESS UPF and
131P4 UPF on the same hardware at the same time if desired (for testing, or
132simultaneous 4G/5G support).
Zack Williamse8c3b2c2021-02-01 12:47:28 -0700133
Zack Williams589ac232021-08-19 08:52:18 -0700134Hardware Descriptions
135---------------------
Zack Williamse8c3b2c2021-02-01 12:47:28 -0700136
Zack Williams589ac232021-08-19 08:52:18 -0700137Fabric Switch
138"""""""""""""
Zack Williams9026f532020-11-30 11:34:32 -0700139
Zack Williams589ac232021-08-19 08:52:18 -0700140To use the P4 UPF, you must use fabric switches based on the `Intel (previously
141Barefoot) Tofino chipset
142<https://www.intel.com/content/www/us/en/products/network-io/programmable-ethernet-switch/tofino-series.html>`_.
143There are two variants of this switching chipset, with different resources and
144capabilities.
Zack Williams9026f532020-11-30 11:34:32 -0700145
Zack Williams589ac232021-08-19 08:52:18 -0700146Aether currently supports these P4 switch models:
Zack Williams9026f532020-11-30 11:34:32 -0700147
148* `EdgeCore Wedge100BF-32X
Zack Williams589ac232021-08-19 08:52:18 -0700149 <https://www.edge-core.com/productsInfo.php?cls=1&cls2=180&cls3=181&id=335>`_,
150 a Dual Pipe Tofino ASIC
Zack Williams9026f532020-11-30 11:34:32 -0700151
152* `EdgeCore Wedge100BF-32QS
Zack Williams589ac232021-08-19 08:52:18 -0700153 <https://www.edge-core.com/productsInfo.php?cls=1&cls2=180&cls3=181&id=770>`_,
154 a Quad Pipe Tofino ASIC, which has more chip resources and a faster embedded
155 system with more memory and storage.
Zack Williams9026f532020-11-30 11:34:32 -0700156
Zack Williams589ac232021-08-19 08:52:18 -0700157The P4 UPF and SD-Fabric features run within the constraints of the Dual Pipe
158system for production deployments, but for development of features in P4, the
159larger capacity of the Quad Pipe is desirable.
160
161These switches feature 32 QSFP+ ports capable of running in 100GbE, 40GbE, or
1624x 10GbE mode (using a split DAC or fiber cable) and have a 1GbE management
163network interface.
164
Zack Williams2dfb6242021-08-24 09:19:00 -0700165See also the :ref:`Rackmount of Equipment
Zack Williams589ac232021-08-19 08:52:18 -0700166<edge_deployment/site_planning:rackmount of equipment>` for how the Fabric
167switches should be rackmounted to ensure proper airflow within a rack.
168
169Compute Server
170""""""""""""""
171
Zack Williams1ae109e2021-07-27 11:17:04 -0700172These servers run Kubernetes, Aether connectivity apps, and edge applications.
Zack Williams9026f532020-11-30 11:34:32 -0700173
Zack Williams1ae109e2021-07-27 11:17:04 -0700174Minimum hardware specifications:
Zack Williams9026f532020-11-30 11:34:32 -0700175
176* AMD64 (aka x86-64) architecture
Zack Williams9026f532020-11-30 11:34:32 -0700177
Zack Williams2dfb6242021-08-24 09:19:00 -0700178* 8 CPU Cores (minimum), 16+ recommended
Zack Williams9026f532020-11-30 11:34:32 -0700179
Zack Williams589ac232021-08-19 08:52:18 -0700180* 32GB of RAM (minimum), 128GB+ recommended
181
182* 250 GB of storage (SSD preferred), 1TB+ recommended
183
184* 2x 40GbE or 100GbE Ethernet network card to P4 switches, with DPDK support
185
Zack Williams2dfb6242021-08-24 09:19:00 -0700186* 1x 1GbE management network port, with PXE boot support. 2x required for BESS
187 UPF.
Zack Williams589ac232021-08-19 08:52:18 -0700188
189Optional but highly recommended:
190
191* Lights out management support, with either a shared or separate NIC and
192 support for HTML5 console access.
Zack Williams9026f532020-11-30 11:34:32 -0700193
194Management Server
195"""""""""""""""""
196
197One management server is required, which must have at least two 1GbE network
Zack Williams1ae109e2021-07-27 11:17:04 -0700198ports, and runs a variety of network services to bootstrap and support the
199edge.
Zack Williams9026f532020-11-30 11:34:32 -0700200
Zack Williams1ae109e2021-07-27 11:17:04 -0700201In current Aether deployments, the Management Server also functions as a router
202and VPN gateway back to Aether Central.
Zack Williams9026f532020-11-30 11:34:32 -0700203
Zack Williams1ae109e2021-07-27 11:17:04 -0700204Minimum hardware specifications:
Zack Williams9026f532020-11-30 11:34:32 -0700205
Zack Williams1ae109e2021-07-27 11:17:04 -0700206* AMD64 (aka x86-64) architecture
Zack Williams589ac232021-08-19 08:52:18 -0700207
208* 4 CPU cores, or more
209
210* 8GB of RAM, or more
211
212* 120GB of storage (SSD preferred), or more
213
Zack Williams2dfb6242021-08-24 09:19:00 -0700214* 2x 1GbE Network interfaces (one for WAN, one to the management switch) with
215 PXE boot support.
Zack Williams1ae109e2021-07-27 11:17:04 -0700216
217Optional:
218
Zack Williams589ac232021-08-19 08:52:18 -0700219* 10GbE or 40GbE network card with DPDK support to connect to fabric switch
220
221* Lights out management support, with either a shared or separate NIC and
222 support for HTML5 console access.
Zack Williams9026f532020-11-30 11:34:32 -0700223
224Management Switch
225"""""""""""""""""
226
Zack Williams1ae109e2021-07-27 11:17:04 -0700227A managed L2/L3 management switch is required to provide connectivity within
228the cluster for bootstrapping equipment. It is configured with multiple VLANs
229to separate the management plane, fabric, and the out-of-band and lights out
230management connections on the equipment.
Zack Williams9026f532020-11-30 11:34:32 -0700231
Zack Williams1ae109e2021-07-27 11:17:04 -0700232Minimum requirements:
233
Zack Williams2dfb6242021-08-24 09:19:00 -0700234* 8x 1GbE Copper Ethernet ports (adjust to provide a sufficient number for
235 every copper 1GbE port in the system)
Zack Williams1ae109e2021-07-27 11:17:04 -0700236
237* 2x 10GbE SFP+ or 40GbE QSFP interfaces (only required if management server
238 does not have a network card with these ports)
239
240* Managed via SSH or web interface
241
Zack Williams2dfb6242021-08-24 09:19:00 -0700242* LLDP protocol support, for debugging cabling issues
Zack Williams589ac232021-08-19 08:52:18 -0700243
Zack Williams1ae109e2021-07-27 11:17:04 -0700244* Capable supporting VLANs on each port, with both tagged and untagged traffic
245 sharing a port.
246
Zack Williams589ac232021-08-19 08:52:18 -0700247
Zack Williams1ae109e2021-07-27 11:17:04 -0700248Optional:
249
Zack Williams589ac232021-08-19 08:52:18 -0700250* PoE+ support, which can power eNB and monitoring hardware, if using
251 Management switch to host these devices.
Zack Williams1ae109e2021-07-27 11:17:04 -0700252
253eNB Radio
254"""""""""
255
256The LTE eNB used in most deployments is the `Sercomm P27-SCE4255W Indoor CBRS
Zack Williams589ac232021-08-19 08:52:18 -0700257Small Cell
258<https://www.sercomm.com/contpage.aspx?langid=1&type=prod3&L1id=2&L2id=1&L3id=107&Prodid=751>`_.
Zack Williams1ae109e2021-07-27 11:17:04 -0700259
Zack Williams589ac232021-08-19 08:52:18 -0700260While this unit ships with a separate power brick, it also supports PoE+ power
261on the WAN port, which provides deployment location flexibility. Either a PoE+
262capable switch or PoE+ power injector should be purchased.
Zack Williams1ae109e2021-07-27 11:17:04 -0700263
Zack Williams2dfb6242021-08-24 09:19:00 -0700264If connecting directly to the fabric switch through a QSFP to 4x SFP+ split
Zack Williams589ac232021-08-19 08:52:18 -0700265cable, a 10GbE SFP+ to 1GbE Copper media converter should be purchased. The `FS
266UMC-1S1T <https://www.fs.com/products/101476.html>`_ has been used for this
267purpose successfully.
268
269Alternatively, the Fabric's 10GbE SFP+ could be connected to another switch
270(possibly the Management Switch) which would adapt the speed difference, and
271provide PoE+ power, and power control for remote manageability.
272
273
274Testing Hardware
Zack Williams1ae109e2021-07-27 11:17:04 -0700275----------------
276
277The following hardware is used to test the network and determine uptime of
Zack Williams2dfb6242021-08-24 09:19:00 -0700278edges. It is currently required, to properly validate that an edge site is
Zack Williams1ae109e2021-07-27 11:17:04 -0700279functioning properly.
Zack Williams9026f532020-11-30 11:34:32 -0700280
Hyunsun Moon2b626762021-03-26 16:21:14 -0700281Monitoring Raspberry Pi and CBRS dongle
282"""""""""""""""""""""""""""""""""""""""
283
Zack Williams1ae109e2021-07-27 11:17:04 -0700284One pair of Raspberry Pi and CBRS band supported LTE dongle is required to
285monitor the connectivity service at the edge.
Hyunsun Moon2b626762021-03-26 16:21:14 -0700286
Zack Williams2dfb6242021-08-24 09:19:00 -0700287The Raspberry Pi model used in Aether is a `Raspberry Pi 4 Model B/2GB
Hyunsun Moon2b626762021-03-26 16:21:14 -0700288<https://www.pishop.us/product/raspberry-pi-4-model-b-2gb/>`_
289
290Which is configured with:
291
Zack Williams2dfb6242021-08-24 09:19:00 -0700292* Raspberry Pi case (HiPi is recommended for PoE Hat)
Zack Williams589ac232021-08-19 08:52:18 -0700293
Zack Williams2dfb6242021-08-24 09:19:00 -0700294* A power source, either one of:
Zack Williams589ac232021-08-19 08:52:18 -0700295
296 * PoE Hat used with a PoE switch (recommended, allows remote power control)
297
298 * USB-C Power Supply
299
Hyunsun Moon2b626762021-03-26 16:21:14 -0700300* MicroSD Card with Raspbian - 16GB
301
Zack Williams1ae109e2021-07-27 11:17:04 -0700302One LTE dongle model supported in Aether is the `Sercomm Adventure Wingle
Hyunsun Moon2b626762021-03-26 16:21:14 -0700303<https://www.sercomm.com/contpage.aspx?langid=1&type=prod3&L1id=2&L2id=2&L3id=110&Prodid=767>`_.
Zack Williams589ac232021-08-19 08:52:18 -0700304
305
306Example BoMs
307------------
308
309To help provision a site, a few example Bill of Materials (BoM) are given
310below, which reference the hardware descriptions given above.
311
312Some quantities are dependent on other quantities - for example, the number of
313DAC cables frequently depends on the number of servers in use.
314
315These BOMs do not include UE devices. It's recommended that the testing
316hardware given above be added to every BoM for monitoring purposes.
317
318
319BESS UPF Testing BOM
320""""""""""""""""""""
321
322The following is the minimum BoM required to run Aether with the BESS UPF.
323
324============ ===================== ===============================================
325Quantity Type Purpose
326============ ===================== ===============================================
3271 Management Switch Must be Layer 2/3 capable for BESS VLANs
3281 Management Server
3291-3 Compute Servers Recommended at least 3 for Kubernetes HA
3301 (or more) eNB
3311x #eNB PoE+ Injector Required unless using a PoE+ Switch
332Sufficient Cat6 Network Cabling Between all equipment
333============ ===================== ===============================================
334
335P4 UPF Testing BOM
336""""""""""""""""""
337
338============ ===================== ===============================================
339Quantity Type Description/Use
340============ ===================== ===============================================
3411 P4 Fabric Switch
3421 Management Switch Must be Layer 2/3 capable
3431 Management Server At least 1x 40GbE QSFP ports recommended
3441-3 Compute Servers Recommended at least 3 for Kubernetes HA
3452x #Server 40GbE QSFP DAC cable Between Compute, Management, and Fabric Switch
Zack Williams2dfb6242021-08-24 09:19:00 -07003461 QSFP to 4x SFP+ DAC Split cable between Fabric and eNB
Zack Williams589ac232021-08-19 08:52:18 -07003471 (or more) eNB
3481x #eNB 10GbE to 1GbE Media Required unless using switch to convert from
349 converter fabric to eNB
3501x #eNB PoE+ Injector Required unless using a PoE+ Switch
351Sufficient Cat6 Network Cabling Between all equipment
352============ ===================== ===============================================
353
354P4 UPF Paired Leaves BOM
355""""""""""""""""""""""""
356
357============ ===================== ===============================================
358Quantity Type Description/Use
359============ ===================== ===============================================
3602 P4 Fabric Switch
3611 Management Switch Must be Layer 2/3 capable
3621 Management Server 2x 40GbE QSFP ports recommended
3633 Compute Servers
3642 100GbE QSFP DAC cable Between Fabric switches
3652x #Server 40GbE QSFP DAC cable Between Compute, Management, and Fabric Switch
Zack Williams2dfb6242021-08-24 09:19:00 -07003661 (or more) QSFP to 4x SFP+ DAC Split cable between Fabric and eNB
Zack Williams589ac232021-08-19 08:52:18 -07003671 (or more) eNB
3681x #eNB 10GbE to 1GbE Media Required unless using switch to convert from
369 converter fabric to eNB
3701x #eNB PoE+ Injector Required unless using a PoE+ Switch
371Sufficient Cat6 Network Cabling Between all equipment
372============ ===================== ===============================================
373
374
375P4 UPF 2x2 Leaf Spine Fabric BOM
376""""""""""""""""""""""""""""""""
377
378============ ===================== ===============================================
379Quantity Type Description/Use
380============ ===================== ===============================================
3814 P4 Fabric Switch
3821 Management Switch Must be Layer 2/3 capable
3831 Management Server 2x 40GbE QSFP ports recommended
3843 Compute Servers
3858 100GbE QSFP DAC cable Between Fabric switches
3862x #Server 40GbE QSFP DAC cable Between Compute, Management, and Fabric Switch
Zack Williams2dfb6242021-08-24 09:19:00 -07003871 (or more) QSFP to 4x SFP+ DAC Split cable between Fabric and eNB
Zack Williams589ac232021-08-19 08:52:18 -07003881 (or more) eNB
3891x #eNB 10GbE to 1GbE Media Required unless using switch to convert from
390 converter fabric to eNB
3911x #eNB PoE+ Injector Required unless using a PoE+ Switch
392Sufficient Cat6 Network Cabling Between all equipment
393============ ===================== ===============================================
394