blob: f44ae51fc310ac9f34f3a1e82cb1a8eb7d590ffd [file] [log] [blame]
Hung-Wei Chiu77c969e2020-10-23 18:13:07 +00001..
2 SPDX-FileCopyrightText: © 2020 Open Networking Foundation <support@opennetworking.org>
3 SPDX-License-Identifier: Apache-2.0
4
Hung-Wei Chiu77c969e2020-10-23 18:13:07 +00005Overview
6========
Zack Williams9026f532020-11-30 11:34:32 -07007
Zack Williams1ae109e2021-07-27 11:17:04 -07008There are many ways to deploy Aether, depending on the requirements of the edge
9site. The Reliability, Availability, and Serviceability (RAS) of each set of
10equipment will differ depending on the characteristics of each edge.
Zack Williams9026f532020-11-30 11:34:32 -070011
Zack Williams1ae109e2021-07-27 11:17:04 -070012This document provides several hardware deployment options and explains the
13differences between them.
Zack Williams9026f532020-11-30 11:34:32 -070014
Zack Williams1ae109e2021-07-27 11:17:04 -070015Deployment Options
16------------------
17
18Development Environments
19""""""""""""""""""""""""
20
21For users looking for a development or fully software-simulated environment,
22there is ``Aether-in-a-Box (AiaB)`` - instructions for running this can be
Andy Bavierf73c3d22021-08-30 10:29:06 -070023found in the :doc:`Setting Up Aether-in-a-Box </developer/aiab>`. AiaB
Zack Williams1ae109e2021-07-27 11:17:04 -070024is only suitable for testing and developing software, and can't connect to
25physical hardware, but is a good choice for learning about the different
Zack Williams589ac232021-08-19 08:52:18 -070026software components within Aether.
Zack Williams1ae109e2021-07-27 11:17:04 -070027
28Production Environments
29"""""""""""""""""""""""
30
31Deploying Aether on hardware is required for both production deployments and
Zack Williams589ac232021-08-19 08:52:18 -070032hardware testing. Before deploying Aether, a detailed plan including the
33network topology, hardware, and all cabling needs to be created.
Zack Williams1ae109e2021-07-27 11:17:04 -070034
35For redundancy of workloads running in Kubernetes, at least 3 compute nodes
36must be available. A single or pair of compute nodes can be used, but software
37would need to be configured without High Availability (HA) enabled.
Zack Williamse8c3b2c2021-02-01 12:47:28 -070038
Zack Williams589ac232021-08-19 08:52:18 -070039The topologies below are *simplified physical topologies* to show the equipment
40needed and the minimal connectivity between devices. Within these topologies,
41multiple VLANs, routing, and other network-level configuration is required to
42make a functional Aether edge.
43
44There are also possible RAS improvements that can be done at a topology level -
Zack Williams2dfb6242021-08-24 09:19:00 -070045for example, fabric switch connections can be made with two cables, and
Zack Williams589ac232021-08-19 08:52:18 -070046configured to tolerate the failure or replacement of one cable or NIC port,
47which is recommended especially for inter-switch links.
48
49Edge Connectivity
50-----------------
51
52Aether's is a managed service, and Aether Edges require a constant connection
53via VPN to the 4G and 5G core in Aether Central for managing subscriber
54information.
55
Zack Williams2dfb6242021-08-24 09:19:00 -070056The edge site must provide internet access to the Aether edge, specifically the
57Management Server. The traffic required is:
Zack Williams589ac232021-08-19 08:52:18 -070058
Zack Williams2dfb6242021-08-24 09:19:00 -070059* VPN connection (ESP protocol, Ports UDP/500 and UDP/4500) to Aether Central
60
61* SSH (TCP/22). used for installation, troubleshooting, and updating the site.
62
63* General outgoing internet access used for installation of software and other
64 components from ONF and public (Ubuntu) software repositories.
65
66The open ports can be restricted to specific internet addresses which are used
67for Aether.
68
69The Management Server needs to have an IP address assigned to it, which can be either:
70
71* A public static IP address
72
73* Behind NAT with port forwarding with the ports listed above forwarded to the
74 Management Server
75
76In either case, the Management Server's IP address should be assigned using
77a reserved DHCP if possible, which eases the installation process.
Zack Williams589ac232021-08-19 08:52:18 -070078
79BESS-based Network Topology
80---------------------------
81
Zack Williams5d2d6782021-08-26 13:08:44 -070082The :doc:`Software-only BESS UPF </edge_deployment/bess_upf_deployment>`, is
83supported for production use in the Aether 1.5 release. This UPF can be used
84for deployments that do not have P4 switching hardware.
Zack Williams589ac232021-08-19 08:52:18 -070085
86.. image:: images/edge_mgmt_only.svg
87 :alt: BESS network topology
88
89
Zack Williams2dfb6242021-08-24 09:19:00 -070090`BESS <https://github.com/NetSys/bess>`_ runs on an x86 compute server, and is
91deployed using Kubernetes. In production it requires an SR-IOV capable network
92card, and specific K8s CNIs to be used.
Zack Williams589ac232021-08-19 08:52:18 -070093
Zack Williams2dfb6242021-08-24 09:19:00 -070094The Management Server and Switch must be configured with multiple VLANs and
95subnets with routing required for the BESS UPF.
Zack Williams589ac232021-08-19 08:52:18 -070096
97P4-based Network Topology
98-------------------------
Zack Williams9026f532020-11-30 11:34:32 -070099
Zack Williams5d2d6782021-08-26 13:08:44 -0700100.. note::
101
102 The P4-based SD-Fabric UPF is an advanced feature and is of beta quality in
103 the Aether 1.5 release. It requires one or more P4-capable switches using
104 the Tofino chipset.
105
Zack Williams9026f532020-11-30 11:34:32 -0700106If only a single P4 switch is used, the :doc:`Simple
Zack Williams1ae109e2021-07-27 11:17:04 -0700107<trellis:supported-topology>` topology can be used, but provides no network
108redundancy:
Zack Williams9026f532020-11-30 11:34:32 -0700109
Zack Williams1ae109e2021-07-27 11:17:04 -0700110.. image:: images/edge_single.svg
Zack Williams589ac232021-08-19 08:52:18 -0700111 :alt: Single Switch Topology
Zack Williams9026f532020-11-30 11:34:32 -0700112
Zack Williams2dfb6242021-08-24 09:19:00 -0700113If another switch is added, the "Paired Leaves" (aka :doc:`Paired Switches
114<trellis:supported-topology>`) topology can be used, which can tolerate the
115loss of a leaf switch and still retain connections for all dual-homed devices.
116Single homed devices on the failed leaf would lose their connections (the
117single-homed server is shown for reference, and not required). If HA is needed
118for single-homed devices, one option would be to deploying multiple of those
119devices in a way that provides that redundancy - for example, multiple eNBs
120where some are connected to each leaf and have overlapping radio coverage:
Zack Williams9026f532020-11-30 11:34:32 -0700121
Zack Williams1ae109e2021-07-27 11:17:04 -0700122.. image:: images/edge_paired_leaves.svg
Zack Williams589ac232021-08-19 08:52:18 -0700123 :alt: Paired Leaves Topology
Zack Williamse8c3b2c2021-02-01 12:47:28 -0700124
Zack Williams1ae109e2021-07-27 11:17:04 -0700125For larger deployments, a 2x2 fabric can be configured (aka :doc:`Single-Stage
126Leaf-Spine <trellis:supported-topology>`), which provide Spine redundancy, but
127does not support dual-homing of devices.
128
129.. image:: images/edge_2x2.svg
Zack Williams589ac232021-08-19 08:52:18 -0700130 :alt: 2x2 Fabric Topology
Zack Williams1ae109e2021-07-27 11:17:04 -0700131
Zack Williams2dfb6242021-08-24 09:19:00 -0700132Other topologies as described in the :doc:`Trellis Documentation
Zack Williams589ac232021-08-19 08:52:18 -0700133<trellis:supported-topology>` can possibly be used, but are not actively being
134tested at this time.
Zack Williams1ae109e2021-07-27 11:17:04 -0700135
Zack Williams589ac232021-08-19 08:52:18 -0700136Additionally, the P4-based topologies can support running both the BESS UPF and
137P4 UPF on the same hardware at the same time if desired (for testing, or
138simultaneous 4G/5G support).
Zack Williamse8c3b2c2021-02-01 12:47:28 -0700139
Zack Williams589ac232021-08-19 08:52:18 -0700140Hardware Descriptions
141---------------------
Zack Williamse8c3b2c2021-02-01 12:47:28 -0700142
Zack Williams589ac232021-08-19 08:52:18 -0700143Fabric Switch
144"""""""""""""
Charles Chan770bb612022-03-02 12:34:04 -0800145See :ref:`Switch Hardware Selection <sdfabric:deployment:switch hardware selection>`
Zack Williams589ac232021-08-19 08:52:18 -0700146
147Compute Server
148""""""""""""""
149
Zack Williams1ae109e2021-07-27 11:17:04 -0700150These servers run Kubernetes, Aether connectivity apps, and edge applications.
Zack Williams9026f532020-11-30 11:34:32 -0700151
Zack Williams1ae109e2021-07-27 11:17:04 -0700152Minimum hardware specifications:
Zack Williams9026f532020-11-30 11:34:32 -0700153
154* AMD64 (aka x86-64) architecture
Zack Williams9026f532020-11-30 11:34:32 -0700155
Zack Williams2dfb6242021-08-24 09:19:00 -0700156* 8 CPU Cores (minimum), 16+ recommended
Zack Williams9026f532020-11-30 11:34:32 -0700157
Zack Williams589ac232021-08-19 08:52:18 -0700158* 32GB of RAM (minimum), 128GB+ recommended
159
160* 250 GB of storage (SSD preferred), 1TB+ recommended
161
162* 2x 40GbE or 100GbE Ethernet network card to P4 switches, with DPDK support
163
Zack Williams2dfb6242021-08-24 09:19:00 -0700164* 1x 1GbE management network port, with PXE boot support. 2x required for BESS
165 UPF.
Zack Williams589ac232021-08-19 08:52:18 -0700166
167Optional but highly recommended:
168
169* Lights out management support, with either a shared or separate NIC and
170 support for HTML5 console access.
Zack Williams9026f532020-11-30 11:34:32 -0700171
172Management Server
173"""""""""""""""""
174
175One management server is required, which must have at least two 1GbE network
Zack Williams1ae109e2021-07-27 11:17:04 -0700176ports, and runs a variety of network services to bootstrap and support the
177edge.
Zack Williams9026f532020-11-30 11:34:32 -0700178
Zack Williams1ae109e2021-07-27 11:17:04 -0700179In current Aether deployments, the Management Server also functions as a router
180and VPN gateway back to Aether Central.
Zack Williams9026f532020-11-30 11:34:32 -0700181
Zack Williams1ae109e2021-07-27 11:17:04 -0700182Minimum hardware specifications:
Zack Williams9026f532020-11-30 11:34:32 -0700183
Zack Williams1ae109e2021-07-27 11:17:04 -0700184* AMD64 (aka x86-64) architecture
Zack Williams589ac232021-08-19 08:52:18 -0700185
186* 4 CPU cores, or more
187
188* 8GB of RAM, or more
189
190* 120GB of storage (SSD preferred), or more
191
Zack Williams2dfb6242021-08-24 09:19:00 -0700192* 2x 1GbE Network interfaces (one for WAN, one to the management switch) with
193 PXE boot support.
Zack Williams1ae109e2021-07-27 11:17:04 -0700194
195Optional:
196
Zack Williams589ac232021-08-19 08:52:18 -0700197* 10GbE or 40GbE network card with DPDK support to connect to fabric switch
198
199* Lights out management support, with either a shared or separate NIC and
200 support for HTML5 console access.
Zack Williams9026f532020-11-30 11:34:32 -0700201
202Management Switch
203"""""""""""""""""
204
Zack Williams1ae109e2021-07-27 11:17:04 -0700205A managed L2/L3 management switch is required to provide connectivity within
206the cluster for bootstrapping equipment. It is configured with multiple VLANs
207to separate the management plane, fabric, and the out-of-band and lights out
208management connections on the equipment.
Zack Williams9026f532020-11-30 11:34:32 -0700209
Zack Williams1ae109e2021-07-27 11:17:04 -0700210Minimum requirements:
211
Zack Williams2dfb6242021-08-24 09:19:00 -0700212* 8x 1GbE Copper Ethernet ports (adjust to provide a sufficient number for
213 every copper 1GbE port in the system)
Zack Williams1ae109e2021-07-27 11:17:04 -0700214
215* 2x 10GbE SFP+ or 40GbE QSFP interfaces (only required if management server
216 does not have a network card with these ports)
217
218* Managed via SSH or web interface
219
Zack Williams2dfb6242021-08-24 09:19:00 -0700220* LLDP protocol support, for debugging cabling issues
Zack Williams589ac232021-08-19 08:52:18 -0700221
Zack Williams1ae109e2021-07-27 11:17:04 -0700222* Capable supporting VLANs on each port, with both tagged and untagged traffic
223 sharing a port.
224
Zack Williams589ac232021-08-19 08:52:18 -0700225
Zack Williams1ae109e2021-07-27 11:17:04 -0700226Optional:
227
Zack Williams589ac232021-08-19 08:52:18 -0700228* PoE+ support, which can power eNB and monitoring hardware, if using
229 Management switch to host these devices.
Zack Williams1ae109e2021-07-27 11:17:04 -0700230
231eNB Radio
232"""""""""
233
234The LTE eNB used in most deployments is the `Sercomm P27-SCE4255W Indoor CBRS
Zack Williams589ac232021-08-19 08:52:18 -0700235Small Cell
236<https://www.sercomm.com/contpage.aspx?langid=1&type=prod3&L1id=2&L2id=1&L3id=107&Prodid=751>`_.
Zack Williams1ae109e2021-07-27 11:17:04 -0700237
Zack Williams589ac232021-08-19 08:52:18 -0700238While this unit ships with a separate power brick, it also supports PoE+ power
239on the WAN port, which provides deployment location flexibility. Either a PoE+
240capable switch or PoE+ power injector should be purchased.
Zack Williams1ae109e2021-07-27 11:17:04 -0700241
Zack Williams2dfb6242021-08-24 09:19:00 -0700242If connecting directly to the fabric switch through a QSFP to 4x SFP+ split
Zack Williams589ac232021-08-19 08:52:18 -0700243cable, a 10GbE SFP+ to 1GbE Copper media converter should be purchased. The `FS
244UMC-1S1T <https://www.fs.com/products/101476.html>`_ has been used for this
245purpose successfully.
246
247Alternatively, the Fabric's 10GbE SFP+ could be connected to another switch
248(possibly the Management Switch) which would adapt the speed difference, and
249provide PoE+ power, and power control for remote manageability.
250
251
252Testing Hardware
Zack Williams1ae109e2021-07-27 11:17:04 -0700253----------------
254
255The following hardware is used to test the network and determine uptime of
Zack Williams2dfb6242021-08-24 09:19:00 -0700256edges. It is currently required, to properly validate that an edge site is
Zack Williams1ae109e2021-07-27 11:17:04 -0700257functioning properly.
Zack Williams9026f532020-11-30 11:34:32 -0700258
Hyunsun Moon2b626762021-03-26 16:21:14 -0700259Monitoring Raspberry Pi and CBRS dongle
260"""""""""""""""""""""""""""""""""""""""
261
Zack Williams1ae109e2021-07-27 11:17:04 -0700262One pair of Raspberry Pi and CBRS band supported LTE dongle is required to
263monitor the connectivity service at the edge.
Hyunsun Moon2b626762021-03-26 16:21:14 -0700264
Zack Williams2dfb6242021-08-24 09:19:00 -0700265The Raspberry Pi model used in Aether is a `Raspberry Pi 4 Model B/2GB
Hyunsun Moon2b626762021-03-26 16:21:14 -0700266<https://www.pishop.us/product/raspberry-pi-4-model-b-2gb/>`_
267
268Which is configured with:
269
Zack Williams2dfb6242021-08-24 09:19:00 -0700270* Raspberry Pi case (HiPi is recommended for PoE Hat)
Zack Williams589ac232021-08-19 08:52:18 -0700271
Zack Williams2dfb6242021-08-24 09:19:00 -0700272* A power source, either one of:
Zack Williams589ac232021-08-19 08:52:18 -0700273
274 * PoE Hat used with a PoE switch (recommended, allows remote power control)
275
276 * USB-C Power Supply
277
Hyunsun Moon2b626762021-03-26 16:21:14 -0700278* MicroSD Card with Raspbian - 16GB
279
Zack Williams1ae109e2021-07-27 11:17:04 -0700280One LTE dongle model supported in Aether is the `Sercomm Adventure Wingle
Hyunsun Moon2b626762021-03-26 16:21:14 -0700281<https://www.sercomm.com/contpage.aspx?langid=1&type=prod3&L1id=2&L2id=2&L3id=110&Prodid=767>`_.
Zack Williams589ac232021-08-19 08:52:18 -0700282
283
284Example BoMs
285------------
286
287To help provision a site, a few example Bill of Materials (BoM) are given
288below, which reference the hardware descriptions given above.
289
290Some quantities are dependent on other quantities - for example, the number of
291DAC cables frequently depends on the number of servers in use.
292
Zack Williams5d2d6782021-08-26 13:08:44 -0700293These BoMs do not include UE devices. It's recommended that the testing
Zack Williams589ac232021-08-19 08:52:18 -0700294hardware given above be added to every BoM for monitoring purposes.
295
296
Zack Williams5d2d6782021-08-26 13:08:44 -0700297BESS UPF Testing BoM
Zack Williams589ac232021-08-19 08:52:18 -0700298""""""""""""""""""""
299
300The following is the minimum BoM required to run Aether with the BESS UPF.
301
302============ ===================== ===============================================
303Quantity Type Purpose
304============ ===================== ===============================================
3051 Management Switch Must be Layer 2/3 capable for BESS VLANs
3061 Management Server
3071-3 Compute Servers Recommended at least 3 for Kubernetes HA
3081 (or more) eNB
3091x #eNB PoE+ Injector Required unless using a PoE+ Switch
310Sufficient Cat6 Network Cabling Between all equipment
311============ ===================== ===============================================
312
Zack Williams5d2d6782021-08-26 13:08:44 -0700313P4 UPF Testing BoM
Zack Williams589ac232021-08-19 08:52:18 -0700314""""""""""""""""""
315
316============ ===================== ===============================================
317Quantity Type Description/Use
318============ ===================== ===============================================
3191 P4 Fabric Switch
3201 Management Switch Must be Layer 2/3 capable
3211 Management Server At least 1x 40GbE QSFP ports recommended
3221-3 Compute Servers Recommended at least 3 for Kubernetes HA
3232x #Server 40GbE QSFP DAC cable Between Compute, Management, and Fabric Switch
Zack Williams2dfb6242021-08-24 09:19:00 -07003241 QSFP to 4x SFP+ DAC Split cable between Fabric and eNB
Zack Williams589ac232021-08-19 08:52:18 -07003251 (or more) eNB
3261x #eNB 10GbE to 1GbE Media Required unless using switch to convert from
327 converter fabric to eNB
3281x #eNB PoE+ Injector Required unless using a PoE+ Switch
329Sufficient Cat6 Network Cabling Between all equipment
330============ ===================== ===============================================
331
Zack Williams5d2d6782021-08-26 13:08:44 -0700332P4 UPF Paired Leaves BoM
Zack Williams589ac232021-08-19 08:52:18 -0700333""""""""""""""""""""""""
334
335============ ===================== ===============================================
336Quantity Type Description/Use
337============ ===================== ===============================================
3382 P4 Fabric Switch
3391 Management Switch Must be Layer 2/3 capable
3401 Management Server 2x 40GbE QSFP ports recommended
3413 Compute Servers
3422 100GbE QSFP DAC cable Between Fabric switches
3432x #Server 40GbE QSFP DAC cable Between Compute, Management, and Fabric Switch
Zack Williams2dfb6242021-08-24 09:19:00 -07003441 (or more) QSFP to 4x SFP+ DAC Split cable between Fabric and eNB
Zack Williams589ac232021-08-19 08:52:18 -07003451 (or more) eNB
3461x #eNB 10GbE to 1GbE Media Required unless using switch to convert from
347 converter fabric to eNB
3481x #eNB PoE+ Injector Required unless using a PoE+ Switch
349Sufficient Cat6 Network Cabling Between all equipment
350============ ===================== ===============================================
351
352
Zack Williams5d2d6782021-08-26 13:08:44 -0700353P4 UPF 2x2 Leaf Spine Fabric BoM
Zack Williams589ac232021-08-19 08:52:18 -0700354""""""""""""""""""""""""""""""""
355
356============ ===================== ===============================================
357Quantity Type Description/Use
358============ ===================== ===============================================
3594 P4 Fabric Switch
3601 Management Switch Must be Layer 2/3 capable
3611 Management Server 2x 40GbE QSFP ports recommended
3623 Compute Servers
3638 100GbE QSFP DAC cable Between Fabric switches
3642x #Server 40GbE QSFP DAC cable Between Compute, Management, and Fabric Switch
Zack Williams2dfb6242021-08-24 09:19:00 -07003651 (or more) QSFP to 4x SFP+ DAC Split cable between Fabric and eNB
Zack Williams589ac232021-08-19 08:52:18 -07003661 (or more) eNB
3671x #eNB 10GbE to 1GbE Media Required unless using switch to convert from
368 converter fabric to eNB
3691x #eNB PoE+ Injector Required unless using a PoE+ Switch
370Sufficient Cat6 Network Cabling Between all equipment
371============ ===================== ===============================================
372