Let's take a detailed look at the networking architecture and technical system of the computing power network.
The architecture of the computing power network
The role of the computing power network is to provide users with computing power resource services. It completely "integrates" computing resources into the communication network, and provides computing resources services that best meet the needs of users in a more holistic form.
This is a highly flexible, highly intelligent network. The core feature of the computing power network is that it realizes the full takeover of computing resources and network resources through computing power, allowing the network to perceive the computing power needs of users and its own computing power status in real-time. After analysis, the computing power network can schedule different locations and different types of computing resources to serve users.
We can understand it as a "universal faucet". It can always provide the water flow with the right strength, the right temperature, and the right pH according to your needs so that you can have the best water experience. Then, we are naturally curious, what kind of "water net" is behind this "universal faucet"? How does it perform resource control and scheduling?
Next, let's take a look at the architecture of the computing power network. The computing power network is still a technology under research and has not been fully finalized. Therefore, what I give below is only the computing power network architecture (assumption) diagram of each operator and equipment manufacturer, and there will be certain differences between them. Moreover, it is very likely that changes will occur in the future, so please be aware.
The following pictures are the representative computing network architecture (assumption) diagrams of operators and equipment manufacturers:
(From "Cloud-Network Convergence 2030 Technology White Paper")
These pictures are a bit complicated, and I believe that everyone is not very understanding. I drew a simple diagram, everyone should be able to understand:
To put it simply, it is still the layered idea of information communication. Looking at the layers, the ideas can be clarified. In order to reduce the pressure of reading, you can pass the green low-carbon and safety protection on the right first.
We look at it from the bottom up, layer by layer.
Computing network infrastructure
The lowest layer of the computing power network is still the computing network infrastructure layer, which cannot be escaped. In the computing power network, the infrastructure is no longer just computing resources (CPU, storage, GPU, etc.), but also network resources.
In terms of computing resources, the integration of computing infrastructure is still centered on cloud technology, including cloud-native, virtualization, and containers.
Here, some underlying new technology innovations related to computing power are also worthy of attention, such as heterogeneous computing, serverless computing, integration of storage and computing, and computing power offloading. You can leave it alone for now and introduce it later.
On the network side, the changes are relatively large. The previous articles have introduced to you that the network changes under the trend of cloud-network integration can be explained in one sentence, that is, to make the closed network more open (decoupled), so that it becomes like a computer, which can be accessed through software way (software-defined), to give it a "command".
The bottom layer of the network (physical layer, data link layer), on the transmission network side, is still light (only light can guarantee basic hard indicators such as communication bandwidth and capacity). The optical underlying technology, such as OTN, OXC, ROADM, C+L frequency band, 400G / 800G, etc., are still them, and there is no difference.
On the access network side, 5G NR and PON (50G-PON) is nothing special (except that the edge computing power will be somewhat related).
The big changes are in the upper layers of the network. We'll talk about that later.
Computing and network integration (computing resource management, network resource management)
Going up, things start to look different. Because the computing power network needs to schedule computing power and network resources, it is necessary to "preprocess" and "connect" the resources. (This layer, in fact, is attributed to the orchestration management in many architectures. I think it will be easier to understand if it is taken out separately.)
At this layer, the most worthy of attention is the routing of computing power.
The biggest difference between the computing power network and the traditional network is that it has three characteristics. The first major feature is computing power routing.
Before computing power, let’s talk about computing power perception.
Computing power perception is the network's comprehensive perception of the deployment location, real-time status, load information, and business needs of computing resources and computing services. China Mobile's computing power perception network white paper is analyzed from the perspective of computing power perception.
(From China Mobile's "Computer Power Sensing Network CAN Technology White Paper 2021")
On the one hand, each computing power node models the computing network information metrics and releases them uniformly, and the network aggregates the computing network information reported by multiple nodes to build a global unified computing network status view. On the other hand, the network completes the unified analysis of business computing network requirements, realizes a comprehensive perception of the business, and provides a guarantee for computing power scheduling based on business requirements.
(From "Computer Power Sensing Network CAN Technology White Paper 2021")
To put it bluntly, the purpose of computing power perception is to find out how many resources are on hand and what are the needs of users.
Perceived objects definitely need to be measured and identified. Therefore, there is computing power and a computing power logo. These two pieces, operators are currently attaching great importance.
There is diversity in computing power, and there are also heterogeneous hardware chips. Therefore, a unified measurement system and standard have not yet been formed. This will cause trouble to establish a unified computing power model. Very simple, the units of measurement are different, how to evaluate and bill the computing power?
If the computing power is identified, it is equivalent to the ID card of computing power. The user indicates the required service through the computing power identifier, and the network obtains the target computing power service, computing power demand, and other information by analyzing the computing power identifier, which provides the basis for computing power scheduling. This hash rate identification must be globally unified and verifiable.
Well, the next step is computing power routing.
In the computing power perception network architecture diagram of China Mobile just now, in addition to the computing power resource layer, a separate computing power routing layer is also designated.
The computing power routing layer is the core functional layer of the computing power perception network. It supports the perception and notification of multi-dimensional resources and services such as network, computing, and storage, and realizes the joint scheduling of "network + computing".
The computing power routing layer, including computing power routing control technology and computing power routing forwarding technology, realizes the on-demand scheduling of business requests at the routing layer.
When it comes to computing power routing, we must talk about CFN (Computing First Networking).
CFN is sometimes used in a broad sense to refer to the entire computing network. However, here, CFN is a distributed routing protocol that schedules and balances computing power on demand between different edge clouds.
The CFN thing is actually very interesting. The routing we traditionally understand is the routing of data packets. The routing here in CFN is to use the computing resource status and network resource status as routing information and publish it to the network. Then, the network routes the computing task packets to the most appropriate computing nodes based on the virtual service ID.
At present, the CFN protocol mainly carries the computing power information in the form of extended byte information in the BGP packet header of the service route, so as to spread the load of the computing nodes in the network to the whole network in real-time.
Obviously, computing power routing and CFN are the prerequisites for the upper layer to perform computing power scheduling.
Further up is the core key part of the computing power network - orchestration management.
Students who have worked on core network NFV (Network Element Function Virtualization) must be very familiar with orchestration. In the NFV architecture, there is orchestration. This is a very interesting word, which has "collected and arranged" you. To put it bluntly, it is full control and scheduling.
The second major feature of the computing power network is computing power scheduling.
Computing power scheduling is the intelligent arrangement and flexible scheduling of computing resources of the entire network based on the computing network brain.
More specifically, it is complete computing power operation and computing power service arrangement, complete the management of computing power resources and network resources, including the perception, measurement, and OAM management of computing power resources, etc.; realize computing network operation for end users, and the management of the computing power routing layer and the network resource layer.
After the computing resources are perceived, identified, and perceived, in order to manage them, there is also a logical process of computing resources management (also called separately: computing management). It mainly includes computing power registration, computing power OAM (performance monitoring, fault management), and computing power operation (billing management, service contracts, etc.).
(From "Computer Power Sensing Network CAN Technology White Paper 2021")
This function division is very troublesome. In some operator architecture diagrams, this block is defined as the computing power management layer, which exists alone. In some architecture diagrams, there is no such layer, and it is assigned to the orchestration management layer. Therefore, each understanding is different, and it is very big. Anyway, everyone should know that there is such a functional group.
To achieve it, it must be inseparable from a "nerve center" that controls the network.
The core control part of the scheduling and orchestration network, some operators call it "the brain of the computing network".
This "computer brain" has four important functions:
Obtain global real-time computing, network, and data resources, as well as cloud, edge, and terminal distribution, and build a global situational awareness map.
Cross-domain collaborative scheduling, which intelligently and automatically decomposes multi-domain collaborative scheduling tasks to each enabling platform to realize resource scheduling of computing, network, and data.
Multi-domain fusion orchestration, according to the needs of multi-domain fusion services, based on the atomic capabilities of computing, network, and data, it can be combined and arranged flexibly as needed.
Intelligent auxiliary decision-making, based on factors such as SLA requirements of different services, overall network load, and available computing resource pool distribution, intelligently and dynamically calculates the optimal collaborative strategy for computing, network, and data.
The computing network management and orchestration layer maintains the global static computing power, service, and network topology information, and synchronizes it to each entry computing power routing node. The computing power routing node maintains the topology information of computing power services and the real-time status information of computing power resources and network resources and performs computing network coordination scheduling through distributed computing power routing nodes.
The "Computer Network Brain" is the nerve center of the computing power network, and its importance is self-evident for coordinating global resources.
The introduction of AI artificial intelligence into the "Computer Network Brain" is certainly not to be said. In addition to AI, now, operators and equipment vendors also plan to put "digital twins" and "intent engines" in them to try their best to improve the intelligence level of the "Computer Network Brain".
To be honest, looking closely at the management and orchestration layer, I have a feeling that the computing power network is the upgrade and integration of SDN and NFV technologies, and there are shadows of these two technologies everywhere. However, many other technologies are added to the surface to form a more complete and larger system.
Above the computing power brain, is there no more? No, don't forget, we also have a service operations layer. The service operation layer, to put it bluntly, is to provide upper-layer applications with an interface to manage and control the computing power network. The brain of the network is not a real boss, but a "big housekeeper". It is the upper-layer users and applications that really control the network.
The purpose of the service operation layer is to open up the ability to provide the computing power network. Key technologies include: computing power transaction, computing power grid connection, computing power packaging, intention perception, etc. The third major feature of the computing power network is computing power trading.
Routing, scheduling, and transactions are the three major features of the computing power network. Computing power trading is an important innovation of the computing power network.
Before the integration of computing power and the network, the user will pay whoever uses the cloud computing, and whoever uses the communication network will pay. After the integration of the computing network, computing power is everywhere, and scheduling comes and goes. Who should you give money to?
Computing power trading is to solve this problem. In the computing power trading scenario, there will be a trusted ubiquitous computing power trading platform.
From "Hashpower Network - Network Architecture and Key Technologies in the Era of Cloud-Network Convergence 2.0"
The general process is as follows:
First, the computing power consumer puts forward detailed computing power requirements. Then, the computing power trading platform integrates the available resources according to the demand to generate a computing power network resource table. According to factors such as delay and cost, there will be corresponding quotations. Next, the computing power consumer chooses a "package" that is most suitable for him according to the resource table. Then, the transaction contract is signed on the computing power trading platform. Then, the trading platform schedules resources through the computing power network according to the contract and updates the resource information. Finally, the contract ends, the service is terminated, and resources are released.
(From "Integrating Multi-party Computing Networks to Achieve Optimal Utilization of Computing Resources", Leibo)
In the 21st century, whenever a transaction model is mentioned, a word must be involved - blockchain.
That’s right, the computing power trading platform also uses blockchain technology to realize functions such as distributed ledgers and anonymous transactions.
To sum up, the computing power network closely focuses on the computing power needs of users, covering the entire life cycle of computing power generation, scheduling, transaction, and consumption. What the computing power network wants to achieve is to move with people and with "demand", so as to achieve the perfect match between the supply and demand of computing power, without any delay or waste.
Standard progress of computing power network
In recent years, the computing power network has become very popular in China. Both operators and equipment manufacturers attach great importance to this concept. Especially domestic operators are crazy about the computing power network.
The reason is very simple. Foreign operators rarely touch the Internet. Basically, they are engaged in communication, and at most, they are engaged in the cloudification of traditional communication. Domestic operators, on the other hand, have both the network and the cloud, and especially hope to integrate and revitalize the resources of the network and the cloud. Moreover, our country attaches great importance to infrastructure construction, and the computing power network is an important digital infrastructure. Therefore, operators, as state-owned enterprises, also pay special attention to this accumulation.
On the one hand, in order to master the initiative of technical standards, and on the other hand, for brand promotion and image building, the three major operators have begun to roll up on the computing power network.
2019 is recognized as the first year of computing power networks. On November 1 of this year, China Unicom took the lead and released the "China Unicom Hashpower Network White Paper" during the Beijing PT Exhibition. This is the first white paper related to computing power networks in China.
Since then, domestic operators have scrambled to release a number of white papers one after another, and actively promoted the formulation of standards for computing power networks.
From the actual situation, the computing power network is still a long way away from us. At present, we are in the advanced stage of cloud-network integration, and the integration of computing and network or computing power network is still under study.
Nonetheless, there is no doubt that the computing power network will be the future development trend of computing power and connection power. The integration of computing and network is inevitable, and it is also the way back for the entire communications industry. As a "pipe", traditional communication cannot enhance its own value. Only by fully investing in the embrace of the cloud is the best way out.