Static Evaluation of a Midimew Connected Torus Network for Next Generation Supercomputers

Al-Shammari, Maryam; Haque, Asrar; Rahman, M M Hafizur

doi:10.3390/su15086766

Open AccessArticle

Static Evaluation of a Midimew Connected Torus Network for Next Generation Supercomputers

by

Maryam Al-Shammari

¹,

Asrar Haque

¹ and

M M Hafizur Rahman

^2,*

¹

Department of Computer Science, CCSIT, King Faisal University, Al Ahsa 31982, Saudi Arabia

²

Department of Computer Networks & Communications, CCSIT, King Faisal University, Al Ahsa 1982, Saudi Arabia

^*

Author to whom correspondence should be addressed.

Sustainability 2023, 15(8), 6766; https://doi.org/10.3390/su15086766

Submission received: 16 January 2023 / Revised: 1 April 2023 / Accepted: 4 April 2023 / Published: 17 April 2023

(This article belongs to the Special Issue To the Future: Adoption of Artificial Intelligence and Blockchain in Agriculture and Healthcare from a Sustainability Perspective)

Download

Browse Figures

Versions Notes

Abstract

:

Many artificially intelligent systems solve complex health- and agriculture-related problems that require great computational power. Such systems are used for tracking medical records, genome sequence analysis, image-based plant disease detection, food supply chain traceability, and photosynthesis simulation. Massively parallel computers (MPCs) are among those used to solve these computation-intensive problems. MPCs comprise a million nodes; connecting such a large number of nodes is a daunting task. Therefore, hierarchical interconnection networks (HINs) have been introduced to solve this problem. A midimew-connected torus network (MTN) is a HIN that has basic modules (BM) as torus networks that are connected hierarchically by midimew links. This paper presents the performance of MTNs in terms of static topological parameters and cost-effectiveness, as measured through simulations. An MTN was compared with other networks, including mesh, torus, TESH, TTN, MMN, and TFBN. The results showed that our MTN had a low diameter with a high bisection width and arc connectivity. In addition, our MTN had a high cost–performance trade-off factor (CPTF), a high cost-effective factor (CEF), low packing density, and moderate message-traffic density with marginally higher costs, as compared to other networks, due to wire complexity. However, our MTN provided better bandwidth with higher static fault tolerance. Therefore, MTNs are suggested for further evaluation of the effective implementation of MPCs.

Keywords:

massively parallel computers; hierarchical interconnection networks; static network performance; cost-effective analysis; MTN; cost-performance trade-off factor; cost-effective factor; artificial intelligence; sustainability

1. Introduction

One of the challenging issues in implementing supercomputers is the topology involved in connecting a massive number of nodes. It is necessary for the enormous computational power of these supercomputers to progress towards a more sustainable world, which will provide better healthcare, better use of natural resources, sustainable agriculture, and energy-efficient machinery and industry. To achieve these goals, it is required to advance complex modeling, comprehensive simulations, and big data analysis, which are all computation-intensive tasks. Massively parallel computers (MPCs) have been broadly utilized to perform these tasks.

The role of MPCs in building sustainable applications is of great interest, as they could lead to a next-generation agricultural revolution with the assistance of artificial intelligence (AI). MPCs utilize resources to provide important functionalities, such as image-based plant disease detection, plague control, pesticide design, understanding pesticides effects, and food supply-chain traceability [1,2]. Massive amounts of seed data have been analyzed to identify plants with the Rubisco enzyme, which aids in the conversion of plant carbon into biomass [3]. Agricultural water management (AWM) was able to integrate multidisciplinary models that used MPCs [4]. Workflows for agricultural and livestock-farming applications have been designed through hybrid data analytics [5]. Enormous computational power is required to implement deep-learning networks for the price forecasting of agricultural products, which has a significant impact on the profitability of agricultural products [6].

AI has been applied in several approaches to climate pattern prediction and weather forecasting [7,8] to increase agricultural productivity and sustainability [9]. The use of models for assessing future risks that are related to climate change and its impact on agriculture is critical [10]. The genome sequencing of bacterial species to enhance plant growth in order to overcome climate challenges and limited resources has been conducted using supercomputers [11].

The role of MPCs is crucial in providing sustainable healthcare services. In recent years, they have advanced in order to handle the unusually large and distributed medical and non-medical data related to COVID-19 [12] by enabling the tracking of infected people and effectively monitoring whole communities [13,14]. Supercomputers were used to analyze the genome sequence of the virus in order to identify an appropriate vaccine [15,16,17]. Another important domain of supercomputers in healthcare is telemedicine, which includes the diagnosis, the examination, and the medical assessment. Telemedicine relies on information systems with massive storage, and requires the precise operation of tasks by high-performance computing and communications (HPCC) strategies [18]. Researchers used next-generation AI and robot core technologies to simulate the human brain [19,20]. To control the spread of serious communicable diseases, integrating epidemiology with the rapid computing power of MPCs was proposed [21]. Deep-learning models were implemented on MPCs as accelerators to speed-up medical applications for large biomedical databases [22]. Cambridge-1 was used for building and hosting powerful AI language models in molecular chemistry [23]. Supercomputers have also contributed to everyday activities such as fraudulent insurance claim detection by analyzing massive claim data within a short period of time [24].

The performance of MPCs continues to increase in order to provide the massive computational power required for various applications. According to the TOP500 organization, the first true exascale supercomputer was the Frontier system at the Oak Ridge National Laboratory (ORNL) in the U.S. [25]. It achieved the top rank in November 2022, with a performance exceeding 1.102 Exaflop/s. MPCs can support high accuracy applications because they are driven by matrix multiplication [26]. For example, AI applications rely on supercomputers for complex modeling tasks, such as seismic imaging, climate/environmental geospatial predictions, and computational astronomy [27]. Therefore, supercomputers have been shown to be ideal for training deep neural networks (DNNs) by processing the large datasets required for fast and accurate DNN results [28,29]. Supercomputers have been linked directly to grand challenges and their solutions [30]. Therefore, many research centers are rapidly adopting supercomputer resources due to these benefits.

In this paper, we addressed a serious concern in the implementation of supercomputers, which is the topology connecting massive numbers of nodes. This topology directly impacts the performance of MPCs [31]. Therefore, hierarchical interconnection networks (HINs) have received much attention from the research community. HINs enable the connection of large numbers of nodes with reasonable delays, as compared to conventional networks [32]. Supercomputer connections have reached the capacity of one million nodes, making conventional networks infeasible [33]. In our previous work, we proposed an HIN called an MTN, which used 2D torus connected by midimew links to build higher levels in order to include greater numbers of nodes [31]. However, the current research aimed to evaluate the network topology represented by static parameters and cost effectiveness. The evaluation of topological properties is crucial for a hierarchal network to be considered for further development. MTNs provide optimal features for static and cost-effective performance. This is shown in a comparative analysis of static network performance and cost-effective parameters according to the diameter and average distance evaluated by computer simulation. MTNs are compared to six other networks, two with conventional topologies, including 2D mesh and 2D torus. The presence of conventional networks in the evaluation was because of their current usage by supercomputers [34]. In addition, an MTN is compared to other HINs, including a midimew-connected mesh network (MMN) [35], a tori-connected mesh network (TESH) [36], a tori-connected flattened butterfly network (TFBN) [37], and a tori-connected torus network (TTN) [38].

The results of the simulated MTN showed a similar diameter to MMN that was lower than the mesh and TESH but higher than the other networks. MTNs have a lower average distance than almost all networks; a node degree which is lower than TFBN but higher than conventional networks; moderate bisection width; high arc connectivity; and higher costs as a result of wire complexity. However, MTNs also have better fault tolerance and bandwidth availability. In addition, in terms of static performance, MTNs have advantages in CPTFs, moderate time–cost-effective factors (TCEFs), message traffic density (MTD), and low packing density. However, MTNs have higher costs when compared to other networks, although they are still lower than mesh networks. This is a result of their improved bandwidth availability and high fault tolerance. Therefore, practical implementations of MPCs should consider adopting the most relevant features for the intended system. For example, high fault handling and sufficient bandwidth may make an MTN ideal for certain applications; however, considering its static performance and overall costs are worthy considerations.

The remaining sections are organized as follows: Section 2 illustrates MTN architecture, followed by Section 3, which discusses the routing algorithm. Section 4 provides a comparative analysis of the static network performance. Section 5 shows the cost-effective analysis of the network. Finally, Section 6 presents the main findings and recommendations.

2. Network Architecture of MTNs

In this section, the architecture of MTNs is illustrated. With regards to our previous paper [31], we only updated the layout of the horizontal links as seen in Figure 1. This difference provides a low diameter and average distances in the network. The main goal of HINs is to achieve a low diameter and average distance values, as described in Section 4. These two properties were not evaluated in our previous paper. Furthermore, this study also evaluated a new modified network to evaluate its static performance and cost effectiveness.

The basic building block of a midimew-connected torus network is the basic module (BM), which was

2^{m}

\times

2^{m}

torus, where

m

is a positive integer. The hierarchy connection which was implemented at higher-level connections with the

2^{m}

\times

2^{m}

torus BM used midimew links. The 2D torus block represented level 1, the lowest level. Higher levels were the result of midimew connections in level 1. The packet flow in the network links was different at the lowest level (level 1), as compared to that at higher levels. All of the levels used bidirectional links for flexible connections. The following definitions illustrated the network topology:

Definition 1.

The 2D torus (level 1) was a

2^{m}

\times

2^{m}

BM containing

2^{2 m}

nodes created by

2^{m}

rows and

2^{m}

columns.

Definition 2.

An MTN (m, L, q) had a

2^{m}

\times

2^{m}

BM, where L is the level of hierarchy and q is the inter-level connectivity. The connections that formed the higher levels used

4

\times

2^{q}

of each of the BM’s free ports,

2

(

2^{q}

) for the horizontal connections and

2

(

2^{q}

) for the diagonal connections, where

q \in {0, 1, \dots ., m}

. The minimal level of inter-connectivity was when

q = 0

, while maximum was when

q = m

. In this paper,

m = 2

for better granularity.

Definition 3.

The highest level that could be obtained by the MTNs (

m

,

L

,

q

) was

L_{m a x}

=

2^{m - q}

+

1

level.

Definition 4.

The wire complexity,

W_{L},

was the total number of wires used to link the network elements. Therefore, the

W_{L}

of MTNs was found using Equation (1), where

W_{B M}

is the number of links in a single BM.

W_{L} = W_{B M} \times 2^{m \times 2 (L - 1)} + \sum_{x = 2}^{L} 2 (2^{q}) \times 2^{m \times 2 (L - 1)}

(1)

The following figures are provided to illustrate these definitions. Figure 1 represents an MTN (2,2,0). Figure 2 shows the overall network connections forming level 2. Further details of each level are shown in Table 1.

The modification to the network, as shown in Figure 1 and based on the previous version [31], did not change the total number of nodes, free ports, or wire complexity, as shown in Table 1. This indicated that both versions supported the same number of nodes for supercomputers. However, different connections were needed to achieve better values for diameter and average distance, as shown in Section 4.

3. Routing Algorithm of MTNs

This section discusses the simulation of the proposed topology. MTN was simulated using JAVA coding to generate the diameter and average distance values. The algorithm for the routing protocol is presented here, and it generated the message paths in the network with respect to its layout. The routing algorithm identified the methods by which the network chose the best path, which in this study was the shortest. This protocol, followed by the packets from their source to their destination, was critical. It’s incorrect implementation would severely impact the performance efficiency, since the main goal of the routing algorithm was to select the most cost-effective path. A supercomputer relies on the routing protocol used to connect its nodes. In HINs, these procedures are based on the network topology. For example, the main phases of specific HINs, such as STTN, FTBN, and MMN, may be adapted by applying similar routing procedures, as detailed in this section. However, due to the different layouts of the network (i.e., the topology), the result of the hop-distance, as represented by diameter and average distance, would not be the same.

The MTN routing algorithm began at the higher levels before moving to the lower levels based on the hierarchy interconnectivity. The packet dimensions were compared to determine whether its destination was on the same level. If its destination was on the same level, then the routing used an inter-BM connection. The network would follow the shortest path while transferring the packet via assigned outer nodes in each BM. In a similar manner, if the packet was traveling to another level, the routing would use the identified outer nodes to complete the packet transfer. Figure 1 shows the outer nodes for each level. For example, if a level-2 packet had a level-5 destination, then the shortest-path routing would use the bottom-right BM to exit that level. At level 5, the packet would enter through the top-left BM.

In the simulation, the MTN was programmed to generate the shortest path between any two distinct nodes of the network, so that the simulation would be able to generate all of the required paths in order to calculate the diameter and average distance. The algorithm’s implementation generated the paths with respect to a simple procedure to evaluate the proposed layout. The simulation was conducted using deterministic order routing due to its simplicity and clarity. In such protocols, the hop-distance or the shortest path was counted directly, depending on the proposed topology. The simulation followed dimensional order routing to choose paths based on the cost and dimensionality of the nodes. To move from one node to another, the simulation would first travel horizontally via the x-axis, and then, when the packet could go no further in this direction, it would switch to moving vertically along the y-axis. Therefore, the hop-distance was counted directly as the distance between the packet source and the destination to evaluate the MTN topology in terms of low hop-counts and, thus, better performance.

Figure 3 shows the steps followed to apply the MTN topology. The network used source-and-destination pairs as BMs, and it considered the levels of both BMs represented in source destination nodes. It would then choose higher or lower levels based on where each BM node was located. The MTN used an outer BM for moving from one level to another, as shown in Figure 1. Inside the level, it considered whether the BM had the destination node to route the packet to that exact node inside the BM. Routes using shortest-path followed dimensional order routing.

Figure 3 illustrates the steps which were followed to code the MTN to calculate the diameter and average distance. A supercomputer contains many interconnected nodes. Since the main goal of the routing algorithm was to simulate the MTN performance for such systems, it considered the path of all pairs of nodes in the network in order to generate diameter and average distance values. Therefore, the intent was to obtain the routing results from one node to another. The addresses and the representations of the nodes are shown in Figure 4. The layout of the BM was also presented as a reference for the numbering format used for naming, such as “level.BM.node.” From source to destination, we found the following:

Source.BM.level/destination.BM.level: This referred to the level where the source/destination BM was located. In Figure 4, the source and destination were both on level 2. Therefore, Source.BM.level and destination.BM.level = 2.

Source.BM/destination.BM: This referred to the exact source/destination BM on the same level. In Figure 4, Source.BM = (3,3), destination.BM = (2,1).

Source.BM.node/destination.BM.node: This referred to the source/destination node inside a particular BM. In Figure 4, Source.BM.node = (2,2), destination.BM.node = (2,2).

The simulation was performed by coding the BM and the higher level in order to obtain the paths between a pair of nodes in the network. The flow chart of the routing algorithm is shown in Figure 3, following the protocol and using dimensional order routing and controlled routing flow to obtain the network paths. In the next section, we evaluated the MTN via the results from this simulation for diameter and average distance, while the other parameters were derived using a graph model.

4. Comparative Static Network Performance Evaluation

To evaluate the performance of MTNs, it is beneficial to analyze the network’s static parameters. MPCs require an implementation with a low constant degree, low cost, good connectivity, high fault tolerance, and high scalability [39]. Such features could be derived from the static performance evaluation, as shown in this section. Our MTN was compared with two conventional networks, mesh and torus, since such networks are still widely used in supercomputer applications [34]. However, several proposed HINs were included in the comparison to evaluate their capabilities in connecting nodes while maintaining the required characteristics.

4.1. Node Degree

Node degree refers to the maximum number of links exiting a node. Its impact was directly linked to the network cost, where the higher the node degree, the higher the cost due to the router I/O and physical links [40]. In addition, the MPCs considered constant degrees because, when the number of nodes increased, the configuration of the router interfaces would not change. Therefore, the scalability of this network was simple and cost effective. In contrast, our MTN was more computationally demanding. However, it provided high bandwidth availability for low congestion and minimal network latency, both for a reasonable cost increase [41]. As shown in Table 2, the MTN had a moderate node degree, as compared to a high degree in the TFBN and the low values found in conventional networks. In addition, Figure 5, below, is used to show the time complexity of various networks compared to MTNs when the network size increases.

4.2. Diameter

Considering all of the shortest paths for node-to-node connections, the diameter was the maximum path. It was a main parameter that contributed to the dynamic performance of the network, as it significantly affected network latency despite regular throughput. High diameter resulted in high delays of message transmissions. In addition, higher diameters lowered the message-passing bandwidth and, thus, degraded the overall performance of the network [42]. Conventional networks, such as mesh and torus, as shown in Table 2, suffer from this specific challenge, making them infeasible for next-generation MPCs. This motivated our proposal of this novel HIN to maintain a low diameter while supporting a large number of node connections. The MTN diameter was generated according to the routing protocol described in Section 3. The MTN had a low diameter, although it was larger than that of the TFBN. However, it was considered low as compared to conventional topologies, as their diameter increased as the number of network nodes increased.

4.3. Average Distance

The average distance was the average obtained from the total value of all different pairs of nodes represented by the shortest path connection. A lower average distance indicated a lower buffer, which yielded lower contention. It was directly proportional to the network latency, as a lower average distance resulted in lower latency. Similar to the diameter, it had a direct impact on the latency of the network. However, it considered the network under no-load conditions, while the diameter considered the packet flow saturation [43]. The same as for diameter, it was generated with the help of the routing protocol described in Section 3. To promote optimal inter-node communication in terms of average message-transfer latency, the average distance needed to be low, as demonstrated by the MTN, as compared to all the other networks, with the exception of the TFBN.

4.4. Arc Connectivity

This was the minimum number of links required to divide the network into two separate parts. It was a critical value, as it indicated the fault tolerance of the network. The fault tolerance had to be prepared in advance due to the high frequency of faults [44]. Therefore, a high value in arc connectivity indicated a more robust network. The static fault tolerance of the HIN was identified by the ratio between the arc connectivity and the node degree [45]. The BMs of the MTN provided the maximum fault tolerance that a network could have due to the equality between the node degree and the arc connectivity. In addition, the arc connectivity indicated the alternative paths that could be taken in the event of a failure. The MTN had optimal arc connectivity, similar to most of the other networks. However, the BM had a maximum fault tolerance that enabled robust implementation of the underlying architecture. In addition, Figure 6, below, shows the arc connectivity of various networks compared to MTN when the network size increases.

4.5. Bisection Width

Bisection width was the minimum number of links required to ensure two equal halves of the network. In a parallel processing environment, this was vital due to the regular use of a divide-and-conquer approach, where the network is divided into two halves that are then each divided further to enable the network to perform multiple tasks simultaneously. If the value of the bisection width was low, this resulted in a lower bandwidth used to merge the two halves, leading to more congestion in the packet flow. However, a large value could not be viable without a significant increase in wiring, resulting in a more costly and complex implementation [46]. Therefore, moderate values were ideal for parallel processing, which was demonstrated in the MTN, as compared to conventional networks that were more complex and expensive. Also, Figure 7, below, shows the bisection width of the MTN in comparison with other networks as the network size increases.

4.6. Wire Complexity

Wire complexity refers to the total number of wires required to implement the network topology. It depended on the node degree, since it identified the wires exiting a node. Although wire complexity was not an actual cost of the network configuration due to physical routers and the length and wire types used, it provided a reliable indicator of resource demand. The greater the number of wires that were needed, the more cost was required for physical resources. However, costs could be justified according to their contributions to low latency and high fault tolerance, and even lower the costs of the overall network, that is, they could elicit a better performance in terms of the connectivity between nodes and hierarchal levels. The MTN had better performance on this parameter, as compared to the TFBN, due to having lower wire complexity for physical links, improving its scalability and affordability. Finally, Figure 8, below, is used to show the wire complexity of various networks in comparison with MTN when the network size increases.

This evaluation indicated the most important parameters for the implementation of a practical supercomputer. The best adaptation of the parallel processing environment would require some compromises in terms of robustness versus costs. For example, given the high probability of faults when scaling is regularly conducted, the reasonable costs of the flexible MTN makes it a better choice than the TFBN, which could become quite expensive under these circumstances. However, a TTN could provide simpler and less costly scaling, as compared to an MTN. In contrast, MTNs have better inter-node communication with respect to average message transfer latency due to the lower average distance. According to the above comparison, MTNs have better fault tolerance, enhanced cost-efficient scaling, improved inter-node communication, and fewer physical wires, as compared to other HINs with similar features.

5. Cost Effectiveness Analysis of MTNs

Static costs were used to identify the feasibility of the network prior to implementation. The hardware used, including routers, processing elements, and wiring for the topology adaptation, contributed to the actual cost. However, static costs, along with the other parameters presented in this section, were good indicators for network assessment. A cost-effective study was performed, since the overall costs were a major factor when choosing a topology for adaptation. The performance of a supercomputer was identified by the topology used to connect its nodes. Therefore, evaluating the cost of the topology was essential to clarify the characteristics of a network prior to its implementation.

The network costs of MTNs were illustrated by six parameters: cost, cost performance trade-off factor, cost-effective factor, time–cost-effective factor, packing density, and message traffic density. These factors were chosen to summarize the relationships between the costs and the most essential parameters of MPCs. The six factors which were assessed for all of the other networks and compared to MTNs were similar to the static parameters, as shown in Section 4. A tubular comparison is depicted in Table 3 for exact value comparison. The following explanation elaborates on each factor and its calculations.

5.1. Cost

Cost was determined as the product of diameter and node degree. These two main parameters represented the capability of the hardware. For example, a higher node degree resulted in higher routing expenses in either installation or maintenance. Message traffic density, the distance between nodes, network bandwidth, latency, and fault tolerance all depended on the diameter and node degree. Systems with a large diameter and node degree would be expensive, with low bandwidth and poor scalability. Cost (

C

) was calculated using Equation (2), where

D

is diameter and

N_{d}

is node degree.

C = N_{d} \times D

(2)

Our MTN had a good cost outcome, as compared to conventional networks such as mesh, as shown in Table 3. However, it was slightly more expensive than other networks. However, this slight difference could be justified due to its better bandwidth and enhanced network latency.

5.2. Cost Performance Trade-Off Factor

The CPTF enabled the assessment of the network by providing a crucial static parameter. The CPTF summarized four of the most important parameters: diameter, node degree, wire complexity, and total number of nodes. This provided a rapid comparison regarding costs, as opposed to comparing each characteristic individually. The diameter indicated the upper latency and bandwidth. The node degree was related to the router costs and scalability. The wire complexity was directly proportional to the wire costs through the layout links. The number of nodes was the actual processing elements that were required for the execution of the programs. The

C P T F

was calculated using Equation (3).

C P T F = \frac{N_{d} \times G_{p}}{D}, G_{p} = \frac{W_{L}}{t o t a l # n o d e s}

(3)

The CPTF of the MTN was higher than for the TESH, the MMN, the torus, and the mesh. It was slightly lower than the TTN, and notably different from the TFBN. Therefore, the MTN provided a good trade-off between performance and costs.

5.3. Cost-Effective Factor

Supercomputers provide fast computational power for various complex simulations. Therefore, speed, as a result of connecting large numbers of nodes, is critical. In addition, the level of efficiency while using these nodes contributed to the fast response. Therefore, speed and efficiency were indicated using CEF. In addition, the number of links used to connect the nodes depended on the topology configuration. These links had to be considered when analyzing system costs. The

C E F

was expressed as Equation (4), where

ρ = \frac{c o s t o f W_{L}}{c o s t o f n o d e s}

,

0 \leq ρ \leq 1

; in our evaluation

ρ = 0.1

C E F = \frac{1}{1 + ρ \times G_{p}}

(4)

The CEF of the MTN was almost equal to that for the six compared networks. However, the MTN’s efficiency resulted in a good CEF.

5.4. Time–Cost-Effective Factor

This factor was included to show the static topology performance. MPCs conduct complex experiments that require results within a relatively short period of time. The efficiency of this process is affected by the speed of the processing nodes used, the problem-solving steps, and the routing algorithm via topology connections. However, TCEF was important for assessing the time required for large programs used by MPCs. The TCEF for the MTN was evaluated according to the final simplification using Equation (5), where

p = t o t a l n o d e s

:

T C E F = \frac{2}{1 + ρ G_{p} + \frac{1}{p}}

(5)

The TCEF of the MTN was similar to that of the TTN, torus, and mesh, with a slight difference when compared with TESH, MMN, and TFBN. The time comparison considered nodes and links for performing parallel tasks, resulting in a reasonable factor for use in an MPC.

5.5. Packing Density

In order to show that MTNs could be used for very large-scale integrations (VLSIs), the packing density was measured. This was the ratio between the total number of nodes and the cost, which indicated that a higher value for packing density required less area for the VLSI chip. The packing density of the MTN was evaluated using Equation (6).

p a c k i n g d e n s i t y = \frac{t o t a l n o d e s}{C}

(6)

The packing density of the MTN was better, as compared to the higher values found in torus, TESH, TFBN, and MMN. The MTN was almost similar to mesh, with a slightly higher result.

5.6. Message Traffic Density

This factor was related to the alternative paths that were available from a single node to another. Multiple paths enabled the system to consider alternative choices in the event of faulty links. In other words, this factor indicated whether the traffic distribution of the network was efficient. The traffic distribution relied on the availability of multiple paths for a packet. The MTN was evaluated using Equation (7), along with other networks for comparison.

M T D = a v e r a g e d i s t a n c e \times \frac{t o t a l n o d e s}{W_{L}}

(7)

The MTD of the MTN provided a better average MTD, as compared to the high TESH and TFBN results.

The MTN provided satisfactory costs with respect to various performance parameters. Therefore, using MTNs for supercomputing is feasible, with better bandwidth, latency, fault tolerance, time, and costs, as compared to conventional topologies. As compared to other hierarchal interconnection networks, there were many similarities and some differences. For supercomputing, MTNs are more expensive; however, they provide better performance with better diameter, node degree, and fault tolerance due to wrap-around links of torus BMs that are supported with hierarchal connections via midimew links.

6. Conclusions

This paper analyzed the proposed HIN for a large number of nodes for exascale and zetta-scale computational power. In this study, the static performance was completed by simulating the network. The simulation of the network-level connections resulted in low diameter and average distance values. The MTN was compared with both conventional mesh and torus networks. In addition, the comparison included hierarchal networks, such as TESH, TTN, TFBN, and MMN. We used simulations to evaluate an MTN via static parameters, including the node degree, the diameter, the average distance, the bisection width, the arc connectivity, and the wire complexity. In addition, a cost-effective analysis of the network was conducted based on the following parameters: MTD, CEF, CPTF, TCEF, packing density, and cost. Our results provide a solid foundation for further research and implementation.

This study showed that MTNs have good features with respect to network connectivity, fault tolerance, and bandwidth, as the MTN had lower parameters than conventional networks. However, it was higher than for the TFBN while not as high as the TESH. The MTN had a lower average distance than all of the other networks, except for the TFBN. In addition, in terms of static performance, the MTN had a high CPTF; moderate TCEF and MTD; and low packing density. However, the MTN had a high cost factor, as compared to other networks, although it was still lower than that of mesh. This was the result of better bandwidth and its high fault tolerance. In this case, a practical implementation of a supercomputer may consider the relative required features for specified tasks to determine an appropriate compromise.

The MTN was shown to be a promising HIN. Therefore, future work could include dynamic performance evaluations [36,38]. In addition, since this research evaluated the network with a single configuration, that is, MTNs (2,2,0), additional configurations should be assessed to determine the best layout. However, the current MTN was assessed in terms of static topological parameters and several cost factors, and these results may promote fast-emerging technological advances.

Author Contributions

Conceptualization, M.M.H.R. and A.H.; methodology, M.M.H.R. and A.H.; software, M.A.-S.; validation, M.M.H.R. and A.H.; formal analysis, M.A.-S.; investigation, M.M.H.R. and A.H.; resources, M.M.H.R. data curation, M.A.-S.; writing—original draft preparation, M.A.-S.; writing—review and editing, M.M.H.R. and A.H.; visualization, M.A.-S.; supervision, M.M.H.R. and A.H.; project administration, A.H.; funding acquisition M.A.-S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Deanship of Scientific Research, Vice Presidency for Graduate Studies and Scientific Research, King Faisal University, Saudi Arabia [Grant No. 3214].

Institutional Review Board Statement

Not applicable.

Acknowledgments

The authors extend their appreciation to the Deanship of Scientific Research, Vice Presidency for Graduate Studies and Scientific Research, King Faisal University, Saudi Arabia [Grant No. 3214]. The authors would like to thank the anonymous reviewers for their insightful scholastic comments and suggestions, which improved the quality and clarity of the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Zhong, R.; Xu, X.; Wang, L. Food Supply Chain Management: Systems, Implementations, and Future Research. Ind. Manag. Data Syst. 2017, 117, 2085–2114. [Google Scholar] [CrossRef]
Liu, J.; Wang, X. Plant Diseases and Pests Detection Based on Deep Learning: A Review. Plant Methods 2021, 17, 22. [Google Scholar] [CrossRef] [PubMed]
Randall, R. How Supercomputers and Plant Genetics Provide Farmers Better Seeds. Genetic Literacy Project. 2015. Available online: https://geneticliteracyproject.org/2015/10/02/how-supercomputers-and-plant-genetics-provide-farmers-better-seeds/ (accessed on 1 December 2022).
Haacker, E.M.K.; Sharda, V.; Cano, A.M.; Hrozencik, R.A.; Núñez, A.; Zambreski, Z.; Nozari, S.; Smith, G.E.B.; Moore, L.; Sharma, S.; et al. Transition Pathways to Sustainable Agricultural Water Management: A Review of Integrated Modeling Approaches. JAWRA J. Am. Water Resour. Assoc. 2019, 55, 6–23. [Google Scholar] [CrossRef]
Georgiou, Y.; Zhou, N.; Zhong, L.; Hoppe, D.; Pospieszny, M.; Papadopoulou, N.; Nikas, K.; Nikolos, O.L.; Kranas, P.; Karagiorgou, S.; et al. Converging HPC, Big Data and Cloud Technologies for Precision Agriculture Data Analytics on Supercomputers. In Proceedings of the High Performance Computing; Jagode, H., Anzt, H., Juckeland, G., Ltaief, H., Eds.; Springer: Cham, Switzerland, 2020; pp. 368–379. [Google Scholar]
Luo, J.; Zhao, C.; Chen, Q.; Li, G. Using Deep Belief Network to Construct the Agricultural Information System Based on Internet of Things. J. Supercomput. 2022, 78, 379–405. [Google Scholar] [CrossRef]
Agrawal, N.; Das, A.; Modani, M. Scalability Analysis of Weather Research Forecast Model on NVIDIA Ampere Based Dense GPU Cluster. In Proceedings of the 2022 International Conference on Computing, Communication, Security and Intelligent Systems (IC3SIS), Kochi, India, 23–25 June 2022; IEEE: New York, NY, USA, 2022; pp. 1–6. [Google Scholar]
Supercomputing|NASA Center for Climate Simulation. Available online: https://www.nccs.nasa.gov/services/high-performance-computing (accessed on 29 December 2022).
Streich, J.; Romero, J.; Gazolla, J.G.F.M.; Kainer, D.; Cliff, A.; Prates, E.T.; Brown, J.B.; Khoury, S.; Tuskan, G.A.; Garvin, M.; et al. Can Exascale Computing and Explainable Artificial Intelligence Applied to Plant Biology Deliver on the United Nations Sustainable Development Goals? Curr. Opin. Biotechnol. 2020, 61, 217–225. [Google Scholar] [CrossRef]
Jägermeyr, J.; Müller, C.; Ruane, A.; Elliott, J.; Balkovič, J.; Castillo, O.; Faye, B.; Foster, I.; Folberth, C.; Franke, J.; et al. Climate Impacts on Global Agriculture Emerge Earlier in New Generation of Climate and Crop Models. Nat. Food 2021, 2, 873–885. [Google Scholar] [CrossRef]
Franco-Sierra, N.D.; Posada, L.F.; Santa-María, G.; Romero-Tabarez, M.; Villegas-Escobar, V.; Álvarez, J.C. Bacillus Subtilis EA-CB0575 Genome Reveals Clues for Plant Growth Promotion and Potential for Sustainable Agriculture. Funct. Integr. Genom. 2020, 20, 575–589. [Google Scholar] [CrossRef]
Chandrashekhar, B.N.; Sanjay, H.A. High-Performance Computing with Artificial Intelligence Benefits for the Civilization Impacted by the COVID-19 Pandemic. In Emerging IT/ICT and AI Technologies Affecting Society; Chaurasia, M.A., Juang, C.-F., Eds.; Lecture Notes in Networks and Systems; Springer Nature: Singapore, 2023; Volume 478, pp. 107–118. ISBN 978-981-19294-0-3. [Google Scholar]
Munayco, C.V.; Tariq, A.; Rothenberg, R.; Soto-Cabezas, G.G.; Reyes, M.F.; Valle, A.; Rojas-Mezarina, L.; Cabezas, C.; Loayza, M.; Chowell, G. Early Transmission Dynamics of COVID-19 in a Southern Hemisphere Setting: Lima-Peru: 29 February–30 March 2020. Infect. Dis. Model. 2020, 5, 338–345. [Google Scholar] [CrossRef]
Pathak, Y.; Shukla, P.K.; Arya, K.V. Deep Bidirectional Classification Model for COVID-19 Disease Infected Patients. IEEE/ACM Trans. Comput. Biol. Bioinform. 2021, 18, 1234–1241. [Google Scholar] [CrossRef]
Gharizadeh, B.; Yue, J.; Yu, M.; Liu, Y.; Zhou, M.; Lu, D.; Zhang, J. Navigating the Pandemic Response Life Cycle: Molecular Diagnostics and Immunoassays in the Context of COVID-19 Management. IEEE Rev. Biomed. Eng. 2021, 14, 30–47. [Google Scholar] [CrossRef]
Gao, M.; Coletti, M.; Davidson, R.B.; Prout, R.; Abraham, S.; Hernández, B.; Sedova, A. Proteome-Scale Deployment of Protein Structure Prediction Workflows on the Summit Supercomputer. In Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), Lyon, France, 30 May–3 June 2022; IEEE: New York, NY, USA; pp. 206–215. [Google Scholar]
Cortés, U.; Cortés, A.; Garcia-Gasulla, D.; Pérez-Arnal, R.; Álvarez-Napagao, S.; Àlvarez, E. The Ethical Use of High-Performance Computing and Artificial Intelligence: Fighting COVID-19 at Barcelona Supercomputing Center. AI Ethics 2022, 2, 325–340. [Google Scholar] [CrossRef]
Ackerman, M.J.; Howe, S.E.; Masys, D.R. Don Lindberg, High Performance Computing and Communications, and Telemedicine. Inf. Serv. Use 2022, 42, 117–127. [Google Scholar] [CrossRef]
Yamazaki, T.; Igarashi, J.; Yamaura, H. Human-Scale Brain Simulation via Supercomputer: A Case Study on the Cerebellum. Neuroscience 2021, 462, 235–246. [Google Scholar] [CrossRef]
Dipietro, L.; Elkin-Frankston, S.; Ramos-Estebanez, C.; Wagner, T. Methodologies and Applications of Supercomputing, 1st ed.; IGI Global: Hershey, PA, USA, 2021; pp. 290–300. ISBN 978-1-79987-156-9. [Google Scholar]
Brown, N.; Nash, R.; Poletti, P.; Guzzetta, G.; Manica, M.; Zardini, A.; Flatken, M.; Vidal, J.; Gueunet, C.; Belikov, E.; et al. Utilising Urgent Computing to Tackle the Spread of Mosquito-Borne Diseases. In Proceedings of the 2021 IEEE/ACM HPC for Urgent Decision Making (UrgentHPC), St. Louis, MO, USA, 19 November 2021; IEEE: New York, NY, USA, 2021; pp. 36–44. [Google Scholar]
Barakat, C.; Fritsch, S.; Riedel, M.; Brynjólfsson, S. An HPC-Driven Data Science Platform to Speed-up Time Series Data Analysis of Patients with the Acute Respiratory Distress Syndrome. In Proceedings of the 2021 44th International Convention on Information, Communication and Electronic Technology (MIPRO), Opatija, Croatia, 27 September–1 October 2021; IEEE: New York, NY, USA, 2021; pp. 311–316. [Google Scholar]
May, A. Why Healthcare Needs a Supercomputer. AIMed Artificial Intellifence Medicine. 2021. Available online: https://ai-med.io/analysis/technology/why-healthcare-needs-a-supercomputer/ (accessed on 1 December 2022).
Improving Insurance Fraud Detection Using AI Running on Cloud-Based GPU-Accelerated Systems. Available online: https://www.hpcwire.com/solution_content/microsoft-nvidia/improving-insurance-fraud-detection-using-ai-running-on-cloud-based-gpu-accelerated-systems/ (accessed on 1 December 2022).
November 2022|TOP500. Available online: https://www.top500.org/lists/top500/2022/11/ (accessed on 3 January 2023).
Kindratenko, V.; Trancoso, P. Trends in High-Performance Computing. Comput. Sci. Eng. 2011, 13, 92–95. [Google Scholar] [CrossRef]
Kudo, S.; Nitadori, K.; Ina, T.; Imamura, T. Prompt Report on Exa-Scale HPL-AI Benchmark. In Proceedings of the 2020 IEEE International Conference on Cluster Computing (CLUSTER), Kobe, Japan, 14–17 September 2020; IEEE: New York, NY, USA, 2020. [Google Scholar]
Li, M.; Chen, J.; Xiao, Q.; Wang, F.; Jiang, Q.; Zhao, X.; Lin, R.; An, H.; Liang, X.; He, L. Bridging the Gap between Deep Learning and Frustrated Quantum Spin System for Extreme-Scale Simulations on New Generation of Sunway Supercomputer. IEEE Trans. Parallel Distrib. Syst. 2022, 33, 2846–2859. [Google Scholar] [CrossRef]
Liu, R.; Jia, J.; Zhou, Y.; Zhou, Y.; Liu, Y. Training Deep Neural Networks with Large-Scale Datasets on Sunway High Performance Computer. In Proceedings of the 2022 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA), Dalian, China, 24–26 June 2022; IEEE: New York, NY, USA, 2022; pp. 466–471. [Google Scholar]
Siegel, H.J.; Abraham, S.; Bain, W.L.; Batcher, K.E.; Casavant, T.L.; DeGroot, D.; Dennis, J.B.; Douglas, D.C.; Feng, T.-Y.; Goodman, J.R.; et al. Report of the Purdue Workshop on Grand Challenges in Computer Architecture for the Support of High Performance Computing. J. Parallel Distrib. Comput. 1992, 16, 199–211. [Google Scholar] [CrossRef]
Al-Shammari, M.M.; Haque, A.; Rahman, M.M.H. Midimew Connected Torus Network for Next Generation Massively Parallel Computer System. Procedia Comput. Sci. 2021, 179, 590–597. [Google Scholar] [CrossRef]
Moudi, M.; Othman, M.; Lun, K.Y.; Rahiman, A.R.A. X-Folded TM: An Efficient Topology for Interconnection Networks. J. Netw. Comput. Appl. 2016, 73, 27–34. [Google Scholar] [CrossRef]
Al Faisal, F.; Rahman, M.M.H.; Inoguchi, Y. HFBN: An Energy Efficient High Performance Hierarchical Interconnection Network for Exascale Supercomputer | IEEE Journals & Magazine|IEEE Xplore. IEEE Access 2021, 10, 3088–3104. [Google Scholar] [CrossRef]
Kurokawa, M. The K Computer: 10 Peta-FLOPS Supercomputer. In Proceedings of the The 10th International Conference on Optical Internet (COIN2012), Kanagawa, Japan, 29–31 May 2012; IEEE: New York, NY, USA, 2012; p. 1. [Google Scholar]
Awal, M.R.; Rahman, M.M.; Mohd Nor, R.; Sembok, T.; Akhand, M.A.H. Architecture and Network-on-Chip Implementation of a New Hierarchical Interconnection Network. J. Circuits Syst. Comput. 2014, 24, 1540006. [Google Scholar] [CrossRef]
Rahman, M.M.H.; Inoguchi, Y.; Sato, Y.; Miura, Y.; Horiguchi, S. Dynamic Communication Performance of a TESH Network under the Nonuniform Traffic Patterns. In Proceedings of the 2008 11th International Conference on Computer and Information Technology, Khulna, Bangladesh, 24–27 December 2008; IEEE: New York, NY, USA, 2008; pp. 365–370. [Google Scholar]
Rahman, M.M.H.; Al-Naeem, M.; Ali, M.N.M.; Sufian, A. TFBN: A Cost Effective High Performance Hierarchical Interconnection Network. Appl. Sci. 2020, 10, 8252. [Google Scholar] [CrossRef]
Rahman, M.M.; Inoguchi, Y.; Sato, Y.; Horiguchi, S. TTN: A High Performance Hierarchical Interconnection Network for Massively Parallel Computers. IEICE Trans. Inf. Syst. 2009, E92D. [Google Scholar] [CrossRef]
Faisal, F.A.; Rahman, M.M.H.; Inoguchi, Y. A New Power Efficient High Performance Interconnection Network for Many-Core Processors. J. Parallel Distrib. Comput. 2017, 101, 92–102. [Google Scholar] [CrossRef]
Chaintreau, A.; Mtibaa, A.; Massoulié, L.; Diot, C. The Diameter of Opportunistic Mobile Networks. In Proceedings of the 2007 ACM Conference on Emerging Network Experiment and Technology, New York, NY, USA, 10–13 December 2007; ACM: New York, NY, USA, 2007; p. 12. [Google Scholar]
Ali, M.N.M.; Rahman, M.M.H.; Ibrahim, A.A.; Al-Naeem, M.; Hossain, E. A High Static Performance Hierarchical Three-Dimensional Shifted Completely Connected Network. IEEE Access 2022, 10, 43812–43836. [Google Scholar] [CrossRef]
Sohaini, M.H.; Hafizur Rahman, M.M.; Nor, R.M.; Sembok, T.M.T.; Akhand, M.A.H.; Inoguchi, Y. A Low Hop Distance Hierarchical Interconnection Network. In Proceedings of the 2015 2nd International Conference on Electrical Information and Communication Technologies (EICT), Khulna, Bangladesh, 10–12 December 2015; pp. 39–43. [Google Scholar]
Parhami, B.; Rakov, M. Perfect Difference Networks and Related Interconnection Structures for Parallel and Distributed Systems. IEEE 2005, 16, 714–724. [Google Scholar] [CrossRef]
Bossard, A.; Kaneko, K. Cluster-Fault Tolerant Routing in a Torus. Sensors 2020, 20, 3286. [Google Scholar] [CrossRef]
Faisal, F.A.; Rahman, M.M.H. Symmetric Tori Connected Torus Network. In Proceedings of the 2009 12th International Conference on Computers and Information Technology, Dhaka, Bangladesh, 21–23 December 2009; IEEE: New York, NY, USA, 2009; pp. 174–179. [Google Scholar]
Krishnamoorthy, A.V.; Ho, R.; Zheng, X.; Schwetman, H.; Lexau, J.; Koka, P.; Li, G.; Shubin, I.; Cunningham, J.E. Computer Systems Based on Silicon Photonic Interconnects. Proc. IEEE 2009, 97, 1337–1361. [Google Scholar] [CrossRef]

Figure 1. Hierarchy of MTNs (2,2,0).

Figure 2. Complete level 2 connection of MTNs (2,2,0).

Figure 3. Simplified routing steps of MTN.

Figure 4. Example of source and destination pairs in an MTN.

Figure 5. Node degree of various networks when increasing the network size.

Figure 6. Arc connectivity of various networks when increasing the network size.

Figure 7. Bisection width of various networks when increasing the network size.

Figure 8. Wire complexity of various networks when increasing the network size.

Table 1. All levels of MTNs (2, L, 0).

Level	Consists of	Total Nodes	Free Ports Used	Wire Complexity
Level 2	16 of 4 × 4 torus	256	4	544
Level 3	16 of 4 × 4 level 2	4096	8	8736
Level 4	16 of 4 × 4 level 3	65,536	12	139,808
Level 5	16 of 4 × 4 level 4	1,048,576	16	2,236,960

Table 2. Comparison of static network performance parameters.

	Diameter	Average Distance	Node Degree	Bisection Width	Arc Connectivity	Wire Complexity
			256 Node
16 $\times$ 16 Mesh	30	10.67	4	16	2	480
16 $\times$ 16 Torus	16	8	4	32	4	512
TESH (2,2,0)	21	10.47	4	8	2	416
TTN (2,2,0)	15	7.44	6	8	4	544
TFBN (2,2,0)	10	5.75	8	8	4	800
MMN (2,2,0)	17	9.07	4	8	2	416
MTNs (2,2,0)	17	6.93	6	8	4	544

Table 3. Static network cost comparison.

	Cost	Cost Performance Trade-Off Factor	Cost Effective Factor	Time Cost Effective Factor	Packing Density	Message Traffic Density
			256 Node
16 $\times$ 16 Mesh	120	0.2500	0.8421	1.67868	2.1333	5.69066
16 $\times$ 16 Torus	64	0.5	0.8333	1.66125	4.0000	4.000
TESH (2,2,0)	84	0.309	0.8602	1.71466	3.0476	6.4430
TTN (2,2,0)	90	0.850	0.8247	1.64418	2.8444	3.5011
TFBN (2,2,0)	80	2.5	0.7619	1.51928	3.2000	1.84
MMN (2,2,0)	68	0.382	0.8602	1.71466	3.7647	5.5815
MTNs (2,2,0)	102	0.75	0.8247	1.64418	2.5098	3.2611

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Al-Shammari, M.; Haque, A.; Rahman, M.M.H. Static Evaluation of a Midimew Connected Torus Network for Next Generation Supercomputers. Sustainability 2023, 15, 6766. https://doi.org/10.3390/su15086766

AMA Style

Al-Shammari M, Haque A, Rahman MMH. Static Evaluation of a Midimew Connected Torus Network for Next Generation Supercomputers. Sustainability. 2023; 15(8):6766. https://doi.org/10.3390/su15086766

Chicago/Turabian Style

Al-Shammari, Maryam, Asrar Haque, and M M Hafizur Rahman. 2023. "Static Evaluation of a Midimew Connected Torus Network for Next Generation Supercomputers" Sustainability 15, no. 8: 6766. https://doi.org/10.3390/su15086766

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Static Evaluation of a Midimew Connected Torus Network for Next Generation Supercomputers

Abstract

1. Introduction

2. Network Architecture of MTNs

3. Routing Algorithm of MTNs

4. Comparative Static Network Performance Evaluation

4.1. Node Degree

4.2. Diameter

4.3. Average Distance

4.4. Arc Connectivity

4.5. Bisection Width

4.6. Wire Complexity

5. Cost Effectiveness Analysis of MTNs

5.1. Cost

5.2. Cost Performance Trade-Off Factor

5.3. Cost-Effective Factor

5.4. Time–Cost-Effective Factor

5.5. Packing Density

5.6. Message Traffic Density

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI