Evolutionary Dynamics of Division of Labor Games for Underwater Searching Tasks

Xiong, Minglei; Xie, Guangming

doi:10.3390/sym14050941

Open AccessArticle

Evolutionary Dynamics of Division of Labor Games for Underwater Searching Tasks

by

Minglei Xiong

^1,*,†,‡ and

Guangming Xie

^1,2,*,‡

¹

State Key Laboratory for Turbulence and Complex Systems, Intelligent Biomimetic Design Lab, College of Engineering, Peking University, Beijing 100871, China

²

Peng Cheng Laboratory, Shenzhen 518055, China

^*

Authors to whom correspondence should be addressed.

^†

Current address: Boya Gongdao (Beijing) Robot Technology Co., Ltd., Beijing 100176, China.

^‡

These authors contributed equally to this work.

Symmetry 2022, 14(5), 941; https://doi.org/10.3390/sym14050941

Submission received: 27 March 2022 / Revised: 23 April 2022 / Accepted: 2 May 2022 / Published: 5 May 2022

(This article belongs to the Topic Dynamical Systems: Theory and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Division of labor in self-organized groups is a problem of both theoretical significance and application value. Many application problems in the real world require efficient task allocation. We propose a model combining bio-inspiration and evolutionary game theory. This research model theoretically analyzes the problem of target search in unknown areas for multi-robot systems. If the robot’s operating area is underwater, the problem becomes more complicated due to its information sharing restrictions. Additionally, it drives strategy updates and calculates the fixed probability of relevant strategies, using evolutionary game theory and the commonly used Fermi function. Our study estimates the fixed probability under arbitrary selection intensity and the fixed probability and time under weak selection for the two-player game model. In the multi-player game, we get these results for weak selection, which is conducive to the coexistence of the two strategies. Moreover, the conducted simulations confirm our analysis. These results help to understand and design effective mechanisms in which self-organizing collective dynamics appears in the form of maximizing the benefits of multi-agent systems in the case of the asymmetric game.

Keywords:

fixation probability; game theory; collaborative search; multi-agent systems

1. Introduction

It is a long-standing problem to study cooperative behavior in a limited population involving many disciplines. The reason is simple. In many scenarios, multiple robots are needed to perform tasks. Cooperative machines are widely used in real society and engineering applications. We need to use cooperative robots to form an efficient group in real society and engineering applications. In many cases, the optimal individual strategy in the group conflicts with the optimal decision of the group [1,2,3,4]. It poses a major challenge for an evolutionary model, as many models predict that this cooperation discontinues in the case of free rider exploitation. Evolutionary game theory is widely used in such problems [5,6,7,8].

A variety of specific game models describe the conflict of interest among individuals and groups [9,10]. A very widely used example is the prisoner’s dilemma [11]. Another example is the public goods game, a common model for describing the intragroup game [12,13]. The public goods game is a concise description of the cooperative conflict between group and individual interests [14,15]. The cross-study of evolutionary game theory and cooperation has also obtained many factors to promote cooperation [16,17].

Usually, real game scenarios have a limited number of players. For analyzing this case, researchers mostly employ the finite group dynamic evolution analysis methods based on the stochastic process [18]. Examples of the common analysis methods include the Moran [19], Wright Fisher [19], local update [20], and pairwise comparison processes. In these studies, it is essential to calculate and analyze fixed probability and time, respectively referring to the probability and time that the strategy succeeds and dominates the whole population. These studies employ selection intensity to measure the impact of game returns on fitness, which directly affects the difference of fitness. Weak selection is a common assumption, implying that the payoff difference based on different strategies plays a relatively minor role in the evolution process.

Labor division is very representative in the specific applications of the evolutionary game [21,22,23]. In real societies, it is common to see the scenarios of labor division in which groups of individuals repeatedly and non-randomly perform specific tasks [24,25]. In these scenarios, groups show the need for labor division, that is, there is unequal work distribution among or within specific tasks [26,27,28,29]. The cooperative search of multiple robots is a typical problem [30,31,32,33]. Searching targets within an unknown environment is a real problem with broad application backgrounds. In this problem, multiple robot individuals search for targets with unknown positions in an unknown area [34,35]. In this actual scenario, a problem is whether the individuals cooperate. Cooperation is an altruistic strategy, which means sharing information with other individuals. In contrast, selfishness means not sharing information with others.

Sharing information implies that individuals have to spend a certain cost to store and transmit information [36,37,38]. The cost infers that the computing and communication units consume energy, affecting the robot’s running time and speed. This impact is relatively negligible for land robots, but profound for UAVs, especially the underwater ones [39,40].

When designing task allocation algorithms, we find that designing low-cost swarm robots may be a better solution in some specific scenarios, but what kind of swarm can better perform complex tasks is a difficult problem. We wish to propose a framework for exploring whether individuals in a group cooperate when performing tasks. Of course, we know that there is no framework that can adapt to all scenarios. The motivation of this study is to develop a strategy evolution model of the labor division game based on an underwater multi-robot search problem, and investigate the evolutionary dynamics in a finite population. This study assumes two types of strategies, representing two categories of robots, A and B. Each player can choose one of them. The implementation of each task brings corresponding benefits and costs. Tasks A and B correspond to selfish and cooperative individual strategies, respectively. This research studies two- and multi-player games. Results can theoretically explain the impact of different selection intensity on the fixed probability of strategies, and derive the fixed probability and time under weak selection. Simulation is an approach to check the effectiveness of the theoretical analysis.

The main goal of our work is to establish the relationship between theoretical analysis and real applications. When we use multiple robots to perform a task, how to select or design a suitable robot swarm is a difficult problem. At the same time, task allocation for the robot swarm is also difficult. It is well known that self-organizing task allocation is the basic attribute of an intelligent group. Collaboration among individuals can improve the efficiency of the swarm to perform tasks. However, this will make the design of the swarm more complex, and may also increase the cost of the robots. Here, we propose a model from the perspective of information sharing by introducing game theory. Information sharing undoubtedly promotes cooperation among individuals and may facilitate the execution of tasks. At the same time, information sharing may also affect the efficiency with which individuals perform tasks. We hope to find an optimal solution between information sharing and personal selfishness in a specific scenario. We expect our results can be applied to the actual unknown area search task. This is one of the main differences from the previous works.

The organizational structure of this paper is as follows. Section 2 describes the main model and theoretical results of the two-player games. Section 3 represents the fixed probability and time under weak selection. Section 4 explains the results of a multi-player model involving multiple players in a game. Section 5 offers the results of computer simulations. Finally, the conclusion summarizes the research.

2. Two-Player Game with Dual-Strategy Task

We consider two types of robot players, strategy A and strategy B. Each player chooses a task as the strategy to gain its benefits and bear its costs. In this case, A robots only search by themselves without sharing information, while B robots share information with other individuals, regardless of each other’s strategies. Because of this difference, their corresponding benefits, respectively,

b_{A}

and

b_{B}

, are also different. Similarly, their costs are

c_{A}

and

c_{B}

, respectively. Because of the altruistic behavior of B robots, that is, they share information with other individuals, their opponents obtain additional synergy benefits. As an opponent of B, A gets

α

and B gets

β

. The following payoff matrix defines the above scenarios:

\begin{matrix} \begin{matrix} \begin{matrix} A & \begin{matrix} B \end{matrix} \end{matrix} \end{matrix} \\ \begin{matrix} A \\ B \end{matrix} (\begin{matrix} b_{A} - c_{A} & b_{A} - c_{A} + α \\ b_{B} - c_{B} & b_{B} - c_{B} + β \end{matrix}) . \end{matrix}

Let

b_{A} - c_{A} = x

,

b_{B} - c_{B} = y

, and

σ = x - y

representing the net income difference between tasks A and B. A simplified payoff matrix is as follows

\begin{matrix} \begin{matrix} \begin{matrix} A & \begin{matrix} B \end{matrix} \end{matrix} \end{matrix} \\ \begin{matrix} A \\ B \end{matrix} (\begin{matrix} x & x + α \\ y & y + β \end{matrix}) . \end{matrix}

Presumably, N is the size of the investigated population, where jA-players and

N - j

B-players coexist. Equation (1) shows

π_{A}

, the expected payoffs of the players who perform task A,

π_{A} = \frac{j - 1}{N - 1} x + \frac{N - j}{N - 1} (x + α) = x + \frac{N - j}{N - 1} α

(1)

Equation (2) represents the B-players’ expected payoffs,

π_{B} = \frac{j}{N - 1} y + \frac{N - j - 1}{N - 1} (y + β) = y + \frac{N - j - 1}{N - 1} β

(2)

Equation (3) describes the payoff difference between tasks A and B in the group,

π_{A} - π_{B} = σ + \frac{(N - j) α - (N - j - 1) β}{N - 1} .

(3)

Equation (4) is the often-used Fermi function for the strategy update,

p = \frac{1}{1 + exp [- ω (π_{A} - π_{B})]},

(4)

where P is the probability that an A-player replaces a B-player in the evolution;

π_{A}

and

π_{B}

represent the focal A- and B-player’s payoffs, respectively; and

ω

indicates the selection strength. This research also focuses on the results under weak selection (Section 3 has more details). Firstly, we focus on the generated results ignoring the selection strength.

Following the analysis of the Moran process, Equation (5) presents the probability to increase the number of A-players from j to

j + 1

:

T +

and decrease from j to

j - 1

:

T_{j}^{-}

,

T_{j}^{\pm} = \frac{j}{N} \frac{N - j}{N} \frac{1}{1 + exp [\mp ω (π_{A} - π_{B})]} .

(5)

The probability that all players finally choose to perform strategy A for any initial configuration relies on the ratio

γ_{j} = T_{j}^{-} / T_{j}^{+}

, according to Equation (6),

γ_{j} = \frac{T_{j}^{-}}{T_{j}^{+}} = exp [- ω (π_{A} - π_{B})] .

(6)

Equation (7) represents the fixation probability

ϕ_{k} = \frac{\sum_{i = 0}^{k - 1} \prod_{j = 1}^{i} γ_{j}}{\sum_{i = 0}^{N - 1} \prod_{j = 1}^{i} γ_{j}} .

(7)

Substituting Equations (3) and (6) into Equation (7) gives Equation (8),

φ_{k} = \frac{\sum_{i = 0}^{k - 1} exp [- i σ ω + \frac{ω i}{N - 1} (\frac{(i + 1) (α - β)}{2} + (N - 1) β - N α)]}{\sum_{i = 0}^{N - 1} exp [- i σ ω + \frac{ω i}{N - 1} (\frac{(i + 1) (α - β)}{2} + (N - 1) β - N α)]} .

(8)

Calculate result of the fixation probability by the integral approximation formula [27]:

φ_{k} = \frac{e r f (ε_{k}) - e r f (ε_{0})}{e r f (ε_{N}) - e r f (ε_{0})},

(9)

where

e r f (x) = \frac{2}{\sqrt{π}} \int_{0}^{x} e^{- y^{2}} d y

,

ε_{k} = \sqrt{\frac{ω}{2 (β - α) (N - 1)}} [(N - 1) σ + (N - k) α + (k - N + 1) β]

.

2.1. Scenario 1: Game with a Dominating Strategy

Suppose

σ > 0

and

σ + α > β

, the payoff matrix in the framework of labor division situations is a prisoner’s dilemma game. In this case, an A-player always gets a higher income than a B-player, so strategy A has an advantage over strategy B. Figure 1 presents such a scenario and shows the evolution of strategy in dependence on the

σ

(e.g., 2.01, 3, 4 and 5 respectively) and the values of ω (e.g., 0.01, 0.05, 0.1 and 1), where

β = 1

and

α = 1

.

If

σ < 0

and

σ + α < β

, the payoff matrix in the framework of labor division situations is a prisoner’s dilemma game with a dominating strategy B. Figure 2 shows the evolution of strategy in dependence on the

σ

(e.g., 2.01, 3, 4 and 5 respectively) and the values of ω (e.g., 0.01, 0.05, 0.1 and 1), where

β = 8

and

α = 6

.

2.2. Scenario 2: Coordination Game

If

σ > 0

and

σ + α < β

, the payoff matrix in the framework of labor division situations is a coordination game. In this case, players should always use the same strategy. Figure 3 shows the evolution of the strategy in such a scenario.

2.3. Scenario 3: Game with Coexisting A and B

Suppose

σ < 0

and

σ + α > β

, the payoff matrix in the framework of labor division situations is a coordination game. In this case, players should always use different strategies. A typical example is a hawk-dove or snowdrift game. Figure 4 illustrates the evolution of the strategy in this scenario.

3. Two-Player Game under Weak Selection

3.1. Fixation Probability

According to the Taylor series expansion, Equation (10) simplifies the transition probability in Equation (5) to obtain the updated transition probabilities,

T_{j}^{\pm} \approx \frac{j}{N} \frac{N - j}{N} [\frac{1}{2} \pm \frac{1}{4} ω (π_{A} - π_{B})] .

(10)

The ratio of these transition probabilities plays a vital role in measuring the state that the system evolves into, given by Equation (11),

γ_{j} = \frac{T_{j}^{-}}{T_{j}^{+}} \approx 1 - ω (π_{A} - π_{B}) .

(11)

In the Fermi process,

T_{j}^{+} (_{ω = 0}) = T_{j}^{-} (_{ω = 0}) = \frac{1}{2} \frac{j}{N} \frac{N - j}{N}

is the neutral transition probabilities, giving

γ_{j} = 1

. Based on Equation (7), the function probability under neutral selection is as Equation (12),

ϕ_{k} (_{ω = 0}) = \frac{k}{N} .

(12)

3.2. Average Time

Under weak selection, the average time with only one A-player in the population initially is

t_{1}

. Equation (13) shows the general form of

t_{1}

t_{1} \approx 2 N H_{_{N - 1}} + ω v (N - 1 - H_{_{N - 1}}),

(13)

where

H_{_{N - 1}} = \sum_{l = 1}^{N - 1} \frac{1}{l}

,

π_{A} - π_{B} = u j + v

. In this research,

π_{A} - π_{B} = σ + \frac{(N - j) α - (N - j - 1) β}{N - 1}

. Therefore, Equation (14) represents the estimated average time,

t_{1} \approx 2 N H_{N - 1} + ω (N - 1 - H_{N - 1}) (σ + \frac{N - 1}{N} α - β) .

(14)

3.3. Conditional Fixation Time

Under weak selection, the conditional fixation time with only one A-player in the population initially is

t_{1}^{A}

. Equation (14) shows the general form of

t_{1}^{A}

,

t_{1}^{A} \approx 2 N (N - 1) - ω u N (N - 1) \frac{N^{2} + N - 6}{18} .

(15)

Equation (16) represents the conditional function time,

t_{1}^{A} \approx 2 N (N - 1) - ω N (β - α) \frac{N^{2} + N - 6}{18} .

(16)

4. Multi-Player Game

Assuming that the group size is N, and n people participate in each game. If there are m participants among other

n - 1

individuals who complete task A, then

a_{m}

and

b_{m}

represent the corresponding income when the central individual is task A and B, respectively. For more details, please see Table 1.

According to the income matrix, the expression of

a_{m}

and

b_{m}

are as below.

\begin{matrix} a_{m} & = b_{A} m + (b_{A} + α) (n - 1 - m) - c_{A} \\ = b_{A} (n - 1) + α (n - 1 - m) - c_{A}, \end{matrix}

(17)

\begin{matrix} b_{m} & = b_{B} m + (b_{B} + β) (n - 1 - m) - c_{B} \\ = b_{B} (n - 1) + β (n - 1 - m) - c_{B} . \end{matrix}

(18)

let

p = b_{A} (n - 1) - c_{A}

,

q = b_{B} (n - 1) - c_{B}

and

τ = p - q

, then

a_{m}

can be simplified to:

a_{m} = p + α (n - 1 - m) .

(19)

b_{m}

can be simplified to:

b_{m} = q + β (n - 1 - m) .

(20)

Assume that in a thoroughly mixed limited group of size N, the number of participants in strategy A is j, and the number of participants in strategy B is

N - j

. The probability of an individual with task A interacting with the remaining m individuals with task A follows a hypergeometric distribution. Therefore, Equations (21) and (22) show the average income of the individual of tasks A and B, respectively,

π_{A} = \sum_{m = 0}^{n - 1} \frac{C_{j - 1}^{m} C_{N - j}^{n - 1 - m}}{C_{N - 1}^{n - 1}} a_{m} .

(21)

Equation (22) shows the average income of the individual of task B,

π_{B} = \sum_{m = 0}^{n - 1} \frac{C_{j}^{m} C_{N - j - 1}^{n - 1 - m}}{C_{N - 1}^{n - 1}} b_{m} .

(22)

Equation (23) represents the function probability

ϕ_{k}

,

\begin{matrix} φ_{k} & = \frac{\sum_{i = 0}^{k - 1} \prod_{j = 1}^{i} γ_{j}}{\sum_{i = 0}^{N - 1} \prod_{j = 1}^{i} γ_{j}} \\ = \frac{\sum_{i = 0}^{k - 1} \prod_{j = 1}^{i} exp [- ω (\sum_{m = 0}^{n - 1} \frac{C_{j - 1}^{m} C_{N - j}^{n - 1 - m}}{C_{N - 1}^{n - 1}} a_{m} - \sum_{m = 0}^{n - 1} \frac{C_{j}^{m} C_{N - j - 1}^{n - 1 - m}}{C_{N - 1}^{n - 1}} b_{m})]}{\sum_{i = 0}^{N - 1} \prod_{j = 1}^{i} exp [- ω (\sum_{m = 0}^{n - 1} \frac{C_{j - 1}^{m} C_{N - j}^{n - 1 - m}}{C_{N - 1}^{n - 1}} a_{m} - \sum_{m = 0}^{n - 1} \frac{C_{j}^{m} C_{N - j - 1}^{n - 1 - m}}{C_{N - 1}^{n - 1}} b_{m})]} . \end{matrix}

(23)

Figure 5 and Figure 6 display the results in the case of

n = 10

. Based on the results, the task allocation in the multi-player game model is similar to that in the two-player game, and also reflects various similar evolution dynamics. This also shows that the two-player model can be extended to a multi-player model.

5. Simulations

To verify rationality of the proposed model, we perform several multi-robot search experiments in unknown environments. In our experiments, an unknown area is a square region. In one experiment, we use 10 robots to search for targets in an unknown area. Each area has 10 targets at unknown locations. In each experiment, a swarm of 10 robots with selfish or altruistic strategies performs the search task together. It should be noted that in our experiments, all the targets are the same, and when a robot reaches a target’s position, it means that this robot has found the target. When all the targets are found, the experiment ends. In order to be closer to the real application scenarios, we created different searching maps and

30 %

of each map are random obstacle areas. We assume that the area to be searched is connected in each experiment. In a search, individuals with strategy A are selfish. They only search by themselves and do not share search information with other individuals, that is, they do not tell other individuals which areas they have searched. Notably, sharing information reduces replicated searches. In addition, the speed of robot movement slows down because sharing information requires additional computing, storage, and communication units.

Figure 7 represents the results when the speed ratio is different. Regarding Figure 7, some of these curves show an upward trend, some show a downward trend, and some have peaks. Such results imply that the assumptions are correct in our previous theoretical analysis. That is, individuals have an optimal value in choosing a selfish strategy and altruistic strategy.

When r is relatively large, our results show that selfish individuals have great advantages in moving speed, and the curves shows a monotonous increasing trend. This shows that the speed plays a more decisive role than being able to obtain the benefits of shared information from other individuals. The best strategy in such a scenario is not to share information. When r is relatively small, we see a trend of increasing first and then decreasing. This shows that sharing information can bring benefits and reduce the number of search steps. Overall, our conclusion is that if not sharing information can bring more advantages in speed, not sharing information is an dominant strategy. On the contrary, sharing information is a good strategy if it does not reduce the moving speed too much.

The simulation results show that for the same task, different individual combinations will bring different results. In such a scenario, individuals need to cooperate, but cooperation requires costs. Such costs reduce the ability of the individual, and thus the efficiency of the group, to perform the task. The core issue is to strike a balance between individual competence and group coordination. Of course, such a balance will be highly uncertain. When generalizing such a model, we need to take this uncertainty into account.

6. Conclusions

Effective labor division is widely used in many fields, such as engineering and multi-robot system. Theoretically, the realization of effective labor division is closely related to individual behavior and scheduling group-level tasks. The critical point here is how the rationality of labor division contributes to the collective interests of a multi-agent system. In a distributed group, self-organizing task allocation results from self-organization without central command or global information exchange.

The search of underwater robot swarms for unknown environments is a typical scenario of task allocation. Both designing and combining different robots to perform such tasks may be reasonable solutions. Which is better? Our research tries to answer this question. To respond to this question, we use evolutionary game theory to model and study the evolution time of cooperation in labor division games by theoretically calculating fixed probability and fixed probability. We develop two- and multi-player models according to different scenarios requiring labor division. In this specific scenario, whether individuals share information is a decisive factor. Underwater information sharing requires additional sensors as information transmission and computing devices. Due to the difficulty of underwater information transmission, the cost is high. Therefore, here we take sharing information as a strategy of cooperation between individuals. Cooperation means paying a certain cost, but getting more task information. Selfish individuals only pay attention to their tasks. Moreover, we conduct simulations. The results confirm the reliability of our model, and provide new clues for the self-organization behavior in the task allocation problem faced by groups.

This study is geared towards a specific problem, but we believe it can be generalized. The results suggest that unconditional cooperation is not always the best strategy. Moreover, this study provides an idea that individuals can adopt different strategies to improve the efficiency of the group. When considering using clusters to perform tasks, we can refer to this study. We can have some individuals cooperate while others adopt selfish strategies. In the process of performing the task, the strategies can be updated dynamically. This may yield the best results. We find that such modeling and analysis can be carried out in the study of search problems in unknown underwater environment, but we think that when faced with similar problems, this may be a good idea, such as multi-UAV search and multi-robot search. In the future, we will conduct more experiments and apply this framework to more practical applications.

Author Contributions

Conceptualization, M.X. and G.X.; writing—original draft preparation, M.X.; writing—review and editing, G.X.; project administration, G.X.; funding acquisition, G.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China under grant number 61973007 and 61633002, and in part by the Beijing Natural Science Foundation under Grant 4192026.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare no conflict of interest.

References

Axelrod, R. Effective Choice in the Prisoner’s Dilemma. J. Confl. Resolut. 1980, 24, 3–25. [Google Scholar] [CrossRef]
Balafoutas, L.; Nikiforakis, N.; Rockenbach, B. Altruistic punishment does not increase with the severity of norm violations in the field. Nat. Commun. 2016, 7, 13327. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Iranzo, J.; Buldú, J.; Aguirre, J. Competition among networks highlights the power of the weak. Nat. Commun. 2016, 7, 13273. [Google Scholar] [CrossRef] [PubMed]
Xia, F.; Jedari, B.; Yang, L.T.; Ma, J.; Huang, R. A Signaling Game for Uncertain Data Delivery in Selfish Mobile Social Networks. IEEE Trans. Comput. Soc. Syst. 2016, 3, 100–112. [Google Scholar] [CrossRef]
Reddy, P.V.; Zaccour, G. Feedback Nash Equilibria in Linear-Quadratic Difference Games With Constraints. IEEE Trans. Autom. Control 2017, 62, 590–604. [Google Scholar] [CrossRef]
Zhang, M.; Liu, H. Game-Theoretical Persistent Tracking of a Moving Target Using a Unicycle-Type Mobile Vehicle. IEEE Trans. Ind. Electron. 2014, 61, 6222–6233. [Google Scholar] [CrossRef]
Xu, Z.; Zhang, J.; Zhang, C.; Chen, Z. Fixation of strategies driven by switching probabilities in evolutionary games. EPL 2016, 116, 58002. [Google Scholar] [CrossRef]
Zhang, J.; Zhang, C.; Cao, M. How insurance affects altruistic provision in threshold public goods games. Sci. Rep. 2015, 5, 9098. [Google Scholar] [CrossRef] [Green Version]
Gong, Y.; Yu, Q. Evolution of conformity dynamics in complex social networks. Symmetry 2019, 11, 299. [Google Scholar] [CrossRef] [Green Version]
Musaev, A.; Borovinskaya, E. Evolutionary Optimization of Case-Based Forecasting Algorithms in Chaotic Environments. Symmetry 2021, 13, 301. [Google Scholar] [CrossRef]
Szép, J.; Forgó, F. Two-Person Games; Springer: Dordrecht, The Netherlands, 1985. [Google Scholar]
Li, C.; Liu, Y.; Luo, Y.; Zhou, M. Collaborative content dissemination based on game theory in multimedia cloud. Knowl. Based Syst. 2017, 124, 1–15. [Google Scholar]
Selten, R. Evolutionary stability in extensive two-person games. Math. Soc. Sci. 1986, 5, 269–363. [Google Scholar] [CrossRef]
Weitz, J.S.; Eksin, C.; Paarporn, K.; Brown, S.P.; Ratcliff, W.C. An oscillating tragedy of the commons in replicator dynamics with game-environment feedback. Proc. Natl. Acad. Sci. USA 2016, 113, E7518–E7525. [Google Scholar] [CrossRef] [Green Version]
Zhang, C.; Zhang, J.; Xie, G.; Wang, L. Group penalty on the evolution of cooperation in spatial public goods games. J. Stat. Mech. Theory Exp. 2010, 2010, 12004. [Google Scholar] [CrossRef]
Krishnanand, K.; Ghose, D. Glowworm swarm optimization for simultaneous capture of multiple local optima of multimodal functions. Swarm. Intell. 2009, 3, 87–124. [Google Scholar] [CrossRef]
Bonnet, F.; Gribovskiy, A.; Halloy, J.; Mondada, F. Closed-loop interactions between a shoal of zebrafish and a group of robotic fish in a circular corridor. Swarm. Intell. 2018, 12, 227–244. [Google Scholar] [CrossRef]
Lv, S.; Song, F. Particle swarm intelligence and the evolution of cooperation in the spatial public goods game with punishment. Appl. Math. Comput. 2022, 412, 126586. [Google Scholar] [CrossRef]
Matsuzawa, R.; Tanimoto, J. A social dilemma structure in diffusible public goods. EPL 2016, 116, 38005. [Google Scholar] [CrossRef]
Ramazi, P.; Riehl, J.; Cao, M. Networks of conforming or nonconforming individuals tend to reach satisfactory decisions. Proc. Natl. Acad. Sci. USA 2016, 113, 12985–12990. [Google Scholar] [CrossRef] [Green Version]
Taylor, C.; Fudenberg, D.; Sasaki, A. Evolutionary game dynamics in finite populations. Bull. Math. Biol. 2004, 66, 1621–1644. [Google Scholar] [CrossRef] [Green Version]
Nowak, M.A.; Sasaki, A.; Taylor, C.; Fudenberg, D. Emergence of cooperation and evolutionary stability in finite populations. Nature 2004, 428, 646–650. [Google Scholar] [CrossRef]
Zhang, J.; Xu, Z.; Chen, Z. Effects of strategy switching and network topology on decision-making in multi-agent systems. Int. J. Syst. Sci. 2018, 49, 1934–1949. [Google Scholar] [CrossRef]
Chemtob, Y.; Cazenille, L.; Bonnet, F.; Gribovskiy, A.; Mondada, F.; Halloy, J. Strategies to modulate zebrafish collective dynamics with a closed-loop biomimetic robotic system. Bioinspir. Biomim. 2020, 15, 046004. [Google Scholar] [CrossRef]
Li, Y.; Fish, F.; Chen, Y.; Ren, T.; Zhou, J. Bio-inspired robotic dog paddling: Kinematic and hydro-dynamic analysis. Bioinspir. Biomim. 2019, 14, 066008. [Google Scholar] [CrossRef]
Imhof, L.A.; Nowak, M.A. Evolutionary game dynamics in a Wright-Fisher process. J. Math. Biol. 2006, 52, 667–681. [Google Scholar] [CrossRef] [Green Version]
Traulsen, A.; Claussen, J.C.; Hauert, C. Coevolutionary dynamics: From finite to infinite populations. Phys. Rev. Lett. 2005, 95, 238701. [Google Scholar] [CrossRef] [Green Version]
Traulsen, A.; Pacheco, J.M.; Nowak, M.A. Pairwise comparison and selection temperature in evolutionary game dynamics. J. Theor. Biol. 2007, 246, 522–529. [Google Scholar] [CrossRef] [Green Version]
Chen, J.; Liu, X. Fixation Probabilities in Evolutionary Dynamics with a Wright-Fisher Process in Finite Diploid Populations. J. Southwest Univ. 2011, 33, 40–49. [Google Scholar]
Nedjah, N.; Junior, L.S. Review of methodologies and tasks in swarm robotics towards standardization. Swarm Evol. Comput. 2019, 50, 100565. [Google Scholar] [CrossRef]
Tang, H.; Sun, W.; Yu, H.; Lin, A.; Xue, M. A multirobot target searching method based on bat algorithm in unknown environments. Expert Syst. Appl. 2020, 141, 112945. [Google Scholar] [CrossRef]
Zhou, Z.; Luo, D.; Shao, J.; Xu, Y.; You, Y. Immune genetic algorithm based multi-UAV cooperative target search with event-triggered mechanism. Phys. Commun. 2020, 41, 101103. [Google Scholar] [CrossRef]
Yu, X.; Li, C.; Zhou, J.F. A constrained differential evolution algorithm to solve UAV path planning in disaster scenarios. Knowl. Based Syst. 2020, 204, 106209. [Google Scholar] [CrossRef]
Alhaqbani, A.; Kurdi, H.; Youcef-Toumi, K. Fish-Inspired Task Allocation Algorithm for Multiple Unmanned Aerial Vehicles in Search and Rescue Missions. Remote Sens. 2020, 13, 27. [Google Scholar] [CrossRef]
Zhu, H.; Wang, Y.; Li, X. UCAV path planning for avoiding obstacles using cooperative co-evolution Spider Monkey Optimization. Knowl. Based Syst. 2022, 246, 108713. [Google Scholar] [CrossRef]
Tang, H.; Sun, W.; Yu, H.; Lin, A.; Xue, M.; Song, Y. A novel hybrid algorithm based on PSO and FOA for target searching in unknown environments. Appl. Intell. 2019, 49, 2603–2622. [Google Scholar] [CrossRef]
Ni, J.; Tang, G.; Mo, Z.; Cao, W.; Yang, S. An improved potential game theory based method for multi-UAV cooperative search. IEEE Access 2020, 8, 47787–47796. [Google Scholar] [CrossRef]
Brass, P.; Cabrera-Mora, F.; Gasparri, A.; Xiao, J. Multirobot tree and graph exploration. IEEE Trans. Robot. 2011, 27, 707–717. [Google Scholar] [CrossRef]
Zeng, N.; Wang, Z.; Liu, W.; Zhang, H.; Hone, K.; Liu, X. A dynamic neighborhood-based switching particle swarm optimization algorithm. IEEE Trans. Cybern. 2020, 1–12. [Google Scholar] [CrossRef]
Zeng, N.; Song, D.; Li, H.; You, Y.; Liu, Y.; Alsaadi, F.E. A competitive mechanism integrated multi-objective whale optimization algorithm with differential evolution. Neurocomputing 2021, 432, 170–182. [Google Scholar] [CrossRef]

Figure 1. Fixation probabilities for the game whose dominant strategy is A with different selection intensities if

α

= 1

, β

= 1. The horizontal axis is the fixation probability of strategy A in the group, while the vertical axis represents the initial number of A-players in the group. The results show that the evolutionary dynamics of the population under such conditions is similar to the prisoner’s dilemma game.

Figure 1. Fixation probabilities for the game whose dominant strategy is A with different selection intensities if

α

= 1

, β

= 1. The horizontal axis is the fixation probability of strategy A in the group, while the vertical axis represents the initial number of A-players in the group. The results show that the evolutionary dynamics of the population under such conditions is similar to the prisoner’s dilemma game.

Figure 2. Fixation probabilities for the game whose dominant strategy is A with different selection intensities if

α

= 6,

β

= 8. The horizontal axis is the fixation probability of strategy A in the group, while the vertical axis represents the initial number of A-players in the group. The only difference to the results in Figure 1 is about the positions of A and B. This is also a prisoner’s dilemma game, but the dominant strategy is B.

Figure 2. Fixation probabilities for the game whose dominant strategy is A with different selection intensities if

α

= 6,

β

= 8. The horizontal axis is the fixation probability of strategy A in the group, while the vertical axis represents the initial number of A-players in the group. The only difference to the results in Figure 1 is about the positions of A and B. This is also a prisoner’s dilemma game, but the dominant strategy is B.

Figure 3. Fixation probabilities for the game whose dominant strategy is A with different selection intensities if

α

= 2,

β

= 10. The horizontal axis is the fixation probability of strategy A in the group, while the vertical axis represents the initial number of A-players in the group. The only difference to the results in Figure 1 is that the evolutionary dynamics in Figure 3 is a typical coordination game.

Figure 3. Fixation probabilities for the game whose dominant strategy is A with different selection intensities if

α

= 2,

β

= 10. The horizontal axis is the fixation probability of strategy A in the group, while the vertical axis represents the initial number of A-players in the group. The only difference to the results in Figure 1 is that the evolutionary dynamics in Figure 3 is a typical coordination game.

Figure 4. Fixation probabilities for the game whose dominance strategy is A with different selection intensities if

α

= 6,

β

= 1. The horizontal axis is the fixation probability of strategy A in the group, while the vertical axis represents the initial number of A-players in the group. The only difference to the results in Figure 1 is that the opposite strategies in the other group promote a strategy to be fixed in its own group.

Figure 4. Fixation probabilities for the game whose dominance strategy is A with different selection intensities if

α

= 6,

β

= 1. The horizontal axis is the fixation probability of strategy A in the group, while the vertical axis represents the initial number of A-players in the group. The only difference to the results in Figure 1 is that the opposite strategies in the other group promote a strategy to be fixed in its own group.

Figure 5. Fixation probabilities for the game whose dominant strategy is A with different selection intensities when

α

= 1,

β

= 1.

Figure 5. Fixation probabilities for the game whose dominant strategy is A with different selection intensities when

α

= 1,

β

= 1.

Figure 6. Fixation probabilities for the game whose dominance strategy is A with different selection intensities when

α

= 1,

β

= 2.

Figure 6. Fixation probabilities for the game whose dominance strategy is A with different selection intensities when

α

= 1,

β

= 2.

Figure 7. Evolutionary outcomes for the searching tasks on different areas with (A)

L \times L

= 100, (B)

L \times L

= 400, (C)

L \times L

= 900 and (D)

L \times L

= 1600. L is the side length of a square area. The vertical axis represents the number of search steps, and the horizontal axis is the proportion of individual a. All the areas are set to be connected, and

30 %

of the unreachable areas are random obstacle areas. r is the speed ratio, that is, the speed of the individuals who do not need to share information is r times greater than that of the individuals who share information. In the specific search, individuals take the strategy of depth-first.

Figure 7. Evolutionary outcomes for the searching tasks on different areas with (A)

L \times L

= 100, (B)

L \times L

= 400, (C)

L \times L

= 900 and (D)

L \times L

= 1600. L is the side length of a square area. The vertical axis represents the number of search steps, and the horizontal axis is the proportion of individual a. All the areas are set to be connected, and

30 %

of the unreachable areas are random obstacle areas. r is the speed ratio, that is, the speed of the individuals who do not need to share information is r times greater than that of the individuals who share information. In the specific search, individuals take the strategy of depth-first.

Table 1. The income matrix of n-person participation in the dual-strategy task allocation game model.

The Number of the Remaining A		A		B
n − 1		$a_{n - 1}$		$b_{n - 1}$
⋮	⋮		⋮
m		$a_{m}$		$b_{m}$
⋮	⋮		⋮
0		$a_{0}$		$b_{0}$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xiong, M.; Xie, G. Evolutionary Dynamics of Division of Labor Games for Underwater Searching Tasks. Symmetry 2022, 14, 941. https://doi.org/10.3390/sym14050941

AMA Style

Xiong M, Xie G. Evolutionary Dynamics of Division of Labor Games for Underwater Searching Tasks. Symmetry. 2022; 14(5):941. https://doi.org/10.3390/sym14050941

Chicago/Turabian Style

Xiong, Minglei, and Guangming Xie. 2022. "Evolutionary Dynamics of Division of Labor Games for Underwater Searching Tasks" Symmetry 14, no. 5: 941. https://doi.org/10.3390/sym14050941

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Evolutionary Dynamics of Division of Labor Games for Underwater Searching Tasks

Abstract

1. Introduction

2. Two-Player Game with Dual-Strategy Task

2.1. Scenario 1: Game with a Dominating Strategy

2.2. Scenario 2: Coordination Game

2.3. Scenario 3: Game with Coexisting A and B

3. Two-Player Game under Weak Selection

3.1. Fixation Probability

3.2. Average Time

3.3. Conditional Fixation Time

4. Multi-Player Game

5. Simulations

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI