Next Article in Journal
Improving the Performance of Quantum Cryptography by Using the Encryption of the Error Correction Data
Next Article in Special Issue
On the Analysis of Regularized Fuzzy Systems of Uncertain Differential Equations
Previous Article in Journal
Uniform Treatment of Integral Majorization Inequalities with Applications to Hermite-Hadamard-Fejér-Type Inequalities and f-Divergences
Previous Article in Special Issue
A Covariant Non-Local Model of Bohm’s Quantum Potential
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Shannon Entropy and Herfindahl-Hirschman Index as Team’s Performance and Competitive Balance Indicators in Cyclist Multi-Stage Races

1
School of Business, University of Leicester, Brookfield, Leicester LE2 1RQ, UK
2
Group of Researchers Applying Physics in Economy and Sociology (GRAPES), Beauvallon, Rue de la Belle Jardinière, 483/0021 Sart Tilman, Angleur, B-4031 Liège, Belgium
Entropy 2023, 25(6), 955; https://doi.org/10.3390/e25060955
Submission received: 18 May 2023 / Revised: 5 June 2023 / Accepted: 10 June 2023 / Published: 19 June 2023
(This article belongs to the Special Issue Selected Featured Papers from Entropy Editorial Board Members)

Abstract

:
It seems that one cannot find many papers relating entropy to sport competitions. Thus, in this paper, I use (i) the Shannon intrinsic entropy (S) as an indicator of “teams sporting value” (or “competition performance”) and (ii) the Herfindahl-Hirschman index (HHi) as a “teams competitive balance” indicator, in the case of (professional) cyclist multi-stage races. The 2022 Tour de France and 2023 Tour of Oman are used for numerical illustrations and discussion. The numerical values are obtained from classical and and new ranking indices which measure the teams “final time”, on one hand, and “final place”, on the other hand, based on the “best three” riders in each stage, but also the corresponding times and places throughout the race, for these finishing riders. The analysis data demonstrate that the constraint, “only the finishing riders count”, makes much sense for obtaining a more objective measure of “team value” and team performance”, at the end of a multi-stage race. A graphical analysis allows us to distinguish various team levels, each exhibiting a Feller-Pareto distribution, thereby indicating self-organized processes. In so doing, one hopefully better relates objective scientific measures to sport team competitions. Moreover, this analysis proposes some paths to elaborate on forecasting through standard probability concepts.

1. Introduction

Team ranking is a complex matter, as debatable as individual ranking, and it has been well known in modern times since Condorcet’s paradox [1] and Arrow’s impossibility theorem [2]. This matter is highly relevant in sports competitions and is also a common practice in almost all aspects of social life, whether real or virtual. It is unnecessary to further point out the examples of university rankings, academic hiring, or promotion. Regardless of the method used, resulting ranks lead to prestige or contempt and are often accompanied by monetary rewards. Most of the time, in social life, ranking indicators encourage progress and help avoid mediocrity.
Ranking is derived from a set of indicators, and there are many such indicators used in sports to rank athletes [3] or teams [4,5]. In team competitions like football (soccer), basketball, hockey, rowing, etc., where the focus is sometimes on specific athletes, the quality of a team is determined based on various quantitative measurements of their “quality” or “value”. The ranking “problem” often arises due to the fact that competitions involve duals between athletes or teams, with incomplete round-robin formats. The case of cycling races is somewhat unique. It is widely accepted that cycling races are won by individual riders, but the role of the team is of crucial importance [6,7]. Quoting Cabaud et al. [7]:
“A large proportion of cyclists in a race take part in support of another rider, meaning that they do not care about their personal result but instead try to help their team leader(s). Moreover, a team leader generally has one specific objective among a range of possible ones.”
In fact, cyclist races are quite different from other sport competitions, which emphasise the performance (physical or mental) of individual athletes, as in golf, boxing, escalade, tennis, judo, climbing, chess, bridge, etc. In cyclist races, many teams, typically consisting of an equal number of athletes at the start of the race, compete in order to be rewarded through monetary prizes, UCI (UCI: Union Cycliste Internationale, e.g., https://fr.uci.org/, accessed on 1 May 2023) points, or glory. The competition can be highly strategic: some athletes are geared toward a certain type of prowess, such as a time trial, sprint, mountain climbing, and other specialized cyclist races, not performed on roads.
For simplicity, races that take place “off the roads” are not considered in the discussion below, although it is evident that the concepts discussed here can be extended to such cases. Therefore, this paper specifically focuses on professional road cycling, particularly multi-stage races.

1.1. Team Ranking

The literature about ranking teams, in sport, and in various socio-economic matters, is huge, but is much less abundant about cycling teams. The papers of Sorensen 2000 [8] and Vaziri et al. 2018 [9] appear to be the most pertinent ones about ranking teams in (various) sports, because of their general viewpoints.
More pertinent to our concern, a 2020 paper by Ausloos [10] studied data of professional cyclists from the Tour de France, along the rank-size law method, deducing financial considerations from classical UCI measures (time-based) team ranking. Along a similar perspective, Ficcadenti et al. [11], in 2022, discussed the above-mentioned rank-size law on a soccer competition in Italy, thereby observing some regularities in team values within a ranking process.
One should notice that papers said to be on “performance ranking” do not truly rank teams (or athletes) from their score result at the finishing line, but rather refer to body physiology and/or athletes’ physical and biochemical conditions. These aspects, outside the present discussion, are not considered here.
For completeness, one should point to a non-scholarly website by ProCyclingStats (PCS) as a data source where interesting metrics are calculated; see https://www.procyclingstats.com/info/point-scales (accessed on 1 April 2023).

1.2. Competitive Balance

Fairness and equal chance opportunities are a priori pillars of sport competition. Hence, the literature on competitive balance is so huge, stressing qualitative and quantitative aspects, this paper mostly pertains to league activities rather than to teams point of view [12].
The construction of rules to quantify competitive balance is of utmost importance, particularly for the lower-ranked teams whose main sources of revenue come from sponsorship and viewership [13,14]. This is why specific strategies are employed in cyclist races to ensure that teams, which may not have the most likely winners, have “attacking riders” right from the starting line. Therefore, a dynamic measure is required for cyclist races to adequately capture and reward competitive balance. However, such a measure does not appear to be currently available. Physiological performance is often linked to competitive balance as well [7]. For additional perspectives, readers may refer to economic-related papers, such as those focusing on the Tour de France [15,16] (During the writing of this paper, an interesting example of indirect imbalance occurred in the 2023 Giro when organizers provided helicopters to teams for transporting a few riders down from the Gran Sasso finish line after stage 7, but at a cost that only a few teams could afford [17]).

1.3. Shannon Entropy and Herfindahl–Hirschman Index

The cyclist teams sporting performance is based here on the team’s intrinsic entropy, while the Herfindahl-Hirschman index is used as the team’s competitive balance indicator; both are mathematically defined here below, and thereafter adapted to the studied sport cases. The literature abounds on both measures, but not much on cyclist races.

1.3.1. Shannon Entropy

One can hardly comment in an original way on the use of entropy for measuring complex disorder [18]. Alas, I have not found many papers relating entropy to sport competition results, with the exception of [19,20,21]; however, their approach is somewhat different than that of the present framework. This lack of papers in the literature is surprising since one could easily transform the occurrence of sport results into probabilities, making it potentially useful for audacious forecasting and betting purposes, given appropriate scaling. Thus, let it be considered that this paper is a pioneer contribution to the field.
In information science—particularly in scientometrics, in order to discuss uncertainty in measures, the concept of entropy (S) is a classical concept [18]—although it is sometimes misunderstood, or abusively misinterpreted. Its mathematical formulation reads as follows: (In information science, entropy is usually defined through a log in base 2 in “shannon units”; in thermodynamics, the natural log is used, l n l o g 2 : then, S is given in “nat units”, as it is here; it is well known that a base change is easy, using the formula l o g a ( b ) = l o g x ( a ) / l o g x ( b ) ; l o g 2 ( a ) = l n ( a ) / l n ( 2 ) ; l n ( 2 ) = 0.69315 ).
S = i = 1 N y i j y j · ln y i j y j
where y i j y j p i is the probability of the number of occurrences, y i , of the i-th event among N possible ones. The entropy maximum occurs when all p i are equal to each other, i.e., when there is no disorder: p i = 1 / N :
S M a x = ln 1 N
Notice for the present study that the entropy maximum occurs when each team wins as many stages as another; it may happen that some teams do not win any stage. The team winning a stage might not be the team of the winning rider. It may also (often) happen that there are not enough stages such that each team wins one stage at least. The minimum entropy occurs when one team wins all L stages.

1.3.2. Herfindahl–Hirschman Index

In brief, measurements of competitive balance are often based on “Standard Deviation of Win Percentages” [22]. Other measures have been discussed [23,24].
Aside from the “Standard Deviation of Win Percentages” [22], the Herfindahl–Hirschman index is the most frequently used measure [25,26]. The H H i is a concentration measure, which is typically used in business to emphasize monopolies by measuring the “size” of companies through their market share, hence providing some numerical relationship between the firms and the competition they face. This measure can be adjusted so that it reflects some aspect of competitive balance in sport by calculating the distribution of points, won through time or place, obtained by riders in a race competition.
Notice that the H H i has been used on cyclist races such as the Tour de France in [10], but with a focus on its “market competition” aspect, i.e., measuring teams’ financial gains.
Recall that the Herfindahl-Hirschman index ( H H i ), is a measure of concentration [27]. It is usually applied to describe company sizes (from which the concentration is measured with respect to the entire market): a H H i below 0.01 indicates a highly competitive index (in more usual language, from a portfolio point of view, a low H H i implies a very diversified portfolio).
When applied to the case of sport team ranking, the H H i measure is proposed as an indicator of the level of fair competition among teams, rather than a measure of wealth concentration. Therefore, it serves as a competitive balance indicator. (In political economy and finance, the H H i in Equation (3) represents the sum of the squares of the market shares of the largest companies (traditionally, when expressed as fractions, with N = 50 ). However, in the context of sports, typically N < 50 ). Formally, the H H i measure is defined as:
H H i = i N y i j y j 2 ,
where y i represents the number of wins by the i-th team, and N is the appropriate number of teams.
The higher the value of H H i , the smaller the number of teams with a large value of wins; in other words, H H i is a measure of the number of the best-performing teams in a given competition. Thus, an increase in H H i represents a decrease in competitive balance.
In conclusion of this Introduction section, and in order to prepare the numerical illustrations and the subsequent discussion of findings and features, let it be mentioned that the data pertain to the Tour de France 2022 and the Tour of Oman 2023. In Section 2.2, I explain where these illustrative data can be obtained, i.e., from the official organizer websites.

2. Materials and Method

2.1. Application to Multi-Stage Races

Let us define the notations for clarity moving forward. Consider a race with N teams and a total of M riders participating in an L stage race. Assuming that all teams start with an equal number of riders, there are K = M / N riders per team initially. However, due to rider abandonments during certain stages (s), or riders not starting certain stages for reasons not relevant here, in a given team (#), there are K # riders who finish stage s, such that M ( s ) = # K # ( s ) ; s = 1 , , L . For future reference, it is important to note that each rider (i) is assigned a bib number ( d i ) by the race organizers at the start of the race.
On the 2022 Tour de France (TdF), there were L = 21 stages, and N = 22 teams. Each team started with eight riders; therefore, M = 176.
In the case of the 2023 Tour of Oman (ToO), there were N = 18 teams competing on L = 5 stages. Each team, with the exception of two of them, started with seven riders; therefore, M = 124.

2.2. Data

The relevant data can be obtained from the official websites of the race organizers or from media sources. To conduct the necessary treatments and analyses, it is essential to download the results for the twenty-one stages and five stages, respectively. The official reports provided by the organizers come in various formats. In order to ensure consistency and facilitate subsequent analysis, certain data treatments have been performed manually.
For instance, one can begin by accessing the stage results of the 2022 Tour de France from the official website https://www.letour.fr/en/rankings (accessed on 1 April 2023). The stages can be reviewed in reverse order starting from stage 21 and progressing backwards. Similarly, for the 2023 Tour of Oman, the official website https://www.tour-of-oman.com/en (accessed on 19 March 2023) can be visited to retrieve the final ranking in a single step through https://www.tour-of-oman.com/en/rankings (accessed on 15 February 2023). Then, the previous stages can be accessed and reviewed in reverse order.

2.3. Notations

The classical UCI counting goes as follows: the time ( t s ( # ) ) of the fastest 3 riders of a team ( # ) after a stage (s) is aggregated, in order to give the “team time” for this given stage, say T s ( # ) . The “final team time” results from the aggregation of each “team stage time”, T L ( # ) = Σ s = 1 L t s ( # ) , - even though several of the riders, considered for the aggregated “team stage time” might not have finished the whole race.
One may also rank teams according to the finishing place of the first three riders of a team at the end of each stage. (Notice that this measure differs from the so-called “green jersey race” for riders in the Tour de France).
Let us introduce the relevant notations. Let such riders be at place p i , s ( # ) , with i = 1, 2, 3. Thereafter, one can define another objective team ranking place after stage s from p s ( # ) , i.e.,
p s ( # ) = Σ i = 1 3 p i , s ( # ) ,
and calculate some
P L ( # ) = Σ s = 1 L p s ( # ) .
at the end of the multi-stage race, for the final ranking, - according to the team placing at different stages, again, irrespectively of the involved riders.
Notice that these t and p lists do not necessarily give the riders in the same order, due to the last (3) kilometer(s) neutralisation rule, allowing riders to have “technical problems”, tire punctures, even falls, or to willingly stop racing, along such a distance.
Moreover, one can define the “adjusted team final time”, A L ( # ) , calculated from
A L ( # ) = Σ j = 1 3 t j , L ( # )
where, in Equation (6), j = 1, 2, 3 refers to the “three fastest” riders of the team ( # ) having completed all L stages. Thus, A L ( # ) can only be so obtained at the end of the multistage race. Let it be strongly emphasized again that these three “j” riders might be quite different from the various three “i” riders having contributed to any t s ( # ) , and hence to T L ( # ) .
Similar to the above example, one can define “best team final place” measures, such as P L ( # ) and B L ( # ) , based on the final place of the three “best riders”, at the end of the race: P L ( # ) has been defined in Equation (5); recall that this P L ( # ) measure refers to many different riders. To further refine the team ranking by considering only the riders (j) who successfully finish the race, ensuring a more comprehensive measure of team performance throughout the entire race, another quantity is defined as:
B L ( # ) = j = 1 3 p j , L ( # )
This corresponds to A L ( # ) as defined in Equation (6). It is important to reiterate that in Equation (7), the index j ranges from 1 to 3, representing the three best-placed riders of team ( # ) in the various stages, who have successfully completed all L stages.
This leads to four different team rankings: in each case, the best teams are those which have the lowest values of the four measured “variables”, according to the ranking from measures in ascending order.
One can easily plot and observe the distributions of such values. Moreover, their ranges being found to be finite, one can scale each result, with respect to the respective sums. This allows for the definitions of a posteriori “value probabilities”, which can be interpreted as the “percentages of concentrations”.
These probabilities, denoted as p i to avoid confusion with y i in Equations (1) and (3), can be incorporated into the definitions of S and H H i to obtain:
S = i = 1 N p i · ln ( p i )
which represents the classical formulation of Shannon entropy, and
H H i = i N ( p i ) 2
which is also the standard way of measuring competitive balance through the “Standard Deviation of Win Percentages” (Trandel, 2011). It is important to note that i p i = 1 in these equations.

3. Results: Data Analysis

3.1. Team Ordering Results

3.1.1. 2022 Tour de France (TdF)

Consider the 2022 Tour de France (TdF) results. There were L = 21 stages, and 22 teams (The team acronyms are those of the UCI notations). The final team ranking T L ( # ) leads to the hierarchy IGD, GFC, TJV, BOH, MOV, … TEN, ADC, LTS, QST. In contrast, if the final ranking is based on the aggregated finishing time of the best three fastest riders that have finished the whole race, one obtains the A L ( # ) ranking: IGD, GFC, TJV, BOH, DSM, … TEN, ADC, LTS, QST.
Next, consider the arriving place (instead of the time) of the “best placed” three riders of a team in each stage. The P L ( # ) values lead to the following ranking: TJV, IGD, BOH, GFC, ARK, … TEN, BEX, ADC, LTS, QST, while the B L ( # ) hierarchy is IGD, TJV, BOH, GFC, AST, … BEX, ADC, TEN, QST, LTS.

3.1.2. 2023 Tour of Oman (ToO)

Consider this 2023 Tour of Oman (ToO) case. There were 18 teams competing in L = 5 stages. The final team ranking T L ( # ) , is given by BOH, SOQ, ICW, ARK, …, AST, HPM, TSG, ONT, while it is BOH, ICW, SOQ, ARK, …, AST, HPM, TSG, ONT, for A L ( # ) .
The relevant P L ( # ) lead to BOH, SOQ, ARK, ACT, …, JCL, HPM, TSG, ONT, but to BOH, ICW, COF, UAD, …, JCL, HPM, TSG, ONT for B L ( # ) .

3.2. Statistical Characteristics

A summary of the (rounded) main statistical characteristics of the time and place indicator distributions, with the notations defined here above, for the 2022 Tour de France and the 2023 Tour of Oman, for their respective number of competing teams is given in Table 1. It is observed from the Table that both investigated competitions are on a quite different level from a sportive point of view: e.g., the number of teams and the number of stages are quite different. From a purely statistical perspective, it is worth noting that the time or place measures, both with and without the constraint of considering only the finishing riders in the final team value or team performance measures, cover a wide range of scales. Further comments on this topic will be discussed in the next section Section 4.
Firstly, it is important to note that A L ( # ) consistently exceeds T L ( # ) , even in races with few abandonments, such as the Tour of Oman. This observation suggests that the constraint of considering only the riders who complete the entire race is meaningful. Similarly, in terms of the “place value”, B L ( # ) consistently surpasses P L ( # ) . This clear trend emphasizes the significance of the constraint on the “finishing riders only” in obtaining a more objective measure of team value and team performance at the conclusion of a multi-stage race.

4. Discussion

The time and place distributions are displayed in Figure 1, Figure 2, Figure 3 and Figure 4. The rank-time laws for the Tour de France, as shown in Figure 1, exhibit a well-defined cubic form with a high coefficient of determination ( R 2 0.985 ). It is worth noting that only the integer values on the x-axis hold significance in this context. On the other hand, for the Tour of Oman (ToO), which is a shorter race, the variations in team performance are relatively weak for most teams, except for the last three. As a result, fitting an empirical law is less meaningful. However, for the sake of completeness, it should be mentioned that a similar cubic fit with R 2 0.985 can be obtained if the weakest three teams (HPM, TSG, ONT) are excluded from the fitting process.
Interestingly, nevertheless, one can observe that aside from distinguishing two sub-distributions in the ToO results, observing again the data for TdF, Figure 3, one sees four possible sub-distributions, - in fact, related to a team’s “UCI quality level”.
Concerning the entropy data, see Figure 5, Figure 6, Figure 7 and Figure 8. Best fits to simple polynomials can be attempted. A best-fit curve to an empirical cubic function of the team entropy derived from the final time T L and adjusted final time A L , both defined here above, distributions of the 22 teams having competed on the Tour de France 2022 is displayed on Figure 5. A similar fit for the team entropy, but derived from the final place P L and adjusted final place A L measures, is shown on Figure 6.
The entropy derived from P L and B L distributions for Tour de France 2022 and that for P L and B L for Tour of Oman 2023 are found on Figure 7 and on Figure 8, respectively, with the best fit to a cubic empirical function.
Concerning the the team H H i values, they can also be displayed according to the team rank, both in increasing order. The best-fit curve to an empirical cubic function for the team H H i derived from the final time T L and adjusted final time A L , both as defined here above, distributions of the 22 teams having competed on the Tour of France 2022 and for the 18 teams having competed on the Tour of Oman 2023 are found in Figure 9 and in Figure 10, respectively. The corresponding R 2 is reported.
The corresponding displays for the H H i -rank distributions derived from the final place distributions, P L and adjusted final place B L , as defined here above, for the 22 teams having competed on the Tour de France in 2022 and for the 18 teams having competed on the Tour of Oman in 2023 are found in Figure 11 and in Figure 12, respectively. The best-fit curve to an empirical cubic function is given with the corresponding R 2 .
N.B. Let us still mention that the (cubic polynomial) fits are reasonable on the first 15 teams results only, rather than on the whole 18 bunch for the Tour of Oman cases.
These smooth variations are markedly different from classical rank-size laws analyses based on y r = a r α [28]. However, one could admit that other functions with a small number of parameters could be used in appropriately managing the x- and y-axes scales and range. Using the two-parameter ( κ , χ ) form
y ( r ) = κ N r N r + 1 χ
would be an example of this.
On a semi-log plot, Equation (10) with χ 0 , gives a flat N-shape “ n o i d ” function near its inflection point, allowing for various convex and concave data display shapes. Some rewriting [28,29], defining u = r / ( N + 1 ) , leads to
y ( r ) = Λ ^ u ϕ ( 1 u ) + ψ
which, in the case of ϕ > 0 and ψ < 0 , is the Feller-Pareto function [30], which is associated with self-organized processes [31,32]. Exploring these features for different scenarios involving the number of stages, teams, and riders would be an interesting avenue for future investigation. It is clear that such races exhibit both endogenous and exogenous self-organization dynamics.
Finally, a deduction can stem from observing different hierarchies through the various indicators: one can imagine various ways to motivate teams through financial rewards, or media publicity, based on the above indicators.

5. Conclusions

In this paper, the author proposes the use of the intrinsic entropy (S) as an indicator of “teams sporting value” or “performance” and the Herfindahl-Hirschman index ( H H i ) as a measure of “teams competitive balance” in the context of professional cyclist multi-stage races.
The motivation for this study stems from the observation that there are relatively few papers linking the concept of entropy to sport competition results, with only a few notable exceptions. This is surprising considering that sport results can be easily transformed into probabilities, which have implications for forecasting and betting purposes. However, such analysis requires extensive computer work, including the downloading of appropriate results and scaling them according to the number of teams, number of stages, stage difficulties, stage lengths, and other relevant factors.
Moreover, even though many papers have been concerned with the concept of competitive balance in sports, recall the review in [25], the present study differs from H H i -based theoretical, empirical, or simulation studies, as in the very recent [26]. Most of these papers take into account different leagues’ competitions, in different season lengths with different competitions rounds. Some relation can be tied to the present paper studying two different lengths and difficulties of multi-stages races. However, there are major differences with respect to other works on H H i in sport stem in the type of competitions which are examined. The papers in the literature mainly look at duel competitions. This is very different from professional cyclist races which involve many teams in a single event.
As a second contribution aimed at objectively measuring team value and team performance, a logical constraint is introduced for the classical measures based on the time or place of riders in a team. It is mandated that these measures be solely based on the riders who finish the multi-stage race. This constraint leads to the introduction of new indicators, which are compared to the classical ones. The classical measures aggregate values from various riders in different stages but fail to capture the crucial contributions of the best race-finishing team members.
In the present study, we utilize the 2022 Tour de France and the 2023 Tour of Oman as numerical illustrations and for discussion purposes. The former is a renowned long race featuring top professional teams, while the latter is a more modest race involving teams and riders who may be less well-known. Despite these differences, both races exhibit similar final characteristics. The numerical values are derived not only from new ranking indices that measure the teams’ “final time” and “final place” based on the top three riders in each stage but also from the aggregated final time or final place of the team’s best three finishing riders at the end of the race. The distributions of these values reveal distinct team levels through sub-distributions. Furthermore, these distributions are reasonably well-fitted by cubic (empirical) functions, reminiscent of the Feller-Pareto distribution, which suggests the presence of constrained self-organizational processes.
A conclusive final point pertains to the empirical findings: the intrinsic Shannon entropy (S) appears to be a bona fide indicator of “teams sporting value” (or “performance”), beyond its traditional role in measuring disorder. Furthermore, the analysis reveals an unexpected yet understandable phenomenon of “clustering of teams” through data observation. On the other hand, the Herfindahl-Hirschman index ( H H i ) provides a clear indication of “teams competitive balance” processes. Overall, these empirical findings highlight the effectiveness of these indicators in capturing important aspects of team performance and competitive dynamics.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

Comments by reviewers and editors have much forged the structure and content of this paper.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. De Condorcet, N. Essai sur L’application de L’analyse a la Probabilité des Décisions Rendues à la Pluralité des Voix; Imprimerie Royale: Paris, France, 1785; Reprinted by Chelsea Press: New York, NY, USA, 1973. [Google Scholar]
  2. Arrow, K. A difficulty in the theory of social welfare. J. Political Econ. 1950, 58, 328–346. [Google Scholar] [CrossRef]
  3. Collingwood, J.A.; Wright, M.; Brooks, R.J. Evaluating the effectiveness of different player rating systems in predicting the results of professional snooker matches. Eur. J. Oper. Res. 2022, 296, 1025–1035. [Google Scholar] [CrossRef]
  4. Stefani, R.T. Survey of the major world sports rating systems. J. Appl. Stat. 1997, 24, 635–646. [Google Scholar] [CrossRef]
  5. Wilson, R.L. Ranking college football teams: A neural network approach. Interfaces 1995, 25, 44–59. [Google Scholar] [CrossRef]
  6. Albert, E. Riding a line: Competition and cooperation in the sport of bicycle racing. Sociol. Sport J. 1991, 8, 341–361. [Google Scholar] [CrossRef]
  7. Cabaud, B.; Scelles, N.; François, A.; Morrow, S. Modeling performances and competitive balance in road cycling competitions. In The Economics of Professional Road Cycling; Springer International Publishing: Cham, Switzerland, 2022; pp. 253–281. [Google Scholar]
  8. Sorensen, S.P. An Overview of Some Methods for Ranking Sports Teams. University of Tennessee. Knoxville. 2000. Available online: http://sorensen.info/rankings/Documentation/Sorensen_documentation_v1.pdf (accessed on 14 February 2023).
  9. Vaziri, B.; Dabadghao, S.; Yih, Y.; Morin, T.L. Properties of sports ranking methods. J. Oper. Res. Soc. 2018, 69, 776–787. [Google Scholar] [CrossRef]
  10. Ausloos, M. Rank–size law, financial inequality indices and gain concentrations by cyclist teams. The case of a multiple stage bicycle race, like Tour de France. Physica A 2020, 540, 123161. [Google Scholar] [CrossRef] [Green Version]
  11. Ficcadenti, V.; Cerqueti, R.; Vardeí, C.H. A rank-size approach to analyse soccer competitions and teams: The case of the Italian football league “Serie A”. Ann. Oper. Res. 2023, 325, 85–113. [Google Scholar] [CrossRef]
  12. Sanderson, A.R. The many dimensions of competitive balance. J. Sport. Econ. 2002, 3, 204–228. [Google Scholar] [CrossRef]
  13. Rodríguez, C.; Pérez, L.; Puente, V.; Rodríguez, P. The determinants of television audience for professional cycling: The case of Spain. J. Sport. Econ. 2015, 16, 26–58. [Google Scholar] [CrossRef]
  14. Rodríguez-Gutiérrez, C.; Fernández-Blanco, V. Continuous TV demand in road cycling: The 2015 Vuelta a España. Eur. Sport Manag. Q. 2017, 17, 349–369. [Google Scholar] [CrossRef]
  15. Bačik, V.; Klobučník, M.; Mignot, J.F. What made the tour successful? Competitive balance in the tour de France, 1947–2017. Sport Soc. 2020, 24, 147–159. [Google Scholar] [CrossRef]
  16. Andreff, W.; Mignot, J.F. The Tour de France: A Success Story in Spite of Competitive Imbalance. In The Economics of Professional Road Cycling; Springer International Publishing: Cham, Switzerland, 2022; pp. 163–180. [Google Scholar]
  17. Lenten, L.J. Measurement of competitive balance in conference and divisional tournament design. J. Sport. Econ. 2015, 16, 3–25. [Google Scholar] [CrossRef]
  18. Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef] [Green Version]
  19. Da Silva, S.; Matsushita, R.; Silveira, E. Hidden power law patterns in the top European football leagues. Phys. A Stat. Mech. Appl. 2013, 392, 5376–5386. [Google Scholar] [CrossRef] [Green Version]
  20. Silva, P.; Duarte, R.; Sampaio, J.; Aguiar, P.; Davids, K.; Araújo, D.; Garganta, J. Field dimension and skill level constrain team tactical behaviours in small-sided and conditioned games in football. J. Sport. Sci. 2014, 32, 1888–1896. [Google Scholar] [CrossRef]
  21. Silva, P.; Duarte, R.; Esteves, P.; Travassos, B.; Vilar, L. Application of entropy measures to analysis of performance in team sports. Int. J. Perform. Anal. Sport 2016, 16, 753–768. [Google Scholar] [CrossRef]
  22. Trandel, G.A.; Maxcy, J.G. Adjusting winning-percentage standard deviations and a measure of competitive balance for home advantage. J. Quant. Anal. Sport. 2011, 7. [Google Scholar] [CrossRef] [Green Version]
  23. Humphreys, B.R. Alternative Measures of Competitive Balance in Sports Leagues. J. Sport. Econ. 2002, 3, 133–148. [Google Scholar] [CrossRef]
  24. Triguero Ruiz, F.; Avila-Cano, A. The distance to competitive balance: A cardinal measure. Appl. Econ. 2019, 51, 698–710. [Google Scholar] [CrossRef]
  25. Owen, P.D.; Ryan, M.; Weatherston, C.R. Measuring competitive balance in professional team sports using the Herfindahl-Hirschman index. Rev. Ind. Organ. 2007, 31, 289–302. [Google Scholar] [CrossRef]
  26. Owen, P.D.; Owen, C.A. Simulation evidence on Herfindahl-Hirschman measures of competitive balance in professional sports leagues. J. Oper. Res. Soc. 2022, 73, 285–300. [Google Scholar] [CrossRef]
  27. Hirschman, A.O. The paternity of an index. Am. Econ. Rev. 1964, 54, 761–762. [Google Scholar]
  28. Ausloos, M. Toward fits to scaling-like data, but with inflection points & generalized Lavalette function. J. Appl. Quant. Methods 2014, 9, 1–21. [Google Scholar]
  29. Ausloos, M. Two-exponent Lavalette function: A generalization for the case of adherents to a religious movement. Phys. Rev. E 2014, 89, 062803. [Google Scholar] [CrossRef] [Green Version]
  30. Tahmasebi, S.; Behboodian, J. Shannon entropy for the Feller-Pareto (FP) family and order statistics of FP subfamilies. Appl. Math. Sci. 2010, 4, 495–504. [Google Scholar]
  31. Eliazar, I.; Cohen, M.H. The universal macroscopic statistics and phase transitions of rank distributions. Physica A 2011, 390, 4293–4303. [Google Scholar] [CrossRef]
  32. Cerqueti, R.; Rotundo, G.; Ausloos, M. Investigating the configurations in cross-shareholding: A joint copula-entropy approach. Entropy 2018, 20, 134. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Best-fit curve to an empirical cubic function of the distributions of final time T L and adjusted final time A L , both defined in the main text, of the 22 teams having competed on the Tour de France 2022; the y-axis scale is in days:hours.
Figure 1. Best-fit curve to an empirical cubic function of the distributions of final time T L and adjusted final time A L , both defined in the main text, of the 22 teams having competed on the Tour de France 2022; the y-axis scale is in days:hours.
Entropy 25 00955 g001
Figure 2. Distributions of final time T L and adjusted final place A L , both defined in the main text, of the 18 teams having competed on the Tour of Oman 2023; in order to emphasize the 2 sub-distributions, the best-fit curve to an empirical cubic function is not shown; the y-axis scale is in days:hours:minutes.
Figure 2. Distributions of final time T L and adjusted final place A L , both defined in the main text, of the 18 teams having competed on the Tour of Oman 2023; in order to emphasize the 2 sub-distributions, the best-fit curve to an empirical cubic function is not shown; the y-axis scale is in days:hours:minutes.
Entropy 25 00955 g002
Figure 3. Distributions of final place P L and adjusted final place B L measures, both defined in the main text, of the 22 teams having competed on the Tour de France 2022; in order to emphasize the four sub-distributions, the best-fit curve to an empirical cubic function is not shown, but the resulting R 2 , if the whole fit was completed, is given for information.
Figure 3. Distributions of final place P L and adjusted final place B L measures, both defined in the main text, of the 22 teams having competed on the Tour de France 2022; in order to emphasize the four sub-distributions, the best-fit curve to an empirical cubic function is not shown, but the resulting R 2 , if the whole fit was completed, is given for information.
Entropy 25 00955 g003
Figure 4. Distributions of final place P L and adjusted final place B L measures, both defined in the main text, of the 18 teams having competed on the Tour of Oman 2023; the best-fit curve to an empirical cubic function is made on the first 15 teams only.
Figure 4. Distributions of final place P L and adjusted final place B L measures, both defined in the main text, of the 18 teams having competed on the Tour of Oman 2023; the best-fit curve to an empirical cubic function is made on the first 15 teams only.
Entropy 25 00955 g004
Figure 5. Best-fit curve to an empirical cubic function of the team entropy derived from the final time T L and adjusted final time A L measures, both defined in the main text, distributions of the 22 teams having competed on the Tour de France 2022.
Figure 5. Best-fit curve to an empirical cubic function of the team entropy derived from the final time T L and adjusted final time A L measures, both defined in the main text, distributions of the 22 teams having competed on the Tour de France 2022.
Entropy 25 00955 g005
Figure 6. Team entropy derived from the final time T L and adjusted final time A L measures, both defined in the main text, distributions of the 18 teams having competed on the Tour of Oman 2023.
Figure 6. Team entropy derived from the final time T L and adjusted final time A L measures, both defined in the main text, distributions of the 18 teams having competed on the Tour of Oman 2023.
Entropy 25 00955 g006
Figure 7. Best-fit curve to an empirical cubic function of the team entropy derived from the final place distributions, P L and adjusted final place B L measures, as defined in the main text, for the 22 teams having competed on the Tour de France 2022.
Figure 7. Best-fit curve to an empirical cubic function of the team entropy derived from the final place distributions, P L and adjusted final place B L measures, as defined in the main text, for the 22 teams having competed on the Tour de France 2022.
Entropy 25 00955 g007
Figure 8. Best-fit curve to an empirical cubic function of the team entropy derived from the final place distributions, P L and adjusted final place B L measures, as defined in the main text, for the 18 teams having competed on the Tour of Oman 2023.
Figure 8. Best-fit curve to an empirical cubic function of the team entropy derived from the final place distributions, P L and adjusted final place B L measures, as defined in the main text, for the 18 teams having competed on the Tour of Oman 2023.
Entropy 25 00955 g008
Figure 9. Best-fit curve to an empirical cubic function of the team H H i derived from the final time T L and adjusted final time A L measures, both defined in the main text, distributions of the 22 teams having competed on the Tour of France 2022.
Figure 9. Best-fit curve to an empirical cubic function of the team H H i derived from the final time T L and adjusted final time A L measures, both defined in the main text, distributions of the 22 teams having competed on the Tour of France 2022.
Entropy 25 00955 g009
Figure 10. Best-fit curve to an empirical cubic function of the team H H i derived from the final time T L and adjusted final time A L measures, both defined in the main text, distributions of the 18 teams having competed on the Tour of Oman 2023; the fit is on the first 15 teams result.
Figure 10. Best-fit curve to an empirical cubic function of the team H H i derived from the final time T L and adjusted final time A L measures, both defined in the main text, distributions of the 18 teams having competed on the Tour of Oman 2023; the fit is on the first 15 teams result.
Entropy 25 00955 g010
Figure 11. Best-fit curve to an empirical cubic function of the team H H i derived from the final place distributions, P L and adjusted final place B L measures, as defined in the main text, for the 22 teams having competed on the Tour de France 2022.
Figure 11. Best-fit curve to an empirical cubic function of the team H H i derived from the final place distributions, P L and adjusted final place B L measures, as defined in the main text, for the 22 teams having competed on the Tour de France 2022.
Entropy 25 00955 g011
Figure 12. Best-fit curve to an empirical cubic function of the team H H i derived from the final place distributions, P L and adjusted final place B L measures, as defined in the main text, for the 18 teams having competed in the Tour of Oman 2023; the fit is on the first 15 teams’ results.
Figure 12. Best-fit curve to an empirical cubic function of the team H H i derived from the final place distributions, P L and adjusted final place B L measures, as defined in the main text, for the 18 teams having competed in the Tour of Oman 2023; the fit is on the first 15 teams’ results.
Entropy 25 00955 g012
Table 1. Summary of (rounded) main statistical characteristics of the time (in hours:minutes:seconds) and place indicator distributions;—see notations in the main text for the 2022 Tour de France and the 2023 Tour of Oman; M is the number of competing teams, ii the respective cases; L is the number of stages.
Table 1. Summary of (rounded) main statistical characteristics of the time (in hours:minutes:seconds) and place indicator distributions;—see notations in the main text for the 2022 Tour de France and the 2023 Tour of Oman; M is the number of competing teams, ii the respective cases; L is the number of stages.
Min.Max.SumMeanSt. Dev.Skew.Kurt.
2022 Tour de France; M = 22 ; L = 21
T L ( # ) 239:03:03249:46:165361:45:14243:42:583:01:120.43693−0.70860
A L ( # ) 240:13:39251:22:065401:26:40245:31:133:09:100.08616−0.86863
P L ( # ) 1242428859,7532716.05921.670.22809−1.10963
B L ( # ) 1788589986,7253942.051209.34−0.01383−1.12467
2023 Tour of Oman; M = 18 ; L = 5
T L ( # ) 59:55:1261:43:421084:04:3660:13:350:25:392.64276.66372
A L ( # ) 59:55:2661:44:321085:04:3160:16:550:26:502.22874.50161
P L ( # ) 26714919131507.28296.602.27984.89609
B L ( # ) 357150011,245624.72285.831.97703.26794
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ausloos, M. Shannon Entropy and Herfindahl-Hirschman Index as Team’s Performance and Competitive Balance Indicators in Cyclist Multi-Stage Races. Entropy 2023, 25, 955. https://doi.org/10.3390/e25060955

AMA Style

Ausloos M. Shannon Entropy and Herfindahl-Hirschman Index as Team’s Performance and Competitive Balance Indicators in Cyclist Multi-Stage Races. Entropy. 2023; 25(6):955. https://doi.org/10.3390/e25060955

Chicago/Turabian Style

Ausloos, Marcel. 2023. "Shannon Entropy and Herfindahl-Hirschman Index as Team’s Performance and Competitive Balance Indicators in Cyclist Multi-Stage Races" Entropy 25, no. 6: 955. https://doi.org/10.3390/e25060955

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop