Next Article in Journal
Efficiency of Crude Protein Utilisation in Grazing Dairy Cows: A Case Study Comparing Two Production Systems Differing in Intensification Level in New Zealand
Next Article in Special Issue
Genomic Structural Diversity in Local Goats: Analysis of Copy-Number Variations
Previous Article in Journal
Efficiency of Semen Cryopreservation in Stallions
Previous Article in Special Issue
Influence of the Casein Composite Genotype on Milk Quality and Coagulation Properties in the Endangered Agerolese Cattle Breed
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Estimation of Linkage Disequilibrium and Effective Population Size in Three Italian Autochthonous Beef Breeds

Dipartimento di Scienze e Tecnologie Agrarie, Alimentari, Ambientali e Forestali, Università di Firenze, 50144 Firenze, Italy
*
Author to whom correspondence should be addressed.
Animals 2020, 10(6), 1034; https://doi.org/10.3390/ani10061034
Submission received: 15 May 2020 / Revised: 9 June 2020 / Accepted: 11 June 2020 / Published: 14 June 2020

Abstract

:

Simple Summary

Linkage disequilibrium (LD) of genomic markers is related to various evolutionary forces, such as inbreeding, nonrandom mating, population bottleneck, drift, recombination, and mutations, and hence is an essential parameter to examine population history. In this analysis, we examined the LD pattern of three Italian local beef breeds (Calvana, Mucca Pisana, and Pontremolese) facing the risk of extinction, using the commercial Limousin beef breed as a control. Our results provide important information on the population history and the current status of the breeds and they can be further used for conservation and breeding purposes.

Abstract

The objective was to investigate the pattern of linkage disequilibrium (LD) in three local beef breeds, namely, Calvana (n = 174), Mucca Pisana (n = 270), and Pontremolese (n = 44). As a control group, samples of the Italian Limousin breed (n = 100) were used. All cattle were genotyped with the GeneSeek GGP-LDv4 33k SNP chip containing 30,111 SNPs. The genotype quality control for each breed was conducted separately, and SNPs with call rate < 0.95 and minor allele frequency (MAF) > 1% were used for the analysis. LD extent was estimated in PLINK v1.9 using the squared correlation between pairs of loci (r2) across autosomes. Moreover, r2 values were used to calculate historical and contemporary effective population size (Ne) in each breed. Average r2 was similar in Calvana and Mucca Pisana (~0.14) and higher in Pontremolese (0.17); Limousin presented the lowest LD extent (0.07). LD up to 0.11–0.15 was persistent in the local breeds up to 0.75 Mbp, while in Limousin, it showed a more rapid decay. Variation of different LD levels across autosomes was observed in all the breeds. The results demonstrated a rapid decrease in Ne across generations for local breeds, and the contemporary population size observed in the local breeds, ranging from 41.7 in Calvana to 17 in Pontremolese, underlined the demographic alarming situation.

1. Introduction

In recent years, there has been a greater interest in recovering and preserving local breeds, especially for their adaptation’s capacity in marginal areas and for their importance as reservoir of genetic diversity.
In this context, Tuscany (central region of Italy) represents an important pool of genetic diversity with six different cattle breeds native from this area [1]. Three of these six breeds (Calvana (CAL), Mucca Pisana (MUP), and Pontremolese (PON)) have been recognized from the Italian breeders Association (AIA; Associazione Italiana Allevatori, Rome) to be at risk of extinction and at present are enrolled to the register of cattle breeds at limited diffusion (Registro Anagrafico delle razze bovine autoctone a limitata diffusione). From those, PON faces a critical risk of extinction, consisting a limited number of animals (n = 49). Historically, PON originated from the provinces of Massa Carrara, La Spezia, and Parma and, in the past, was used for the transport of Carrara marble, being a robust and rustic breed. CAL (n = 366) originates from the Calvana mountain, in Prato’s province. It is particularly suitable to live in marginal areas and has ever been considered as a beef breed. MUP (n = 413) is mainly reared nowadays in the province of Pisa. The breed is a crossbreed, mainly between Schwyz and Chianina breeds [2,3]. Hence, the historical, cultural, and ecological values of these three breeds are enormous and undisputed. Furthermore, the landscape of Tuscany placed barriers on widespread intensive breeding. As a result, the conservation and increase in sample sizes of local breeds could be of economic importance for this region.
Exploring genetic diversity is essential for developing conservation programs in autochthonous breeds, and one of the most commonly used parameters to assess genetic diversity is the effective population size (Ne) [4]. Ne is defined as the size of an ideal population that explains the same rate of random genetic changes as the current population [5,6], and has been traditionally estimated from the pedigree. However, with the advent of genomic technology, Ne can also be inferred from genomic data. The pedigree-based method requires an adequate completeness of data over several generations [7], and for local breeds, this is not an easy task for practical reasons related to breeding management. In a previous study [8], Ne from pedigrees was calculated for the same Tuscan breeds, but generally, the low quality of pedigrees and the alarming demographic situation of these breeds have raised the necessity to estimate the effective population size with genomic data; the linkage disequilibrium (LD) method was chosen for this purpose.
LD of genomic markers is related to various evolutionary forces, such as inbreeding, nonrandom mating, population bottleneck, drift, recombination, and mutation, and hence is an essential parameter to examine population history and genetic diversity. LD is defined as the nonrandom association between alleles at two (or more) loci [9]. In general, it is expected that the strength of LD decreases with an increased distance between markers located on the same chromosome. The term linkage disequilibrium (also known as gametic disequilibrium) goes back to Lewontin and Kojima, in 1960 [9,10]. The interest in studying LD patterns grew together with the research in genes associated with diseases [11,12]. Moreover, LD evaluation is an important prerequisite in genome wide association studies (GWAS), useful to detect the number of markers that will be sufficient for quantitative trait locus mapping [13,14]. Information of LD has been utilized in genomic breeding programs, such as marker-assisted selection and whole genome predictions [15]. The cost reduction and the efficient implementation of genotyping in animal breeding have made LD a common analysis in various species, such as pigs [16,17], poultry [18], sheep [19], goats [20], and cattle [21,22,23]. The most common LD measures are the squared genetic correlation coefficient (r2) described by Hill and Robertson [24] and D’ reported by Lewontin [10]. LD patterns in livestock populations have been analyzed to investigate (i) the structure and the history of populations [25], (ii) gene mapping [26], and (iii) the effective population size (Ne) with molecular data [7,27,28].
The aim of this study was to investigate the LD and the LD-based Ne patterns of three Italian beef cattle breeds (CAL, MUP, and PON) facing risk of extinction. The commercial Limousin beef breed (LIM) was included in the analysis to allow for comparisons between local unselected breeds and a cosmopolitan counterpart.

2. Materials and Methods

2.1. Animals and Sampling

A total of 588 beef cattle from four breeds (CAL = 174, MUP = 270, PON = 44, and LIM = 100) were genotyped. The percentage of the sampled cattle relative to the entire pool per breed for the local breeds was 47.5%, 65.4%, and 89.8% for CAL, MUP, and PON, respectively. Regarding the LIM, a sample of 100 cattle was extracted at random from a pool of 533 genotyped cattle, belonging to the last three generations and balanced by sex (52 males and 48 females). Genotypic data from LIM were provided by ANACLI (Associazione nazionale allevatori delle razze bovine Charolaise e Limousin, Roma) [29].

2.2. Genotyping and Quality Control

All cattle were genotyped with GeneSeek GGP-LDv4 33k (Illumina Inc., San Diego, CA, USA). Genotype quality control (QC) and data filtering were performed with PLINK v1.9 [30], and was conducted separately for each breed. Only SNPs located on the 29 autosome chromosomes were included (n = 28,289). SNPs with minor allele frequency (MAF) lower than 1%, and with call rate <0.95 were removed. Further, SNPs with more than 10% missingness values and deviated from Hardy Weinberg Equilibrium (p < 0.000001), as well as animals with more than 10% missingness, were also removed. After filtering, 164, 263, 41, and 100 cattle and 23,646, 23,436, 22,791, and 23,279 SNPs remained for CAL, MUP, PON, and LIM, respectively (Table 1).

2.3. Genomic Relationship Matrix

The genomic relationship matrix (GRM) was created per breed to (i) investigate the among breed identical by state relationships and (ii) compare the status of the genotypic samples among the populations under study. The GRM was calculated with the following formula by VanRaden [31]:
G R M = Z Z 2 p i ( 1 p i )
where Z is a centered matrix of marker genotypes of all individuals and pi is the frequency of the second allele at locus i. Z was calculated from genotypes of reference population subtracting 2pi from the matrix X that defines the genotypes for each individual as 0, 1 or 2. Heatmap graphs for the GRM of each breed were produced in R software [32].
For each GRM, the following parameters were taken into account: (i) the mean of the diagonal values; (ii) the mean of all the off-diagonal; (iii) the minimum and maximum of the diagonal values; and (iv) the minimum and maximum of the off-diagonal values. For all the above-mentioned parameters, the absolute value and squared root were also estimated.

2.4. Linkage Disequilibrium

Linkage disequilibrium was measured using r2, which is the squared correlation of the alleles at two loci [24]. The r2 is considered to be a better measure of LD than D’ because it is more robust and less sensitive to changes in effective population size and gene frequency [33,34]. The r2 ranges between 0 and 1 and was calculated as follows:
r 2 = ( f r e q   ( A B ) × f r e q   ( a b ) f r e q   ( A b ) × f r e q   ( a B ) ) 2 ( f r e q   ( A ) × f r e q   ( a ) × f r e q   ( B ) × f r e q   ( b ) )
where freq (A), freq (a), freq (B), and freq (b) are the allele frequencies and freq (AB), freq (ab), freq (Ab), and freq (aB) are the genotype frequencies. The LD extent was calculated for all SNPs pairs of each chromosome using PLINK v1.9 [30] under the command: --r2 –ld-window 99999 --ld-window-r2 0, in order to take an interval less than 99,999 SNPs and to save in the output all SNPs pairs.
The LD decay was analyzed in order to compare differences between and within breeds: (i) LDs were binned into four intervals of 0.25 Mbp (0–0.25, 0.25–0.5, 0.5–0.75, and 0.75–1 Mbp), the mean and the standard deviation (SD) of r2 values were computed for each interval; (ii) LD was also investigated for each autosome, considering windows of 1 Kbp. The LD-decay plots were visualized using the ggplot2 R package [35].

2.5. Estimation of Historical and Contemporary Effective Population Size

The historical and recent Ne for all breeds were estimated with the SNeP software [36], which is based on the relationships between LD, Ne, and the recombination rate. The default options were used. Ne was analyzed starting 13 generations ago because the default maximum distance in SNeP was 4000 Kbp. High LD in closely linked SNPs reflects ancient population history (50 Kbp ≈ 1500 generations ago), while high LD between distant SNPs describes more recent history (4000 Kbp ≈ 12.5 generations ago) [37]. The following formula was used to estimate Ne from LD [28]:
N e ( t ) = 1 ( 4 f ( c t ) )   (   1 E [ r a d j 2 | c t | α )
where Ne(t) is the effective population size estimated for t generation ago, which is calculated as t = 1 / ( 2 f ( c t ) ) [38]; ct is the recombination rate at t generations ago, defined for a specific physical distance between markers; r2adj is the LD estimate adjusted for sample size (164, 263, 41, and 100 for CAL, MUP, PON, and LIM, respectively); and α is a constant and set to 1, as suggested by Ohta and Kimura [39]. The contemporary effective population size (cNe) was calculated with NEESTIMATOR v.2 [40] with the mating model set to random; cNe means that the results referred to the time period of the sample size included in the analysis.

3. Results

3.1. Quality Control and Genomic Relationship Matrix

Table 1 summarizes the number of SNPs and individuals from each breed, before and after the quality control.
The GRM heatmaps per breed are presented in Figure 1. Individuals were ordered by farm to consider the farm management. Diagonal blocks, indicating highly related individuals, were mainly found for CAL and MUP. Moreover, this grouping was mainly attributed to the farm level. The off-diagonal values among the different farms indicate that there is reduced gene-flow among the farms, with the exception of CAL. In CAL, higher levels of relationship between individuals both between and within farms were observed. The lowest relatedness average between animals was found for LIM.
GRM summary statistics for each breed are reported in Table S1. The mean of the diagonal value was <1 in all breeds. The average of the diagonal values was 0.99 for CAL, MUP, and LIM, and 0.97 for PON. The highest diagonal values were 1.75, 1.72, 1.54, and 1.22 for MUP, PON, CAL, and LIM, respectively. The minimum diagonal values ranged from 0.67 (MUP) to 0.78 (LIM). The average of off-diagonal was negative for all breeds. The highest off-diagonal maximum was found in MUP (1.09), followed by PON (0.84) and CAL (0.79), while the lowest was found in LIM (0.50).

3.2. Linkage Disequilibrium

The total autosomal length in the analyzed SNP chip was 2512 Mbp, with the shortest Bos taurus autosome (BTA) being the BTA25 (~42.8 Mbp) and the longest being the BTA1 (~158.5 Mbp). The dimensions of each chromosome, the number of SNPs, the mean distance between SNPs, and the longest interval between pairwise SNPs for each chromosome are shown in Table S2. In all breeds analyzed, the highest number of SNPs was found on BTA1 (ranging from 1328 to 1282, in CAL and PON, respectively), and the smallest was found in BTA27 (from 451 to 435 SNPs in CAL and PON, respectively). The average distances between adjacent SNPs were 0.105, 0.106, and 108 ± 0.08 Mbp in CAL and MUP, LIM, and PON, respectively. The largest distances between two adjacent SNPs were found on BTA12, that is, 4.44, 2.89, 2.87, and 2.09 Mbp in LIM, CAL, MUP, and PON, respectively.
The average of the highest r2 values was found in PON (0.17), and the lowest in LIM (0.07), with CAL and MUP (0.14) at intermediate values. The mean and SD of r2 per chromosome and breed are summarized in Table 2. For CAL, the highest values were found in BTA20 (0.21), followed by BTA6 (0.18). BTA21 (0.19), BTA4, and BTA9 (0.17) were the three autosomes that showed the highest extent of LD in MUP. For PON, BTA16 had the highest r2 (0.23), followed by BTA21 (0.22), BTA20, and BTA1 (0.20). BTA20 and BTA16 were also the chromosomes with higher LD extent for LIM, albeit at a much lower degree (0.13 and 0.12, respectively). The lowest LD was found in BTA22 (for PON and LIM) and in BTA27 (for CAL, MUP, and LIM). For LIM, the lowest LD values (0.05) were found in more BTA apart from BTA22 and BTA27 (Table 2).
The distribution of r2 across BTA is shown in Figure 2. The highest variation was observed for PON. On the contrary, LIM had both the lowest variation and the lowest r2 values. Median values were similar across BTA within breed except for PON, where more fluctuations were present. The reduced area of the first quartiles indicated that the observations had similar values and were close to 0. The third quartiles were wider than the first in all breeds. Moreover, the whiskers, representing maximum (and minimum) values, suggested the high differentiation of LD extent within the local breed and between the local and commercial breeds.
In order to analyze the LD decay within a specific distance between SNPs, independently of differences within chromosomes, LD was investigated considering intervals of 0.25 Mbp up to 1 Mbp (Figure 3). Differences among breeds were observed, especially for LIM, which was clearly separated from the local breeds with a steeper decay with an increasing distance.
PON and LIM presented the two opposite extremes (top and bottom on Figure 3, respectively) while CAL and MUP had similar and intermediate trends. For PON, an r2 close to 0.15 was maintained even at 1 Mbp distance, while for CAL and MUP, it was maintained at ~0.1. For LIM, r2 < 0.1 was found for distances >0.12 Mbp (Figure 3). In general, for the local breeds, a sharp decay was observed till ~0.12 Mbp, while for LIM, the r2 was stabilized after ~0.25 Mbp. Average r2 for each bin was reported in Table S3. Values of r2 > 0.2 in the first bin were found only for PON, while from 0.25 to 1 Mbp, values remained close to 0.15. MUP and CAL had similar trends, with r2 ~0.19 ± 0.25 in the first interval, whereas from 0.25 to 0.5 Mbp, CAL showed higher mean r2 value than MUP; while with the longest distance (>0.75 and up to 1 Mbp), r2 was close to 0.10 for both breeds. LIM showed the more rapid LD decay, with average r2 ranging from 0.141 (<0.25 Mbp) to 0.035 (>0.75 Mbp).
The r2 was also calculated per BTA in windows of 1 Kbp. The LD decay was found to be different among chromosomes, but with a similar trend in the local breeds (Figure S1). CAL and PON presented a higher variation of r2 within the analyzed interval of 1 Kbp; indeed, the decay did not linearly decrease, while LIM showed a faster decay.

3.3. Estimation of Historical and Contemporary Effective Population Size

On the basis of LD estimates, the trend of Ne over time (per generation) for each breed was investigated (Figure 4). Similar to LD, the Ne pattern of the local breeds was different from that of LIM. The decrease in Ne was clear in all breeds, with a sharper decay for LIM. Ne decreased from ~275 (80th generation ago) to 79 and 65 (13th generation ago) in CAL and MUP, respectively, while for PON, Ne estimates decreased from 204 to 45 (in the most distant and recent generation, respectively). The historical trend of LIM was very different than those for the local breeds, decreasing from 920 to 310 (80th to 13th generations ago).
Regarding the contemporary Ne (cNe), CAL, MUP, PON, and LIM showed values of 41.7, 18.7, 17.0, and 327.9, respectively.

4. Discussion

In this study, LD was investigated to assess the genomic architecture and the evolutionary history of autochthonous populations. LD provides various information that is used in several applications. For example, LD between linked markers determines the power and the precision of association mapping studies, because it influences the ability to localize genes, loci affecting economic traits, and diseases. Indeed, understanding the extent of LD improves the planning and the performance of genomic breeding programs [41]. However, direct comparison between the studies should be done with caution, owing to some factors that influence LD estimates, that is, sample size, population history and structure, LD measure (r2 or D′), marker type (microsatellites or SNPs), marker filtering, density, and distribution [34]. In the present study, a genome-wide LD extent and Ne parameters were calculated for three Italian local beef breeds (CAL, MUP, and PON). The results were contrasted to the cosmopolitan Limousin, a commercial beef breed undergoing selection. This is the first study that performed an in-depth analysis of LD extent in these three Italian local breeds. In the last years, great importance has been given to genetic diversity and has underlined local breeds as important genetic resources as they harbor unique gene pools as a result of adaptation to the local environment [42]. In the absence of high-quality pedigree information, a GRM was constructed for each breed to define the degree of relatedness within a sampled breed. This information was used to investigate potential differences among the breeds that might be responsible for differences in LD and Ne patterns. It is known that, in a closed inbred population, the recombination decreases and the LD increases [43]. The heatmaps showed that this situation has been avoided; farm groups were present, especially in CAL, but the relatedness within them was not found to be worrisome. Differences in LD and Ne estimates of LIM compared with local breeds could be a result of different levels of relationship among the animals sampled, but, most of all, differences could be caused by the greater sample size and the breeding program used for LIM, which is absent for local breeds.
Regarding LD extent, our results revealed an r2 variation among BTA within the breed. This could be partly attributed to different lengths of the chromosomes [22]. However, BTA1, which was the longest chromosome, presented a high LD level only in PON. Interestingly, BTA16, BTA20, and BTA21 were characterized by high r2 values shared in more than one breed: BTA16 showed high r2 values in PON and LIM; BTA20 in CAL, PON, and LIM; and BTA21 in PON and MUP. These autosomes should be investigated in further works because the markers with high r2 values could reveal potential Quantitative Trait Loci (QTLs).
Average LD was different between local breeds and LIM. LIM is one of the most reared beef breeds in Italy, and was analyzed in this study to compare local breeds LD patterns with that belonging to a breed undergoing directional selection. In general, in commercial breeds, artificial selection and insemination allow the control of matings, and thus the inbreeding, which is further linked to LD, as inbreeding augments the covariance between alleles at different loci [9]. Furthermore, if the breed had an expansion in population size or consists of a large population, genetic drift is weaker and, as a consequence, LD decreases, converging to an equilibrium [43]. The higher LD observed in the local breeds is likely related to a higher ancestral relatedness and to a historically smaller Ne [44], as shown in Figure 4. Moreover, the LD decay of the local breeds had the characteristic trend of populations that have suffered a collapse in population size and/or a bottleneck, as described by Rogers [43]. A previous study on Italian local cattle breeds carried out by Mastrangelo et al. [45] confirmed the slow LD decay found in this study. Mastrangelo et al. [44] analyzed two local breeds reared in Sicily, an island in the South of Italy. The r2 values were 0.16 for Cinisara and 0.20 for Modicana. No other studies on the level of LD in Italian local cattle were found in the literature. Hence, comparisons were done with foreign breeds. Mustafa et al. [46] investigated the Sahiwal dairy breed, which is under threat of extinction, showing an average r2 value equal to 0.18, which is similar to our Tuscan local breeds (ranged from 0.14 to 0.17). Nevertheless, differences were found for the LD decay, which was significantly more rapid than in Tuscan populations. Tunisian local breeds, consisting of big populations, studied by Jemaa et al. [22], showed more similar LD patterns to LIM than the Tuscan breeds, with r2 values lower than 0.05 in larger distances (>0.5 Mbp). Another study that corroborated the hypothesis that the decay of Tuscan breeds was slower than other local breeds was performed by Makina et al. [47], who studied four local African cattle compared with Angus and Holstein breeds. In all six breeds, at 1 Mbp, r2 values were lower than 0.1 and in the four local breeds (Afrikaner, Nguni, Drakensberger, and Bonsmara), and r2 did not exceed the value of 0.05 (in our study, all local populations had r2 > 0.1 at 1 Mbp distance).
More studies on LD on commercial beef cattle breeds can be found in the literature. For instance, in Chinese Simmental breed [48] in the window of 0.5–1 Mbp, the r2 was 0.05. This is similar to our estimates in LIM (0.03) and significantly lower than our findings in the local breeds (0.11 for CAL and MUP and 0.15 for PON). Biegelmeyer et al. [49] investigated the LD patterns in Hereford and Braford, and found average r2 values equal to 0.07 and 0.06, respectively, for the same window of 1 Mbp. Both breeds showed a great level of LD in the short term, suggesting a faster decay than in Tuscan breeds and more similar to LIM decay.
Other studies always based on selected breeds reported smaller average r2 by chromosomes for Limousin breed [15,50] than that found in the present research, likely owing to the different history of populations and to a more ancient breeding selection system compared with Italian Limousin.
Sample size is another factor that influences the estimate of LD. Khatkar et al. [51] declared that, if r2 parameter is used to calculate LD, a minimum sample size of 75 animals is required both for an accurate estimation and to avoid bias. LIM, as well as CAL and MUP, consisting of larger populations than PON, were sampled considering the threshold of the aforementioned study. In our analysis, only PON did not reach this threshold. However, this was because, at present, only 49 PON alive animals are officially enrolled to the register of cattle at limited diffusion. Hence, our samples referred to ~81% of the entire PON population. Regarding the two other local breeds, the genotypic data analyzed represented 47.5% and 65.4% of Calvana and Mucca Pisana, respectively. Overall, our study considered a much higher number of genotyped animals per breed compared with previous studies on autochthonous cattle breeds [7,16,20,22,23,45,52].
In fact, sample size could be the cause of the different results found in the present research when compared with previous studies. For example, Kukučková et al. [37] investigated the trend of historical Ne of 15 European cattle breeds, Limousin included, and reported lower values (Ne was equal to ~300 in the 60th generation ago) than the Italian LIM, but the sample size of that study was limited to 20 animals. Mastrangelo et al. [3] investigated the trends of Ne of the same Tuscan breeds, among a plethora of cattle breeds, and related different trends of historical effective population size. The recent Ne (13th generation) was less than 100, as in our study, but the trends for PON and MUP contrast with this latter with the lowest values and no overlapping with Calvana as instead related in the present study. Regarding LIM, both studies showed different trends between local and cosmopolitan breeds, but the LIM historical Ne was higher in our study (80th generation Ne was 920, whereas it was <500 in Mastrangelo et al. [3]). Obviously, the contemporary Ne (cNe) also presented some differences, even maintaining the same ranking: cNe was greater in the present study for the Tuscan breeds, namely, 41.7, 18.7, and 17 (for CAL, MUP, and PON, respectively) and 33.5, 8.7, and 7.2, respectively, for the same breeds in the cited paper. Only LIM had lower cNe value (327.9 instead of 468.9), probably because of the sampling strategy (only animals of the last three generations were taken). However, CAL, and definitely MUP and PON, were at risk of loss of genetic diversity, presenting Ne lower than 50 animals for generation, which is the threshold suggested by Food and Agriculture Organization (FAO) [53].

5. Conclusions

Our results, utilizing a medium-density SNP chip, demonstrated differences in LD and Ne patterns in four beef breeds, three locals under extinction (Calvana, Mucca Pisana, and Pontremolese) and one cosmopolitan (Limousin), reared in Italy. The greater genetic diversity loss, as dictated by the Ne estimates, was found for Mucca Pisana. This work complements previous analysis carried out on pedigree information, both describing the population structure of the three local breeds. It is necessary to carefully monitor these populations and to strengthen the use of accurate mating plans in order to both increase the census size and control relatedness and inbreeding. LD results could be utilized in association studies as well as in the development of low-density SNP chip panels, for example, for parentage testing. Future studies could focus on the genomic regions with high r2 values, as well as investigate runs of homozygosity in the three local breeds.

Supplementary Materials

The following are available online at https://www.mdpi.com/2076-2615/10/6/1034/s1, Figure S1: Linkage Disequilibrium decay described for each chromosome considering intervals of 1 Kbp; Table S1: Genomic relationship matrix (GRM) summary statistics for each breed; Table S2: Summary of analyzed SNPs for each breed, split by autosomal chromosomes; Table S3: Average and standard deviation (SD) of linkage disequilibrium (r2), estimated in four intervals of 0.25 Mbp, per breed.

Author Contributions

R.B. and C.D. conceived the idea, formulated the objectives of this study, and supervised the project. M.C.F. conducted the analysis and wrote the first draft of the paper. C.D. helped in data preparation and in supervising the project. All authors read and approved the final manuscript.

Funding

This research was funded by ANACLI, grant number 2015.99.2264.1127.

Acknowledgments

The research was funded by ANACLI through the I-BEEF project PSRN 2014–2020. Sottomisura 10.2: Biodiversità animale. We acknowledge Associazione Nazionale Allevatori delle razze bovine Charolaise e Limousine (ANACLI) for providing the data.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Bozzi, R.; Alvarez, I.; Crovetti, A.; Fernández, I.; De Petris, D.; Goyache, F. Assessing priorities for conservation in Tuscan cattle breeds using microsatellites. Anim. Int. J. Anim. Biosci. 2012, 6, 203–211. [Google Scholar] [CrossRef] [Green Version]
  2. Negrini, R.; Milanesi, E.; Bozzi, R.; Pellecchia, M.; Ajmone-Marsan, P. Tuscany autochthonous cattle breeds: An original genetic resource investigated by AFLP markers. J. Anim. Breed. Genet. 2006, 123, 10–16. [Google Scholar] [CrossRef]
  3. Mastrangelo, S.; Ciani, E.; Ajmone Marsan, P.; Bagnato, A.; Battaglini, L.; Bozzi, R.; Carta, A.; Catillo, G.; Cassandro, M.; Casu, S.; et al. Conservation status and historical relatedness of Italian cattle breeds. Genet. Sel. Evol. 2018, 50, 35. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Leroy, G.; Mary-Huard, T.; Verrier, E.; Danvy, S.; Charvolin, E.; Danchin-Burge, C. Methods to estimate effective population size using pedigree data: Examples in dog, sheep, cattle and horse. Genet. Sel. Evol. 2013, 45, 1. [Google Scholar] [CrossRef] [Green Version]
  5. Palstra, F.P.; Ruzzante, D.E. Genetic estimates of contemporary effective population size: What can they tell us about the importance of genetic stochasticity for wild population persistence? Mol. Ecol. 2008, 17, 3428–3447. [Google Scholar] [CrossRef] [PubMed]
  6. Crow, J.F.; Kimura, M. An introduction to population genetics theory. Introd. Popul. Genet. Theory 1970, 2, 174–208. [Google Scholar]
  7. Flury, C.; Tapio, M.; Sonstegard, T.; Drögemüller, C.; Leeb, T.; Simianer, H.; Hanotte, O.; Rieder, S. Effective population size of an indigenous Swiss cattle breed estimated from linkage disequilibrium. J. Anim. Breed. Genet. 2010, 127, 339–347. [Google Scholar] [CrossRef]
  8. Fabbri, M.C.; Gonçalves de Rezende, M.P.; Dadousis, C.; Biffani, S.; Negrini, R.; Souza Carneiro, P.L.; Bozzi, R. Population Structure and Genetic Diversity of Italian Beef Breeds as a Tool for Planning Conservation and Selection Strategies. Animals 2019, 9, 880. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  9. Slatkin, M. Linkage disequilibrium—understanding the evolutionary past and mapping the medical future. Nat. Rev. Genet. 2008, 9, 477–485. [Google Scholar] [CrossRef] [Green Version]
  10. Lewontin, R.C. The Interaction of Selection and Linkage. I. General Considerations; Heterotic Models. Genetics 1964, 49, 49–67. [Google Scholar]
  11. Park, L. Population-specific long-range linkage disequilibrium in the human genome and its influence on identifying common disease variants. Sci. Rep. 2019, 9, 1–13. [Google Scholar] [CrossRef] [PubMed]
  12. Kruglyak, L. Prospects for whole-genome linkage disequilibrium mapping of common disease genes. Nat. Genet. 1999, 22, 139–144. [Google Scholar] [CrossRef] [PubMed]
  13. Lee, S.H.; Cho, Y.M.; Lim, D.; Kim, H.C.; Choi, B.H.; Park, H.S.; Kim, O.H.; Kim, S.; Kim, T.H.; Yoon, D.; et al. Linkage Disequilibrium and Effective Population Size in Hanwoo Korean Cattle. Asian-Australas. J. Anim. Sci. 2011, 24, 1660–1665. [Google Scholar] [CrossRef]
  14. Meuwissen, T.H.; Goddard, M.E. Fine mapping of quantitative trait loci using linkage disequilibria with closely linked marker loci. Genetics 2000, 155, 421–430. [Google Scholar] [PubMed]
  15. Lu, D.; Sargolzaei, M.; Kelly, M.; Li, C.; Vander Voort, G.; Wang, Z.; Plastow, G.; Moore, S.; Miller, S.P. Linkage disequilibrium in Angus, Charolais, and Crossbred beef cattle. Front. Genet. 2012, 3. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Muñoz, M.; Bozzi, R.; García-Casco, J.; Núñez, Y.; Ribani, A.; Franci, O.; García, F.; Škrlep, M.; Schiavo, G.; Bovo, S.; et al. Genomic diversity, linkage disequilibrium and selection signatures in European local pig breeds assessed with a high density SNP chip. Sci. Rep. 2019, 9, 1–14. [Google Scholar] [CrossRef]
  17. Du, F.-X.; Clutter, A.C.; Lohuis, M.M. Characterizing Linkage Disequilibrium in Pig Populations. Int. J. Biol. Sci. 2007, 166–178. [Google Scholar] [CrossRef] [PubMed]
  18. Seo, D.; Lee, D.H.; Choi, N.; Sudrajad, P.; Lee, S.-H.; Lee, J.-H. Estimation of linkage disequilibrium and analysis of genetic diversity in Korean chicken lines. PLoS ONE 2018, 13, e0192063. [Google Scholar] [CrossRef] [Green Version]
  19. Ghoreishifar, S.M.; Moradi-Shahrbabak, H.; Parna, N.; Davoudi, P.; Khansefid, M. Linkage disequilibrium and within-breed genetic diversity in Iranian Zandi sheep. Arch. Tierz. 2019, 62, 143–151. [Google Scholar] [CrossRef]
  20. Monau, P.I.; Visser, C.; Muchadeyi, F.C.; Okpeku, M.; Nsoso, S.J.; Van Marle-Köster, E. Population structure of indigenous southern African goats based on the Illumina Goat50K SNP panel. Trop. Anim. Health Prod. 2020. [Google Scholar] [CrossRef]
  21. Edea, Z.; Dadi, H.; Dessie, T.; Lee, S.-H.; Kim, K.-S. Genome-wide linkage disequilibrium analysis of indigenous cattle breeds of Ethiopia and Korea using different SNP genotyping BeadChips. Genes Genom. 2015, 37, 759–765. [Google Scholar] [CrossRef]
  22. Jemaa, S.B.; Thamri, N.; Mnara, S.; Rebours, E.; Rocha, D.; Boussaha, M.; Jemaa, S.B.; Thamri, N.; Mnara, S.; Rebours, E.; et al. Linkage disequilibrium and past effective population size in native Tunisian cattle. Genet. Mol. Biol. 2019, 42, 52–61. [Google Scholar] [CrossRef] [PubMed]
  23. Cañas-Álvarez, J.J.; Mouresan, E.F.; Varona, L.; Díaz, C.; Molina, A.; Baro, J.A.; Altarriba, J.; Carabaño, M.J.; Casellas, J.; Piedrafita, J. Linkage disequilibrium, persistence of phase, and effective population size in Spanish local beef cattle breeds assessed through a high-density single nucleotide polymorphism chip. J. Anim. Sci. 2016, 94, 2779–2788. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Hill, W.G.; Robertson, A. Linkage disequilibrium in finite populations. Theor. Appl. Genet. 1968, 38, 226–231. [Google Scholar] [CrossRef]
  25. Gautier, M.; Faraut, T.; Moazami-Goudarzi, K.; Navratil, V.; Foglio, M.; Grohs, C.; Boland, A.; Garnier, J.-G.; Boichard, D.; Lathrop, G.M.; et al. Genetic and Haplotypic Structure in 14 European and African Cattle Breeds. Genetics 2007, 177, 1059–1070. [Google Scholar] [CrossRef]
  26. Arias, J.A.; Keehan, M.; Fisher, P.; Coppieters, W.; Spelman, R. A high density linkage map of the bovine genome. BMC Genet. 2009, 10, 18. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Wang, J. Estimation of effective population sizes from data on genetic markers. Philos. Trans. R. Soc. B Biol. Sci. 2005, 360, 1395–1409. [Google Scholar] [CrossRef] [Green Version]
  28. Corbin, L.J.; Liu, A.Y.H.; Bishop, S.C.; Woolliams, J.A. Estimation of historical effective population size using linkage disequilibria with marker data. J. Anim. Breed. Genet. 2012, 129, 257–270. [Google Scholar] [CrossRef]
  29. Statistics of the Italian Limousine and Charolaise Breeds, ANACLI Site. Available online: http://www.anacli.it/WEBSITE/index.php?&pagid=2499&sessione= (accessed on 12 February 2020).
  30. Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.R.; Bender, D.; Maller, J.; Sklar, P.; de Bakker, P.I.W.; Daly, M.J.; et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef] [Green Version]
  31. VanRaden, P.M. Efficient methods to compute genomic predictions. J. Dairy Sci. 2008, 91, 4414–4423. [Google Scholar] [CrossRef] [Green Version]
  32. The R Project for Statistical Computing. Available online: https://www.r-project.org/ (accessed on 31 January 2020).
  33. Zhao, H.; Nettleton, D.; Dekkers, J.C.M. Evaluation of linkage disequilibrium measures between multi-allelic markers as predictors of linkage disequilibrium between single nucleotide polymorphisms. Genet. Res. 2007, 89, 1–6. [Google Scholar] [CrossRef]
  34. Jasielczuk, I.; Gurgul, A.; Szmatoła, T.; Semik-Gurgul, E.; Pawlina-Tyszko, K.; Szyndler-Nędza, M.; Blicharski, T.; Szulc, K.; Skrzypczak, E.; Bugno-Poniewierska, M. Comparison of linkage disequilibrium, effective population size and haplotype blocks in Polish Landrace and Polish native pig populations. Livest. Sci. 2020, 231, 103887. [Google Scholar] [CrossRef]
  35. Wickham, H. ggplot2: Elegant Graphics for Data Analysis; Springer: New York, NY, USA, 2016; ISBN 978-3-319-24277-4. [Google Scholar]
  36. Barbato, M.; Orozco-terWengel, P.; Tapio, M.; Bruford, M.W. SNeP: A tool to estimate trends in recent effective population size trajectories using genome-wide SNP data. Front. Genet. 2015, 6. [Google Scholar] [CrossRef] [Green Version]
  37. Kukučková, V.; Moravčíková, N.; Ferenčaković, M.; Simčič, M.; Mészáros, G.; Sölkner, J.; Trakovická, A.; Kadlečík, O.; Curik, I.; Kasarda, R. Genomic characterization of Pinzgau cattle: Genetic conservation and breeding perspectives. Conserv. Genet. 2017, 18, 893–910. [Google Scholar] [CrossRef]
  38. Hayes, B.J. Novel Multilocus Measure of Linkage Disequilibrium to Estimate Past Effective Population Size. Genome Res. 2003, 13, 635–643. [Google Scholar] [CrossRef] [Green Version]
  39. Ohta, T.; Kimura, M. Linkage Disequilibrium between Two Segregating Nucleotide Sites under the Steady Flux of Mutations in a Finite Population. Genetics 1971, 68, 571–580. [Google Scholar]
  40. Do, C.; Waples, R.S.; Peel, D.; Macbeth, G.M.; Tillett, B.J.; Ovenden, J.R. NeEstimator v2: Re-implementation of software for the estimation of contemporary effective population size (Ne) from genetic data. Mol. Ecol. Resour. 2014, 14, 209–214. [Google Scholar] [CrossRef]
  41. Qanbari, S. On the Extent of Linkage Disequilibrium in the Genome of Farm Animals. Front. Genet. 2020, 10. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  42. Upadhyay, M.; Eriksson, S.; Mikko, S.; Strandberg, E.; Stålhammar, H.; Groenen, M.A.M.; Crooijmans, R.P.M.A.; Andersson, G.; Johansson, A.M. Genomic relatedness and diversity of Swedish native cattle breeds. Genet. Sel. Evol. 2019, 51, 56. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  43. Rogers, A.R. How Population Growth Affects Linkage Disequilibrium. Genetics 2014, 197, 1329–1341. [Google Scholar] [CrossRef] [Green Version]
  44. Pérez O’Brien, A.M.; Mészáros, G.; Utsunomiya, Y.T.; Sonstegard, T.S.; Garcia, J.F.; Van Tassell, C.P.; Carvalheiro, R.; da Silva, M.V.B.; Sölkner, J. Linkage disequilibrium levels in Bos indicus and Bos taurus cattle using medium and high density SNP chip data and different minor allele frequency distributions. Livest. Sci. 2014, 166, 121–132. [Google Scholar] [CrossRef]
  45. Mastrangelo, S.; Saura, M.; Tolone, M.; Salces-Ortiz, J.; Di Gerlando, R.; Bertolini, F.; Fontanesi, L.; Sardina, M.T.; Serrano, M.; Portolano, B. The genome-wide structure of two economically important indigenous Sicilian cattle breeds1. J. Anim. Sci. 2014, 92, 4833–4842. [Google Scholar] [CrossRef] [PubMed]
  46. Mustafa, H.; Ahmad, N.; Heather, H.J.; Eui-soo, K.; Khan, W.A.; Ajmal, A.; Javed, K.; Pasha, T.N.; Ali, A.; Kim, J.J.; et al. Whole genome study of linkage disequilibrium in Sahiwal cattle. S. Afr. J. Anim. Sci. 2018, 48, 353–360. [Google Scholar] [CrossRef] [Green Version]
  47. Makina, S.O.; Taylor, J.F.; van Marle-Köster, E.; Muchadeyi, F.C.; Makgahlela, M.L.; MacNeil, M.D.; Maiwashe, A. Extent of Linkage Disequilibrium and Effective Population Size in Four South African Sanga Cattle Breeds. Front. Genet. 2015, 6. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  48. Zhu, M.; Zhu, B.; Wang, Y.H.; Wu, Y.; Xu, L.; Guo, L.P.; Yuan, Z.R.; Zhang, L.P.; Gao, X.; Gao, H.J.; et al. Linkage Disequilibrium Estimation of Chinese Beef Simmental Cattle Using High-density SNP Panels. Asian-Australas. J. Anim. Sci. 2013, 26, 772–779. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  49. Biegelmeyer, P.; Gulias-Gomes, C.C.; Caetano, A.R.; Steibel, J.P.; Cardoso, F.F. Linkage disequilibrium, persistence of phase and effective population size estimates in Hereford and Braford cattle. BMC Genet. 2016, 17, 32. [Google Scholar] [CrossRef] [Green Version]
  50. Rodríguez-Ramilo, S.T.; García-Cortés, L.A.; de Cara, M.Á.R. Artificial selection with traditional or genomic relationships: Consequences in coancestry and genetic diversity. Front. Genet. 2015, 6. [Google Scholar] [CrossRef] [Green Version]
  51. Khatkar, M.S.; Nicholas, F.W.; Collins, A.R.; Zenger, K.R.; Cavanagh, J.A.L.; Barris, W.; Schnabel, R.D.; Taylor, J.F.; Raadsma, H.W. Extent of genome-wide linkage disequilibrium in Australian Holstein-Friesian cattle based on a high-density SNP panel. BMC Genom. 2008, 9, 187. [Google Scholar] [CrossRef] [Green Version]
  52. Sharma, A.; Lim, D.; Chai, H.-H.; Choi, B.-H.; Cho, Y. Demographic Trends in Korean Native Cattle Explained Using Bovine SNP50 Beadchip. Genom. Inform. 2016, 14, 230–233. [Google Scholar] [CrossRef] [Green Version]
  53. Woolliams, J.A.; Gwaze, D.P.; Meuwissen, T.H.E.; Planchenault, D.; Renard, J.-P.; Thibier, M.; Wagner, H. Secondary Guidelines for Development of National Farm Animal Genetic Resources Management Plans–Measurement of Domestic Animal Diversity (MoDAD); FAO: Rome, Italy, 1998; p. 58. [Google Scholar]
Figure 1. Heatmaps of genomic relationship matrix of (A) Calvana, (B) Mucca Pisana, (C) Pontremolese, and (D) Limousin breeds.
Figure 1. Heatmaps of genomic relationship matrix of (A) Calvana, (B) Mucca Pisana, (C) Pontremolese, and (D) Limousin breeds.
Animals 10 01034 g001aAnimals 10 01034 g001b
Figure 2. Boxplots of linkage disequilibrium (LD) (r2) of Bos taurus autosomes (BTAs) per breed. The yellow asterisks represent the mean and horizontal lines within each boxplot are the medians (outliers have been removed). CAL = Calvana; MUP = Mucca Pisana; PON = Pontremolese, LIM = Limousin.
Figure 2. Boxplots of linkage disequilibrium (LD) (r2) of Bos taurus autosomes (BTAs) per breed. The yellow asterisks represent the mean and horizontal lines within each boxplot are the medians (outliers have been removed). CAL = Calvana; MUP = Mucca Pisana; PON = Pontremolese, LIM = Limousin.
Animals 10 01034 g002
Figure 3. Linkage disequilibrium (LD), calculated as r2, in different distances of the genome and up to 1 Mbp for Calvana (CAL), Mucca Pisana (MUP), Pontremolese (PON), and Limousin (LIM).
Figure 3. Linkage disequilibrium (LD), calculated as r2, in different distances of the genome and up to 1 Mbp for Calvana (CAL), Mucca Pisana (MUP), Pontremolese (PON), and Limousin (LIM).
Animals 10 01034 g003
Figure 4. Average estimated historical effective population size (Ne) in the Italian breeds: Calvana (CAL), Mucca Pisana (MUP), Pontremolese (PON), and Limousin (LIM).
Figure 4. Average estimated historical effective population size (Ne) in the Italian breeds: Calvana (CAL), Mucca Pisana (MUP), Pontremolese (PON), and Limousin (LIM).
Animals 10 01034 g004
Table 1. Number of autosomal SNPs and individuals before (pre-) and after (post-)quality control (QC) per breed.
Table 1. Number of autosomal SNPs and individuals before (pre-) and after (post-)quality control (QC) per breed.
Breed 1 N SNPs Pre-QCN SNPs Post-QCN Individuals Pre-QCN Individuals Post-QC
CAL28,28923,646174164
MUP28,28923,436270263
PON28,28922,7914441
LIM28,28923,279100100
1 CAL = Calvana; MUP = Mucca Pisana; PON = Pontremolese; LIM = Limousin.
Table 2. The average and standard deviation (SD) of linkage disequilibrium (r2) for Bos taurus autosomes (BTAs) per breed.
Table 2. The average and standard deviation (SD) of linkage disequilibrium (r2) for Bos taurus autosomes (BTAs) per breed.
Breed 1CALMUPPONLIM
AutosomeAverage r2SDAverage r2SDAverage r2SDAverage r2SD
BTA10.140.200.130.200.200.250.080.16
BTA20.160.230.130.190.170.220.100.18
BTA30.150.210.120.170.160.220.080.15
BTA40.140.190.170.260.200.260.100.22
BTA50.150.200.150.200.180.230.070.12
BTA60.180.260.140.200.190.230.080.14
BTA70.120.160.130.170.170.210.070.12
BTA80.140.190.110.160.160.210.060.11
BTA90.150.220.170.230.150.210.080.17
BTA100.130.180.130.170.150.180.060.10
BTA110.120.180.120.160.150.180.060.11
BTA120.120.170.140.190.170.210.060.10
BTA130.150.210.120.170.160.200.050.09
BTA140.130.180.130.180.190.230.060.11
BTA150.140.180.150.210.200.240.090.17
BTA160.150.220.150.220.230.280.120.20
BTA170.130.180.150.200.150.180.060.10
BTA180.130.170.150.200.170.200.060.12
BTA190.150.200.140.190.160.210.070.13
BTA200.210.270.150.210.200.250.130.21
BTA210.140.210.190.240.220.250.100.18
BTA220.120.160.110.160.140.170.050.08
BTA230.140.200.140.190.150.190.080.15
BTA240.140.190.120.160.170.220.060.11
BTA250.130.170.120.160.150.190.060.10
BTA260.130.170.130.180.180.220.050.09
BTA270.110.150.100.140.150.180.050.09
BTA280.120.160.130.160.150.190.060.10
BTA290.150.220.120.170.200.250.050.09
1 CAL = Calvana; MUP = Mucca Pisana; PON = Pontremolese, LIM = Limousin.

Share and Cite

MDPI and ACS Style

Fabbri, M.C.; Dadousis, C.; Bozzi, R. Estimation of Linkage Disequilibrium and Effective Population Size in Three Italian Autochthonous Beef Breeds. Animals 2020, 10, 1034. https://doi.org/10.3390/ani10061034

AMA Style

Fabbri MC, Dadousis C, Bozzi R. Estimation of Linkage Disequilibrium and Effective Population Size in Three Italian Autochthonous Beef Breeds. Animals. 2020; 10(6):1034. https://doi.org/10.3390/ani10061034

Chicago/Turabian Style

Fabbri, Maria Chiara, Christos Dadousis, and Riccardo Bozzi. 2020. "Estimation of Linkage Disequilibrium and Effective Population Size in Three Italian Autochthonous Beef Breeds" Animals 10, no. 6: 1034. https://doi.org/10.3390/ani10061034

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop