Next Article in Journal
Green Extraction Technology of Polyphenols from Food By-Products
Next Article in Special Issue
Recent Advances in OMICs Technologies and Application for Ensuring Meat Quality, Safety and Authenticity
Previous Article in Journal
Nutritional and Antioxidant Properties of Moringa oleifera Leaves in Functional Foods
Previous Article in Special Issue
Preliminary Results about Lamb Meat Tenderness Based on the Study of Novel Isoforms and Alternative Splicing Regulation Pathways Using Iso-seq, RNA-seq and CTCF ChIP-seq Data
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Interlaboratory Validation of a DNA Metabarcoding Assay for Mammalian and Poultry Species to Detect Food Adulteration

1
Austrian Agency for Health and Food Safety (AGES), Department for Molecular Biology and Microbiology, Institute for Food Safety Vienna, Spargelfeldstrasse 191, 1220 Vienna, Austria
2
QuoData GmbH, Prellerstrasse 14, 01309 Dresden, Germany
3
Department of Analytical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Strasse 38, 1090 Vienna, Austria
*
Authors to whom correspondence should be addressed.
Foods 2022, 11(8), 1108; https://doi.org/10.3390/foods11081108
Submission received: 5 March 2022 / Revised: 5 April 2022 / Accepted: 8 April 2022 / Published: 12 April 2022

Abstract

:
Meat species authentication in food is most commonly based on the detection of genetic variations. Official food control laboratories frequently apply single and multiplex real-time polymerase chain reaction (PCR) assays and/or DNA arrays. However, in the near future, DNA metabarcoding, the generation of PCR products for DNA barcodes, followed by massively parallel sequencing by next generation sequencing (NGS) technologies, could be an attractive alternative. DNA metabarcoding is superior to well-established methodologies since it allows simultaneous identification of a wide variety of species not only in individual foodstuffs but even in complex mixtures. We have recently published a DNA metabarcoding assay for the identification and differentiation of 15 mammalian species and six poultry species. With the aim to harmonize analytical methods for food authentication across EU Member States, the DNA metabarcoding assay has been tested in an interlaboratory ring trial including 15 laboratories. Each laboratory analyzed 16 anonymously labelled samples (eight samples, two subsamples each), comprising six DNA extract mixtures, one DNA extract from a model sausage, and one DNA extract from maize (negative control). Evaluation of data on repeatability, reproducibility, robustness, and measurement uncertainty indicated that the DNA metabarcoding method is applicable for meat species authentication in routine analysis.

1. Introduction

Food authentication is known to be a challenging task. The methodology applied depends on several factors, including sample type, type of adulteration, and the information required. Meat products are most commonly adulterated by the replacement of high-priced animal species by lower-quality or cheaper ones. Since DNA-based methodologies are highly suitable to detect genetic variations such as single nucleotide polymorphisms (SNPs), insertions, and deletions, they play a crucial role in the identification and differentiation of animal species in meat products [1,2,3]. DNA-based methodologies target either species-specific sequences in nuclear DNA or conserved regions in mitochondrial DNA [4]. Currently, authentication of meat products in official food laboratories is mainly based on real-time polymerase chain reaction (PCR) assays and/or DNA arrays. Multiplex real-time PCR assays, allowing the identification and quantification of multiple species in one and the same well, are particularly applicable for routine analysis because they allow saving time and costs. Multiplex real-time PCR assays have not only been developed for domesticated species, e.g., beef, pig, chicken, and turkey [5], and beef, pig, horse, and sheep [6], but also for game species, e.g., roe deer, red deer, fallow deer, and sika deer [7]. However, the low number of optical channels of real-time PCR instruments limits the number of species that can be targeted simultaneously.
Next generation sequencing (NGS) technologies, in particular massively parallel sequencing of PCR products based on the analysis of species-specific differences in DNA sequences (DNA barcoding), is being considered a promising alternative [8,9,10]. So-called DNA metabarcoding offers the possibility to identify a wide variety of species not only in individual foodstuffs but even in complex mixtures. Moreover, in contrast to real-time PCR assays, it is an untargeted approach, allowing the detection of species one has not been looking for.
We have recently developed a DNA metabarcoding method for 15 mammalian species and six poultry species, which are quite frequently contained in European foodstuff [11]. In order to detect both mammalian and poultry species, a primer pair for mammals and a primer pair for poultry species was combined in a duplex PCR assay. A ~120 bp fragment of the mitochondrial 16S ribosomal DNA gene serves as barcode region. The DNA metabarcoding method has been validated with regard to specificity, repeatability, robustness, accuracy, and limit of detection (LOD) [11]. In-house validation data showed that the DNA metabarcoding method can be used for routine applications. Meat species can be identified down to a concentration of 0.1%. Very recently, the applicability of the DNA metabarcoding method for routine analysis was further investigated by the analysis of a total of 104 samples (25 reference samples, 56 food products, and 23 pet food products). Results obtained by DNA metabarcoding were in line with those obtained by real-time PCR and/or a commercial DNA array [12]. However, interlaboratory evaluation of novel methods is a prerequisite for standardization and harmonization.
In this study, we summarized the results of an interlaboratory ring trial for the DNA metabarcoding method, initiated by the §64 German Food and Feed Code (LFGB) working group “NGS Species Identification”, chaired by the Federal Office of Consumer Protection and Food Safety (BVL) in Germany. One goal of the working group is the validation and standardization of (screening) methods for the identification and differentiation of animal species based on next generation amplicon sequencing for food authentication. The interlaboratory ring trial was coordinated by the Austrian Agency for Health and Food Safety (AGES) in 2020 and involved 15 participating laboratories. The aim was to evaluate the performance (e.g., repeatability, reproducibility, accuracy) of the DNA metabarcoding method in detail.

2. Materials and Methods

2.1. Participating Laboratories

The interlaboratory ring trial was organized by the AGES on behalf of the BVL. The following laboratories participated in the ring trial (in alphabetical order): Bavarian State Office for Food Safety and Health (LGL), Oberschleißheim, Germany; Chemical and Veterinary Analytical Institute Muensterland-Emscher-Lippe (CVUA-MEL), Muenster, Germany; Chemical and Veterinary Investigation Office Freiburg (CVUA-FR), Freiburg, Germany; Chemical and Veterinary Investigation Office Karlsruhe (CVUA-KA), Karlsruhe, Germany; Eurofins Genomics Europe Applied Genomics GmbH, Ebersberg, Germany; StarSEQ GmbH, Mainz, Germany; Labor Kneissler GmbH and Co. KG, Burglengenfeld, Germany; Saxony-Anhalt State Office for Consumer Protection (LAV S-A), Halle, Germany; State Office Laboratory Hessen (LHL), Kassel, Germany; Max Rubner-Institut (MRI)/National Reference Centre for Authentic Food (NRZ-Authent), Kulmbach, Germany; Max Planck Institute for Plant Breeding Research (MPIPZ), Köln, Germany; Lower Saxony State Office for Consumer Protection and Food Safety (LAVES), Niedersachsen, Germany; AGES, Vienna, Austria; PLANTON Laboratory for Analysis and Biotechnology GmbH, Kiel, Germany; SGS Institute Fresenius GmbH, Freiburg, Germany.

2.2. Samples

In the course of the interlaboratory ring trial, eight samples had to be analyzed: six DNA extract mixtures (samples 1–6), one DNA extract from a model sausage (sample 7), and one DNA extract from maize (sample 8), serving as a negative control (Table 1).
In total, seven animal species, including five mammalian species (Sus scrofa domesticus (pig), Bos taurus (cattle), Equus caballus (horse), Ovis gmelini aries (sheep), and Capra aegagrus hircus (goat)), and two poultry species (Gallus gallus domesticus (chicken) and Meleagris gallopavo (turkey)) were covered by the samples. All samples originated from muscle meat and were purchased from local meat suppliers.
Sample 1 contained DNA from seven animal species: DNA from pig as major component (94%, v/v), and DNA from six animal species (cattle, horse, sheep, goat, chicken, turkey; 1% (v/v) each). Samples 2–6 consisted of DNA from five animal species in varying proportions, ranging from 0.1% (v/v) to 67.5% (v/v). All DNA extract mixtures were prepared at the AGES.
The model sausage was produced according to the Codex Alimentarius Austriacus by the Higher Technical College for Food Technology Hollabrunn (Hollabrunn, Austria). The model sausage consisted of 50% (w/w) beef, 40% (w/w) pig, 5% (w/w) chicken, and 5% (w/w) turkey.

2.3. Genomic DNA Extraction

Extraction of genomic DNA from muscle meat of the seven animal species (pig, beef, horse, sheep, goat, chicken, turkey) was carried out at the CVUA-FR by applying the official method L 00.00–119 [13]. Identity of the animal species was verified by subjecting DNA extracts to Sanger sequencing and matching a ~464 base pair (bp) fragment of the mitochondrial cytochrome b gene against public databases provided by the National Center for Biotechnology Information (NCBI, Bethesda, MD, USA) [14,15]. After verification by Sanger sequencing and isolating genomic DNA fourfold, individual DNA extracts were combined. Total DNA of the (combined) DNA extracts was quantified by spectroscopy employing a UV/VIS spectrophotometer, adjusted to a DNA concentration of 20 ng/µL and sent to AGES. Isolation of DNA from the homogenized model sausage was performed at AGES [13].
The copy number of the mitochondrial 16S ribosomal DNA gene in the extracts from the respective animal species was determined by droplet digital PCR (ddPCR, QX200 Droplet Generator, QX200 Droplet Reader (Bio-Rad, Hercules, CA, USA)) using the EvaGreen Supermix. DNA extract mixtures were prepared at the AGES, by taking the copy numbers (pig (870 copies/µL), beef (1069 copies/µL), horse (1795 copies/µL), sheep (520 copies/µL), goat (790 copies/µL), chicken (620 copies/µL), turkey (673 copies/µL)), into account. The percentages of samples 1 to 6 given in Table 1 were calculated by relating the DNA copy number of the respective animal species to the total number of copies of animal species in the sample.

2.4. Study Design

The interlaboratory ring trial for validation of the DNA metabarcoding method for mammalian and poultry species [11] was conducted in the framework of the §64 LFGB working group “NGS Species Identification” under the coordination of AGES. Statistical data analysis was performed by QuoData GmbH (Dresden, Germany).
For sequencing, three benchtop NGS instruments from two companies were used. Benchtop instruments from Illumina (Illumina, San Diego, CA, USA) were employed by eleven laboratories, whereof eight used the MiSeq instrument, three the iSeq 100 instrument, and one participant used both the MiSeq and the iSeq 100 instrument. Four laboratories applied the Ion GeneStudio S5 instrument from Thermo Fisher Scientific (Thermo Fisher Scientific, Waltham, MA, USA).
Each participant obtained 16 anonymously labelled samples, comprising two subsamples of each of the eight samples (Table 1). Sixteen samples were chosen to allow the iSeq 100 platform to be included in the ring trial. This also enabled the use of the most cost-effective MiSeq Reagent Micro Kit v2 for a small number of samples on the Illumina platforms. Participants directly used all individual DNA extracts for DNA library preparation and subsequently for amplicon sequencing on a next-generation sequencing instrument. Together with the “ready-to-use” DNA extracts, the participants obtained reagents for creating DNA libraries, a sequencing kit, and a step-by-step instruction.
In order to be able to perform sequencing on the Ion GeneStudio S5 instrument (Thermo Fisher Scientific, Waltham, MA, USA), the protocol for preparation of DNA libraries and sequencing published previously by Dobrovolny et al. (2019) [11] had to be adapted as follows. Each of the two forward primers were elongated by a 3 bp barcode adapter, one of 16 different 10 bp barcodes (index sequence) and the overhang adapter sequence (A adapter). Each of the two reverse primers was linked to an overhang adapter sequence (trP1 adapter). The PCR setup did not include additional magnesium chloride solution. In the magnetic bead cleaning step, a total of 37.5 µL Agencourt® AMPure® XP beads (Beckman Coulter, Brea, CA, USA) was used and the DNA was eluted with 50 µL Tris-EDTA (TE) buffer. The average library size was 190 bp, and all DNA libraries were adjusted to 100 pM and were mixed together in a single 1.5 mL tube. A 25 pM DNA pool was used for sequencing. In general, any deviations from the protocols had to be reported by the participants.
Paired-end sequencing on an Illumina instrument was performed using either the MiSeq Reagent Micro Kit v2 (300-cycles) or the iSeq 100 i1 Reagent v2 (300-cycles), which included a 5% PhiX spike-in. The Ion Chef instrument was used with the Ion 510TM and Ion 520TM and Ion 530TM Kit-Chef and the Ion 520TM Chip Kit to perform template preparation, enrichment and chip loading. Finally, the sequencing reaction was started on the Ion GeneStudio S5 instrument.
To obtain information about the presence of the animal species in the samples, the sequencing output in FastQ format was processed with a multi-step analysis pipeline by using Galaxy (version 19.01) as described previously [12]. Before the resulting FastQ files were used as input for the data analysis, the raw binary base call (bcl) files generated by Illumina devices were converted to text files using the conversion software bcl2fastq2-v2.19.0.316 (Illumina, San Diego, CA, USA). The default demultiplexing option of one allowed mismatch in the barcode recognition of the Illumina software (-- barcode mismatches) was thereby set to zero (value: 0) and the step was also integrated into the pipeline. Preliminary tests had shown that this can increase the quality of index recognition. The Thermo Fisher instrument software uses this setting by default. The analysis pipeline for sequencing data of the Thermo Fisher Scientific (Waltham, MA, USA) platforms was modified because paired-end FastQ files do not exist in this case. Consequently, the primer sequences were adapted according to the requirements of the analysis tool Cutadapt (Galaxy version 1.16.6 [16]) and the tool fastq-join (Galaxy version 1.1.2-806.1) [17] was removed. Dereplicated reads were directly matched against a customized database (AGES database) including 51 mitochondrial genomes from animals (Supplementary Table S1) and the public databases provided by NCBI using BLASTn [18]. The AGES database contains verified sequences from the NCBI database exclusively from food-relevant animal species. This reduces the time needed for alignment and is intended to avoid nonsense results. For each of the samples, results were listed automatically in a table according to taxonomy and abundance and a formula calculated the proportions of animal species by relating the number of reads for the respective species to the number of total reads (after pipeline) across all animal species obtained for the subsample. For further statistical analysis, all Excel spreadsheets were sent to QuoData GmbH.

2.5. Statistical Data Analysis

Statistical analyses were performed by QuoData GmbH. Even though the DNA metabarcoding method used in this interlaboratory comparison is commonly regarded as a qualitative method, the underlying decision process is based on the comparison of a quantitative value, namely the proportion of a single species, with a specific threshold. The performance of such a method can be assessed both on the basis of the qualitative result (yes/no) and on the basis of the underlying quantitative data. Because the information content of the quantitative data can be far greater than the corresponding qualitative data, the quantitative data were used in addition to the qualitative data to describe the performance of the DNA metabarcoding method.
In addition, the study of quantitative data also aimed to verify the extent to which this method can also be used for quantitative determinations.

2.5.1. Quantitative Statistical Analyses

Proportions of animal species ranged between 0.1% and 94%. To avoid asymmetric distributions for proportions near 0% and 100%, and to ensure equality of variances for the individual combinations of samples/animal species, the proportions were subjected to a logit transformation:
logit ( proportion ) = ln   ( proportion 1 - proportion )
The logit-transformed proportions can be retransformed as follows:
proportion = e logit ( proportion ) 1 + e logit ( proportion )
The logit-transformed proportions were then subjected to several outlier tests. Data were checked for systematic errors across samples and/or animal species affecting the mean values (Mandel h statistics) and/or variances (Mandels k statistics). In addition, the occurrence of sample- and animal species-specific outliers regarding the laboratory mean values and variances was tested for by means of the Grubbs and the Cochran tests (significance level 1%), respectively. Proportions identified as outliers were excluded from further statistical analyses.
Logit-transformed and outlier-cleaned data were checked for normal distribution using the Shapiro–Wilk test. Then, repeatability, reproducibility, and accuracy of the proportions of animal species were determined according to the criteria of QuoData certified with the aid of the software solution for method comparison studies and interlaboratory comparison studies PROLab Plus, version 2021.7.22.0 [19] (QuoData, Dresden, Germany), using the statistical methods according to DIN ISO 5725-2 and according to the specifications in the Official Collection of Test Methods ASU §64 LFGB for the statistical evaluation of ring trials for method validation [20]. Taking into account the obtained repeatability and reproducibility standard deviations for samples 1–7, variance functions describing the functional relationship between standard deviations and overall mean for the individual combinations of samples/animal species were modelled.
For each of the combinations, the bias (difference) between this overall mean and the proportion of the animal species added to the sample was also determined. Furthermore, the standard deviation of this bias was calculated.
Prediction profiles and measurement uncertainty profiles were constructed, both not considering the bias (based on reproducibility standard deviations), and considering the bias (based on reproducibility standard deviations as well as on the standard deviation of the bias).
In addition, the z scores for each combination of lab/sample/animal species were determined, providing a measure for the standardized deviations of laboratory mean values from the respective overall mean value.

2.5.2. Qualitative Statistical Analyses

A sample was classified as false positive if for at least one animal species that had not been added, a proportion above a defined threshold was obtained. By contrast, a sample was considered false negative for a specific animal species if the proportion was below a defined threshold for this species.
The probability of detection for an arbitrary animal at a defined threshold for (1) a laboratory with average performance, (2) a laboratory with positive bias, and (3) a laboratory with negative bias was determined based on the variance functions by the quantitative statistical analysis.

3. Results and Discussion

Fourteen of the fifteen laboratories submitted their sequencing results in time. Ten laboratories applied the Illumina platform, with seven laboratories (01, 02, 03, 04, 06, 08, 14) using the MiSeq, two laboratories (07, 13) the iSeq 100, and one laboratory utilizing both the MiSeq and the iSeq 100 (referred to as “laboratory 15” and “laboratory 20”, respectively). The remaining four laboratories (09, 10, 11, 12) applied the Ion GeneStudio S5 system from Thermo Fisher Scientific.
Each of the laboratories submitted 16 sequencing results in total (eight samples, two subsamples each), with the exception of laboratories 07, 12, and 13. Laboratory 07 did not provide the result for subsample 8B (negative control), whereas datasets submitted by laboratories 12 and 13 were lacking results for both subsamples of sample 8. With the exception of laboratory 03, sequencing was done by using the test kit provided by the AGES. The suitability of the MiSeq Reagent Kit v2 applied by laboratory 03 had been demonstrated in preliminary experiments.
FastQ data provided by the participating laboratories was evaluated by the AGES by using the analysis pipeline in Galaxy. For identification of animal species, the DNA sequences (reads) were aligned, once with the customized AGES database and once with the NCBI database. Supplementary Table S2 summarizes the total number of reads for each laboratory, taking into account the results obtained for each of the fourteen subsamples containing animal species (samples 1–7).
Total numbers of raw reads that passed the analysis pipeline were quite different between laboratories. Very low total numbers of raw reads can, for example, be caused by errors during wet-lab activities, e.g., pipetting errors or error rate of DNA polymerase. In addition, problems with adapter- and index-recognition are known to have an impact. Loss of reads or their elimination by pipeline tools can also be caused by errors in PCR amplification of the library (e.g., index hopping), sequencing errors (e.g., inserts, substitutions, or deletions) or insufficient cluster resolution [21]. All these errors may affect the quality of raw data (FastQ file) and thus the number of DNA sequences (reads) after analysis pipeline.
Total numbers of reads obtained with the Ion GeneStudio S5 were significantly higher than those obtained with the MiSeq (p < 0.001) and the iSeq 100 (p < 0.001). Differences observed between the Illumina and the Thermo Fisher technology are caused by differences in data filtering. The instrument-specific software of the Ion GeneStudio S5 removes datasets of lower quality by filtering before starting the analysis pipeline. Thus, considerably more sequences remain after primary data analysis compared to the instruments from Illumina.
Differences in recoveries, by relating the total number of reads to the number of raw reads before analysis pipeline, between laboratories using the same instrument type (MiSeq, iSeq 100, or Ion GeneStudio S5, Table S3) hint at differences in the quality of the sequencing run and unintended loss of reads. Significant differences (p < 0.001) in recoveries between laboratories using Illumina instruments and those applying the Ion GeneStudio S5 were, however, expected. These differences are caused by the fact that the pipeline of the Ion GeneStudio S5 neither included paired-end sequencing nor a “joining step” as was the case with the Illumina platforms.

3.1. Quantitative Evaluation of Ring Trial Data

The aim of quantitative evaluation of ring trial data was to determine average proportions of the animal species that had been added to samples 1–7, and to identify resulting error components within and in between laboratories.

3.1.1. Proportions of Animal Species in Samples 1–7

Proportions of animal species were calculated by relating the number of reads for the respective species to the number of total reads (after pipeline) across all animal species obtained for the sample. Table 2 gives the proportions of animal species determined for sample 1 containing seven animal species (Table 2a), sample 2 (as a representative of samples consisting of five animal species; Table 2b) and sample 7, a model food sample (Table 2c). Results for samples 3, 4, 5, and 6 are shown as stacked bar plots (Figure 1).
Preliminary evaluation of the results indicated that the proportions of animal species determined considerably depended on the sequencing platform/technology applied. Due to the low number of laboratories using the sequencing technology from Thermo Fisher Scientific, only results obtained by laboratories applying Illumina platforms were included into statistical evaluation. Results obtained by laboratories 09–12 using the Ion GeneStudio S5 are only shown for comparison.

3.1.2. Logit Transformation

Proportions of animal species in samples 1–7 were quite different, ranging from 0.1% to 94%. However, a prerequisite for the evaluation of ring trial data according to ASU §64 LFGB is that the proportions of animal species follow normal or at least symmetric distribution. In order to allow assumption of normal distribution and ensure equality of variances of the individual combinations of samples/animal species (after elimination of outliers), proportions of animal species were subjected to a logit transformation. The logit is the logarithm of the proportion of the animal species divided by 1 minus the proportion of the animal species. Proportions of, e.g., 0.1%, 1%, 10%, 30%, 50%, and 70%, resulted in logit values of −6.91, −4.60, −2.20, −0.85, 0, and 0.85, respectively. Since the logit for 0% and 100% is not defined, it was set to surrogate values of −15 and +15, respectively.

3.1.3. Outlier Tests

In the course of evaluating ring trial data according to ASU §64 LFGB, logit-transformed proportions of animal species were subjected to several outlier tests (see also Section 2.5.1). Table 3 summarizes the outliers and reasons for their elimination.
In case an outlier was only identified for one database (either AGES or NCBI database), it was, however, eliminated for both databases to ensure data comparability. In total, 15 of 396 (3.8%) combinations of laboratory/sample/animal species were identified as outliers and excluded from further evaluation for each of the databases.

3.1.4. Distribution of Sample-Specific Proportions of Animal Species

After outlier elimination, sample-specific logit-transformed proportions of animal species were tested for normal distribution by Kernel density estimation and the Shapiro–Wilk test.
A small number of cases were found to have a bimodal distribution. However, the Shapiro–Wilk test did not show evidence for non-normality for any of the combinations of samples/animal species. Thus, logit-transformed proportions of animal species could be subjected to further statistical evaluation.

3.1.5. Statistical Parameters According to ASU §64 LFGB

Logit-transformed and outlier-cleaned data were normally distributed and thus could be subjected to statistical evaluation according to ASU §64 LFGB. Table 4 gives—for each animal species and both databases (AGES and NCBI)—main statistical parameters, including re-transformed mean value, reproducibility standard deviation (sR), and repeatability standard deviation (sr). For the sake of completeness, the logit-transformed parameters are shown as well. The reproducibility standard deviation characterizes the variability of results between laboratories, and the repeatability standard deviation the variability within a laboratory under constant conditions, i.e., the variability of results obtained for the two subsamples of the same sample.

3.1.6. Dependence of Bias, Reproducibility Standard Deviation, and Repeatability Standard Deviation on the Mean Proportion of Animal Species

Next, it was evaluated whether the bias between the proportion of the animal species added to the sample and the overall mean determined in the ring trial, as well as whether reproducibility standard deviation and repeatability standard deviation depended on the proportion of the respective animal species (Figure 2A) and/or the predominant animal species in the sample (Figure 2B). Evaluation was based on the statistical parameters for logit-transformed proportions of animal species.
As expected, the reproducibility standard deviation and repeatability standard deviation were found to be higher for lower proportions of animal species than for proportions of about 50% (logit = 0), independent of the database selected for alignment. It was found that pig tends to have higher standard deviations in reproducibility, but not in repeatability, compared to other animal species (Figure 2A). A tendency towards higher reproducibility standard deviations was also observed in case beef was the predominant animal species in the sample. This also held true for the repeatability standard deviation, although to a lower extent.
The bias between the proportion of the animal species added to the sample and the overall mean determined in the ring trial was in the range from −0.6 to 0.6 logits, with just two exceptions. Neither the animal species nor the proportion of the animal species was found to have a systematic effect on the bias. The animal species and the proportion of the animal species did not have a systematic impact on the reproducibility standard deviation either.
Thus, the standard deviation induced by the bias (“bias standard deviation”), absolute reproducibility standard deviation, and absolute repeatability standard deviation could be modeled across animal species and samples, for both the AGES and NCBI databases. The modeled variance function was similar for both databases. The lowest bias standard deviation, reproducibility standard deviation, and repeatability standard deviation were found for a proportion of 50% (logit = 0). The closer the proportion to 0% or 100%, the higher the standard deviations. Table 5 summarizes the modeled and re-transformed standard deviations, which were found to be independent of the database used for alignment.

3.1.7. Variability across Animal Species and Measuring Uncertainty

To allow predictions for further analyses, the measuring uncertainty was evaluated. Since the database (AGES or NCBI) was not found to have an impact on bias standard deviation, reproducibility standard deviation, or repeatability standard deviation, the measuring uncertainty was only evaluated for the AGES database, representative for both databases.
Based on the reproducibility standard deviation, a prediction profile was established in terms of the 95% confidence interval of the results of all laboratories across all animal species. In addition, the 95% confidence interval was established by considering both the reproducibility standard deviation and the bias standard deviation.
The upper part of Figure 3 shows the 95% confidence interval of the (outlier-cleaned) results depending on the respective overall mean value of the proportion of an animal species (without considering the bias). The left side of the figure shows the entire range, and the right side an enlarged view of proportions from 0 to 10%. The figure indicates that for a “true proportion” (assuming that the overall mean across laboratories applying Illumina platforms reflects the “true proportion”) of, e.g., 5%, the 95% confidence interval is 4.1–6.2%. In total, 4.7% of the individual values are outside the 95% confidence interval.
The 95% confidence interval of the (outlier-cleaned) results of all laboratories, depending on the respective proportion added (by considering the bias) is shown in the lower part of Figure 3. The left and right sides show the entire range and an enlarged view of proportions from 0 to 10%, respectively. For example, for an added proportion of 5%, the 95% confidence interval is 2.5–9.9%. In total, 3.7% of the individual values are outside the 95% confidence interval.
From the prediction profiles, a measurement uncertainty profile was established, indicating how far the proportion determined may deviate from the “true“ proportion of the animal species. Figure 4 shows the 95% measurement uncertainty intervals depending on the proportion of the animal species determined, on the left side without consideration of the bias (assuming that the “true” proportion equals the overall mean across laboratories applying Illumina platforms), on the right side under consideration of the bias (assuming that the “true” proportion equals the proportion of animal species added to the sample).
For a 50% proportion of the animal species, the “true value” can be 3.6% (percentage points) lower or higher, if the bias is not taken into account, or even 15.2%, if the bias is taken into consideration.

3.1.8. z Scores

z scores were calculated for interlaboratory evaluation across samples and animal species (Figure 5). z scores measure standardized deviations of laboratory mean values from the overall mean value. An absolute z score > 2 hints at a statistically significant deviation of the respective laboratory.
LASSO regression was applied to check whether absolute z scores depended on the instrument (MiSeq, iSeq 100), NGS experience of the respective laboratory, and/or activation of the function ”adapter trimming“ before starting the sequencing run. Analyses were performed excluding data previously identified as outliers. NGS experience of the laboratory was found to significantly affect the z score. For laboratories more experienced in NGS (laboratories 01, 04, 08, 15, 20), lower absolute z scores were determined compared to those with lower NGS experience. By contrast, neither the instrument (MiSeq, iSeq 100) nor activation of the function “adapter trimming” before starting the sequencing run was found to significantly affect the z score.

3.2. Qualitative Evaluation of Ring Trial Data

3.2.1. False Positive Rate

Next, it was investigated whether animal species were identified that had not been added to the samples, and whether the false positive rate depended on the database selected for alignment. A sample was classified as positive if for at least one animal species that had not been added, a proportion above a defined threshold was obtained. The threshold was set to 0.1%, 0.5%, 1.0%, or 2.0% proportion of the animal species.
Table 6 indicates that alignment against the AGES database (Table 6A) resulted in less false positive reads compared to alignment against the NCBI database (Table 6B).
Alignment against the AGES database only yielded false positive samples at threshold values of 0.05% or 0.1% (Figure 6).
Higher false positive rates for the NCBI database were inevitably caused by the higher number of entries in the NCBI database compared to the AGES database. Most species resulting in false positive reads when using the NCBI database were not contained in the AGES database and thus could not be identified with the latter.
From the ring trial data it can be concluded that by using an appropriate customized database and by setting the threshold to 0.5%, false positive rates < 1% will be obtained.

3.2.2. False Negative Rate

Next, the false negative rate was evaluated at threshold values of 0.05%, 0.1%, 0.5%, and 1% for both the AGES and the NCBI databases (Table 7).
There was no considerable difference between the AGES and the NCBI databases regarding the proportion of false negative results obtained for samples 1–7. At a threshold of 0.05%, false negative results were only obtained for pig in sample 4. At a threshold of 0.1%, none of the combinations of sample–animal species led to false negative results, with the exception of proportions being close to the threshold (0.1%). The data indicate that at a threshold of ≥0.5%, the probability of obtaining false negative results is very low.

3.2.3. Probability of Detection (Qualitative Evaluation)

Figure 7 shows the probability of detection for three thresholds (0.1%, 0.5%, and 1%) and three scenarios, namely a laboratory with average performance, a laboratory with positive bias, and a laboratory with negative bias.
Figure 7 indicates that a threshold value of 1% seems to be a good compromise, provided that the variance function determined in the ring trial is equal to the actual variance function. A threshold value of 1% guarantees that a laboratory with a positive bias (overestimating the actual proportion) does not complain if the proportion of a certain animal species is 0.5%, whereas even a laboratory with a negative bias (underestimating the actual proportion) will be able to identify proportions >1.5% reliably.

3.3. Negative Control

Sample 8 was a DNA extract from maize, serving as a negative control. Since the marker system designed for mammals and poultry species does not detect maize, all reads that were obtained for sample 8 had to be regarded as false positive.
Table 8 lists the number of total reads (after pipeline) per laboratory and species. Laboratories 12 and 13 did not submit results for sample 8, laboratory 07 only provided results for one of the two subsamples.
In total, eight animal species were identified in the negative control by alignment against the AGES database. Fourteen further animal species, including the species Homo sapiens were identified, when the NCBI database was used. Per laboratory, up to five animal species were only identified with the NCBI database, with the exception of laboratory 06, which even detected eight additional animal species in subsample A.
In most cases, within a laboratory, the number of reads per animal species was similar for both subsamples. When the AGES database was used for alignment, most reads were assigned to beef and pig. Sample multiplexing in general, together with an inappropriate index layout, carries the risk of index misassignment. This is obviously the reason for the over-represented number of reads for pig and beef in the negative controls, as these animal species represent the main quantities in the samples. Although the index kit was used according to the manufacturer’s instructions, the number of reads of these animal species could be reduced to the expected level in a supplementary experiment with an alternative index layout. Alignment against the NCBI database also resulted in considerably high numbers of reads (laboratories 06, 07, and 14 > 300, laboratory 07 even > 1000) for Homo sapiens, which was not contained in the AGES database.
As mentioned above, the marker system applied does not detect maize. Thus, the high number of reads for maize obtained by laboratory 06 for one subsample seems to be caused by a random error.
In general, the total number of false reads obtained for both subsamples was similar within a laboratory. Larger differences between the total reads of subsamples was observed for laboratories 03 and 14 (AGES and NCBI databases) and laboratory 06 (only NCBI database).

4. Conclusions

In summary, evaluation of data from the interlaboratory ring trial indicates that the DNA metabarcoding method performed on an Illumina platform is applicable for determining the proportion of the seven animal species with the given precision. Furthermore, the applicability of the method for testing foodstuff was demonstrated by the correct identification of the ingredients of a model sausage, which also supports the results in our study published recently.
Based on the data of the ring trial, a threshold of 0.5% is suitable to reliably assess whether a certain animal species is contained in a sample. The DNA metabarcoding method turned out to be rather robust and is therefore suitable to be implemented in routine analysis in official food control laboratories. Even laboratories that did not have much experience in NGS were able to provide reliable results. We suggest strictly following the given protocol. The results of the interlaboratory ring trial indicate that even alternative test kits or various sequencing platforms might be applied. However, the impact of any deviations from the experimental conditions has obviously to be tested before implementation in routine analysis.
Correct index recognition is of particular importance for pooled DNA libraries. We recommend frequently changing the index kits or the use of longer index sequences to avoid false positive and/or false negative results.
For taxonomic assignment, we suggest applying a customized database, as the pipeline is completed significantly faster and no nonsense results from erroneous database entries occur. However, if unexpected read losses and non-identifiable reads occur, the additional use of the entire NCBI database or any other appropriate sequence database is recommended.
In order to increase interlaboratory comparability of results obtained by DNA metabarcoding methods, it would be necessary to establish a reference database with verified sequence entries of relevant species. Access to adequate reference material would also facilitate harmonization of the methods used.
In general, determination of the meat content (w/w) from the number of NGS reads or the determined target DNA concentration is a well-known difficulty, especially in the quantification of meat species in processed foods. The result is also influenced by the degree of processing of the sample present and by the type of animal ingredients used. Data from testing reference samples out of proficiency testing schemes confirm the limitations known for DNA quantification in meat products [12]. Quantitative results should therefore serve only as rough estimates for weight ratios of different species in food.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/foods11081108/s1, Supplementary Table S1: Sequences included into the reference database, Supplementary Table S2: Total number of reads after pipeline (n = 14, samples 1–7, two subsamples each). min: minimal value, max: maximal value, mean: arithmetic mean, RSD: relative standard deviation, Supplementary Table S3: Recovery (%) (total number of reads after pipeline related to the number of raw reads before analysis pipeline). (n = 14, samples 1–7, two subsamples each). min: minimal value, max: maximal value, mean: arithmetic mean, RSD: relative standard deviation.

Author Contributions

Conceptualization, R.H.; methodology, S.D. and R.H.; validation, S.U., K.F., A.S., K.N. and K.S.; writing—original draft preparation, M.C.-M., S.D. and R.H.; writing—review and editing, R.H., M.C.-M., S.D., S.U., K.F., A.S.; visualization, M.C.-M., K.F. and A.S.; supervision, R.H. All authors have read and agreed to the published version of the manuscript.

Funding

The ring trial was supported by the Federal Office of Consumer Protection and Food Safety, Germany (BVL). BVL funded all reagents for performing the tests and the statistical evaluation by QuoData GmbH.

Institutional Review Board Statement

Ethical review and approval was waived for this study as no live animals were used or slaughtered to achieve the aims of the study.

Data Availability Statement

The datasets generated during the current study are available from the corresponding authors upon reasonable request.

Acknowledgments

We thank the members of the working group “NGS Species identification” within the scope of the official method collection according to §64 of the German Food and Feed Code (LFGB) for their participation in the study.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Ballin, N.Z.; Vogensen, F.K.; Karlsson, A.H. Species determination—Can we detect and quantify meat adulteration? Meat Sci. 2009, 83, 165–174. [Google Scholar] [CrossRef] [PubMed]
  2. Ballin, N.Z. Authentication of meat and meat products. Meat Sci. 2010, 86, 577–587. [Google Scholar] [CrossRef] [PubMed]
  3. Sajali, N.; Wong, S.C.; Abu Bakar, S.; Khairil Mokhtar, N.F.; Manaf, Y.N.; Yuswan, M.H.; Mohd Desa, M.N. Analytical approaches of meat authentication in food. Int. J. Food Sci. Technol. 2021, 56, 1535–1543. [Google Scholar] [CrossRef]
  4. Kumar, A.; Kumar, R.R.; Sharma, B.D.; Gokulakrishnan, P.; Mendiratta, S.K.; Sharma, D. Identification of species origin of meat and meat products on the DNA basis: A review. Crit. Rev. Food Sci. Nutr. 2015, 55, 1340–1351. [Google Scholar] [CrossRef] [PubMed]
  5. Köppel, R.; Ruf, J.; Zimmerli, F.; Breitenmoser, A. Multiplex real-time PCR for the detection and quantification of DNA from beef, pork, chicken and turkey. Eur. Food Res. Technol. 2008, 227, 1199–1203. [Google Scholar] [CrossRef]
  6. Köppel, R.; Ruf, J.; Rentsch, J. Multiplex real-time PCR for the detection and quantification of DNA from beef, pork, horse and sheep. Eur. Food Res. Technol. 2011, 232, 151–155. [Google Scholar] [CrossRef]
  7. Kaltenbrunner, M.; Hochegger, R.; Cichna-Markl, M. Tetraplex real-time PCR assay for the simultaneous identification and quantification of roe deer, red deer, fallow deer and sika deer for deer meat authentication. Food Chem. 2018, 269, 486–494. [Google Scholar] [CrossRef] [PubMed]
  8. Staats, M.; Arulandhu, A.J.; Gravendeel, B.; Holst-Jensen, A.; Scholtens, I.; Peelen, T.; Prins, T.W.; Kok, E. Advances in DNA metabarcoding for food and wildlife forensic species identification. Anal. Bioanal. Chem. 2016, 408, 4615–4630. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  9. Nehal, N.; Choudhary, B.; Nagpure, A.; Gupta, R.K. DNA barcoding: A modern age tool for detection of adulteration in food. Crit. Rev. Biotechnol. 2021, 41, 767–791. [Google Scholar] [CrossRef] [PubMed]
  10. Galimberti, A.; Casiraghi, M.; Bruni, I.; Guzzetti, L.; Cortis, P.; Berterame, N.M.; Labra, M. From DNA barcoding to personalized nutrition: The evolution of food traceability. Curr. Opin. Food Sci. 2019, 28, 41–48. [Google Scholar] [CrossRef]
  11. Dobrovolny, S.; Blaschitz, M.; Weinmaier, T.; Pechatschek, J.; Cichna-Markl, M.; Indra, A.; Hufnagl, P.; Hochegger, R. Development of a DNA metabarcoding method for the identification of fifteen mammalian and six poultry species in food. Food Chem. 2019, 272, 354–361. [Google Scholar] [CrossRef] [PubMed]
  12. Preckel, L.; Brünen-Nieweler, C.; Denay, G.; Petersen, H.; Cichna-Markl, M.; Dobrovolny, S.; Hochegger, R. Identification of mammalian and poultry species in food and pet food samples using 16s rdna metabarcoding. Foods 2021, 10, 2875. [Google Scholar] [CrossRef] [PubMed]
  13. ISO 21571 I; Foodstuffs—Methods of Analysis for the Detection of Genetically Modified Organisms and Derived Products—Nucleic Acid Extraction. ISO: Geneva, Switzerland, 2005.
  14. Meyer, R.; Höfelein, C.; Lüthy, J.; Candrian, U. Polymerase chain reaction-restriction fragment length polymorphism analysis: A simple method for species identification in food. J. AOAC Int. 1995, 78, 1542–1551. [Google Scholar] [CrossRef] [PubMed]
  15. Palumbi, S.R.; Martin, A.; Romano, S.; McMillan, W.O.; Stice, L.; Grabowski, G. The Simple Fool’s Guide to PCR; Deptartment of Zoology and Kewalo Marine Laboratory, University of Hawaii: Honolulu, HI, USA, 2002. [Google Scholar]
  16. Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011, 17, 10. [Google Scholar] [CrossRef]
  17. Aronesty, E. Comparison of sequencing utility programs. Open Bioinform. J. 2013, 7, 1–8. [Google Scholar] [CrossRef]
  18. Edgar, R.C. Search and clustering orders of magnitude faster than BLAST. Bioinformatics 2010, 26, 2460–2461. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  19. Quodata. PROLab Plus Software for Planning, Organizing, Performing and Analyzing Interlaboratory Studies. Available online: www.quodata.de (accessed on 10 December 2021).
  20. Official Collection of Test Methods A. Statistik—Planung und Statistische Auswertung von Ringversuchen zur Methodenvalidierung; Federal Office of Consumer Protection and Food Safety (BVL), Beuth Verlag: Berlin, Germany, 2006.
  21. MacConaill, L.E.; Burns, R.T.; Nag, A.; Coleman, H.A.; Slevin, M.K.; Giorda, K.; Light, M.; Lai, K.; Jarosz, M.; McNeill, M.S.; et al. Unique, dual-indexed sequencing adapters with UMIs effectively eliminate index cross-talk and significantly improve sensitivity of massively parallel sequencing. BMC Genom. 2018, 19, 30. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Proportions of animal species determined for samples 3, 4, 5, and 6. (A) AGES database, (B) NCBI database. Laboratories 01–06, 08, 14, 15: MiSeq; laboratories 07, 13, 20: iSeq 100; laboratories 09–12: Ion GeneStudio S5.
Figure 1. Proportions of animal species determined for samples 3, 4, 5, and 6. (A) AGES database, (B) NCBI database. Laboratories 01–06, 08, 14, 15: MiSeq; laboratories 07, 13, 20: iSeq 100; laboratories 09–12: Ion GeneStudio S5.
Foods 11 01108 g001
Figure 2. Bias, reproducibility standard deviation, and repeatability standard deviation depending on the overall mean determined in the ring trial based on logit-transformed proportions of animal species. Colors refer to (A) the respective animal species and (B) the predominant animal species in the respective sample.
Figure 2. Bias, reproducibility standard deviation, and repeatability standard deviation depending on the overall mean determined in the ring trial based on logit-transformed proportions of animal species. Colors refer to (A) the respective animal species and (B) the predominant animal species in the respective sample.
Foods 11 01108 g002
Figure 3. 95% confidence interval for the mean (top) and added (bottom) proportion of animal species based on a single measurement, independent of the animal species.
Figure 3. 95% confidence interval for the mean (top) and added (bottom) proportion of animal species based on a single measurement, independent of the animal species.
Foods 11 01108 g003
Figure 4. 95% measuring uncertainty interval for the proportion of the animal species determined, based on a single measurement, independent of the animal species.
Figure 4. 95% measuring uncertainty interval for the proportion of the animal species determined, based on a single measurement, independent of the animal species.
Foods 11 01108 g004
Figure 5. z scores for determination of proportions of animal species. (A) AGES database, (B) NCBI database. Absolute z scores < 2 are shown in blue, absolute z scores > 2 in red.
Figure 5. z scores for determination of proportions of animal species. (A) AGES database, (B) NCBI database. Absolute z scores < 2 are shown in blue, absolute z scores > 2 in red.
Foods 11 01108 g005
Figure 6. False positive rates by sample, depending on the defined threshold for the AGES (left) and the NCBI (right) database.
Figure 6. False positive rates by sample, depending on the defined threshold for the AGES (left) and the NCBI (right) database.
Foods 11 01108 g006
Figure 7. Probability of detection for a laboratory with average performance, a laboratory with positive bias, and a laboratory with negative bias, depending on the threshold and the database.
Figure 7. Probability of detection for a laboratory with average performance, a laboratory with positive bias, and a laboratory with negative bias, depending on the threshold and the database.
Foods 11 01108 g007
Table 1. Sample composition. Samples 1–6: DNA extract mixtures; percentage refers to DNA (v/v). Sample 7: extract from a model sausage; percentage refers to meat content (w/w). Sample 8: DNA extract from maize (negative control). - indicates that the species is not in the DNA extract mixture/sample.
Table 1. Sample composition. Samples 1–6: DNA extract mixtures; percentage refers to DNA (v/v). Sample 7: extract from a model sausage; percentage refers to meat content (w/w). Sample 8: DNA extract from maize (negative control). - indicates that the species is not in the DNA extract mixture/sample.
SampleChickenHorseTurkeyBeefSheepPigGoat
Percentage (%)
111111941
2-1.90.565.71.930-
31.966.11.9-0.5-30
4-0.5-3067.50.11.9
567.5--1.9300.50.1
60.13067.50.5-1.9-
75-550-40-
8-------
Table 2. Proportion of animal species determined for samples 1 (a), 2 (b), and 7 (c). A and B refer to subsamples A and B, respectively.
Table 2. Proportion of animal species determined for samples 1 (a), 2 (b), and 7 (c). A and B refer to subsamples A and B, respectively.
(a). Sample 1
SpeciesSpiking
Level (%)
LaboratoryAGES DatabaseNCBI DatabaseLaboratoryAGES DatabaseNCBI Database
ABABABAB
Pig940191.2591.7791.2191.741095.5894.8395.5694.81
Chicken11.421.371.421.370.731.080.731.08
Horse11.421.341.421.330.600.670.600.66
Turkey11.171.031.161.020.490.660.410.56
Beef11.551.511.551.510.570.580.570.58
Sheep11.651.571.621.561.051.151.051.15
Goat11.541.421.561.440.971.030.981.04
Pig940290.3190.3390.2990.301194.6294.9094.5994.87
Chicken11.501.571.501.570.470.300.470.30
Horse11.641.591.641.591.011.091.001.09
Turkey11.321.411.311.410.540.430.510.40
Beef11.771.671.771.671.021.031.021.03
Sheep11.811.731.791.721.191.161.191.16
Goat11.661.691.681.711.151.101.161.11
Pig940392.6792.4892.6592.461294.3094.6894.2794.65
Chicken11.041.111.041.110.560.360.570.36
Horse11.041.141.041.141.101.131.101.13
Turkey10.900.900.890.890.660.480.640.47
Beef11.601.551.601.550.981.030.981.04
Sheep11.441.481.431.461.241.191.241.19
Goat11.311.341.321.351.161.131.161.14
Pig940492.0892.2892.0692.251392.0191.8091.9891.77
Chicken11.161.211.161.211.271.331.271.33
Horse11.221.141.221.141.351.381.351.38
Turkey11.091.001.090.991.221.171.221.16
Beef11.561.561.561.561.451.521.451.52
Sheep11.451.491.451.481.381.371.381.37
Goat11.431.331.441.341.321.431.341.44
Pig940690.3690.4190.3390.381490.4493.4890.4193.46
Chicken11.511.531.511.531.570.971.570.97
Horse11.521.561.511.561.320.921.320.92
Turkey11.361.331.361.321.140.821.130.82
Beef11.871.841.871.842.171.502.171.50
Sheep11.721.671.711.661.811.241.801.24
Goat11.661.661.681.671.561.061.581.07
Pig940792.4892.1992.4792.171592.2292.2392.1892.20
Chicken11.081.171.081.171.191.181.191.18
Horse11.211.271.211.271.141.211.141.21
Turkey11.031.101.021.100.980.960.980.96
Beef11.511.501.511.501.601.551.601.55
Sheep11.361.381.371.381.481.451.471.44
Goat11.321.391.321.401.391.411.411.42
Pig940891.3191.2891.2891.252092.5992.6392.5892.61
Chicken11.341.401.341.401.031.091.031.09
Horse11.401.321.401.321.241.281.241.28
Turkey11.281.231.271.231.111.031.111.02
Beef11.681.621.681.621.441.391.441.39
Sheep11.531.591.521.571.311.301.311.29
Goat11.461.561.481.581.271.281.281.29
Pig940994.6293.6194.6093.59
Chicken10.641.070.641.08
Horse11.061.191.061.19
Turkey10.631.100.611.07
Beef10.920.900.920.90
Sheep11.101.051.101.05
Goat1 1.041.071.051.08
(b). Sample 2
SpeciesSpiking
Level (%)
LaboratoryAGES DatabaseNCBI DatabaseLaboratoryAGES DatabaseNCBI Database
ABABABAB
Beef65.70164.5965.9164.4265.731053.2655.4353.1455.32
Pig3030.0429.3629.9329.2441.4440.0941.2539.91
Horse1.92.241.972.231.962.211.632.191.61
Sheep1.92.612.332.582.302.892.712.882.70
Turkey0.50.460.400.450.390.190.130.160.11
Beef65.70262.9963.1862.8363.011166.0165.7865.8965.68
Pig3031.3031.2431.2031.1229.5229.7529.4029.63
Horse1.92.492.442.482.432.032.002.021.98
Sheep1.92.602.562.572.532.162.272.152.26
Turkey0.50.610.540.600.530.280.200.260.18
Beef65.70371.6770.5571.4870.361266.2964.9766.2264.88
Pig3024.7525.5024.6825.4329.1330.2528.9830.08
Horse1.91.371.441.361.442.082.232.062.21
Sheep1.91.902.161.882.132.232.312.222.30
Turkey0.50.300.330.290.330.270.240.250.23
Beef65.70467.2267.5567.0567.391362.2161.7362.0061.52
Pig3028.2628.1028.1728.0132.3732.7532.2632.64
Horse1.91.901.761.891.752.522.552.512.54
Sheep1.92.182.162.172.142.312.312.302.30
Turkey0.50.400.370.390.370.560.620.550.62
Beef65.70666.4665.4566.2865.241470.2068.6569.9968.47
Pig3028.6329.3528.5429.2426.0727.1825.9927.11
Horse1.92.092.102.082.091.441.631.431.62
Sheep1.92.322.542.312.521.942.151.922.13
Turkey0.50.470.520.470.520.320.360.320.36
Beef65.70765.6466.9665.4666.771569.7767.6369.6167.48
Pig3029.7828.8929.6828.8026.4327.9826.3627.90
Horse1.91.931.731.921.721.491.771.481.76
Sheep1.92.172.042.172.031.962.201.932.16
Turkey0.50.470.370.470.370.310.370.310.37
Beef65.70868.9069.2668.7069.062068.0165.3367.8165.14
Pig3026.4526.1226.3626.0328.1030.1028.0230.01
Horse1.91.921.851.911.841.692.061.692.05
Sheep1.92.262.312.232.281.852.091.842.08
Turkey0.50.430.430.430.430.340.420.340.42
Beef65.70963.6463.1163.6463.11
Pig3031.5932.2131.3831.99
Horse1.92.262.212.242.18
Sheep1.92.262.252.242.23
Turkey0.50.240.220.230.21
(c). Sample 7
SpeciesSpiking
Level (%)
LaboratoryAGES DatabaseNCBI DatabaseLaboratoryAGES DatabaseNCBI Database
ABABABAB
Beef500145.7945.8045.7545.781034.3334.5834.3434.59
Pig4042.3142.7142.2542.6461.1660.6061.0360.47
Chicken57.567.357.557.353.323.463.323.45
Turkey54.314.084.284.051.161.330.951.13
Beef500248.5448.1848.5048.161145.2043.0345.1943.03
Pig4040.6840.4640.6240.4050.4145.7350.3045.63
Chicken56.716.776.706.762.526.592.526.59
Turkey53.994.263.974.251.814.601.704.35
Beef500352.1552.3052.0852.241245.8742.7245.9142.76
Pig4038.5038.6838.4538.6449.6544.8449.5044.72
Chicken56.105.876.095.862.587.142.587.13
Turkey53.243.133.203.101.845.241.785.08
Beef500449.3149.5949.2949.571344.1343.4644.0843.41
Pig4040.4740.1640.4240.1143.4644.0943.4044.03
Chicken56.236.216.226.207.627.447.617.43
Turkey53.703.773.693.764.634.854.624.84
Beef500650.0549.5250.0149.461450.9551.4650.9051.40
Pig4038.4039.3338.3539.2738.4938.7138.4438.66
Chicken57.427.127.417.116.756.226.746.22
Turkey54.084.004.073.993.783.593.763.58
Beef500747.4448.5247.3948.471549.5348.7849.4948.74
Pig4042.2241.8342.1841.7839.3839.8439.3439.80
Chicken56.215.896.205.887.037.247.027.24
Turkey54.093.724.083.724.024.134.004.11
Beef500851.8251.7351.7551.672046.9946.1546.9446.11
Pig4035.1636.2335.1136.1841.4842.0341.4542.00
Chicken58.617.828.597.816.847.186.847.18
Turkey54.384.184.374.164.664.614.654.60
Beef500944.5943.0444.7043.14
Pig4050.6551.2350.4551.03
Chicken52.873.472.863.46
Turkey51.872.221.802.15
Table 3. Summary of eliminated outliers.
Table 3. Summary of eliminated outliers.
SampleSpeciesLaboratoryReason
1All species (n = 7)14Excessive variance of results for both subsamples
3Sheep06Too high laboratory mean value
15Too high laboratory mean value
4Pig02Excessive variance of results for both subsamples
5Chicken08Too high laboratory mean value
13Too high laboratory mean value
Sheep08Too low laboratory mean value
13Too low laboratory mean value
6Chicken15Excessive variance of results for both subsamples
Table 4. Statistical parameters for relative proportions of animal species according to ASU §64 LFGB. AGES: AGES database, NCBI: NCBI database.
Table 4. Statistical parameters for relative proportions of animal species according to ASU §64 LFGB. AGES: AGES database, NCBI: NCBI database.
Sample
SpeciesParameter 11234567
AGESNCBIAGESNCBIAGESNCBIAGESNCBIAGESNCBIAGESNCBIAGESNCBI
ChickenNumber of labs1111 1111 111111111111
Number of labs after outlier elimination101011119910101111
Proportion (v/m)1%1.9%67.5%0.1%5%
Mean value1.27%1.26%2.12%2.11%55.49%55.34%0.15%0.15%6.89%6.88%
sR0.18%0.18%0.23%0.23%0.84%0.84%0.03%0.03%0.72%0.71%
sr0.04%0.04%0.08%0.08%0.57%0.57%0.01%0.01%0.24%0.24%
Logit proportion−4.60−4.60−3.94−3.940.730.73−6.91−6.91−2.94−2.94
Logit mean value−4.36−4.36−3.83−3.830.220.21−6.53−6.53−2.60−2.61
Logit sr0.140.140.110.110.030.030.210.210.110.11
Logit sr0.030.030.040.040.020.020.050.050.040.04
HorseNumber of labs1111111111111111 1111
Number of labs after outlier elimination10101111111111111111
Proportion (v/m)1%1.9%66.1%0.5%30%
Mean value1.31%1.31%1.89%1.89%60.3%60.0%0.47%0.47%42.0%41.9%
sR0.17%0.17%0.37%0.37%1.1%1.1%0.08%0.08%1.9%1.9%
sr0.05%0.05%0.14%0.14%0.3%0.3%0.04%0.04%0.7%0.7%
Logit proportion−4.60−4.60−3.94−3.940.670.67−5.29−5.29−0.85−0.85
Logit mean value−4.32−4.32−3.95−3.950.420.40−5.35−5.36−0.32−0.33
Logit sr0.130.130.200.200.050.050.180.170.080.08
Logit sr0.040.040.080.080.010.010.080.080.030.03
TurkeyNumber of labs111111111111 11111111
Number of labs after outlier elimination10101111111111111111
Proportion (v/m)1%0.5%1.9%67.5%5%
Mean value1.12%1.12%0.42%0.42%1.86%1.85%54.0%53.8%4.03%4.02%
sR0.16%0.16%0.09%0.09%0.19%0.19%2.1%2.0%0.46%0.46%
sr0.05%0.05%0.04%0.04%0.07%0.07%0.9%0.9%0.14%0.14%
Logit proportion−4.60−4.60−5.29−5.29−3.94−3.940.730.73−2.94−2.94
Logit mean value−4.48−4.48−5.47−5.48−3.97−3.970.160.15−3.17−3.17
Logit sr0.140.140.220.220.100.100.080.080.120.12
Logit sr0.050.050.100.100.040.040.040.040.040.04
BeefNumber of labs11111111 1111111111111111
Number of labs after outlier elimination101011111111111111111111
Proportion (v/m)1%65.7%30%1.9%0.5%50%
Mean value1.58%1.58%66.9%66.7%33.02%32.87%2.79%2.79%0.91%0.91%48.7%48.7%
sR0.13%0.13%2.8%2.8%0.55%0.56%0.20%0.20%0.09%0.09%2.6%2.6%
sr0.04%0.04%1.0%1.0%0.31%0.31%0.10%0.10%0.05%0.05%0.4%0.4%
Logit proportion−4.60−4.600.650.65−0.85−0.85−3.94−3.94−5.29−5.2900
Logit mean value−4.13−4.130.700.69−0.71−0.71−3.55−3.55−4.69−4.69−0.05−0.05
Logit sr0.080.080.130.130.020.030.070.070.100.100.110.10
Logit sr0.020.020.040.040.010.010.040.040.060.060.020.02
SheepNumber of labs11111111111111111111
Number of labs after outlier elimination1010111199111199
Proportion (v/m)1%1.9%0.5%67.5%30%
Mean value1.50%1.49%2.21%2.19%0.14%0.14%64.41%63.67%40.92%40.62%
sR0.15%0.14%0.22%0.22%0.01%0.01%0.65%0.60%0.62%0.65%
sr0.03%0.03%0.14%0.13%0.01%0.01%0.27%0.27%0.49%0.46%
Logit proportion−4.60−4.60−3.94−3.94−5.29−5.290.730.73−0.85−0.85
Logit mean value−4.18−4.19−3.79−3.80−6.58−6.600.590.56−0.37−0.38
Logit sr0.100.100.100.100.100.090.030.030.030.03
Logit sr0.020.020.060.060.080.080.010.010.020.02
PigNumber of labs11111111 1111111111111111
Number of labs after outlier elimination101011111010111111111111
Proportion (v/m)94%30%0.1%0.5%1.9%40%
Mean value91.77%91.74%28.5%28.4%0.04%0.04%0.61%0.61%2.90%2.90%40.2%40.1%
sR0.83%0.83%2.2%2.2%0.02%0.02%0.12%0.12%0.26%0.26%2.3%2.3%
sr0.16%0.16%0.7%0.7%0%0%0.04%0.04%0.13%0.13%0.4%0.4%
Logit proportion2.752.75−0.85−0.85−6.91−6.91−5.29−5.29−3.94−3.94−0.41−0.41
Logit mean value2.412.41−0.92−0.92−7.73−7.74−5.09−5.10−3.51−3.51−0.40−0.40
Logit sr0.110.110.110.110.410.410.190.190.090.090.100.10
Logit sr0.020.020.030.030.100.100.070.070.050.050.020.02
GoatNumber of labs1111 111111111111
Number of labs after outlier elimination1010111111111111
Proportion (v/m)1%30%1.9%0.1%
Mean value1.44%1.45%35.4%35.6%2.00%2.01%0.09%0.09%
sR0.14%0.14%1.0%1.0%022%0.23%0.02%0.02%
sr0.05%0.05%0.3%0.3%0.10%0.10%0.02%0.02%
Logit proportion−4.60−4.600.85−0.85−3.94−3.94−6.91−6.91
Logit mean value−4.23−4.22−0.60−0.59−3.89−3.89−7.05−6.99
Logit sr0.100.100.040.050.110.110.190.19
Logit sr0.040.040.010.010.050.050.190.17
1 sR: reproducibility standard deviation; sr: repeatability standard deviation; v: volume; m: mass.
Table 5. Bias standard deviation, reproducibility standard deviation, and repeatability standard deviation (absolute, i.e., retransformed to proportions of animal species) depending on the proportion of animal species.
Table 5. Bias standard deviation, reproducibility standard deviation, and repeatability standard deviation (absolute, i.e., retransformed to proportions of animal species) depending on the proportion of animal species.
Standard DeviationProportion of Animal Species
5%/95%50%
Absolute bias standard deviation1.8%7.2%
Absolute reproducibility standard deviation0.5%1.8%
Absolute repeatability standard deviation0.2%0.6%
Table 6. False positive reads obtained for samples 01–07. (A): AGES database, (B): NCBI database. Laboratories 01–06, 08, 14, 15: MiSeq; laboratories 07, 13, 20: iSeq 100.
Table 6. False positive reads obtained for samples 01–07. (A): AGES database, (B): NCBI database. Laboratories 01–06, 08, 14, 15: MiSeq; laboratories 07, 13, 20: iSeq 100.
(A)
SampleSubsampleLaboratory
0102030406070813141520
1A621-2-2-22-
B25-33- -14-
2A99412195786764144875
B438931121738744334121
3A493251412341779147359918864
B513694013561968544355795235
4A555123810669573905710
B383420641173780101006710
5A6512286173121561242717965
B957328621262661373963932
6A837211127480110331614164
B550710231446106501697127
7A4518842608996269250616239
B80717425961408386294492825
(B)
SampleSubsampleLaboratory
0102030406070813141520
1A104751224972406520487219
B120911645981245835295425
2A630715612743804242891465526637388
B4897831047753898388752540432682345
3A3877664561639497394209583355297242
B2417986331792606383229621365177127
4A21334002738148219081095221467011642428585
B158420432896171719411477218283713902585668
5A9878461963112712591177988531863924487
B1711117922611063136846812026281165547297
6A5015902354634796584556437525371261
B6275802619584853394518440612543200
7A250452496788322210426459244229175
B260950422819627366388519245192132
Table 7. Proportion of results below a thresholds of 0.05%, 0.1%, 0.5%, and 1%.
Table 7. Proportion of results below a thresholds of 0.05%, 0.1%, 0.5%, and 1%.
SampleSpeciesSpiking Level (%)Proportion of Results below a Threshold of
0.05%0.1%0.5%1%0.05%0.1%0.5%1%
AGES DatabaseNCBI Database
1Pig94--------
Chicken1---1/22 (5%)---1/22 (5%)
Horse1---1/22 (5%)---1/22 (5%)
Turkey1---5/22 (23%)---6/22 (27%)
Beef1--------
Sheep1--------
Goat1--------
2Beef65.7--------
Pig30.0--------
Horse1.9--------
Sheep1.9--------
Turkey0.5--17/22 (77%)22/22 (100%)--17/22 (77%)22/22 (100%)
3Horse66.1--------
Goat30.0--------
Chicken1.9--------
Turkey1.9--------
Sheep0.5--22%22 (100%)22/22 (100%)--22/22 (100%)22/22 (100%)
4Sheep67.5--------
Beef30.0--------
Goat1.9--------
Horse0.5--13/22 (59%)22/22 (100%)--14/22 (64%)22/22 (100%)
Pig0.114/22 (64%)20/22 (91%)22/22 (100%)22/22 (100%)14/22 (64%)20/22 (91%)22/22 (100%)22/22 (100%)
5Chicken67.5--------
Sheep30.0--------
Beef1.9--------
Pig0.5--4/22 (18%)22/22 (100%)--4/22 (18%)22/22 (100%)
Goat0.1-15/22 (68%)22/22 (100%)22/22 (100%)-12/22 (55%)22/22 (100%)22/22 (100%)
6Turkey67.5--------
Horse30.0--------
Pig1.9--------
Beef0.5---19/22 (86%)---19/22 (86%)
Chicken0.1--22/22 (100%)22/22 (100%)--22/22 (100%)22/22 (100%)
7Beef50.0--------
Pig40.0--------
Chicken5.0--------
Turkey5.0--------
Table 8. Reads obtained for sample 8 (negative control). (A): AGES database, (B): NCBI database.
Table 8. Reads obtained for sample 8 (negative control). (A): AGES database, (B): NCBI database.
(A)
SpeciesLaboratory
01020304060708141520
ABABABABABABABABABAB
Chicken--1328--4230498--2-1017721085
Horse--76-25751-2-----1--75
Turkey--511-65152371--32326615139
Beef10925222171176510197222-4351185148579911055
Pig-1132387113216114563-2121915638545026
Sheep51451792282743919-31113--910
Goat11-1--11112----1---1
Bison bonasus-------------1-2----
Total reads162510215027268261276328328807-6977529394103178197111
(B)
SpeciesLaboratory
01020304060708141520
ABABABABABABABABABAB
Chicken--1328--4230498--2-1017721085
Horse--76-25751-2-----1--75
Turkey--511-65150371--32316615139
Beef10925222171176510197222-4352187149579911155
Pig 1132399113226114564-2121915738545027
Sheep61450762280743919-3173--910
Goat11-1--11112----1---1
Bison bonasus---------------1----
Bos mutus---------1---1--1---
Brachypodium sylvaticum--------1-----------
Coregonus migratorius--------2-----------
Equus zebra--------------------
Eukaryotic synthetic---15----------------
Homo sapiens--584630188822153027841086-5124687451-15-49
Meleagris ocellata-------2------1-----
Oncorhynchus environmental--------11----------
Oncorhynchus mykiss--------3791----------
Ovis ammon-115--211-----4-----
Ovis vignei------11------------
Phascolosoma esculenta-----1--------------
Synthetic construct--------20-----------
Zea mays---2---38513--------2-
total reads172616021557469344496155312081894-742021218846104193200161
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Dobrovolny, S.; Uhlig, S.; Frost, K.; Schlierf, A.; Nichani, K.; Simon, K.; Cichna-Markl, M.; Hochegger, R. Interlaboratory Validation of a DNA Metabarcoding Assay for Mammalian and Poultry Species to Detect Food Adulteration. Foods 2022, 11, 1108. https://doi.org/10.3390/foods11081108

AMA Style

Dobrovolny S, Uhlig S, Frost K, Schlierf A, Nichani K, Simon K, Cichna-Markl M, Hochegger R. Interlaboratory Validation of a DNA Metabarcoding Assay for Mammalian and Poultry Species to Detect Food Adulteration. Foods. 2022; 11(8):1108. https://doi.org/10.3390/foods11081108

Chicago/Turabian Style

Dobrovolny, Stefanie, Steffen Uhlig, Kirstin Frost, Anja Schlierf, Kapil Nichani, Kirsten Simon, Margit Cichna-Markl, and Rupert Hochegger. 2022. "Interlaboratory Validation of a DNA Metabarcoding Assay for Mammalian and Poultry Species to Detect Food Adulteration" Foods 11, no. 8: 1108. https://doi.org/10.3390/foods11081108

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop