Next Article in Journal
Understanding the Biophysical Interaction of LTX-315 with Tumoral Model Membranes
Next Article in Special Issue
Trypanosoma brucei rhodesiense Inhibitor of Cysteine Peptidase (ICP) Is Required for Virulence in Mice and to Attenuate the Inflammatory Response
Previous Article in Journal
Selected Biomarkers of Depression: What Are the Effects of Cytokines and Inflammation?
Previous Article in Special Issue
The Mammalian Cysteine Protease Legumain in Health and Disease
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Complex of Proline-Specific Peptidases in the Genome and Gut Transcriptomes of Tenebrionidae Insects and Their Role in Gliadin Hydrolysis

by
Valeriia F. Tereshchenkova
1,
Irina Y. Filippova
1,
Irina A. Goptar
2,
Yakov E. Dunaevsky
3,
Mikhail A. Belozersky
3 and
Elena N. Elpidina
3,*
1
Faculty of Chemistry, Lomonosov Moscow State University, Moscow 119991, Russia
2
Izmerov Research Institute of Occupational Health, Moscow 105275, Russia
3
A.N. Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow 119991, Russia
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2023, 24(1), 579; https://doi.org/10.3390/ijms24010579
Submission received: 30 November 2022 / Revised: 21 December 2022 / Accepted: 25 December 2022 / Published: 29 December 2022
(This article belongs to the Special Issue Peptidases: Role and Function in Health and Disease)

Abstract

:
A detailed analysis of the complexes of proline-specific peptidases (PSPs) in the midgut transcriptomes of the larvae of agricultural pests Tenebrio molitor and Tribolium castaneum and in the genome of T. castaneum is presented. Analysis of the T. castaneum genome revealed 13 PSP sequences from the clans of serine and metal-dependent peptidases, of which 11 sequences were also found in the gut transcriptomes of both tenebrionid species’ larvae. Studies of the localization of PSPs, evaluation of the expression level of their genes in gut transcriptomes, and prediction of the presence of signal peptides determining secretory pathways made it possible to propose a set of peptidases that can directly participate in the hydrolysis of food proteins in the larvae guts. The discovered digestive PSPs of tenebrionids in combination with the post-glutamine cleaving cysteine cathepsins of these insects effectively hydrolyzed gliadins, which are the natural food substrates of the studied pests. Based on the data obtained, a hypothetical scheme for the complete hydrolysis of immunogenic gliadin peptides by T. molitor and T. castaneum digestive peptidases was proposed. These results show promise regarding the development of a drug based on tenebrionid digestive enzymes for the enzymatic therapy of celiac disease and gluten intolerance.

1. Introduction

Tribolium castaneum and Tenebrio molitor are genetic and biochemical model insects in the family Tenebrionidae. These insects are stored product pests of grain crops and wheat flour, and their main dietary proteins are therefore proline- and glutamine-rich gliadins—the prolamins of wheat and the main component of gluten. Gliadins contain up to 50% glutamine and up to 30% proline residues [1,2]. Proline is structurally unique and the only imino acid among 20 natural proteinogenic amino acids. Proline bonds cannot be hydrolyzed by most peptidases with broad substrate specificity because their unique conformation provides a rigid structure of proteins and peptides that protects them from degradation. In nature, only a special group of peptidases, proline-specific peptidases (PSPs), almost exclusively hydrolyzes bonds formed by proline. Due to the need to digest gliadin-containing food, tenebrionids should possess enzymes capable of hydrolyzing the peptide bonds formed by proline, which thus makes them a good natural enzymatic system to study the broad spectrum of PSPs.
Proteolysis of proline-rich proteins and peptides is also important for providing insects with free proline. This imino acid is an important component of insect metabolism as oxidation of free proline in the mitochondria may provide an important energy source for overwintering larvae or flying adults [3,4]. It has been shown that increased levels of proline correlate with specific environmental conditions in insects, which can be observed in overwintering insects and those with cold tolerance [3,5,6]. Thus, the study of peptidases that participate in the release of free proline from dietary proteins are of great interest. Such enzymes are especially important in the metabolism of insects that feed on proline-rich proteins.
Information about known eukaryotic PSPs has been summarized in several reviews [7,8,9]. However, the complexes of PSPs in distinct species, including insects, have not been described. This group of peptidases includes enzymes that specifically hydrolyze peptide bonds formed (i) by the carboxyl group of proline (prolyl oligopeptidase (POP) (which includes the so-called prolyl endopeptidase (PEP), which actually is POP), dipeptidyl peptidases (DPP) 2, 4, 8, and 9, fibroblast activation protein α (FAP), and prolyl carboxypeptidase (PRCP)) and (ii) by the imino group of proline (prolidase (XPD, Xaa-Pro dipeptidase) and aminopeptidases P (APP) 1, 2, and 3). Most of the representatives of PSPs that have been described up to now are exopeptidases; POP is an endopeptidase, and FAP can show both exo- and endopeptidase activities. The rest of the PSPs include a rather diverse group of different exopeptidases that cleave amino acids from both ends of the polypeptide chain.
Besides the above, there also exist substrates with N-terminal Pro residue, which are not hydrolyzed by any PSP from the described complex. These substrates can be hydrolyzed by peptidases with broad substrate specificity with no preference to Pro residue (post-proline cleaving peptidases with broad substrate specificity, PPCPbs). Thus, N-terminal Pro from a polypeptide chain of any length can be cleaved by leucyl aminopeptidase (LAP). The dipeptide Pro-X can be cleaved by cytosolic non-specific dipeptidase (CND), which hydrolyzes dipeptides of any composition, except Pro-Pro.
The list of PSPs described in the comprehensive collection of exhaustively characterized peptidases [8] includes 19 types of PSPs from all kingdoms of living organisms. Previously, we reviewed 13 representatives of human PSPs [9] from two clans according to the MEROPS classification [10]: serine peptidases and metallopeptidases. POP, DPP, and FAP belonged to the S9 family of serine peptidases: S9A (POP), S9B (DPP 4, 8, 9, FAP), S9X (DPP 6, 10). PRCP and DPP 2 were included in the S28 family of serine peptidases. APP1, APP2, APP3, and XPD belonged to the M24 family of metallopeptidases. For effective release of Pro residues from dietary proteins, each animal species used a specific complex of digestive PSPs, which is a shorter list than the whole list of animal PSPs. We could not find publications devoted to a description of digestive PSP complexes in animals including insects, while inhibition of its members being an effective way to control insect pests. Among the human intestinal PSPs that could participate in digestion, only DPP 4 [11] and XPD [12,13,14,15] activities have been reported in the literature.
In this paper, we present a detailed analysis of our own data about PSPs complexes in the midgut transcriptomes of stored product pests (T. molitor and T. castaneum larvae), T. castaneum genome [16], in comparison with the PSP complex in the human genome [9].

2. Results

2.1. Search for PSP Sets in Tribolium Castaneum and Tenebrio Molitor

Analysis of the T. castaneum genome using all eukaryotic PSPs as a query revealed 13 PSP sequences (Table 1): single sequences for POP, DPP 8, APP3, and XPD; two sequences for DPP 4, PRCP, and APP1; and three sequences for DPP 10 (inactive DPP 4 homolog). The T. castaneum genome also contained three sequences homologous to human LAP and one sequence homologous to CND. Most of the found sequences were annotated in the UniProt and NCBI databases, but for the DPP4.2_Tc (hereinafter the name before the dot means the type of peptidase, the number after the dot can occur if there are several variants of sequences of one type of peptidase, and the abbreviation after the underscore refers to the organism: Tc—T. castaneum, Tm—T. molitor, and Hs—H. sapiens), DPP10.2_Tc, and DPP10.3_Tc sequences annotation was refined by us during phylogenetic analysis and multiple alignment with human homologs. Comparison of PSP sets found in the T. castaneum genome and T. castaneum larval gut transcriptome showed that some of the found PSPs were not transcribed in the gut, namely POP_Tc, DPP10.3_Tc, LAP.2_Tc, and LAP.3_Tc; thus, the T. castaneum larval gut transcriptome contained 11 PSP sequences and two PPCPbs sequences.
The set of T. molitor PSPs was analyzed using the larval gut transcriptome. It contained 11 predicted protein sequences of PSPs similar to those presented in the T. castaneum and human genomes and the T. castaneum larval gut transcriptome (Table 1): single sequences for POP, DPP 8, PRCP, APP1, APP2, APP3, and XPD and two sequences for DPP 4 and DPP 10. We also found sequences homologous to LAP and CND.
The sets of human and insect PSPs are presented in Table 1. Overall, the sets of PSPs for all studied organisms were similar, but insects did not possess some of the PSPs found in humans: DPP 9 (which is very close to DPP 8 and DPP 4), FAP, DPP 6 (an inactive homolog of DPP 4), and DPP 2. Interestingly, insects, unlike humans, had some PSPs duplicated (DPP 4, DPP 10, PRCP, APP1, and LAP), and some of the duplicates were found only in the T. castaneum genome.

2.2. General Characteristics of the PSP Sequences

2.2.1. Serine PSPs

For all peptidases, residues that determine primary specificity were identified. Active sites of active serine PSPs from S9A, S9B, and S28 consisted of a highly conserved catalytic triad in the order Ser, Asp, His. Surrounding residues of all peptidases were also very conservative, with few synonymic substitutions.
Peptidases from the S9A subfamily hydrolyze substrates of the general formula (Xaa)1-15-Xbb-Pro↓Xbb-(Xaa)1-15, where Xaa represents any amino acid and Xbb represents any amino acid except Pro. The active site of the S9A serine peptidase subfamily including POP consists of Ser554, Asp641, and His680 (hereinafter numbering is given by human POP, UniProt P48147) with conserved sequence motifs NGGSNGG, ADHDDRV, and RAGHGAG [17,18,19]. In the sequences found in Tenebrionidae insects (POP_Tc and POP_Tm), composition of the catalytic triad Ser, Asp, His was similar, with surrounding residues and residues forming binding subsites being highly conserved.
Peptidases from the S9B subfamily hydrolyze substrates of the general formula Xbb-Pro↓Xbb-(Xaa)n, where n represents any number. The active site of the S9B serine peptidase subfamily consists of Ser630, Asp708, and His740 (hereinafter numbering is given by human DPP 4, UniProt P27487). Homologous sequences of the S9B serine peptidase subfamily found in Tenebrionidae insects had the same order in the catalytic triad. In the structure of human DPP 4, the Pro (P1) residue of the substrate is located in the hydrophobic S1 subsite surrounded by Tyr631, Val656, Trp659, Tyr662, Tyr666, and Val711. The S2 subsite includes Asn710, Arg125, Glu205, and Glu206 [18,20,21]. Active insect peptidases contained conserved substrate-binding subsite residues, with the only exception being the S1 residue Trp659, which was replaced in DPP4.1_Tc Trp659Leu and DPP4.1_Tm Trp659Phe. All DPP 4-like peptidases found in Tenebrionidae insects, together with human DPP 4, possessed only exopeptidase activity, having Asp or Asn residues in an important Asp663 position similar to human DPP 4. This fact distinguished DPP 4 from FAP, containing in human smaller Ala657 residue [22] in a corresponding position, which explains the presence of endopeptidase activity in addition to exopeptidase activity in FAP [20,22,23,24]. An inactive homolog of DPP 4 (human DPP 10) had a Gly residue instead of Ser630 in the active site [25,26]. In the T. castaneum genome, three sequences of DPP 10 were found. Two of them also had a Gly residue instead of Ser630 and formed homologous pairs with T. molitor sequences. The third—DPP10.3_Tc—had Ser residue in the active site similar to active members of the S9B subfamily, and presumably can thus have enzymatic activity similar to DPP 4, although the sequence structure is characteristic of DPP 10.
PRCP from the S28 subfamily hydrolyzes substrates of the general formula (Xaa)n-Xbb-Pro↓Xbb. DPP 2 hydrolyzes substrates of the same general formula as the S9B subfamily, cleaving Xbb-Pro dipeptide from the N-terminus, as described above; however, no such activity was found in tenebrionids. The catalytic triad of the S28 serine peptidase family contains Ser179, Asp430, and His455 (hereinafter the numbering is given by human PRCP, UniProt P42785) residues. PRCP sequences from Tenebrionidae insects also had conserved residues surrounding the catalytic triad with minor replacements, as well as similar Pro-binding S1 subsites (Table 2).

2.2.2. Metallodependent PSPs

All metallodependent PSPs (APP1, APP2, APP3, and XPD) belong to the M24B subfamily and are double Mn2+-dependent enzymes, requiring both ions for activity [27]. APP hydrolyzes substrates of the general formula Xbb↓Pro(Xaa)1-9. XPD hydrolyzes dipeptides of the general formula Xbb↓Pro. Insect peptidases contained two conserved metal-binding sites, as well as residues essential for maintaining activity similar to human enzymes (Table 3).

2.2.3. Post-Proline Cleaving Peptidases with Broad Substrate Specificity (PPCPbs)

Additional PPCPbs (LAP and CND) are also metallopeptidases. LAP hydrolyzes substrates of the general formula Xaa↓Xbb-(Xaa)n. It is a Zn2+-dependent peptidase from the M17 family and contains two Zn2+ ions. CND hydrolyzes dipeptides of the general formula Xaa↓Xbb. It is from the M20 family and contains two Mn2+ ions. In general, all sequences found in Tenebrionidae insects were highly conserved in metal-binding sites (Table 4, numbering is given by human peptidases).
This analysis showed that all found PSP sequences from T. molitor had a conserved structure compared to T. castaneum and human enzymes sequences, which made sequence annotation easier.

2.3. Comparison and Phylogeny of PSPs in T. molitor and T. castaneum

The phylogenetic tree represented in Figure 1 includes H. sapiens, T. castaneum, and T. molitor PSPs. The tree was divided into separate branches, each corresponding to one of the three families of PSPs (S9, S28, M24) or one of the two PPCPbs (M20 and M17). Each branch in turn was subdivided into types of peptidases, which formed orthologous pairs. A clade of DPP 4 proteins was divided into two distinct orthologous pairs: DPP4.1_Tc–DPP4.1_Tm and DPP4.2_Tc–DPP4.2_Tm. Insect DPP 10 formed a separate clade. Among them, DPP10.1_Tc–DPP10.1_Tm and DPP10.2_Tc–DPP10.2_Tm formed two distinct orthologous pairs. DPP10.3_Tc had a Ser residue in the catalytic triad that was located near the second pair. Separate clades were also formed by other PSPs. Distinct groups were formed by the DPP 8, POP, PRCP, APP1, APP2, APP3, XPD, CND, and LAP proteins. In each clade, homologous insect sequences formed orthologous pairs. Human proteins were located near them due to the high structural and functional similarities.

2.4. Localization of PSPs

The predicted molecular masses of serine PSPs from the S9 and S28 families ranged from 54 to 103 kDa, and for metallodependent PSPs from the M24B subfamily these predictions ranged from 54 to 88 kDa. In most cases, calculated molecular masses were similar for enzymes of the same type (Table 5).
To get closer to understanding the possible functions of PSPs, we analyzed the presence of specific localization domains connected with secretory pathways, which in this case was the signal peptide (with transportation to the gut or lysosomes). We also searched for the mitochondrial transit peptide and transmembrane domains. The results of this study are summarized in Table 5.
Human and insect POP was predicted to lack signal peptide according to SignalP-6.0. However, POP_Tm was predicted to have it by the TargetP-2.0 server. Based on previous studies, human [28] and T. molitor [29] POP were shown to be cytoplasmic enzymes.
Among DPP 4 sequences, DPP4_Hs had a small transmembrane domain (20 amino acid residues). The TargetP-2.0 server predicted signal peptides for this sequence, but the probability of its appearance was not very high. According to experimental data, this protein exists in two forms: membrane-bound with a large extracellular domain on the cell surface [30] or soluble protein resulting from the cleavage of the extracellular domain from the short transmembrane part [31]. Tenebrionidae insects had two pairs of sequences coding DPP 4 proteins. The first homologous pair of sequences (DPP4.1_Tc–DPP4.1_Tm) had signal peptides predicted with high probability and proteins were presumed to be secreted. The other pair of DPP 4 sequences (DPP4.2_Tc–DPP4.2_Tm) had transmembrane domains and no signal peptide, similar to the human protein. The rest of the DPP 4-like proteins (DPP 8 and DPP 10 for both humans and insects) had no signal peptides. Human DPP 8 was shown to be a cytoplasmic peptidase [32,33,34]. All DPP 10 sequences had transmembrane domains, which is supported by previous data for this human protein [25,26,35].
Both human and insect PRCP sequences from the S28 family had signal peptides and were predicted to be secreted in agreement with previous studies [29,36,37,38,39,40].
Almost all metallopeptidase sequences had no specific domains connected to secretion, except for human and T. molitor APP2 and T. castaneum APP1.2_Tc, which had N-terminal signal peptides predicted. The APP1.2_Tc sequence is currently annotated in UniProt and NCBI databases as APP 1-like protein, but due to the predicted localization domain and conducted phylogenetic analysis (Figure 1) it probably belongs to APP2 proteins. Although there are no reliable data on the localization of such peptidases in the studied insect, there is some information about the localization of mammalian peptidases. It is known that signal peptide can serve as a signal-anchor sequence for transmembrane proteins of type III [41] and this is supported by data for human APP2, which was shown to be plasma membrane peptidase [42]. APP1 was localized predominantly in the cytoplasm, and this is supported by data for rat APP1 [43]. Human APP3 had mitochondrial localization [44], which is consistent with the predicted presence of the mitochondrial transit peptide in APP3_Hs and APP3_Tc (high probability), as well as its predicted presence in APP3_Tm (low probability). All LAP sequences, except for LAP.1_Tc, had the predicted mitochondrial transit peptide. Human LAP was found predominantly in microsomal fractions and cytoplasm [45], and CND was found also in cytoplasm with ubiquitous tissue distribution [46].
Overall, there was good agreement between the localization of predicted bioinformatics and experimental data on mammalian PSPs found in the literature (Table 5). Based on the predicted signal peptides for insect peptidases, only DPP4.1 and PRCP were presumed to be secreted proteins.
These bioinformatics predictions correlate with our previous and new biochemical data on the localization of several PSPs in the midgut of T. molitor larvae. We found that DPP 4 [48] and PRCP [29] were located mainly in the AM (anterior mesenteron, anterior part of the midgut), POP [29] was distributed evenly between the AM and PM (posterior mesenteron, posterior part of the midgut), and XPD [49] was localized predominantly in the PM. DPP 4 and PRCP were located in soluble gut contents, while POP and XPD were mainly located in tissue fractions.
We also show that the remaining PSP (APP) is distributed evenly between AM and PM parts of the gut, with more than half of its total activity belonging to soluble tissue fractions (Table 6).

2.5. Analysis of the Expression Levels of Genes Coding PSPs in T. castaneum and T. molitor Guts

Data on expression levels of genes coding the identified T. castaneum and T. molitor gut PSPs are presented in Table 7 as reads per kilobase per million mapped reads (RPKM). For better comparison of the possible digestive role of PSPs between two insects, we also calculated the expression levels as a percentage from the sum of expressions of all active PSPs. This calculation involved only active PSPs: POP, DPP 4, DPP 8, PRCP, APP1, APP2, APP3, and XPD. DPP 10 was not included in this calculation because it is an inactive DPP 4 homolog and does not possess peptidase activity, and LAP and CND were not included because they possess broad substrate specificity and the cleavage of post-proline bonds is only additional, not primary.
The highest expression level of S9 PSP mRNA was observed in the orthologous pair of secreted DPP4.1_Tc–DPP4.1_Tm (25% each from the sum of expressions of all active PSPs). The membrane pair DPP4.2_Tc–DPP4.2_Tm had lower expression levels of 8% and 17%, respectively.
The expression level of mRNA in T. castaneum PRCP (PRCP.1_Tc) was only 0.4%, whereas orthologous mRNA in T. molitor PRCP (PRCP_Tm) exhibited a 9% expression level. However, T. castaneum had another sequence (PRCP.2_Tc) with an expression level of 10%, for which there was no ortholog in the T. molitor transcriptome.
Among metal-dependent PSP mRNA, the APP1.1_Tc–APP1_Tm pair possessed high expression levels of about 36% and 15%. High expression was also evaluated for the XPD_Tc–XPD_Tm pair, about 19% and 25%, respectively.
All other mRNA of active PSPs from both organisms had expression levels of less than 5% of the sum of expressions of all active PSPs, including the single oligopeptidase POP that was expressed only in the gut of T. molitor (POP_Tm, 4%).
Among the rest of the mRNA sequences, there was a highly expressed inactive DPP 4 homolog in T. molitor (DPP10.1_Tm) whereas both orthologs of T. castaneum were low in terms of expression. PPCPbs mRNA (LAP and CND) had comparable expression levels in both insects, and CND had the highest expression levels among studied sequences in T. molitor and T. castaneum.
In general, there were similar expression profiles of PSP mRNA in both T. castaneum and T. molitor (Figure 2). Prospective orthologs had comparable expression levels.
The highest level of gut expression among active PSPs belonged to the exopeptidases DPP 4, PRCP, APP1, and XPD, which may indicate their important role in tenebrionid physiology, particularly participation in the digestive process. It should be noted that POP had a low or non-existent level of gut expression, which makes its participation in the proteolysis of food proteins unlikely.
Localization studies together with the evaluation of the expression level of genes coding T. molitor PSPs in the gut transcriptome, as well as the presence of signal peptides connected with secretion, allowed for confirmation of the set of peptidases that can directly participate in the hydrolysis of food proteins in insect larval guts. This set consists of DPP4.1_Tm and PRCP_Tm, which are characterized as secreted peptidases localized in soluble gut contents with high expression levels of their mRNA in the gut transcriptome.

2.6. Gliadin Hydrolysis

Gliadin, a major storage protein of wheat, cannot be completely hydrolyzed by human digestive peptidases with the usual broad specificity. Proline- and glutamine-rich peptides, with the high content of proline and glutamine residues in their composition [1,2], remain unhydrolyzed [50] and can cause the development of diseases of the human gastrointestinal tract in predisposed people. In this regard, digestive peptidases of Tenebrionidae insects, in particular PSPs, may present the potential to hydrolyze these remaining fragments by serving as a proposed oral medicine as tenebrionids are stored product pests and gliadins are their major dietary proteins.
We studied the effect of the major luminal digestive T. molitor peptidases DPP4.1_Tm and PRCP_Tm, which are capable of cleaving bonds formed by proline on the total preparation of wheat gliadins (Figure 3). In addition, the effect of both individual PSPs and their combination with digestive post-glutamine cleaving peptidases of T. molitor (the total preparation of larval digestive cysteine cathepsins (CC)) [51] was studied. Proline-specific luminal enzymes (DPP 4 and PRCP) are exopeptidases, and CC are endopeptidases. Thus, we were able to study the combined action of endo- and exopeptidases on gliadins.
The enzymes were taken in equal amounts per larvae. Among the individual enzymes, the highest intensity of gliadin hydrolysis was achieved by the action of post-glutamine cleaving endopeptidases: CC preparation (62% of which belongs to the major digestive cathepsin L of T. molitor NCBI ID AJF94885 according to [52]) was eight times more active than studied PSPs. Among the used PSPs, DPP 4 exhibited 1.5-fold higher activity with gliadins than PRCP. Separate addition of DPP 4 and PRCP to CC increased hydrolysis by 1.4 and 1.3 times, respectively. The combination of all three peptidase preparations improved the rate of hydrolysis by 1.7 times compared to single CC preparation (Figure 3). It should be noted that, in all cases, the combined effect of endo- and exopeptidases was greater than the sum of the separate enzyme actions. This indicates that hydrolysis with endopeptidases forms additional substrates for exopeptidases, and this leads to an increase in the hydrolysis rate of the original protein.

2.7. Hypothetical Scheme for the Complete Hydrolysis of Glutamine- and Proline-Rich 26-Mer Immunogenic Gliadin Fragments by T. molitor and T. castaneum Digestive Peptidases

Based on the obtained data, we suggested a hypothetical scheme for the complete hydrolysis of some gliadin fragments by T. molitor and T. castaneum digestive peptidases (Figure 4). As an example, we have taken the 26-mer FLQPQQPFPQQPQQPYPQQPQQPFPQ of γ-gliadin (UniProt: Q8L6B2), which along with the 33-mer peptide is a strong immunogenic peptide [50]. During the first stage of proteolysis, digestive post-glutamine cleaving endopeptidase CC can most effectively hydrolyze peptide bonds between two glutamine residues. Despite the ability of CC to hydrolyze bonds after the glutamine residue [53], the QP bond should not be cleaved by them, since the bonds formed by proline residue are hydrolyzed by the PSPs almost exclusively. Resulting fragments can be the substrates for DPP 4, which cleaves dipeptides such as QP, FP, and YP from their N-terminus. The remaining tripeptides (QPQ) can be effectively hydrolyzed by PRCP. Prolidase XPD may complete gliadin degradation, cleaving final dipeptides to single amino acids in the cytoplasm of the epithelial cells in the gut [49]. The remaining fragment FLQP is not immunogenic [50].
According to the scheme, the cleavage of bonds formed by glutamine residues by CC is an important first step in the hydrolysis process, and each of the found PSPs is essential for complete digestion of glutamine- and proline-rich gliadin fragments in the midgut of tenebrionids.

3. Discussion

A bioinformatic search for PSPs in the T. castaneum genome and both Tenebrionidae larvae gut transcriptomes revealed 14 distinct types of sequences in both insects: one POP, two DPP 4, one DPP 8, two PRCP, two APP1, one APP2, one APP3, one XPD sequence (all active peptidases), and three DPP 10 (inactive homologs of DPP 4). There also were two types of peptidases found with broad substrate specificity, possessing the ability to cleave bonds formed by proline residue (three LAP and one CND). The main difference in PSP sets in Tenebrionidae larvae gut transcriptomes was that T. castaneum, unlike T. molitor, had two PRCP sequences, two APP1 sequences, and lacked APP2 and POP sequences. Generally, Tenebrionidae insect PSP sets were similar to those in humans, lacking only DPP 9, FAP, DPP 2, as well as DPP 6, which is inactive homolog of DPP 4 together with DPP 10 found in all three species. Inactive homologs are represented in all major peptidase families. According to the MEROPS database, it is the serine peptidases that have the highest number of identified inactive enzymes, especially in insects. It has been suggested that a large number of both inactive and active serine peptidases indicates that the evolution of inactive peptidases and their new functions were beneficial for insects [54]. We have also shown a large number of serine peptidases and their inactive homologs in T. molitor [55], which can indicate that the emergence of inactive homologs is not an accident, but rather a natural process.
Phylogenetic analysis and multiple alignment of Tenebrionidae insect PSPs with those of humans allowed us to annotate all T. molitor peptidase sequences and refine annotation for several T. castaneum peptidase sequences based on analysis of cladograms of the tree, active centers, and substrate binding subsites.
The major digestive organ of T. molitor larvae is the midgut, where pH varies from 5.2 to 6.5 in the AM and 7.8 to 8.2 in the PM [56,57,58]. This gradient determines localization of the activity of different digestive enzymes along the midgut in accordance with their pH optima [59]. Gut peptidases may be localized either inside the gut lumen contents, and thus directly participate in digestion, or in the epithelial tissues lining the gut lumen. The combination of gene expression studies in the larval gut transcriptome, bioinformatic prediction of signal peptides, and biochemical localization experiments demonstrate the digestive function of the highly expressed secreted PSPs.
Using our previous and new data on the localization of T. molitor PSPs, we can summarize that DPP 4 [48] and PRCP [29] were located mainly in the AM, POP [29] and newly studied APP were distributed evenly between the AM and PM parts of the gut, and XPD [49] was localized predominantly in the PM. Analysis of the tissue distribution of the PSPs showed that DPP 4 [48] and PRCP [29] were located in soluble gut contents and can participate in T. molitor digestion, and this is in agreement with bioinformatic data (these peptidases were predicted to be secreted and highly expressed). The remaining POP, APP, and XPD were localized in tissues. Among them, XPD definitely seems to participate in digestion during the final stages as the substrates for XPD are dipeptides that can be absorbed by epithelial tissues [49]. There is also the possibility of APP participating in digestion because it can cleave oligopeptides and tripeptides, and tripeptides also can be absorbed by midgut epithelia [60]. POP hydrolyzes only oligopeptides, which cannot be absorbed by tissues. Compared to POP and APP, the participation of XPD in gliadin hydrolysis seems most likely and is supported by its high expression level. Since about 60% of XPD activity was located in the PM, it can act during the final stages of digestion and thus provide complete hydrolysis of gliadins.
Information from various sources about PSPs that can effectively break down gliadins was summarized in a previous review [61]. Attempts to hydrolyze gliadins were mostly limited by the search for peptidases capable of cleaving only peptide bonds formed by proline, such as prolyl oligopeptidases from Myxococcus xanthus, Sphingomonas capsulata, and Flavobacterium meningosepticum [62], prolyl endopeptidase from Aspergillus niger [63], and PSPs from the lactic acid bacteria Lactobacillus plantarum and Pediococcus pentosaceus [64]. However, the use of single PSPs did not achieve the complete hydrolysis of gliadins, and only the discovery of cysteine peptidase from Hordeum vulgare (EP-B2) in 2006 [65,66] allowed for the creation and testing of a dosage form for celiac disease treatment, which was based on a mixture of prolyl oligopeptidase from S. capsulata and cysteine peptidase from H. vulgare (EP-B2) (ALV003 Alvine, USA) and which is now under clinical trial [67,68]. The other cysteine peptidase capable of participating in the hydrolysis of gliadins was later described from Triticum aestivum [69].
Previously, we showed that T. molitor DPP 4 is able to hydrolyze gliadins more efficiently than human DPP 4 [48]. In the present study, we tested a natural complex of digestive enzymes of tenebrionids to discover whether this would increase the efficiency of hydrolysis by providing efficient hydrolysis of their proline-rich natural food substrates (such as gliadins) in natural conditions and ensuring sufficient intake of proline. Gliadin hydrolysis was studied using single digestive T. molitor peptidases and their mixtures capable of cleaving bonds formed by glutamine and proline residues. Among the separate enzymes, the highest intensity of hydrolysis was achieved by the action of post-glutamine cleaving cysteine cathepsins (CC), which corresponds with the hypothesis that they arose during evolution for the efficient hydrolysis of gliadins [51]. Individual digestive PSPs (DPP 4 and PRCP) were less active on intact gliadins, but the combined action of endo- and exopeptidases demonstrated the synergistic effect; this indicates that, after hydrolysis by endopeptidases, additional substrates for exopeptidases were formed, thus leading to an increase in the hydrolysis rate of the original protein.
Based on the obtained data, we suggested a hypothetical scheme for the complete hydrolysis of glutamine- and proline-rich 26-mer immunogenic gliadin fragments by T. molitor and T. castaneum digestive peptidases. The first stage of hydrolysis was carried out with endopeptidase CC, which can hydrolyze bonds formed by glutamine residues. DPP 4 then cleaved dipeptides from the resulting oligopeptides. The remaining tripeptides were hydrolyzed by PRCP. During the final stage, XPD degraded dipeptides to single amino acids in the cytoplasm of midgut epithelial cells.
The suggested hypothetical scheme is original and interesting because it can explain the hydrolysis of glutamine- and proline-rich gliadin fragments from different gliadins, including not only γ-gliadins (UniProt: P21292) containing PQQPFPQ repeats and ω-gliadins (UniProt: Q9FUW7) with PQQPFPQQ (which are the richest in glutamine and proline residues) but also α-gliadins (UniProt: P02863) with PQPQPFP and PQQPY repeats. Importantly, considering the suggested scheme, several proline- and glutamine-rich gliadin peptides may be degraded, such as QLQPFPQPQLPY, PQPQLPYPQPQLP, LGQQQPFPPQQPYPQPQPF, LQLQPF(PQPQLPY)3PQPQPF, and FLQPQQPF(PQQ)2PY(PQQ)2PFPQ that are not hydrolyzed by human digestive peptidases [50,70] and which cause autoimmune celiac disease. This indicates that it may be possible to develop an enzyme therapy drug for celiac disease and gluten intolerance that is based on tenebrionid digestive enzymes.

4. Materials and Methods

4.1. Preparation of the Biological Material and Sequencing of cDNA

Preparation of biological material from T. molitor and T. castaneum larvae guts and sequencing of cDNA was described previously [52]. The preparation of total gut RNA from T. molitor larvae from three independent biological replicates was sent to a sequencing facility (National Center for Genome Resources (NCGR), Santa Fe, NM, USA), where mRNA was isolated by polyA, standard libraries were made, and paired-end sequencing was performed on an Illumina HiSeq 2000 (San Diego, CA, USA) using standard protocols from the manufacturer. We obtained approximately 240 million sequence reads with a 250 bp insert.
Paired-end sequencing on the Illumina HiSeq 2000 of T. castaneum cDNA was performed by the High Throughput Genomics Center, Seattle, USA. We obtained approximately 340 million sequence reads, and insert size was approximately 250 bp.

4.2. Assembly of Contigs

A custom assembly of T. molitor sequences combined from all replicates was made by the NCGR, resulting in 197,800 contigs (minimum length = 100 and maximum length = 51328; Q1 = 123, Q2 = 153, Q3 = 335; N50 = 2232, B1000 = 71.9%, B2000 = 54.1%). For T. molitor sequences, we also combined the replicate data and included previous Sanger sequencing [71] and pyrosequencing [72] databases of mRNA from the larval gut and performed additional de novo assemblies using SeqManNGen (v. 4.0.1.4, DNAStar, Madison, WI USA) and custom assembly programs. For T. castaneum sequences, we used SeqManNGen to map sequences to the T. castaneum genome (Tcas3, NCBI; parameters for alignment were merSize = 19; 309,572,610 bp submitted, 263,305,494 aligned, 17,268,742 unaligned; sequence count score was >90%), as well as the Galaxy platform [73,74]. Potential coding sequences, starting at methionine and covering at least 20% of the mRNA sequence, were found in the T. molitor contigs using custom software.

4.3. Search for Insect PSP Sequences

Reference sequences were taken from the human genome and downloaded from the UniProt database [75,76]. The list of reference human PSPs is given in Table 1. Initially, translated T. molitor ORFs homologous to human PSP query sequences were determined using the algorithm tblastx [77,78]. Multiple alignment was constructed by means of pairwise alignments using the Wagner–Fisher algorithm [79]. ORFs that were grouped into blocks with an identity of at least 95% and that overlapped with another block of at least 10 amino acid residues were considered as referring to one unique peptidase. Multiple sequence alignment (MAFFT) [80] was used to refine and build consensus sequences. The predicted start of the T. molitor mature enzyme sequences was identified by sequence homology through alignment with mature human PSPs. Multiple alignments of the analyzed amino acid sequences were performed using the program Clustal Omega [81,82].
T. castaneum peptidase sequences were found in the NCBI database [83,84] via Protein BLAST, using human PSPs as queries. Found sequences that lacked annotation in the database (namely DPP4.2_Tc, DPP10.2_Tc, and DPP10.3_Tc) were annotated during the subsequent construction of a phylogenetic tree and the analysis of multiple alignments.

4.4. Phylogenetic Analysis and Orthology Predictions

A phylogenetic tree of human, T. castaneum, and T. molitor PSPs was constructed in MEGA X [85,86] using the maximum likelihood method and 200 bootstrap repetitions. Based on the cladogram, pairs of sequences from T. castaneum and T. molitor were considered orthologous if they formed a single clade.

4.5. Analysis of the Expression Levels

Analysis of the expression levels was carried out as written in [52]. To analyze the expression of peptidase transcripts in T. castaneum, we used reads mapped to the genome of T. castaneum (Tcas3) by SeqManPro (DNAStar) to obtain expression values in normalized reads per kilobase per million mapped reads (RPKM) [87].
For T. molitor, about 40 million reads were obtained from each of the three RNA-Seq datasets. BLAST was used to obtain all nucleotide sequences in the contigs that potentially encode peptidases. The contigs were used to assemble and refine sequences of complete peptidase mRNA. Refined peptidase mRNA sequences were used to identify contigs in each assembly with at least 97% sequence identity, which were then used for expression calculations. If a contig aligned only partially to the mRNA, its contribution to expression was proportional to the aligned part if it was more than 50% of the contig’s length. RPKM was used to calculate the number of reads mapped to a contig when calculating each multiread as one unit.

4.6. Analysis of the Structure of PSP Sequences

Active centers and substrate binding subsites were analyzed using multiple alignments of studied peptidases (performed in Clustal Omega [81,82] and visualized in GeneDoc [88,89]). The SignalP-6.0 server [90,91] and TargetP-2.0 [92,93] were used to analyze N-terminal parts in the translated mRNA sequences, predict the presence and location of signal peptide cleavage sites and the subsequent secretion of proteins, and predict the presence of mitochondrial transit peptides. Trans-membrane domains were predicted using the DeepTMHMM server [94,95]. Molecular masses were calculated using the online server ExPASy [96]. Molecular masses and sequence lengths for human and T. castaneum PSPs were taken from the UniProt and NCBI databases.

4.7. Enzyme Assays Using Chromogenic p-Nitroanilide Substrates

Peptidase activity with chromogenic p-nitroanilide substrates was determined spectrophotometrically according to [97] at 405 nm with an ELx808 Absorbance Microplate Reader (BioTek Instruments, Inc., Winooski, VT, USA) and was calculated using the initial rates of hydrolysis.
Exact protocols for studied peptidases were described earlier: CC with 0.5 mM Glp-Phe-Gln-pNA (was obtained according to [53]) at pH 5.6 with 6 mM Cys [53], DPP 4 with 0.25 mM Ala-Pro-pNA (was synthesized according to standard procedures [98]) at pH 7.9 [48], and PRCP with 0.5 mM Z-Ala-Ala-Pro-pNA (was synthesized according to standard procedures [98]) at pH 5.6 [99].

4.8. Enzyme Assays Using Fluorogenic Substrates

The reaction mixture was placed in a microplate well and included 5-50 µL of APP enzyme, 10 µL of 15 mM MnCl2, and 50 mM Tris-HCl buffer (pH 7.5) to a total volume of 195 µL. After 20 min of preincubation at room temperature, 5 µL of 3 mM Lys(Abz)-Pro-Pro-pNA (Bachem, Bubendorf BL, Switzerland) dissolved in DMF was added. APP activity toward the fluorogenic substrate Lys(Abz)-Pro-Pro-pNA was determined at λex 340 nm and λem 400 nm with a FLx800 Fluorescence Microplate Reader (BioTek Instruments, Inc., USA), and was calculated as described in the previous section.

4.9. Localization Studies

For localization studies, fractionation of AM (anterior mesenteron, anterior part of the midgut) and PM (posterior mesenteron, posterior part of the midgut) parts from T. molitor larvae was performed as described in [48]. AM and PM sections were isolated from fourth-instar T. molitor larvae and were split longitudinally. The contents were extruded and retained, while the tissue was rinsed with 0.9% saline. The lumen contents or the gut tissues were pooled from 50 AM or PM (for each of the four biological replicates) and were homogenized separately at 4 °C in 0.5 mL of double distilled water in a glass Dounce homogenizer and centrifuged for 1 h at 35,000× g. There were two fractions after centrifugation of the lumen contents: supernatant (soluble contents) and precipitate (insoluble contents). After centrifugation of the tissues there were also two fractions: supernatant (soluble tissues fraction) and precipitate (insoluble tissues fraction). All fractions were stored at −70 °C. The activity of the studied peptidase was measured in each fraction using the same larval gut equivalents of material and the corresponding substrate. Measurements were carried out in 3 biological replicates with 3–5 technical repetitions. Standard deviation did not exceed 5–10%. The relative content of each peptidase in the fractions was then calculated.

4.10. Gliadin Proteolysis Assay

To isolate digestive peptidases, extract from 200 pooled AM gut sections (2–2.5 mL) was applied to a Sephadex G-150 column (2.7 × 110 cm). Elution was performed with 0.01 M phosphate buffer (pH 5.6) containing 0.5 M NaCl at 4 °C. Fractions of 9.0 mL in volume were collected and analyzed for protein content at 280 nm and for hydrolytic activity with a specific chromogenic substrate for each peptidase: 0.5 mM Glp-Phe-Gln-pNA at pH 5.6 with 6 mM Cys for CC, 0.25 mM Ala-Pro-pNA at pH 7.9 for DPP 4, and 0.5 mM Z-Ala-Ala-Pro-pNA at pH 5.6 for PRCP. Active fractions corresponding to the major peak of each peptidase’s activity were pooled separately, concentrated, and desalted on Amicon YM3 membranes (Amicon, Winsum, Netherlands) at 4 °C and either used immediately for further analysis or stored at −70 °C.
Proteolysis of gliadins was measured by an increase in free amino groups with 2,4,6-trinitrobenzene sulfonic acid (TNBS) (Reachem, Moscow, Russia) [100]. For a comparative assay of activity, the enzymes were taken in equal amounts per larvae. We used an aliquot of studied peptidases (CC, DPP 4, or PRCP) corresponding to the activity of 1.7 T. molitor larval guts. This was calculated using data regarding the activity of the enzymes in the initial midgut extract in the presence of 2 M urea. The activity of CC was measured with 0.5 mM Glp-Phe-Gln-pNA at pH 5.6 with 6 mM Cys, DPP 4 was measured with 0.25 mM Ala-Pro-pNA at pH 7.9, and PRCP was measured with 0.5 mM Z-Ala-Ala-Pro-pNA at pH 5.6. The gliadin proteolysis reaction mixture included 750 µg of gliadins (MP Biomedicals, Illkirch, France) dissolved in 0.1 M universal acetate–phosphate–borate buffer (UB) [101] at a pH of 6.5, 3 mM DTT, 2 M urea, and T. molitor purified enzymes equal to 1.7 larvae. The final volume was 307.5 µL. In total, 20 μL of the reaction mixture was removed at time zero and at 7 min intervals over a 35 min incubation period at 37 °C. The reaction with TNBS was carried out immediately according to the protocol described in [100]. The optical absorbance of the colored reaction product was measured at 405 nm with an ELx808 Absorbance Microplate Reader (BioTek Instruments, Inc., USA).

Author Contributions

Conceptualization, E.N.E.; data curation, I.Y.F.; formal analysis, Y.E.D.; investigation, V.F.T. and I.A.G.; methodology, V.F.T.; validation, M.A.B.; writing—original draft, V.F.T.; writing—review and editing, I.Y.F., Y.E.D. and E.N.E. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Russian Science Foundation (project № 22-24-00553).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We are grateful to Brenda Oppert for participating in the discussion of our results and providing valuable comments.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Shewry, P.R.; Tatham, A.S. The prolamin storage proteins of cereal seeds: Structure and evolution. Biochem. J. 1990, 267, 1–12. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Shewry, P.R.; Halford, N.G. Cereal seed storage proteins: Structures, properties and role in grain utilization. J. Exp. Bot. 2002, 53, 947–958. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Misener, S.R.; Chen, C.; Walker, V.K. Cold tolerance and proline metabolic gene expression in Drosophila melanogaster. J. Insect Physiol. 2001, 47, 393–400. [Google Scholar] [CrossRef]
  4. Teulier, L.; Weber, J.M.; Crevier, J.; Darveau, C.A. Proline as a fuel for insect flight: Enhancing carbohydrate oxidation in hymenopterans. Proc. Biol. Sci. 2016, 283, 20160333. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Bonnot, G.; Peypelut, L.; Gérard Febvay, L.; Fleurat-Lessard, F.; Fields, P.G. The effect of cold acclimation and deacclimation on cold tolerance, trehalose and free amino acid levels in Sitophilus granarius and Cryptolestes ferrugineus (Coleoptera). J. Insect Physiol. 1998, 44, 955–965. [Google Scholar] [CrossRef]
  6. Bonnett, T.R.; Robert, J.A.; Pitt, C.; Fraser, J.D.; Keeling, C.I.; Bohlmann, J.; Huber, D.P. Global and comparative proteomic profiling of overwintering and developing mountain pine beetle, Dendroctonus ponderosae (Coleoptera: Curculionidae), larvae. Insect Biochem. Mol. Biol. 2012, 42, 890–901. [Google Scholar] [CrossRef] [Green Version]
  7. Cunningham, D.F.; O’Connor, B. Proline specific peptidases. Biochim. Biophys. Acta 1997, 1343, 160–186. [Google Scholar] [CrossRef]
  8. Rawlings, N.D.; Salvesen, G. (Eds.) Handbook of Proteolytic Enzymes, 3rd ed.; Academic Press: London, UK, 2013; p. 4104. [Google Scholar]
  9. Dunaevsky, Y.E.; Tereshchenkova, V.F.; Oppert, B.; Belozersky, M.A.; Filippova, I.Y.; Elpidina, E.N. Human proline specific peptidases: A comprehensive analysis. Biochim. Biophys. Acta Gen. Subj. 2020, 1864, 129636. [Google Scholar] [CrossRef]
  10. Rawlings, N.D.; Waller, M.; Barrett, A.J.; Bateman, A. MEROPS: The database of proteolytic enzymes, their substrates and inhibitors. Nucleic Acids Res. 2014, 42, D503–D509. [Google Scholar] [CrossRef] [Green Version]
  11. Darmoul, D.; Voisin, T.; Couvineau, A.; Rouyer-Fessard, C.; Salomon, R.; Wang, Y.; Swallow, D.M.; Laburthe, M. Regional expression of epithelial dipeptidyl peptidase IV in the human intestines. Biochem. Biophys. Res. Commun. 1994, 203, 1224–1229. [Google Scholar] [CrossRef]
  12. Rubino, A.; Vetrella, M.; Pierro, M.; Rey, J.; Jos, J.; Auricchio, S. Intestinal glutaminyl-proline dipeptidase activity in children with treated coeliac disease. Rev. Eur. Etud. Clin. Biol. 1972, 17, 993–997. [Google Scholar] [PubMed]
  13. Grävinghoff, J.; Hütter, H.J. The activities of alanine aminopeptidase, leucine aminopeptidase, proline dipeptidase and prolyl dipeptidase in the mucosa of the small intestine. Investigations on normal children and patients with the malabsorption syndrome. Eur. J. Pediatr. 1977, 127, 57–62. [Google Scholar] [CrossRef] [PubMed]
  14. Sjöström, H.; Norén, O.; Krasilnikoff, P.A.; Gudmand-Høyer, E. Intestinal peptidases and sucrase in coeliac disease. Clin. Chim. Acta 1981, 109, 53–58. [Google Scholar] [CrossRef]
  15. Ganapathy, V.; Pashley, S.J.; Roesel, R.A.; Pashley, D.H.; Leibach, F.H. Inhibition of rat and human prolidases by captopril. Biochem. Pharmacol. 1985, 34, 1287–1291. [Google Scholar] [CrossRef]
  16. Tribolium Genome Sequencing Consortium. The genome of the model beetle and pest Tribolium castaneum. Nature 2008, 452, 949–955. [Google Scholar] [CrossRef] [Green Version]
  17. Polgár, L. The prolyl oligopeptidase family. Cell Mol. Life Sci. 2002, 59, 349–362. [Google Scholar] [CrossRef]
  18. Rea, D.; Fülöp, V. Structure-function properties of prolyl oligopeptidase family enzymes. Cell Biochem. Biophys. 2006, 44, 349–365. [Google Scholar] [CrossRef]
  19. Szeltner, Z.; Polgár, L. Structure, function and biological relevance of prolyl oligopeptidase. Curr. Protein Pept. Sci. 2008, 9, 96–107. [Google Scholar] [CrossRef]
  20. Aertgeerts, K.; Ye, S.; Tennant, M.G.; Kraus, M.L.; Rogers, J.; Sang, B.C.; Skene, R.J.; Webb, D.R.; Prasad, G.S. Crystal structure of human dipeptidyl peptidase IV in complex with a decapeptide reveals details on substrate specificity and tetrahedral intermediate formation. Protein Sci. 2004, 13, 412–421. [Google Scholar] [CrossRef] [Green Version]
  21. Hiramatsu, H.; Yamamoto, A.; Kyono, K.; Higashiyama, Y.; Fukushima, C.; Shima, H.; Sugiyama, S.; Inaka, K.; Shimizu, R. The crystal structure of human dipeptidyl peptidase IV (DPPIV) complex with diprotin A. Biol. Chem. 2004, 385, 561–564. [Google Scholar] [CrossRef]
  22. Meadows, S.A.; Edosada, C.Y.; Mayeda, M.; Tran, T.; Quan, C.; Raab, H.; Wiesmann, C.; Wolf, B.B. Ala657 and conserved active site residues promote fibroblast activation protein endopeptidase activity via distinct mechanisms of transition state stabilization. Biochemistry 2007, 46, 4598–4605. [Google Scholar] [CrossRef] [PubMed]
  23. Park, J.E.; Lenter, M.C.; Zimmermann, R.N.; Garin-Chesa, P.; Old, L.J.; Rettig, W.J. Fibroblast activation protein, a dual specificity serine protease expressed in reactive human tumor stromal fibroblasts. J. Biol. Chem. 1999, 274, 36505–36512. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Aertgeerts, K.; Levin, I.; Shi, L.; Snell, G.P.; Jennings, A.; Prasad, G.S.; Zhang, Y.; Kraus, M.L.; Salakian, S.; Sridhar, V.; et al. Structural and kinetic analysis of the substrate specificity of human fibroblast activation protein alpha. J. Biol. Chem. 2005, 280, 19441–19444. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Yokotani, N.; Doi, K.; Wenthold, R.J.; Wada, K. Non-conservation of a catalytic residue in a dipeptidyl aminopeptidase IV-related protein encoded by a gene on human chromosome 7. Hum. Mol. Genet. 1993, 2, 1037–1039. [Google Scholar] [CrossRef]
  26. Qi, S.Y.; Riviere, P.J.; Trojnar, J.; Junien, J.L.; Akinsanya, K.O. Cloning and characterization of dipeptidyl peptidase 10, a new member of an emerging subgroup of serine proteases. Biochem. J. 2003, 373 (Pt 1), 179–189. [Google Scholar] [CrossRef] [Green Version]
  27. Li, X.; Lou, Z.; Li, X.; Zhou, W.; Ma, M.; Cao, Y.; Geng, Y.; Bartlam, M.; Zhang, X.C.; Rao, Z. Structure of human cytosolic X-prolyl aminopeptidase: A double Mn(II)-dependent dimeric enzyme with a novel three-domain subunit. J. Biol. Chem. 2008, 283, 22858–22866. [Google Scholar] [CrossRef] [Green Version]
  28. Walter, R. Partial purification and characterization of post-proline cleaving enzyme: Enzymatic inactivation of neurohypophyseal hormones by kidney preparations of various species. Biochim. Biophys. Acta 1976, 422, 138–158. [Google Scholar] [CrossRef]
  29. Goptar, I.A.; Filippova, I.Y.; Lysogorskaya, E.N.; Oksenoit, E.S.; Vinokurov, K.S.; Zhuzhikov, D.P.; Bulushova, N.V.; Zalunin, I.A.; Dunaevsky, Y.E.; Belozersky, M.A.; et al. Localization of post-proline cleaving peptidases in Tenebrio molitor larval midgut. Biochimie 2008, 90, 508–514. [Google Scholar] [CrossRef]
  30. Alfalah, M.; Jacob, R.; Naim, H.Y. Intestinal dipeptidyl peptidase IV is efficiently sorted to the apical membrane through the concerted action of N- and O-glycans as well as association with lipid microdomains. J. Biol. Chem. 2002, 277, 10683–10690. [Google Scholar] [CrossRef] [Green Version]
  31. Ghersi, G.; Zhao, Q.; Salamone, M.; Yeh, Y.; Zucker, S.; Chen, W.T. The protease complex consisting of dipeptidyl peptidase IV and seprase plays a role in the migration and invasion of human endothelial cells in collagenous matrices. Cancer Res. 2006, 66, 4652–4661. [Google Scholar] [CrossRef]
  32. Abbott, C.A.; Yu, D.M.; Woollatt, E.; Sutherland, G.R.; McCaughan, G.W.; Gorrell, M.D. Cloning, expression and chromosomal localization of a novel human dipeptidyl peptidase (DPP) IV homolog, DPP8. Eur. J. Biochem. 2000, 267, 6140–6150. [Google Scholar] [CrossRef]
  33. Ajami, K.; Abbott, C.A.; McCaughan, G.W.; Gorrell, M.D. Dipeptidyl peptidase 9 has two forms, a broad tissue distribution, cytoplasmic localization and DPIV-like peptidase activity. Biochim. Biophys. Acta 2004, 1679, 18–28. [Google Scholar] [CrossRef] [PubMed]
  34. Yu, D.M.; Ajami, K.; Gall, M.G.; Park, J.; Lee, C.S.; Evans, K.A.; McLaughlin, E.A.; Pitman, M.R.; Abbott, C.A.; McCaughan, G.W.; et al. The in vivo expression of dipeptidyl peptidases 8 and 9. J. Histochem. Cytochem. 2009, 57, 1025–1040. [Google Scholar] [CrossRef] [Green Version]
  35. Allen, M.; Heinzmann, A.; Noguchi, E.; Abecasis, G.; Broxholme, J.; Ponting, C.P.; Bhattacharyya, S.; Tinsley, J.; Zhang, Y.; Holt, R.; et al. Positional cloning of a novel gene influencing asthma from chromosome 2q14. Nat. Genet. 2003, 35, 258–263. [Google Scholar] [CrossRef]
  36. Kakimoto, T.; Oshima, G.; Yeh, H.S.; Erdös, E.G. Purification of lysosomal prolycarboxypeptidase angiotensinase C. Biochim. Biophys. Acta 1973, 302, 178–182. [Google Scholar] [CrossRef]
  37. Odya, C.E.; Marinkovic, D.V.; Hammon, K.J.; Stewart, T.A.; Erdös, E.G. Purification and properties of prolylcarboxypeptidase (angiotensinase C) from human kidney. J. Biol. Chem. 1978, 253, 5927–5931. [Google Scholar] [CrossRef]
  38. Tan, F.; Morris, P.W.; Skidgel, R.A.; Erdös, E.G. Sequencing and cloning of human prolylcarboxypeptidase (angiotensinase C). Similarity to both serine carboxypeptidase and prolylendopeptidase families. J. Biol. Chem. 1993, 268, 16631–16638. [Google Scholar] [CrossRef]
  39. Chiravuri, M.; Agarraberes, F.; Mathieu, S.L.; Lee, H.; Huber, B.T. Vesicular localization and characterization of a novel post-proline-cleaving aminodipeptidase, quiescent cell proline dipeptidase. J. Immunol. 2000, 165, 5695–5702. [Google Scholar] [CrossRef] [Green Version]
  40. Maes, M.B.; Lambeir, A.M.; Gilany, K.; Senten, K.; Van der Veken, P.; Leiting, B.; Augustyns, K.; Scharpé, S.; De Meester, I. Kinetic investigation of human dipeptidyl peptidase II (DPPII)-mediated hydrolysis of dipeptide derivatives and its identification as quiescent cell proline dipeptidase (QPP)/dipeptidyl peptidase 7 (DPP7). Biochem. J. 2005, 386 (Pt 2), 315–324. [Google Scholar] [CrossRef] [Green Version]
  41. Hyde, R.J.; Hooper, N.M.; Turner, A.J. Molecular cloning and expression in COS-1 cells of pig kidney aminopeptidase P. Biochem. J. 1996, 319 (Pt 1), 197–201. [Google Scholar] [CrossRef]
  42. Venema, R.C.; Ju, H.; Zou, R.; Venema, V.J.; Ryan, J.W. Cloning and tissue distribution of human membrane-bound aminopeptidase P. Biochim. Biophys. Acta 1997, 1354, 45–48. [Google Scholar] [CrossRef] [PubMed]
  43. Czirják, G.; Burkhart, W.A.; Moyer, M.B.; Antal, J.; Shears, S.B.; Enyedi, P. Cloning and functional expression of the cytoplasmic form of rat aminopeptidase P. Biochim. Biophys. Acta 1999, 1444, 326–336. [Google Scholar] [CrossRef] [PubMed]
  44. O’Toole, J.F.; Liu, Y.; Davis, E.E.; Westlake, C.J.; Attanasio, M.; Otto, E.A.; Seelow, D.; Nurnberg, G.; Becker, C.; Nuutinen, M.; et al. Individuals with mutations in XPNPEP3, which encodes a mitochondrial protein, develop a nephronophthisis-like nephropathy. J. Clin. Investig. 2010, 120, 791–802. [Google Scholar] [CrossRef] [PubMed]
  45. Kohno, H.; Kanno, T. Properties and activities of aminopeptidases in normal and mitogen-stimulated human lymphocytes. Biochem. J. 1985, 226, 59–65. [Google Scholar] [CrossRef] [Green Version]
  46. Teufel, M.; Saudek, V.; Ledig, J.P.; Bernhardt, A.; Boularand, S.; Carreau, A.; Cairns, N.J.; Carter, C.; Cowley, D.J.; Duverger, D.; et al. Sequence identification and characterization of human carnosinase and a closely related non-specific dipeptidase. J. Biol. Chem. 2003, 278, 6521–6531. [Google Scholar] [CrossRef] [Green Version]
  47. Durinx, C.; Lambeir, A.M.; Bosmans, E.; Falmagne, J.B.; Berghmans, R.; Haemers, A.; Scharpé, S.; De Meester, I. Molecular characterization of dipeptidyl peptidase activity in serum: Soluble CD26/dipeptidyl peptidase IV is responsible for the release of X-Pro dipeptides. Eur. J. Biochem. 2000, 267, 5608–5613. [Google Scholar] [CrossRef]
  48. Tereshchenkova, V.F.; Goptar, I.A.; Kulemzina, I.A.; Zhuzhikov, D.P.; Serebryakova, M.V.; Belozersky, M.A.; Dunaevsky, Y.E.; Oppert, B.; Filippova, I.Y.; Elpidina, E.N. Dipeptidyl peptidase 4—An important digestive peptidase in Tenebrio molitor larvae. Insect Biochem. Mol. Biol. 2016, 76, 38–48. [Google Scholar] [CrossRef] [Green Version]
  49. Tereshchenkova, V.F.; Goptar, I.A.; Zhuzhikov, D.P.; Belozersky, M.A.; Dunaevsky, Y.E.; Oppert, B.; Filippova, I.Y.; Elpidina, E.N. Prolidase is a critical enzyme for complete gliadin digestion in Tenebrio molitor larvae. Arch. Insect Biochem. Physiol. 2017, 95, e21395. [Google Scholar] [CrossRef]
  50. Shan, L.; Qiao, S.W.; Arentz-Hansen, H.; Molberg, Ø.; Gray, G.M.; Sollid, L.M.; Khosla, C. Identification and analysis of multivalent proteolytically resistant peptides from gluten: Implications for celiac sprue. J. Proteome Res. 2005, 4, 1732–1741. [Google Scholar] [CrossRef] [Green Version]
  51. Goptar, I.A.; Semashko, T.A.; Danilenko, S.A.; Lysogorskaya, E.N.; Oksenoit, E.S.; Zhuzhikov, D.P.; Belozersky, M.A.; Dunaevsky, Y.E.; Oppert, B.; Filippova, I.Y.; et al. Cysteine digestive peptidases function as post-glutamine cleaving enzymes in tenebrionid stored-product pests. Comp. Biochem. Physiol. B Biochem. Mol. Biol. 2012, 161, 148–154. [Google Scholar] [CrossRef]
  52. Martynov, A.G.; Elpidina, E.N.; Perkin, L.; Oppert, B. Functional analysis of C1 family cysteine peptidases in the larval gut of Тenebrio molitor and Tribolium castaneum. BMC Genom. 2015, 16, 75. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  53. Filippova, I.Y.; Dvoryakova, E.A.; Sokolenko, N.I.; Simonyan, T.R.; Tereshchenkova, V.F.; Zhiganov, N.I.; Dunaevsky, Y.E.; Belozersky, M.A.; Oppert, B.; Elpidina, E.N. New Glutamine-Containing Substrates for the Assay of Cysteine Peptidases From the C1 Papain Family. Front. Mol. Biosci. 2020, 7, 578758. [Google Scholar] [CrossRef]
  54. Pils, B.; Schultz, J. Inactive enzyme-homologues find new function in regulatory processes. J. Mol. Biol. 2004, 340, 399–404. [Google Scholar] [CrossRef] [PubMed]
  55. Zhiganov, N.I.; Tereshchenkova, V.F.; Oppert, B.; Filippova, I.Y.; Belyaeva, N.V.; Dunaevsky, Y.E.; Belozersky, M.A.; Elpidina, E.N. The dataset of predicted trypsin serine peptidases and their inactive homologs in Tenebrio molitor transcriptomes. Data Brief 2021, 38, 107301. [Google Scholar] [CrossRef] [PubMed]
  56. Terra, W.R.; Ferreira, C.; Bastos, F. Phylogenetic consideration of insect digestion. Disaccharidases and the spatial organization of digestion in the Tenebrio molitor larvae. Insect Biochem. 1985, 15, 443–449. [Google Scholar] [CrossRef]
  57. Vinokurov, K.S.; Elpidina, E.N.; Oppert, B.; Prabhakar, S.; Zhuzhikov, D.P.; Dunaevsky, Y.E.; Belozersky, M.A. Diversity of digestive proteinases in Tenebrio molitor (Coleoptera: Tenebrionidae) larvae. Comp. Biochem. Physiol. B Biochem. Mol. Biol. 2006, 145, 126–137. [Google Scholar] [CrossRef]
  58. Elpidina, E.N.; Goptar, I.A. Digestive peptidases in Tenebrio molitor and possibility of use to treat celiac disease. Entomol. Res. 2007, 37, 139–147. [Google Scholar] [CrossRef]
  59. Vinokurov, K.S.; Elpidina, E.N.; Oppert, B.; Prabhakar, S.; Zhuzhikov, D.P.; Dunaevsky, Y.E.; Belozersky, M.A. Fractionation of digestive proteinases from Tenebrio molitor (Coleoptera: Tenebrionidae) larvae and role in protein digestion. Comp. Biochem. Physiol. B Biochem. Mol. Biol. 2006, 145, 138–146. [Google Scholar] [CrossRef] [PubMed]
  60. Silk, D.B. Digestion and absorption of dietary protein in man. Proc. Nutr. Soc. 1980, 39, 61–70. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  61. Dunaevsky, Y.E.; Tereshchenkova, V.F.; Belozersky, M.A.; Filippova, I.Y.; Oppert, B.; Elpidina, E.N. Effective Degradation of Gluten and Its Fragments by Gluten-Specific Peptidases: A Review on Application for the Treatment of Patients with Gluten Sensitivity. Pharmaceutics 2021, 13, 1603. [Google Scholar] [CrossRef]
  62. Shan, L.; Marti, T.; Sollid, L.M.; Gray, G.M.; Khosla, C. Comparative biochemical analysis of three bacterial prolyl endopeptidases: Implications for coeliac sprue. Biochem. J. 2004, 383 (Pt 2), 311–318. [Google Scholar] [CrossRef] [Green Version]
  63. ClinicalTrials.gov. Available online: https://clinicaltrials.gov/ct2/show/NCT00810654 (accessed on 30 November 2022).
  64. Gerez, C.L.; Font de Valdez, G.; Rollán, G.C. Functionality of lactic acid bacteria peptidase activities in the hydrolysis of gliadin-like fragments. Lett. Appl. Microbiol. 2008, 47, 427–432. [Google Scholar] [CrossRef] [PubMed]
  65. Gass, J.; Vora, H.; Bethune, M.T.; Gray, G.M.; Khosla, C. Effect of barley endoprotease EP-B2 on gluten digestion in the intact rat. J. Pharmacol. Exp. Ther. 2006, 318, 1178–1186. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  66. Gass, J.; Bethune, M.T.; Siegel, M.; Spencer, A.; Khosla, C. Combination enzyme therapy for gastric digestion of dietary gluten in patients with celiac sprue. Gastroenterology 2007, 133, 472–480. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  67. Tye-Din, J.A.; Anderson, R.P.; Ffrench, R.A.; Brown, G.J.; Hodsman, P.; Siegel, M.; Botwick, W.; Shreeniwas, R. The effects of ALV003 pre-digestion of gluten on immune response and symptoms in celiac disease in vivo. Clin. Immunol. 2010, 134, 289–295. [Google Scholar] [CrossRef]
  68. Lähdeaho, M.L.; Kaukinen, K.; Laurila, K.; Vuotikka, P.; Koivurova, O.P.; Kärjä-Lahdensuu, T.; Marcantonio, A.; Adelman, D.C.; Mäki, M. Glutenase ALV003 attenuates gluten-induced mucosal injury in patients with celiac disease. Gastroenterology 2014, 146, 1649–1658. [Google Scholar] [CrossRef] [Green Version]
  69. Savvateeva, L.V.; Gorokhovets, N.V.; Makarov, V.A.; Serebryakova, M.V.; Solovyev, A.G.; Morozov, S.Y.; Reddy, V.P.; Zernii, E.Y.; Zamyatnin, A.A., Jr.; Aliev, G. Glutenase and collagenase activities of wheat cysteine protease Triticain-α: Feasibility for enzymatic therapy assays. Int. J. Biochem. Cell Biol. 2015, 62, 115–124. [Google Scholar] [CrossRef]
  70. Shan, L.; Molberg, Ø.; Parrot, I.; Hausch, F.; Filiz, F.; Gray, G.M.; Sollid, L.M.; Khosla, C. Structural basis for gluten intolerance in celiac sprue. Science 2002, 297, 2275–2279. [Google Scholar] [CrossRef] [Green Version]
  71. Prabhakar, S.; Chen, M.S.; Elpidina, E.N.; Vinokurov, K.S.; Smith, C.M.; Marshall, J.; Oppert, B. Sequence analysis and molecular characterization of larval midgut cDNA transcripts encoding peptidases from the yellow mealworm, Tenebrio molitor L. Insect Mol. Biol. 2007, 16, 455–468. [Google Scholar] [CrossRef]
  72. Oppert, B.; Dowd, S.E.; Bouffard, P.; Li, L.; Conesa, A.; Lorenzen, M.D.; Toutges, M.; Marshall, J.; Huestis, D.L.; Fabrick, J.; et al. Transcriptome profiling of the intoxication response of Tenebrio molitor larvae to Bacillus thuringiensis Cry3Aa protoxin. PLoS ONE 2012, 7, e34624. [Google Scholar] [CrossRef]
  73. Giardine, B.; Riemer, C.; Hardison, R.C.; Burhans, R.; Elnitski, L.; Shah, P.; Zhang, Y.; Blankenberg, D.; Albert, I.; Taylor, J.; et al. Galaxy: A platform for interactive large-scale genome analysis. Genome Res. 2005, 15, 1451–1455. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  74. Goecks, J.; Nekrutenko, A.; Taylor, J.; Galaxy Team. Galaxy: A comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol. 2010, 11, R86. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  75. UniProt Consortium. UniProt: The universal protein knowledgebase in 2021. Nucleic Acids Res. 2021, 49, D480–D489. [Google Scholar] [CrossRef] [PubMed]
  76. UniProtKB. Available online: https://www.uniprot.org (accessed on 2 November 2022).
  77. Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 1990, 215, 403–410. [Google Scholar] [CrossRef]
  78. Basic Local Alignment Search Tool. Available online: https://blast.ncbi.nlm.nih.gov/Blast.cgi?PROGRAM=blastx&PAGE_TYPE=BlastSearch&LINK_LOC=blasthome (accessed on 2 November 2022).
  79. Wagner, R.A.; Fischer, M.J. The string-to-string correction problem. JACM 1974, 21, 168–173. [Google Scholar] [CrossRef]
  80. Katoh, K.; Kuma, K.; Toh, H.; Miyata, T. MAFFT version 5: Improvement in accuracy of multiple sequence alignment. Nucleic Acids Res. 2005, 33, 511–518. [Google Scholar] [CrossRef]
  81. Li, W.; Cowley, A.; Uludag, M.; Gur, T.; McWilliam, H.; Squizzato, S.; Park, Y.M.; Buso, N.; Lopez, R. The EMBL-EBI bioinformatics web and programmatic tools framework. Nucleic Acids Res. 2015, 43, W580–W584. [Google Scholar] [CrossRef] [Green Version]
  82. Clustal Omega. Available online: https://www.ebi.ac.uk/Tools/msa/clustalo/ (accessed on 2 November 2022).
  83. NCBI Resource Coordinators. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2018, 46, D8–D13. [Google Scholar] [CrossRef] [Green Version]
  84. The National Center for Biotechnology Information (NCBI). Available online: https://www.ncbi.nlm.nih.gov (accessed on 2 November 2022).
  85. Kumar, S.; Stecher, G.; Li, M.; Knyaz, C.; Tamura, K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol. Biol. Evol. 2018, 35, 1547–1549. [Google Scholar] [CrossRef]
  86. Molecular Evolutionary Genetics Analysis (MEGA). Available online: https://www.megasoftware.net/home (accessed on 23 November 2022).
  87. Mortazavi, A.; Williams, B.A.; McCue, K.; Schaeffer, L.; Wold, B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat. Methods 2008, 5, 621–628. [Google Scholar] [CrossRef]
  88. Nicholas, K.B.; Nicholas, H.B., Jr.; Deerfield, D.W., II. GeneDoc: Analysis and Visualization of Genetic Variation. EMBNEW.NEWS 1997, 4, 14. [Google Scholar]
  89. GeneDoc. Available online: https://genedoc.software.informer.com (accessed on 2 November 2022).
  90. Teufel, F.; Almagro Armenteros, J.J.; Johansen, A.R.; Gíslason, M.H.; Pihl, S.I.; Tsirigos, K.D.; Winther, O.; Brunak, S.; von Heijne, G.; Nielsen, H. SignalP 6.0 predicts all five types of signal peptides using protein language models. Nat. Biotechnol. 2022, 40, 1023–1025. [Google Scholar] [CrossRef] [PubMed]
  91. SignalP-6.0. Available online: https://biolib.com/DTU/SignalP-6/ (accessed on 23 November 2022).
  92. Almagro Armenteros, J.J.; Salvatore, M.; Emanuelsson, O.; Winther, O.; von Heijne, G.; Elofsson, A.; Nielsen, H. Detecting sequence signals in targeting peptides using deep learning. Life Sci. Alliance 2019, 2, e201900429. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  93. TargetP-2.0. Available online: https://services.healthtech.dtu.dk/service.php?TargetP-2.0 (accessed on 23 November 2022).
  94. Hallgren, J.; Tsirigos, K.D.; Pedersen, M.D.; Almagro Armenteros, J.J.; Marcatili, P.; Nielsen, H.; Krogh, A.; Winther, O. DeepTMHMM predicts alpha and beta transmembrane proteins using deep neural networks. bioRxiv, 2022; in press. [Google Scholar] [CrossRef]
  95. DeepTMHMM. Available online: https://dtu.biolib.com/DeepTMHMM (accessed on 23 November 2022).
  96. Compute pI/Mw Tool. Available online: http://web.expasy.org/compute_pi/ (accessed on 2 November 2022).
  97. Erlanger, B.F.; Kokowsky, N.; Cohen, W. The preparation and properties of two new chromogenic substrates of trypsin. Arch. Biochem. Biophys. 1961, 95, 271–278. [Google Scholar] [CrossRef]
  98. Pennington, M.W.; Dunn, B.M. (Eds.) Peptide Synthesis Protocols; Humana Press: Totowa, NJ, USA, 1995; p. 321. [Google Scholar] [CrossRef]
  99. Goptar, I.A.; Shagin, D.A.; Shagina, I.A.; Mudrik, E.S.; Smirnova, Y.A.; Zhuzhikov, D.P.; Belozersky, M.A.; Dunaevsky, Y.E.; Oppert, B.; Filippova, I.Y.; et al. A digestive prolyl carboxypeptidase in Tenebrio molitor larvae. Insect Biochem. Mol. Biol. 2013, 43, 501–509. [Google Scholar] [CrossRef]
  100. Fields, R. The rapid determination of amino groups with TNBS. Methods Enzymol. 1972, 25, 464–468. [Google Scholar] [CrossRef]
  101. Frugoni, J.A.C. Tampone universale di Britton e Robinson a forza ionica constante. Gazz. Chem. Ital. 1957, 87, 403–407. [Google Scholar]
Figure 1. Phylogenetic tree of T. castaneum, T. molitor, and H. sapiens PSPs and PPCPbs constructed using the maximum likelihood method.
Figure 1. Phylogenetic tree of T. castaneum, T. molitor, and H. sapiens PSPs and PPCPbs constructed using the maximum likelihood method.
Ijms 24 00579 g001
Figure 2. Gut mRNA expression levels of T. castaneum (light grey columns) and T. molitor (dark grey columns) active PSPs. * This calculation involved only active PSPs: POP, DPP 4, DPP 8, PRCP, APP1, APP2, APP3, and XPD. DPP 10 was not included because it is an inactive DPP 4 homolog and does not possess peptidase activity. LAP and CND were not included because cleavage of post-proline bonds is not their main specificity.
Figure 2. Gut mRNA expression levels of T. castaneum (light grey columns) and T. molitor (dark grey columns) active PSPs. * This calculation involved only active PSPs: POP, DPP 4, DPP 8, PRCP, APP1, APP2, APP3, and XPD. DPP 10 was not included because it is an inactive DPP 4 homolog and does not possess peptidase activity. LAP and CND were not included because cleavage of post-proline bonds is not their main specificity.
Ijms 24 00579 g002
Figure 3. Proteolysis of gliadins by six samples of T. molitor digestive peptidase preparations: 1. PRCP (white squares), 2. DPP 4 (black squares), 3. CC (white triangles), 4. CC + PRCP mixture (black triangles), 5. CC + DPP 4 mixture (white circles), and 6. CC + DPP 4 + PRCP mixture (black circles) (description of the curves from bottom to top). Proteolysis of gliadins was detected by an increase in free amino groups using 2,4,6-trinitrobenzene sulfonic acid (TNBS). The error bars represent standard deviation.
Figure 3. Proteolysis of gliadins by six samples of T. molitor digestive peptidase preparations: 1. PRCP (white squares), 2. DPP 4 (black squares), 3. CC (white triangles), 4. CC + PRCP mixture (black triangles), 5. CC + DPP 4 mixture (white circles), and 6. CC + DPP 4 + PRCP mixture (black circles) (description of the curves from bottom to top). Proteolysis of gliadins was detected by an increase in free amino groups using 2,4,6-trinitrobenzene sulfonic acid (TNBS). The error bars represent standard deviation.
Ijms 24 00579 g003
Figure 4. Hypothetical scheme for the complete hydrolysis of glutamine- and proline-rich 26-mer immunogenic gliadin fragments by T. molitor and T. castaneum digestive CC and PSPs.
Figure 4. Hypothetical scheme for the complete hydrolysis of glutamine- and proline-rich 26-mer immunogenic gliadin fragments by T. molitor and T. castaneum digestive CC and PSPs.
Ijms 24 00579 g004
Table 1. Classification of PSPs found in the T. castaneum genome and gut transcriptome and the T. molitor gut transcriptome compared with human sequences.
Table 1. Classification of PSPs found in the T. castaneum genome and gut transcriptome and the T. molitor gut transcriptome compared with human sequences.
MEROPS
Classification
T. castaneumT. molitorH. sapiens
PSP NameUniProt IDNCBI IDPSP NameUniProt IDNCBI IDPSP NameUniProt IDNCBI ID
S9APOP_Tc *D6W775 *XP_015833515 *POP_TmA0A4D5SUD3QAY29071POP_HsP48147NP_002717
S9BDPP4.1_TcD6WJ57XP_975053DPP4.1_TmA0A4D5SUH2QAY29072DPP4_HsP27487NP_001926
DPP4.2_TcD6WL31XP_972016DPP4.2_TmA0A4D5SUB4QAY29073
DPP8_TcD6WD50XP_971949DPP8_TmA0A4D5SUP4QAY29074DPP8_HsQ6V1X1NP_932064
DPP9_HsQ86TI2NP_631898
FAP_HsQ12884NP_004451
S9X DPP6_HsP42658NP_570629
DPP10.1_TcD6WQR4XP_015837625DPP10.1_TmA0A4D5SUP7QAY29075DPP10_HsQ8N608NP_065919
DPP10.2_TcD6WJ59XP_008194406DPP10.2_TmA0A4D5SUC4QAY29076
DPP10.3_Tc *D6WS52 *XP_008196030 *
S28PRCP.1_TcD6W8E7XP_971305PRCP_TmA0A4D5SUH1QAY29077PRCP_HsP42785NP_005031
PRCP.2_TcD6WVZ7XP_972807
DPP2_HsQ9UHL4NP_037511
M24APP1.1_TcD6WLY7XP_974698APP1_TmA0A4D5SVH2QAY29078APP1_HsQ9NQW7NP_001161076
APP1.2_TcD6X4C7EEZ97287
APP2_TmA0A4D5SUC7QAY29079APP2_HsO43895NP_003390
APP3_TcD6X4Q9XP_008199099APP3_TmA0A4D5SUE5QAY29080APP3_HsQ9NQH7NP_071381
XPD_TcA0A139WNR5XP_971576XPD_TmA0A4D5SUD6QAY29081XPD_HsP12955NP_000276
M17LAP.1_TcD6WAL0EEZ97983LAP_TmA0A4D5SUH6QAY29082LAP_HsP28838NP_056991
LAP.2_Tc *D6WAL5 *XP_974656 *
LAP.3_Tc *D6WIH3 *XP_972664 *
M20CND_TcD6WGK3XP_971451CND_TmA0A4D5SUC3QAY29083CND_HsQ96KP4NP_060705.2
* These PSP sequences were found only in the T. castaneum genome and were absent in the gut transcriptome. PSP name: the name before the dot means the type of peptidase, the number after the dot can occur if there are several variants of sequences of one type of peptidase, and the abbreviation after the underscore refers to the organism (Tc—T. castaneum, Tm—T. molitor, Hs—H. sapiens).
Table 2. General characteristics of the serine PSP sequences from the S9 and S28 families.
Table 2. General characteristics of the serine PSP sequences from the S9 and S28 families.
PSPSequence NameActive Site
Residues
Residues of S1
Binding Subsite
Residues of S2 Binding SubsiteResidues of S3 Binding Subsite
SerAspHis
POPPOP_HsS554D641H680W595, F476, V644, V580, Y599, Y473R643F173, M235, C255, I591, A594
POP_TcSDHW, F, V, V, Y, YRF, R, C, I, A
POP_TmSDHW, F, V, V, Y, YRF, R, C, I, A
DPP 4DPP4_HsS630D708H740Y631, V656, W659, Y662, Y666, V711N710, R125, E205, E206-
DPP4.1_TcSDHY, V, L, Y, Y, VN, R, E, E-
DPP4.2_TcSDHY, V, W, Y, Y, VN, R, E, E-
DPP4.1_TmSDHY, V, F, Y, Y, VN, R, E, E-
DPP4.2_TmSDHY, V, W, Y, Y, VN, R, E, E-
DPP 8DPP8_HsSDHY, V, W, Y, Y, VN, -, E, E-
DPP8_TcSDHY, V, W, Y, Y, VN, -, E, E-
DPP8_TmSDHY, V, W, Y, Y, VN, -, E, E-
DPP 9DPP9_HsSDHY, V, W, Y, Y, VN, -, E, E-
FAPFAP_HsSDHY, V, W, Y, Y, VN, R, E, E-
DPP 6DPP6_HsDDHY, I, F, Y, F, I,K, Q, E, E-
DPP 10DPP10_HsGDHY, I, L, Y, F, VK, H, E, E-
DPP10.1_TcGDHY, I, W, H, F, VS, K, N, E-
DPP10.2_TcGDHY, I, W, Y, F, AT, R, E, D-
DPP10.3_TcSDHY, V, W, Y, Y, VN, R, E, E-
DPP10.1_TmGDHY, I, W, Y, F, VS, K, N, E-
DPP10.2_TmGDHY, I, W, Y, F, AT, R, E, D-
PRCPPRCP_HsS179D430H455M183, W359, M369, W432--
PRCP.1_TcSDHM, W, M, W--
PRCP.2_TcSDHM, W, M, W--
PRCP_TmSDHM, W, M, W--
DPP 2DPP2_HsSDHM, W, L, W
Table 3. General characteristics of the metallodependent PSP sequences from the M24B subfamily.
Table 3. General characteristics of the metallodependent PSP sequences from the M24B subfamily.
PSPSequence NameResidues of Mn2+-Binding Sites
II/IIIIIII/II
APP1APP1_HsD415D426H489E523E537
APP1.1_TcDDHEE
APP1.2_TcDDHEE
APP1_TmDDHEE
APP2APP2_HsDDHEE
APP2_TmDDHEE
APP3APP3_HsDDHEE
APP3_TcDDHEE
APP3_TmDDHEE
XPDXPD_HsDDHEE
XPD_TcDDHEE
XPD_TmDDHEE
Table 4. General characteristics of the LAP and CND sequences from the M17 and M20 families.
Table 4. General characteristics of the LAP and CND sequences from the M17 and M20 families.
PPCPbsSequence NameResidues of Zn2+-Binding Sites
III/II IIII/II
LAPLAP_HsK282D287K294D305D364E366R368
LAP.1_TcKDKDDER
LAP.2_TcKDKDDER
LAP.3_TcKNKDDAR
LAP_TmKDKDDER
Residues of Mn2+-Binding Sites
II I/II II/II I
CNDCND_HsH99D101D132E166E167D195H228H445
CND_TcHDDEEDHH
CND_TmHDDEEDHH
Table 5. Analysis of the localization domains of PSPs.
Table 5. Analysis of the localization domains of PSPs.
PSPSequence
Name
LengthProtein
Mm (Da)
SignalP-6.0 * TargetP-2.0 *DeepTMHMM ServerExperimental Data
(For Mammalian PSPs Only)
Signal PeptideSignal PeptideMitochondrial
Transit Peptide
TMhelix
POPPOP_Hs71080,700 Cytoplasm [28]
POP_Tc70480,212
POP_Tm70580,116 1–21 (0.90/0.79)
DPP 4DPP4_Hs76688,279 1–22 (0.78/0.40) 7–27Membrane [30,31], soluble [47]
DPP4.1_Tc76787,3331–19 (0.99/0.98)1–19 (0.99/0.73)
DPP4.2_Tc82192,585 34–54
DPP4.1_Tm76886,5761–19 (0.99/0.98)1–19 (0.99/0.78)
DPP4.2_Tm80390,278 34–54
DPP 8DPP8_Hs898103,358 Cytoplasm [26,33]
DPP8_Tc82595,048
DPP8_Tm82594,870
DPP 10DPP10_Hs79690,888 35–55Extracellular membrane [26,35]
DPP10.1_Tc83596,053 37–58
DPP10.2_Tc84595,598 46–66
DPP10.3_Tc85897,129 25–45
DPP10.1_Tm84396,808 45–66
DPP10.2_Tm88299,854 79–100
PRCPPRCP_Hs49655,8001–21 (0.99/0.98)1–21 (0.99/0.48) Lysosome [36,37,38]
PRCP.1_Tc48855,1311–16 (0.99/0.98)1–16 (0.92/0.88)
PRCP.2_Tc47853,8431–17 (0.99/0.98)1–17 (0.99/0.96)
PRCP_Tm48855,1051–15 (0.99/0.98)1–15 (0.99/0.87)
APP 1APP1_Hs62369,918 Cytoplasm [43]
APP1.1_Tc61569,050
APP1.2_Tc69079,2731–13 (0.99/0.98)1–14 (0.89/0.37)
APP1_Tmpartialpartial
APP 2APP2_Hs67475,6251–21 (0.99/0.98)1–23 (0.99/0.54) Plasma membrane [42]
APP2_Tm70981,0411–25 (0.83/0.81)1–25 (0.99/0.97)
APP 3APP3_Hs50757,034 1–31 (0.89/0.72) Mitochondrion [44]
APP3_Tc50357,581 1–22 (0.58/0.43)
APP3_Tm50457,356 –(0.32)
XPDXPD_Hs49354,548 -
XPD_Tc48754,319
XPD_Tm48954,168
LAPLAP_Hs51956,166 1–27 (0.95/0.60) Microsomes, cytoplasm [45]
LAP.1_Tc44748,087
LAP.2_Tc53058,105 1–26 (0.94/0.71)
LAP.3_Tc53458,852 1–21 (0.93/0.81)
LAP_Tm51355,124 1–24 (0.91/0.31)
CNDCND_Hs47552,878 Cytoplasm [46]
CND_Tc47853,583
CND_Tm47853,794
* After the cleavage site position of the signal peptide or mitochondrial transit peptide in brackets, the number before the slash is the likelihood of the specified localization domain appearing and the number after the slash is the probability of cleavage at the indicated position.
Table 6. Localization of T. molitor larval midgut peptidases *.
Table 6. Localization of T. molitor larval midgut peptidases *.
PSPs
DPP 4PRCPPOPAPPXPD
Part of the midgutPeptidase activity, % from the total activity (AM + PM)
AM6073495237
PM4027514863
Tissue distribution, %
Soluble contents8110041829
Insoluble contents3--54
Soluble tissue fraction70945857
Insoluble tissue fraction921910
* The relative content of each peptidase in different fractions is presented.
Table 7. Gut mRNA expression levels of T. castaneum and T. molitor PSPs and PPCPbs.
Table 7. Gut mRNA expression levels of T. castaneum and T. molitor PSPs and PPCPbs.
PeptidaseT. castaneumT. molitor
Sequence NameExpression (RPKM)Percentage from the Sum of Expressions of All Active PSPs *Sequence NameExpression (RPKM)Percentage from the Sum of Expressions of All Active PSPs *
POPPOP_Tc-0POP_Tm254.2
DPP 4DPP4.1_Tc168825.1DPP4.1_Tm14724.6
DPP4.2_Tc5337.9DPP4.2_Tm10217.1
DPP 8DPP8_Tc751.1DPP8_Tm203.3
DPP 10DPP10.1_Tc8-DPP10.1_Tm370-
DPP10.2_Tc5-DPP10.2_Tm7-
PRCPPRCP.1_Tc270.4PRCP_Tm538.9
PRCP.2_Tc67510.0
APP1APP1.1_Tc242035.9APP1_Tm8814.7
APP1.2_Tc80.1
APP2 APP2_Tm30.5
APP3APP3_Tc580.9APP3_Tm122.0
XPDXPD_Tc125418.6XPD_Tm14824.7
LAPLAP.1_Tc86-LAP_Tm15-
CNDCND_Tc2462-CND_Tm625-
* This calculation involved only active PSPs: POP, DPP 4, DPP 8, PRCP, APP1, APP2, APP3, and XPD. DPP 10 was not included because it is an inactive DPP 4 homolog and does not possess peptidase activity. LAP and CND were not included because cleavage of post-proline bonds is not their main specificity.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Tereshchenkova, V.F.; Filippova, I.Y.; Goptar, I.A.; Dunaevsky, Y.E.; Belozersky, M.A.; Elpidina, E.N. Complex of Proline-Specific Peptidases in the Genome and Gut Transcriptomes of Tenebrionidae Insects and Their Role in Gliadin Hydrolysis. Int. J. Mol. Sci. 2023, 24, 579. https://doi.org/10.3390/ijms24010579

AMA Style

Tereshchenkova VF, Filippova IY, Goptar IA, Dunaevsky YE, Belozersky MA, Elpidina EN. Complex of Proline-Specific Peptidases in the Genome and Gut Transcriptomes of Tenebrionidae Insects and Their Role in Gliadin Hydrolysis. International Journal of Molecular Sciences. 2023; 24(1):579. https://doi.org/10.3390/ijms24010579

Chicago/Turabian Style

Tereshchenkova, Valeriia F., Irina Y. Filippova, Irina A. Goptar, Yakov E. Dunaevsky, Mikhail A. Belozersky, and Elena N. Elpidina. 2023. "Complex of Proline-Specific Peptidases in the Genome and Gut Transcriptomes of Tenebrionidae Insects and Their Role in Gliadin Hydrolysis" International Journal of Molecular Sciences 24, no. 1: 579. https://doi.org/10.3390/ijms24010579

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop