Next Article in Journal
Highly Stable CsPbI3 Perovskite Quantum Dots Enabled by Single SiO2 Coating toward Down-Conversion Light-Emitting Diodes
Next Article in Special Issue
Correlation Dependence between Hydrophobicity of Modified Bitumen and Water Saturation of Asphalt Concrete
Previous Article in Journal
Comparison of Land Cover Categorical Data Stored in OSM and Authoritative Topographic Data
Previous Article in Special Issue
Validation of Diesel Fraction Content in Heavy Oils Measured by High Temperature Simulated Distillation and Physical Vacuum Distillation by Performance of Commercial Distillation Test and Process Simulation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Mass–Energy Profiles Obtained by Quantum Chemical Computing Applied in Mass Spectrometry: A Case Study with Identification of a Group of Acetalized Monosaccharide Isomers

1
Department of Biochemistry and Pharmacology, “Victor Babes” University of Medicine and Pharmacy, RO-300041 Timisoara, Romania
2
Department of Dental Medicine, Faculty of Dentistry, “Vasile Goldiș” Western University, RO-310414 Arad, Romania
3
Department of Technical and Natural Sciences, Faculty of Food Engineering, Tourism and Environmental Protection, “Aurel Vlaicu” University, RO-310330 Arad, Romania
4
Department of Functional Sciences, “Victor Babes” University of Medicine and Pharmacy, RO-300041 Timisoara, Romania
5
Department of Pharmaceutical Sciences, Faculty of Pharmacy, “Vasile Goldiș” Western University, RO-310414 Arad, Romania
6
National Institute of Research & Development for Electrochemistry and Condensed Matter, RO-300569 Timisoara, Romania
*
Author to whom correspondence should be addressed.
Appl. Sci. 2023, 13(13), 7530; https://doi.org/10.3390/app13137530
Submission received: 24 May 2023 / Revised: 19 June 2023 / Accepted: 23 June 2023 / Published: 26 June 2023
(This article belongs to the Special Issue Validation and Measurement in Analytical Chemistry: Practical Aspects)

Abstract

:
Accurate modeling of small molecules substantially reduces the logistical effort and time consumption to discover and then obtain chemicals with various applications. Molecular stereochemistry is fundamentally involved in the intermolecular interactions that give rise to biological activity. Establishing the configuration of the asymmetric carbon in diastereomers can be decisive in drug design. In the presented analytical technique, on the basis of quantitative structure–fragmentation relationship (QSFR), mass–energy profiles obtained by electron ionization mass spectrometry (EI-MS) for analytes are used, along with some profiles for candidate structures calculated by quantum chemical (QC) methods. Our paper establishes the analytical conditions that lead to the best matching scores of such profiles corresponding to the actual structures for some isomers of acetalized monosaccharides. The optimization was achieved by group validation of five analytes, using four independent variables: the QC method, the descriptor of calculated energy, the impact energy of electrons, and the descriptor of experimental energy. The true structures were obtained using experimental profiles obtained at low electronic impact energies, and profiles were calculated using the DFT (B3LYP/6-31G) and RM1 QC methods. The double quantification of the ionic mass and the energy that generates it, for only a few primary ions of the mass spectrum, even allows the differentiation of acetalized diastereomers.

1. Introduction

Accurate modeling of small molecules is a useful starting point in stereochemical structure identification. This approach substantially reduces the logistical and time-consuming effort to discover and then obtain chemicals with various applications. For example, molecular symmetry/asymmetry is essentially involved in the intermolecular interactions that give rise to biological activity. In turn, identifying the configuration of asymmetric carbon atoms in diastereomers may be decisive in structure-based drug design. Recent studies show such investigations, especially those related to the biological interactions of chiral phytochemicals for the rapid discovery of potential drugs in SARS-CoV-2 [1]. The transfer of the biologically active structure from in silico to in vitro requires precise analytical methods to discriminate it from its isomers. Quantitative structure–fragmentation relationship (QSFR) techniques based on mass spectrometry could be a solution for controlling the bioactive structure along the entire path from synthesis to its use.
The continuous increase in the number of chemical compounds and their isomers in databases makes their recognition and tracing increasingly subtle and difficult to achieve. Chemical analysis methods must cope with the identification of more and more similar structures to eliminate false-positive candidates. In the case of establishing the isomer symmetry, no orthogonal analytical method is superfluous because only the convergence of as many analytical results as possible can provide structural certainty [2,3,4].
Chromatography and mass spectrometry (MS) are orthogonal methods of analysis because they are based on different physicochemical processes. The coupling of chromatography with mass spectrometry has offered the possibility of analyzing complex mixtures of analytes [5]. In addition to the classical use of standards, they use new variants of data processing and interpretation, which have been effectively translated to computational molecular simulation approaches [6,7,8].
In some QSFR analytical techniques using mass–energy profiles in mass spectrometry, the experimental profile resulting from the mass spectrum is compared with the calculated profiles for the candidate structures. Maximum fit for the proper structure should be obtained if the calculated energy descriptors correctly describe the fragmentation [7,8]. For these techniques, the use of elements of the “lock and key” concept of enzymology (E. Fischer) [9] can be intuitive and productive. In this approach, the “lock” would describe the fragmentation mass–energy profile associated with the mass spectrum of the analyte, and the “keys” would be the computed mass–energy profiles for the candidate structures. However, breakdown graphs show that the relativity of ionic intensities in the mass spectrum is in a continuous dynamic depending on the impact energies [10]. Many other factors may influence this relativity. Therefore, it is to be expected that, even for the true structure, the maximum matching of the calculated and experimental mass–energy profiles will be achieved only under certain conditions, or maybe not at all. This situation leads to an analytical impossibility because there are many profiles (keys) that can be calculated for a certain structure; on the other hand, the profile of the lock is not stable, whereby, for a structure, many mass spectra can be obtained, containing different ionic relativities. Another concept in enzymology, “induced fit” (D. Koshland), could provide a solution to this situation by using the matching score of the profiles [9]. Such matching scores can be calculated for all possible key–lock pairs generated by the factors that influence the dynamics of the calculated and experimental profiles. The maximum score should correspond to the actual structure.
Since QSFR techniques have been proven capable of identifying isomers with similar structures and mass spectra [7,8], our study aims to determine if this type of analysis will have good results in the case of some acetalized diastereomers of some monosaccharides. The QSFR technique presented here uses the well-known correlation between enthalpies and Gibbs energies, along with ionic currents (IC), as a tool for mining chemical structures from many calculated mass–energy profiles for the possible structures of the analyte. In the analytical context mentioned above, our study has the following main objectives: (i) the extension of the QSFR analytical technique based on mass–energy profiles to structural isomers and acetalized diastereomers of some monosaccharides; (ii) the simulation of structural analysis for five such isomers using the mining of experimental profiles obtained by mass spectrometry, in the set of quantum chemical (QC)-calculated mass–energy profiles for the candidate structures; (iii) the establishment of the analytical conditions that lead to the best matching scores between the experimental and calculated profiles, such that they indicate the true structures; (iv) the detailed presentation of the chemical structure mining algorithm and the relationship for the calculation of the profile matching score, in order to disseminate this QSFR technique; (v) the validation of QC-calculated mass–energy profiles for such isomers, located in a database obtained in a previous work [11]. The main novelty of the work is that the mining of isomeric structures is conducted in parallel in two databases with mass–energy profiles, by calculating the matching score. One of the databases was quantum chemically calculated in a previous study [11]. The other was obtained in the present paper from the mass spectrometry analysis of the five isomeric chemical standards. Furthermore, the fulfillment of each of these objectives has an original conception and/or route. The optimization of the technique was achieved by group validation of analytes, a variant frequently used in chromatography because it can simultaneously cover a variety of structures [6,8]. The optimization used four independent variables: the quantum chemical (QC) method, the descriptor of calculated energy, the impact energy (IE) of electrons, and the descriptor of experimental energy. If the mass–energy profiles, calculated and experimental, contain real structural information about the analytes, then the best matching score should correspond to its true positive structures.

2. Materials and Methods

For the validation and optimization of this technique based on QSFR, five structures of acetalized monosaccharides were assigned to the five corresponding chemical standards. For a first stage of optimization and validation of this QSFR technique, the mass–energy profiles calculated for the five candidate structures and the mass–energy profiles resulting from the mass spectra of the respective standards were used. In the absence of a QSFR algorithm for mining chemical structures implemented on personal computers, the group distribution of five analytes to five candidate structures seems feasible from a logistical and time/effort-consuming perspective.

2.1. Calculated Mass–Energy Profiles

Each mass–energy profile consists of a series of pairs of m/z values and energies necessary for the formation of the respective ions. Semiempirical methods such as RM1 and PM7, as well as functional B3LYP, are currently used to calculate the activation energy descriptors of molecular fragmentation in EI-MS, since the fragmentation is an intramolecular transformation. Any other QC method can be used to generate these descriptors because the mining algorithm removes any energy profile that does not provide the maximum docking score. The mass–energy profiles calculated in a previous paper with HyperChem 8.0.10, MOPAC2012, and Gaussian 09 software were used (Table A1, Table A2, Table A3 and Table A4) [11,12,13,14]. They correspond to structures 15 from Figure 1. In these profiles, the strings of the calculated and experimental energy descriptors correspond to the ionic masses in ascending order. In this QSFR-based technique, the calculated energy descriptors were enthalpies and Gibbs energies of ion formation (ΔfHion or ΔfGion) and enthalpies and Gibbs energies of fragmentation (ΔfHfrag or ΔfGfrag), respectively. They were calculated with Equations (1) and (2), respectively:
ΔfHfrag = ΔfH (Ii+) + ΣΔfH (Fi) − ΔfH (M),
ΔfGfrag = ΔfG (Ii+) + ΣΔfG (Fi) − ΔfG (M),
where ΔfH (Ii+) is the formation enthalpy of the resulted fragmentation ion, ΣΔfH (Fi) is the sum of the formation enthalpies of accompanying fragments, and ΔfH (M) is the molecular enthalpy of candidate structure. The meanings are similar for ΔfG [11]
For ease of reference, they were also assigned the following acronyms: DAG (diacetone-D-glucose) (1), DAM (diacetone-D-mannose) (2), DAGal (diacetone-D-galactose) (3), DAF (diacetone-D-fructose) (4), and DAS (diacetone-L-sorbose) (5). Through the perspective of the “Lock and Key” concept, the calculated profiles represent the keys. Each of them contains five of the seven “pins” representing the primary ion energies used in the analysis, possibly obtained from the candidate structures 15 in the mass spectrometer (Table 1).
The absence of some energy values from the calculated profiles indicates differences in primary fragmentations between the candidate structures. Thus, the ions at m/z 245, m/z 187, and m/z 127 had calculated energies for all structures (15). This is because each of them could fragment into those ions by losing methyl, hydroxyl and/or one or two molecules of acetone (Table 1, Figure 1). Instead, several cells in Table A1, Table A2, Table A3 and Table A4 contain the letter “a” for “absent” because the candidate structures could not form the respective primary ions. Thus, the ions at m/z 229 and m/z 171 appeared only for structures 35 because they explicitly contain the –CH2OH group, which the other structures (1 and 2) do not contain (Figure 1, Table 1, Table A1, Table A2, Table A3 and Table A4). Similarly, the ions at m/z 159 and m/z 101 appeared only for structures 1 and 2, as they explicitly contain the –C5H9O2 group, which the other structures (35) do not contain (Figure 1, Table 1, Table A1, Table A2, Table A3 and Table A4) [11].
Primary ions result from the cleavage of a single chemical bond from the molecular structure, accompanied or not by the elimination of some small molecules. Their formation requires small fragmentation energies compared to those of ions that are formed after multiple molecular cleavages. If primary ions with the same m/z originate from different molecular structures, they will need different energies for their formation. QC methods were even able to highlight energetic differences, indeed minor, between the mass–energy profiles of some acetalized diastereomers of monosaccharides [11].

2.2. Experimental Mass–Energy Profiles

In the mass spectrometer, each molecule under analysis receives energy that determines the ionization and release of ions, radicals, and smaller molecules (Table 1). The experimental mass–energy profiles are composed of strings of pairs, m/z, and descriptors of the energy consumed in the formation of the respective primary ion.

2.2.1. Reference Standards and Mass Spectrometry

Sugar-based acetals are common substrates in carbohydrate synthesis [15]. De Jongh and Biemann described the relationship between EI mass spectra of O-isopropylidene derivatives and their molecular weight, ring size and position, and the stereochemistry of the monosaccharide precursors, thus highlighting the importance of information obtained by such analyses in the structure elucidation and identification of monosaccharides [16]. Working at low energy levels for the molecule ionization offers the advantage of simplifying the mass spectra while providing for a quick examination and interpretation. However, diagnostic ions, which would allow the determination of the structure based on the mass spectrum alone, have not been identified for each of these isomers [17].
Using the commercially (Aldrich and Fluka) available standards 15 (Figure 1), 5 mg/mL solutions of these in methylene chloride were prepared. For gas chromatography/mass spectrometry (GC–MS) analysis, a Hewlett Packard HP 6890 Series gas chromatograph, coupled with a Hewlett Packard 5973 mass-selective detector (MSD) system, was used in positive ion mode (calibration factor 1.0). An HP-5 MS capillary column (30 m length, 0.25 mm i.d., 0.25 μm film thickness) was chosen for the separation step. The temperature program was set up from 100 °C to 250 °C with 15 °C/min; both the injector and the detector temperatures were set at 280 °C, and He was used as the carrier gas. The injection volume was 1 μL. Ionization energies of 70, 20, 15, 10, and 5 eV were used for the mass spectrometry experiments, with a source temperature of 150 °C. Ionization energy is the most important factor in the optimization of this QSFR technique, because (i) it greatly influences the values of ICs, thus ensuring a large dispersion of the mass–energy profiles, (ii) EI-MS is the biggest consumer of resources and time from the analytic workflow, and must be optimized, and (iii) the 5–70 eV range is accessible to the usual devices with MSD. The MSD was operated in full-scan acquisition mode between m/z 34 and 400 to obtain the total ion chromatogram (TIC). Automatic background subtraction was applied to obtain clean and interference-free mass spectra [17].

2.2.2. Experimental Energy Descriptors

The mass spectrum does not explicitly provide energies. Instead, the energy received by the fragmentable structure, as well as the transformation induced by it, is what establishes the relativity of the IC intensities in the spectrum. High ionic stabilities (describable by enthalpies and Gibbs energies of formation) cause the appearance of large ICs, while high fragmentation activation energies (describable by enthalpies and Gibbs energies of fragmentation) produce small ICs [18]. Although ion intensities are essential in quantitative mass spectrometry, more and more studies use ion intensities to obtain structural information [19,20]. Even in the chemistry of sugars, a recent study used the ratio of ionic intensities for the differentiation of sugar moieties from saponins and to establish their number [21]. The relationship between the energy of the bonds’ cleavage during the fragmentation of the analyte and the IC in the spectrum was revealed by numerous equations. They either have a quasi-linear form (Equation (A1)) or a quasi-exponential one (Equation (A2)) [18,22]. Therefore, IC and ln IC could be useful descriptors of the activation energies for the fragmentations.

2.3. The Matching Score of the Mass–Energy Profiles

The linear correlation function (Pearson) was used to calculate the matching score of the calculated energy descriptors with the experimental ones. This function has already proven its efficiency in the group identification of six positional isomers of tetrachlorinated biphenyls [8]. Thus, the series of energies calculated for the ions resulting from the primary fragmentation of the candidate structure was compared with the series of energy descriptors (IC or ln IC) from the spectrum of the standard. The linear correlation index Ri was, thus, obtained.
The matching of the masses from the calculated and experimental profiles was assessed by the (nmi)/n ratio, where n is the number of primary ions used to filter the five candidate structures (seven ions in this case, Table 1), while mi is the number of mismatches in the sequence of values and, therefore, of empty places in the correlated series of energy descriptors (Table A1, Table A2, Table A3, Table A4 and Table A5). The difference (nmi) represents the number of pairs of energy descriptors that can be formed for the candidate structures and the mass spectra (namely enthalpies and ICs or ln ICs pairs) to establish the score. Since min, it follows that (nmi)/n is a positive subunit number or it has a zero value. In other words, the (nmi)/n ratio quantifies the matching of the analyte’s primary fragmentation with that of the candidate structure. Thus, the absence of a primary ion from the mass spectrum decreases the matching score. The calculation formula from Equation (1) provides the individual scores in percentages (Pi(%)):
Pi(%) = 100 × (1 − Ri·(n − mi)/n)/2.
Since the simulation of the analysis is conducted in a group of five analytes, the score for each variant of structural attribution is the arithmetic mean of five individual scores:
P(%) = (P1 + … + P5)/5.
Equation (3) reduces the matching score if the “key” or “lock” does not contain all the “pins” (energies). Nevertheless, the sequence of candidate structures is not eliminated but only relegated to the list of possible variants. Therefore, Equations (3) and (4) can allow the quantification of calculated and experimental mass–energy profile matching for structural analysis by mass spectrometry. These equations were kept constant throughout the data processing because their optimization was not the subject of this work. Since a commercial application for personal computers that calculates the matching score of the mass–energy profiles has not been developed, we designed accessible worksheets in MS Excel.

2.4. Design of Experiments

The following independent factors were considered for the calculated profiles (Table 2):
  • The quantum chemical methods, used in four variants for the calculation of enthalpies (ΔfH) and Gibbs energies (ΔfG): RM1, PM7, DFT (B3LYP/6-31G) ΔfH, and DFT (B3LYP/6-31G) ΔfG,
  • The calculated energy descriptors, in two variants: (ΔfHion or ΔfGion) and (ΔfHfrag or ΔfGfrag), and two independent factors for the experimental profiles (Table 2):
  • The descriptors of experimental energy in two variants: IC and ln IC,
  • The impact energy of electrons on five levels: 5, 10, 15, 20, and 70 eV.
The levels of the independent factors generated 4 × 2 × 5 × 2 = 80 variants of matching profiles. A list of scores corresponds to each variant. In turn, the score list contains  A 5 5 = 120  possibilities for distributing the five candidate structures for the five chemical standards. The response factors which followed were the group matching scores (P(%)) and the place (Rank S) occupied by the group with the true structures in the descending probability lists. The lists with matching scores allowed the identification of the conditions in which the technique correctly identified the structures.

3. Results

3.1. Mass Spectra

Table A5 shows the primary ions and the corresponding ICs from the tabular mass spectra recorded for the five chemical standards (DAF, DAG, DAGal, DAM, and DAS) at five electronic impact energies (5, 10, 15, 20, and 70 eV).

3.2. Matching Scores of Mass–Energy Profiles

The matching algorithms were run in MS Excel worksheets (Figure 2a,b and Supplementary Materials). In each worksheet, the P(%) score is calculated between the table of experimental profiles (first one from the top) and each of the 120 variants of calculated profiles found in the tables placed below (see Supplementary Materials).
Matching scores were calculated using Equations (3) and (4). For each standard (as analyte), the column in the matrix with experimental energy descriptor (IC or ln IC) values is compared with the corresponding column in the matrix with calculated energy descriptor values (Figure 2a,b). In this way, Ri × (n − mi)/n and then P(%) are obtained. This operation is repeated for each of the 120 possible structural assignments. Cells with a green background represent true-positive results, and those with a pink background represent false-positive results. In the optimal analysis conditions, a simple inversion toward the correct structural assignment (e.g., between the DAGal column and the DAM column in the third table in Figure 2a,b, and Supplementary Materials), reduces the matching score of the profiles, and the respective variant is pushed lower in the score list.

3.3. Scores Lists

The score lists bring together all 120 possible distributions of the five candidate structures for the five standards. They are placed in descending order of matching scores, P(%). One such list was calculated for each of the 80 variants generated by the four variable factors used for optimization. Each list of scores allowed for establishing the maximum probability (Pmax) and the place (Rank S) occupied by the correct distribution of the structures (the cells with green background in Figure 3). The lists (a) and (b) in Figure 3 correspond to the spreadsheets (a) and (b), respectively, from Figure 2 (see also Supplementary Materials).

3.4. Validation and Optimization Panel

In Table 3, all 80 pairs of Pmax and Rank S are centralized on the basis of quantum calculation variants. The analysis variant is correct and, therefore, useful only when the maximum matching score is obtained for the true sequence of candidate structures, i.e., Pmax corresponds to Rank S = 1. This situation is fulfilled only in three variants for the independent factors (formatted cells with green background in Table 3):
  • The ln IC descriptor from the mass spectrum at 5 eV and ΔfGfrag descriptor calculated using the DFT (B3LYP/6-31G) method (Rank S = 1 at Pmax = 84.52%),
  • The ln IC descriptor from the mass spectrum at 5 eV and ΔfHfrag descriptor calculated using the RM1 method (Rank S = 1 at Pmax = 81.99%),
  • The ln IC descriptor from the mass spectrum at 5 eV and ΔfHion descriptor calculated using the RM1 method (Rank S = 1 at Pmax = 79.20%).
The PM7 method can also give Rank S = 1 (Table 3, cells with pink background), but the respective variants only happen with a small probability (≅55%) when compared with the maximum. Profiles calculated with this method only offer Rank S = 3 for Pmax = 83.12%. Thus, such a quantum method is not useful in a real analysis where the true structure is established only with Pmax.
The table with the results of the technique’s optimization and validation (Table 3) can be considered as a docking panel of the calculated mass–energy profiles with the experimental ones. The maximum docking score (84.52%) correctly indicates the five structures. The optimal conditions of independent factors that can ensure a correct structural analysis show the following:
  • There is a tendency to increase the score with decreased electronic impact energy. The best match is performed above the ionization threshold in its vicinity.
  • The ln IC descriptors offer better results than IC. For this reason, it can be assumed that Equation (A2) is more appropriate to describe the kinetics of dissociation under these optimal conditions.
  • The DFT (B3LYP/6-31G) and RM1 quantum calculation methods provide useful values in the description of energy fragmentation profiles of acetalized monosaccharide isomers.
  • The ΔfGfrag and ΔfHfrag descriptors provide better results than the ΔfHion and ΔfGion descriptors.
This first stage of optimization shows that the precision offered by the new QSFR technique proposed in this work can discriminate between the structural isomers of some acetalized diastereomers of monosaccharides into the group analysis variant. This precision is insufficient for an individual analysis of such a structure. However, the results obtained by the first stage of optimization open perspectives for improving precision through a second stage by shortening the intervals of the analytical factors, increasing the mining resolution in these intervals, improving mass spectrometry, QC methods, descriptors, etc.

4. Discussion

The double quantification of the ionic mass and the energy that generates it for only a few primary ions of the mass spectra allows the differentiation of some structural isomers and acetalized diastereomers of monosaccharides. The increase in the profile matching score at low energy impact agrees with the quasi-equilibrium theory (QET) in mass spectrometry. According to QET, the rate constant (k) of a unimolecular reaction is a function of the reactants’ excess energy (E) in the transition state. Thus, k(E) strongly depends on the internal energy distribution of any ion. However, there is an upper limit for the rate of dissociation that is defined by the vibrational frequency of the bond to be cleaved [10,23]. Hence, at low impact energy, the excess energy E is small, and the activation energy E0 of the transition states becomes the main factor that establishes the relativity of the rate constants, k(E), for the primary dissociations (Equations (A1) and (A2)). Under these conditions, an increasing series of E0 energy descriptors (ΔfHfrag or ΔfGfrag) corresponds to a decreasing series of k(E) descriptors (IC or ln IC). As the electronic impact energy increases, the vibrational frequency of the bond to be cleaved becomes an increasingly important factor in determining k(E), thus altersing the alignment of the energy profiles. Multiple cleavages and molecule isomerizations before fragmentation in the mass spectrometer can also cover the analytical signal. However, the optimization shows that, at low electronic impact energies, these disturbing factors become insignificant for this variant of this QSFR technique with mass–energy profiles.
After this technique has demonstrated its effectiveness for the group analysis of the considered isomers of acetalized monosaccharides, the next step consists of their identification using the optimal variants established in this paper: ΔfGfrag and ΔfHfrag energy descriptors calculated with the quantum DFT (B3LYP/6-31G) and RM1 methods, respectively, and the experimental descriptor ln IC around the 5 eV impact energy.

5. Conclusions

Both QC computations and EI-MS can generate orthogonal mass–energy profiles, in silico and in vitro, respectively, which can be used to discriminate the chosen isomers of acetalized monosaccharides. The docking with the profile maximum scores in the case of true-positive structures demonstrates the correctness of the data obtained with some QC methods, the EI-MS equipment, and the theories that underpin the ionization–fragmentation process. All the algorithms used in this QSFR technique can be easily implemented on personal computers and constitute important candidates for the development of applications for the automation of chemical analyses. However, a commercial application for personal computers that uses the entire analytical workflow of this QSFR technique has not been developed. Completing the databases of small organic molecules with the computer-generated primary mass–energy profiles, together with the technique of their docking to the experimental profiles presented here, could become a generalized tool for the discrimination of isomeric organic molecules. Furthermore, the QSFR technique presented here, which uses mass–energy profiles generated in silico, along with some generated in vitro by mass spectrometry, can improve the accuracy of the current means of conversion between virtual and real isomeric structures.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/app13137530/s1, File S1: Matching Scores of Mass–Energy Profiles at 5 eV for ln IC descriptor and ΔfHfrag calculated with the RM1 method (a) and the list with matching scores of mass–energy profiles—120 positions in descending order of P(%) for ln IC descriptor and ΔfHfrag descriptor calculated by RM1 method (b); File S2: Matching Scores of Mass–Energy Profiles at 5 eV for ln IC descriptor and ΔfGfrag descriptor calculated by DFT (B3LYP/6-31G) method (a) and the list with matching scores of mass–energy profiles—120 positions in descending order of P(%) for ln IC descriptor and ΔfGfrag descriptor calculated by DFT (B3LYP/6-31G) method (b).

Author Contributions

Conceptualization, M.-C.P., N.D., C.C. and A.S.; methodology, M.-C.P., N.D. and E.S.; software, M.-C.P., C.C., N.D. and E.S.; validation, M.-C.P., N.D., E.S., A.S. and M.G.; formal analysis, M.-C.P., N.D., C.C., A.S. and E.S.; investigation, M.-C.P., N.D., C.C., E.S., A.S. and M.G.; resources, C.C. and M.G.; data curation, M.-C.P., N.D., C.C., E.S. and A.S.; writing—original draft preparation, M.-C.P., N.D., C.C., A.S. and M.G.; writing—review and editing, M.-C.P., N.D., C.C., A.S. and M.G.; visualization, N.D., M.-C.P., E.S., A.S. and M.G.; supervision, N.D., M.-C.P., E.S. and M.G.; project administration, M.G., C.C., M.-C.P. and E.S. All authors have read and agreed to the published version of the manuscript.

Funding

Part of this research was conducted at the Center of Genomic Medicine at the “Victor Babes” UMFT (UEFISCDI through POSCCE 185/48749; contract No. 677).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data are available in this publication (including Appendix A).

Acknowledgments

The authors would like to thank Ramona Curpan and Mircea Mracec from “Coriolan Dragulescu” Institute of Chemistry of the Romanian Academy, and Mihai Medeleanu from Politehnica University Timisoara.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. ΔfH (kcal/mol) database calculated with RM1 [11]. The letter “a” indicates the absence of the respective value.
Table A1. ΔfH (kcal/mol) database calculated with RM1 [11]. The letter “a” indicates the absence of the respective value.
Structure →DAF (4)DAG (1)DAGal (3)DAM (2)DAS (5)
ΔfH(M)−283.1−287.1−282.6−288.1−286.7
ΔfH (ion)
m/z 245−96.6−99.6−97.3−102.0−101.8
m/z 229−46.8a−42.4a−33.0
m/z 187−34.2−39.4−36.4−40.2−4.1
m/z 17134.5a9.6a37.0
m/z 159a11.7a8.8a
m/z 127103.7105.187.279.4134.0
m/z 101a98.8a98.8a
ΔfH (M frag) = ΔfH (ion) + ΣΔfH (F) − ΔfH (M)
M → m/z 245211.4212.4210.1211.0209.7
M → m/z 229214.9a218.8a232.3
M → m/z 187221.1219.9218.3220.1254.7
M → m/z 171243.5a218.1a249.6
M → m/z 159a220.0a218.2a
M → m/z 127284.1289.5267.1264.9318.0
M → m/z 101a221.1a219.8a
Table A2. ΔfH database (kcal/mol) calculated with PM7 [11]. The letter “a” indicates the absence of the respective value.
Table A2. ΔfH database (kcal/mol) calculated with PM7 [11]. The letter “a” indicates the absence of the respective value.
Structure →DAF (4)DAG (1)DAGal (3)DAM (2)DAS (5)
ΔfH(M)−277.1−280.8−277.7−281.7−281.6
ΔfH (ion)
m/z 245−99.4−102.0−96.2−102.1−100.7
m/z 229−54.3a−41.6a−34.0
m/z 187−37.5−33.4−37.7−38.2−11.5
m/z 17113.2a40.4a36.5
m/z 159a14.3a14.6a
m/z 127109.7155.7118.887.9161.6
m/z 101a89.9a89.9a
ΔfH (M frag) = ΔfH (ion) + ΣΔfH (F) − ΔfH (M)
M  m/z 245205.6206.7209.5207.6208.8
M  m/z 229201.5a214.8a226.2
M  m/z 187212.1219.8212.5216.0242.6
M  m/z 171213.5a241.4a241.3
M  m/z 159a220.5a221.6a
M  m/z 127283.0332.8292.9265.9339.5
M  m/z 101a215.6a213.5a
Table A3. ΔfH database (kcal/mol) calculated with DFT (B3LYP/6-31G) [11]. The letter “a” indicates the absence of the respective value.
Table A3. ΔfH database (kcal/mol) calculated with DFT (B3LYP/6-31G) [11]. The letter “a” indicates the absence of the respective value.
Structure →DAF (4)DAG (1)DAGal (3)DAM (2)DAS (5)
ΔfH(M)−1559.5−1565.4−1562.9−1564.0−1566.7
ΔfH (ion)
m/z 245−1264.7−1266.0−1263.3−1267.8−1273.3
m/z 229−1239.6a−1227.6a−1237.7
m/z 187−867.0−873.4−870.4−865.2−830.3
m/z 171−827.6a−835.4a−815.7
m/z 159a−728.9a−726.2a
m/z 127−522.8−531.4−534.6−554.1−494.5
m/z 101a−465.7a−465.7a
ΔfH (M frag) = ΔfH (ion) + ΣΔfH (F) − ΔfH (M)
M  m/z 245206.3211.0211.0207.8204.9
M  m/z 229213.5a228.8a222.6
M  m/z 187210.5210.0210.5216.8254.4
M  m/z 171232.0a227.5a251.1
M  m/z 159a229.4a230.7a
M  m/z 127266.2263.5257.8239.4301.8
M  m/z 101a227.8a223.2a
Table A4. ΔfG database (kcal/mol) calculated with DFT (B3LYP/6-31G) [11]. The letter “a” indicates the absence of the respective value.
Table A4. ΔfG database (kcal/mol) calculated with DFT (B3LYP/6-31G) [11]. The letter “a” indicates the absence of the respective value.
Structure →DAF (4)DAG (1)DAGal (3)DAM (2)DAS (5)
ΔfG (M)−1886.7−1892.7−1892.1−1891.0−1895.0
ΔfG (ion)
m/z 245−1557.9−1557.4−1557.3−1559.4−1567.4
m/z 229−1523.3a−1510.7a−1519.8
m/z 187−1072.4−1076.9−1075.9−1070.0−1032.0
m/z 171−1019.7a−1031.1a−1005.5
m/z 159a−913.0a−911.2a
m/z 127−654.9−663.0−665.1−686.7−624.9
m/z 101a−591.7a−591.7a
ΔfG (M frag) = ΔfG (ion) + ΣΔfG (F) − ΔfG (M)
M  m/z 245219.4225.9225.4222.2218.3
M  m/z 229226.6A244.6a238.4
M  m/z 187238.7240.2240.7245.5287.4
M  m/z 171264.1A258.1a286.6
M  m/z 159a245.2a245.4a
M  m/z 127309.1306.9304.3281.6347.3
M  m/z 101a244.7a239.6a
k ( E ) = v × ( E E 0 E ) s 1 ,
k ( E ) = v × e ( s 1 ) E 0 E ,
where k is the rate constant for the dissociation of isolated ions as a function of internal energy, E, and activation energy of the reaction, E0, while ν is a frequency factor that is determined by the number and density of vibrational states. The exponent is given by the number of degrees of freedom, s, minus 1 for the breaking bond [18].
Table A5. The ICs of primary ions from the mass spectra of the standards (15), recorded at 70, 20, 15, 10, and 5 eV. The letter “a” indicates the absence of the respective value.
Table A5. The ICs of primary ions from the mass spectra of the standards (15), recorded at 70, 20, 15, 10, and 5 eV. The letter “a” indicates the absence of the respective value.
IonDAF 70 eVDAG 70 eVDAGal 70 eVDAM 70 eVDAS 70 eV
m/z 245846,208552,640813,440703,872619,840
m/z 229211,26412276740359675,632
m/z 18746,528246,784155,456177,984125,816
m/z 171314,56077538,8721329197,952
m/z 15981541,928117111,662134,144
m/z 127341,120231,872185,664122,17696,888
m/z 10153,168613,95254,928547,136117,048
IonDAF 20 eVDAG 20 eVDAGal 20 eVDAM 20 eVDAS 20 eV
m/z 245282,816403,072211,520419,392638,528
m/z 22964,5763321401148678,920
m/z 18712,985179,45640,480102,544106,456
m/z 171105,72856011,025585199,744
m/z 15919432,1043257702161,472
m/z 127118,432158,78449,80864,89679,624
m/z 10118,488503,04016,568362,240101,728
IonDAF 15 eVDAG 15 eVDAGal 15 eVDAM 15 eVDAS 15 eV
m/z 24516,9152129,848131,328186,560338,880
m/z 22937,448a984a39,928
m/z 187719353,65623,83242,73647,264
m/z 17155,600a6233a96,712
m/z 15913811,515176384996,440
m/z 12758,84851,57624,68826,82437,648
m/z 1018954240,5129001209,34455,768
IonDAF 10 eVDAG 10 eVDAGal 10 eVDAM 10 eVDAS 10 eV
m/z 24530,07226,79241,24035,90477,624
m/z 2296135a243a8035
m/z 1878949062617564256633
m/z 1716110a1637a13,461
m/z 159a19645066718,224
m/z 12753205395450425744274
m/z 10166339,784173028,1206267
IonDAF 5 eVDAG 5 eVDAGal 5 eVDAM 5 eVDAS 5 eV
m/z 24545204604408359217360
m/z 229825aaa749
m/z 187a659305523152
m/z 171189a65a319
m/z 159a116a93873
m/z 1279667a71a
m/z 101a198956161660

References

  1. Vicidomini, C.; Roviello, V.; Roviello, G.N. In Silico Investigation on the Interaction of Chiral Phytochemicals from Opuntia ficus-indica with SARS-CoV-2 Mpro. Symmetry 2021, 13, 1041. [Google Scholar] [CrossRef]
  2. Vaniya, A.; Fiehn, O. Using fragmentation trees and mass spectral trees for identifying unknown compounds in metabolomics. Trends Analyt. Chem. 2015, 69, 52–61. [Google Scholar] [CrossRef] [Green Version]
  3. Böcker, S.; Dührkop, K. Fragmentation trees reloaded. J. Cheminform. 2016, 8, 5. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Rasche, F.; Scheubert, K.; Hufsky, F.; Zichner, T.; Kai, M.; Svatoš, A.; Böcker, S. Identifying the Unknowns by Aligning Fragmentation Trees. Anal. Chem. 2012, 84, 3417–3426. [Google Scholar] [CrossRef] [PubMed]
  5. Bojin, L.A.; Georgescu, M.; Cojocaru, C.; Pascariu, M.C.; Purcarea, V.L.; Ivan, M.V.; Puiu, M.; Dehelean, C.; Serb, A.F.; Sisu, E.; et al. Structural investigation of raw and modified glycans by MALDI-TOF mass spectrometry. Farmacia 2020, 68, 891–897. [Google Scholar] [CrossRef]
  6. Fedorova, E.; Stavrianidi, A.; Minenkova, I.; Buryak, A. Calculations of the Thermodynamic Characteristics and Physicochemical Properties of Symmetric and Asymmetric Isomeric Compounds for Identification in Chromatography-Mass Spectrometry. Symmetry 2021, 13, 1681. [Google Scholar] [CrossRef]
  7. Dinca, N.; Covaci, A. Structural identification by differential mass spectrometry as a criterion for selecting the best quantum chemical calculation of formation enthalpy for tetrachlorinated biphenyls. Rapid Commun. Mass Spectrom. 2012, 26, 2033–2040. [Google Scholar] [CrossRef] [PubMed]
  8. Dinca, N.; Dragan, S.; Dinca, M.; Sisu, E.; Covaci, A. New quantitative structure-fragmentation relationship strategy for chemical structure identification using the calculated enthalpy of formation as a descriptor for the fragments produced in electron ionization mass spectrometry: A case study with tetrachlorinated biphenyls. Anal. Chem. 2014, 86, 4949–4955. [Google Scholar] [PubMed]
  9. Tripathi, A.; Bankaitis, V.A. Molecular Docking: From Lock and Key to Combination Lock. J. Mol. Med. Clin. Appl. 2017, 2, 1–19. [Google Scholar] [CrossRef]
  10. Rosenstock, H.M.; Wallenstein, M.B.; Wahrhaftig, A.L.; Eyring, H. Absolute rate theory for isolated systems and the mass spectra of polyatomic molecules. Proc. Natl. Acad. Sci. USA 1952, 38, 667–678. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  11. Pascariu, M.C.; Dinca, N.; Cojocariu, C.; Sisu, E.; Serb, A.; Birza, R.; Georgescu, M. Computed Mass-Fragmentation Energy Profiles of Some Acetalized Monosaccharides for Identification in Mass Spectrometry. Symmetry 2022, 14, 1074. [Google Scholar] [CrossRef]
  12. Hypercube, Inc. HyperChem™Professional, Version 8.0.10 for Windows; Hypercube, Inc.: Gainesville, FL, USA. Available online: http://www.hypercubeusa.com/?tabid=360 (accessed on 20 March 2021).
  13. MOPAC2012; Stewart, J.J.P. Stewart Computational Chemistry, Colorado Springs, CO, USA. Version 15.027W. Available online: http://OpenMOPAC.net/ (accessed on 20 March 2021).
  14. Frisch, M.J.; Trucks, G.W.; Schlegel, H.B.; Scuseria, G.E.; Robb, M.A.; Cheeseman, J.R.; Scalmani, G.; Barone, V.; Mennucci, B.; Petersson, G.A.; et al. Gaussian 09. Revision B.01; Gaussian, Inc.: Wallingford, CT, USA, 2010. [Google Scholar]
  15. Robertson, J.; Stafford, P.M. Carbohydrates; Osborn, H.M.I., Ed.; Academic Press: London, UK, 2003; pp. 29–33. [Google Scholar]
  16. De Jongh, D.C.; Biemann, K. Mass Spectra of O-Isopropylidene Derivatives of Pentoses and Hexoses. J. Am. Chem. Soc. 1964, 86, 67. [Google Scholar] [CrossRef]
  17. Pascariu, M.C.; Sisu, E.; Ordodi, V.L.; Rusnac, L.M. Spectral Analysis of Diisopropylidenated Monosaccharides. Low Energy EI-MS Fragmentation Study. Bul. Stiint. Univ. Politeh. Timis. Ser. Chim. Ing. Mediu. 2011, 56, 6. [Google Scholar]
  18. Gross, J. Mass Spectrometry—A Textbook, 3rd ed.; Springer: Cham, Switzerland, 2011; pp. 36–40. [Google Scholar]
  19. Grimme, S. Towards first principles calculation of electron impact mass spectra of molecules. Angew. Chem. Int. Ed. Engl. 2013, 52, 6306–6312. [Google Scholar] [CrossRef] [PubMed]
  20. Bauer, C.; Grimme, S. How to Compute Electron Ionization Mass Spectra from First Principles. J. Phys. Chem. A 2016, 120, 3755–3766. [Google Scholar] [CrossRef] [PubMed]
  21. Wang, L.L.; Han, L.F.; Yu, H.S.; Sang, M.M.; Liu, E.W.; Zhang, Y.; Fang, S.M.; Wang, T.; Gao, X.M. Analysis of the Constituents in “Zhu She Yong Xue Shuan Tong” by Ultra High-Performance Liquid Chromatography with Quadrupole Time-of-Flight Mass Spectrometry Combined with Preparative High Performance Liquid Chromatography. Molecules 2015, 20, 20518–20537. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Beynon, J.H.; Gilbert, J.R. Energetics and Mechanisms of Unimolecular Reactions of Positive Ions: Mass Spectrometric Methods. In Gas Phase Ion Chemistry; Bowers, M.T., Ed.; Academic Press: New York, NY, USA, 1979; Volume 2, Chapter 13; pp. 153–179. [Google Scholar]
  23. Wahrhaftig, A.L. Ion Dissociations in the Mass Spectrometer. In Advances in Mass Spectrometry; Waldron, J.D., Ed.; Pergamon: Oxford, UK, 1959; pp. 274–286. [Google Scholar]
Figure 1. 1,2:5,6-di-O-isopropylidene-α-D-glucofuranose (1), 2,3:5,6-di-O-isopropylidene-α-D-mannofuranose (2), 1,2:3,4-di-O-isopropylidene-α-D-galactopyranose (3), 2,3:4,5-di-O-isopropylidene-β-D-fructopyranose (4), and 2,3:4,6-di-O-isopropylidene-α-L-sorbofuranose (5).
Figure 1. 1,2:5,6-di-O-isopropylidene-α-D-glucofuranose (1), 2,3:5,6-di-O-isopropylidene-α-D-mannofuranose (2), 1,2:3,4-di-O-isopropylidene-α-D-galactopyranose (3), 2,3:4,5-di-O-isopropylidene-β-D-fructopyranose (4), and 2,3:4,6-di-O-isopropylidene-α-L-sorbofuranose (5).
Applsci 13 07530 g001
Figure 2. Fragments of two MS Excel sheets for the calculation of matching scores for the following cases: (a) at 5 eV for ln IC descriptor and ΔfHfrag calculated with the RM1 method; (b) at 5 eV for ln IC descriptor and ΔfGfrag calculated with the DFT (B3LYP/6-31G) method. The letter “a” indicates the absence of the respective value.
Figure 2. Fragments of two MS Excel sheets for the calculation of matching scores for the following cases: (a) at 5 eV for ln IC descriptor and ΔfHfrag calculated with the RM1 method; (b) at 5 eV for ln IC descriptor and ΔfGfrag calculated with the DFT (B3LYP/6-31G) method. The letter “a” indicates the absence of the respective value.
Applsci 13 07530 g002
Figure 3. Lists with matching scores of mass–energy profiles—the first 20 positions in descending order of P(%)—in the cases: (a) at 5 eV for ln IC descriptor and ΔfHfrag descriptor calculated by RM1 method; (b) at 5 eV for ln IC descriptor and ΔfGfrag descriptor calculated by DFT (B3LYP/6-31G) method.
Figure 3. Lists with matching scores of mass–energy profiles—the first 20 positions in descending order of P(%)—in the cases: (a) at 5 eV for ln IC descriptor and ΔfHfrag descriptor calculated by RM1 method; (b) at 5 eV for ln IC descriptor and ΔfGfrag descriptor calculated by DFT (B3LYP/6-31G) method.
Applsci 13 07530 g003
Table 1. The seven predicted primary ions of isomers 15.
Table 1. The seven predicted primary ions of isomers 15.
Primary IonFragmentation
m/z 245[M–CH3]+
m/z 229See Figure 1
m/z 187[M–acetone–CH3]+
m/z 171[M–acetone–HOCH2]+
m/z 159See Figure 1
m/z 127[M–2×acetone–HO]+
m/z 101See Figure 1
Table 2. The variable independent factors used in optimization and analytical validation.
Table 2. The variable independent factors used in optimization and analytical validation.
Mass–Energy ProfileOptimized by
Independent Variable
Number of LevelsLevels
CalculatedVariants of quantum chemical method4RM1
PM7
DFT (B3LYP/6-31G) ΔfH
DFT (B3LYP/6-31G) ΔfG
Descriptors of calculated energy2Ionic energy (ΔfHion or ΔfGion)
Fragmentation energy (ΔfHfrag or ΔfGfrag)
ExperimentalThe impact energy of electrons55 eV
10 eV
15 eV
20 eV
70 eV
Descriptors of experimental energy2IC
ln IC
Table 3. The pairs of values for Rank S and Pmax were obtained at validation and optimization of the technique with mass–energy profiles applied in EI-MS for identification of the considered acetalized monosaccharide isomers. Cells with a green background indicate the variants that accurately establish the chemical structures.
Table 3. The pairs of values for Rank S and Pmax were obtained at validation and optimization of the technique with mass–energy profiles applied in EI-MS for identification of the considered acetalized monosaccharide isomers. Cells with a green background indicate the variants that accurately establish the chemical structures.
Experimental Descriptors →ICln IC
Calculated Descriptors →Fragmentation Energy (ΔfHfrag or ΔfGfrag)Ionic Energy
fHion or ΔfGion)
Fragmentation Energy (ΔfHfrag or ΔfGfrag)Ionic Energy
fHion or ΔfGion)
EIQC Rank SPmaxRank SPmaxRank SPmaxRank SPmax
5 eVDFT ΔfG979.6680.35184.52479.37
10 eVDFT ΔfG574.3810275.891969.6911372.57
15 eVDFT ΔfG1469.9711473.546264.7111469.24
20 eVDFT ΔfG1766.9811470.31559.377859.62
70 eVDFT ΔfG1363.5511570.311555.569060.43
5 eVDFT ΔfH778.67680.27284.25479.29
10 eVDFT ΔfH872.8110275.861571.4211372.61
15 eVDFT ΔfH3068.6111373.534566.1811469.28
20 eVDFT ΔfH4666.9911370.232062.087859.17
70 eVDFT ΔfH3366.9511370.242661.88660.43
5 eVPM72276.73580.28383.12380.7
10 eVPM7369.288672.95564.7710669.44
15 eVPM7265.4511070.243560.1511065.78
20 eVPM7462.3611070.11155.459260.23
70 eVPM7462.2211070.05154.739360.75
5 eVRM1376.79279.71181.99179.2
10 eVRM1670.310173.632765.3511469.97
15 eVRM11766.1811570.999461.0911566.22
20 eVRM11363.6411571.31256.1711661.13
70 eVRM11363.5511571.151555.5611661.68
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Cojocariu, C.; Dinca, N.; Georgescu, M.; Sisu, E.; Serb, A.; Pascariu, M.-C. Mass–Energy Profiles Obtained by Quantum Chemical Computing Applied in Mass Spectrometry: A Case Study with Identification of a Group of Acetalized Monosaccharide Isomers. Appl. Sci. 2023, 13, 7530. https://doi.org/10.3390/app13137530

AMA Style

Cojocariu C, Dinca N, Georgescu M, Sisu E, Serb A, Pascariu M-C. Mass–Energy Profiles Obtained by Quantum Chemical Computing Applied in Mass Spectrometry: A Case Study with Identification of a Group of Acetalized Monosaccharide Isomers. Applied Sciences. 2023; 13(13):7530. https://doi.org/10.3390/app13137530

Chicago/Turabian Style

Cojocariu, Carolina, Nicolae Dinca, Marius Georgescu, Eugen Sisu, Alina Serb, and Mihai-Cosmin Pascariu. 2023. "Mass–Energy Profiles Obtained by Quantum Chemical Computing Applied in Mass Spectrometry: A Case Study with Identification of a Group of Acetalized Monosaccharide Isomers" Applied Sciences 13, no. 13: 7530. https://doi.org/10.3390/app13137530

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop