Next Article in Journal
On Generalized Bivariate (p,q)-Bernoulli–Fibonacci Polynomials and Generalized Bivariate (p,q)-Bernoulli–Lucas Polynomials
Previous Article in Journal
Cell-Dependent Mechanical Properties of Asymmetric Crosslinked Metallic Wire Mesh with Hybrid Patterns Based on Arbitrary Poisson’s Ratio
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Chromatic Differentiation of Functional Mappings of the Composition of Nucleic Acids

by
Ivan V. Stepanyan
1,2,* and
Mihail Y. Lednev
1,*
1
Mechanical Engineering Research Institute of the Russian Academy of Sciences (IMASH RAN), 4, M. Kharitonyevskiy Pereulok, Moscow 101990, Russia
2
Peoples’ Friendship University of Russia (RUDN University), 6, Miklukho-Maklaya Street, Moscow 117198, Russia
*
Authors to whom correspondence should be addressed.
Symmetry 2023, 15(4), 942; https://doi.org/10.3390/sym15040942
Submission received: 20 February 2023 / Revised: 31 March 2023 / Accepted: 17 April 2023 / Published: 20 April 2023
(This article belongs to the Section Life Sciences)

Abstract

:
Color visualization of the DNA of diverse living beings can help in the exploration of the issue of chromatic differentiation of functional mappings of the nucleotide composition of DNA molecules. By “chromatic differentiation”, we mean the coloring of these mappings. Algorithms for coloring genetic representations improve the perception of complex genetic information using color. Methodologically, to build the chromatic differentiation of functional mappings of the nucleotide composition of DNA, we employed the system of nucleotide Walsh functions and the Chaos Game Representation (CGR) algorithm. The authors compared these two approaches and proposed a modified CGR algorithm. The work presents various algorithms of chromatic differentiation based on the nucleotide Walsh functions at a specific location of the fragment in the nucleotide chain and on the frequencies of those fragments. The results of the analysis provide examples of chromatic differentiation in a variety of parametric spaces. The paper describes various approaches to coloring and video animation of DNA molecules in their chromatically differentiated spans of physicochemical parameters.

1. Introduction

Color solutions improve the perception of complex information. According to Maxwell’s theory of color perception [1], there are three principal color channels—red, green, and blue. These three channels are structurally comparable to the trinity of nucleotide sub-alphabets. As with nucleotides, these channels have a “complementary” CMYK system (Figure 1). The presence of a trinity of binary sub-alphabets underlies the comparison: 2 or 3 hydrogen bonds, keto-amino properties, and purine-pyrimidine properties. There are also three sub-alphabets for the complementary strand of DNA. These triplets of sub-alphabets are structurally related to the color triangle. There are also more complex relationships between the phenomenon of the degeneracy of genetic coding and color theory, described in the works of Sergey Petoukhov [2,3]. Petoukhov described the connection of these color triangles with genetic algebras [4]. However, we will consider other features also explored by Petoukhov. Namely, we will colorize our visualizations, previously analyzed and presented in [5,6,7], according to the principles illustrated in this work. This group of complex visualization methods combines several techniques, including animated computer graphics and combining various techniques for visualizing genetic information.
Visualization of the structure of molecular genetic information has long been of interest to researchers. Presently, it is possible to observe the DNA structure by employing X-ray analysis. However, because of the quantum mechanical effects, we cannot receive a clear picture of DNA, not to mention color. Simultaneously, color perception is inherent in humans and simplifies the processing of vast volumes of complexly structured information. It is worth noting that the geometrization and visualization of voluminous data issues related to the ergonomics of perception and optimization of the intellectual work of operators working with considerable amounts of information. To date, it has been possible to visualize complex molecular genetic information using various graphs, pictograms, diagrams, and analytical algorithms. If you open a file containing sequenced genetic data in a text editor, you can observe a four-character sequence—nucleotides A, G, T/U, and C. That is a display of a specific DNA section on the monitor screen. Simultaneously, the present methods for representing molecular genetic information have advantages and disadvantages. The benefits include displaying and highlighting particular genes and building various analytical histograms and distributions, including the possibility of interaction of the operator–researcher with genetic information through computer interfaces. The drawbacks of such methods include that they are not always practical or sufficiently clear to the end user.
Let us consider some aspects of DNA imaging technologies and molecular genetic modeling covered in the scientific literature. The nucleus of a cell is a complex, highly organized system. In [8], advancements in chromosome imaging tools reveal further insights into the topology of the three-dimensional genome. It is worth noting that chromosome imaging techniques have demonstrated the presence of chromosome territories, useful nuclear landmarks, and topologically related domains. A crucial principle governing nuclear reactions is the spatial placement of the chromosome [9]. The software 4D Nucleome Network is used to clarify the intricate arrangement of chromosomes in the nucleus and the consequences of their disruption on biology [10]. Ref. [11] describes the software package to retrieve, group, and display the genetic data from the knowledge base. Utilizing the tool described in [12], users can obtain a clear image of the chromosomal distribution pattern through the visualization of genome-wide data mapping. The paper [13] describes the optical reconstruction of chromatin architecture (ORCA), a method that can trace the DNA path in single cells with nanoscale accuracy and genomic resolution reaching two kilobases.
The study [14] proposes taxonomies and tasks related to genomic data visualization. The study [15] describes an alignment genetic visualization tool. It is a system that helps understand structural variants by displaying the position of alignments. The paper [16] presents a method for predicting the 3D structures of chromosomes. In their research [17], authors demonstrate how to simulate the supercoiling and condensation of a bacterial chromosome by using a molecular dynamics engine. Interestingly, modeling and visualization of chromosomes are closely related to the mechanical properties of the genetic molecule. The mechanistic models presented in the article [18] offer verifiable theories regarding the fundamentals of genome folding. By concentrating on the fundamental biophysical mechanisms, fitting-free mechanistic models based on polymer simulations forecast chromatin folding in three dimensions. As follows from the literature analysis, there are several main groups for visualization, including methods for visualizing quantitative data (charts, histograms, and graphs) and techniques for visualizing textual information (semantic networks, tree displays, and diagrams). Furthermore, they distinguish between methods for visualizing concepts (concept maps or Gantt charts), methods for the visualization of metaphors (a template can serve as a basis), and methods for the visualization of strategies (for example, road maps). Refs. [19,20,21] discuss some facets of visualization methods. Refs. [22,23,24] describe the mathematical aspects of the four-color problem.
The study aims to analyze the results of DNA visualization algorithms using various coloring options. Let us show two variants of coloring—one based on the first letter of the fragments of the molecule and the other based on the frequencies of these fragments.

2. Materials and Methods

In his works, Sergey Petoukhov, the founder of matrix genetics, paid much attention to the theory of color and its connection with the algebras of genetic code. Figure 1 shows the Star of David, a multiplication-closed set of genetic matrices obtained by analyzing the degeneracy of the genetic code using the matrix approach. These matrices were acquired by permutations in triplets. Ref. [25] describes this algorithm at greater length. It is worth noting the structural connection between these matrices and color theory. Matrices fully reproduce color representations because of the multiplication operation. Algorithms for coloring genetic representations help enhance the perception of complex genetic information using color. Simultaneously, this opens up some perspectives in the study of the algebras of the genetic code. That also opens up a connection with the degeneracy of the genetic code encoded by triplets.
As we noted, the objective of this study is to describe the techniques of chromatic differentiation (coloring) of the mappings of molecular genetic information described earlier in our papers [5,6,7]. Earlier in our work, we proposed the concept of geometry or genetic visualization through algebraic geometrization in biology. We used different representations in parametric spaces, various dimensions, and diverse coordinate systems.
Let us describe the chief ideas of the proposed set of methods for chromatic differentiation of the composition of nucleic acids. The genetic code represents a system of four nucleotides, mathematically described by Hadamard matrices and Walsh functions. As shown in [26], tetra representations remain an adequate description of the phenomenon of genetic coding. In genometric visualization algorithms, we deal with nucleic acid fragments of identical length. We will color each one in the same color as its last nucleotide. We have four nucleotides in total. Thereupon, we will make use of four colors. Each color will correspond to a nucleotide. Thus, considering the quaternary nature of the molecular genetic code, it seems rational to employ the basic set of four colors. To select specific colors, let us turn to the RGB color model.
To color genometric mappings with four colors, one can rely on the well-known RGB scheme proposed by Maxwell [1,27]. The RGB representation in Figure 2 structurally corresponds to the projection of a pair of Sierpinski tetrahedra, sculpting a three-dimensional figure comprising two complementary tetrahedra. We previously constructed the latter utilizing a system of binary sub-alphabets and Hadamard nucleotide matrices [5]. Note that the missing two vertices, which are absent in Figure 2, are present in volume—these are black and white, which can be checked simply by mixing the colors of the vertices of each corresponding triangle. Each N-plet, moving inside the Sierpinski fractal with its two coordinates bijectively and functionally, sets the color. We will make use of this feature for further research. However, before we proceed to the results, let us describe the relationship between the system of methods we have been developing and the well-known DNA visualization system—Chaos Game Representation [28].

3. Relation with Chaos Game Representation (CGR)

The creation of a fractal using the chaos game method is as follows:
  • Set the starting point coordinate—the center of the plane (X = width/2, Y = height/2);
  • Divide the X and Y values in two (see Figure 3);
  • Depending on the current nucleotide, perform the following operations over the coordinates:
    • A: X = X + width/2; Y = Y + height/2;
    • C: X remains unaltered; Y = Y + height/2;
    • G: X = X + width/2; Y remains unaltered;
    • T/U: X remains unaltered; Y remains unaltered;
  • Mark a point in the plane.
  • Perform steps 2 to 5 over the next nucleotide; concurrently, the starting point will be the current one, i.e., the one marked in step 4.
Thus, each nucleotide falls within its subquadrant.
Compared to our described and developed methods with nucleotide Walsh functions at N = 1, the operational logic will remain the same. However, the nucleotide frequency is displayed in the subquadrant with the brightness of the point and not through its fractal.
Thus, the number of points from subquadrant A of the chaos game method will equal the frequency in subquadrant A by the techniques employing Walsh functions, matching the number of nucleotides A in the sequence. Figure 4 demonstrates different approache between CGR and Walsh methods.
It is possible to extend coordinate variation rules to 3D cases, following the system of binary sub-alphabets (Figure 5). There, for 1, we set a coordinate value, and 0 remains unaltered:
A:
  • 1: X = X + width/2;
  • 1: Y = Y + height/2;
  • 1: Z = Z + depth/2;
C:
  • 0: X remains unaltered;
  • 1: Y = Y + height/2;
  • 0: Z = remains unaltered;
G:
  • 1: X = X + width/2;
  • 0: Y remains unaltered;
  • 0: Z remains unaltered;
T/U:
  • 0: X remains unaltered;
  • 0: Y remains unaltered;
  • 1: Z = Z + depth/2.
In this way, all sub-quadrants can be compared across all projections of nucleotide binary sub-alphabets and the chaos game.
Following the chaos game method, we have developed a visualization technique for N-plets, in which not simply each nucleotide but a substring of nucleotides is considered (as with the binary sub-alphabets system).
CGR extension. Let us complete the Chaos Game Representation (CGR) algorithm as follows:
  • Take the starting point coordinate–the center of the plane
    • X = width/2, Y = height/2;
  • Divide the X and Y values in two (see Figure 6);
  • Depending on the current nucleotide, perform the following operations over the coordinates:
    • A: X = X + width/2; Y = Y + height/2;
    • C: X remains unaltered; Y = Y + height/2;
    • G: X = X + width/2; Y remains unaltered;
    • T/U: X remains unaltered; Y remains unaltered.
  • Shift the center of coordinates to the obtained point. Wherein:
    • width = width/2,
    • height = height/2.
  • Employ the next nucleotide from the N-plet and follow steps 2 through 5 until the entire nucleotide is processed. In this case, the starting point will be the current one-the new center of coordinates obtained in step 4.
  • Mark a point in the plane when all nucleotides of the N-plet have been processed.
  • Return the starting width and height values, and follow steps 2 to 7 for the next N-plet. In this case, the starting point will be the current one, i.e., obtained in step 6.
Figure 7 shows the final comparison of visualizations obtained using the CGR and Walsh methods at different parameters of the scaling factor N.

4. Results and Discussion

We created the first pilot version of our program in Python. However, this language proved to be disappointingly slow and unproductive for processing the voluminous genetic data. Therefore, we obtained all the results in this article using C++. This considerably accelerated the calculations and allowed the construction of more accurate graphical representations.
In computational experiments, we saw multiple genomes of many organisms in various coordinate systems. Spherical and cyclic coordinate systems in two and three dimensions are of particular interest. These visualizations exhibit beautiful fractal patterns and variety.
It is worth noting that with chromatic differentiation, there is an increase in computational loads compared to the basic approach. Therefore, we refrained from calculating the colors for each point to conserve computational resources. We computed colors based solely on the last nucleotide in the N-plet. Hence, only four colors were utilized in the visualizations, per the pyramid of Sierpinski, as depicted in Figure 2. Figure 8, Figure 9, Figure 10 and Figure 11 are visualizations of the tenth human chromosome.
Animated 3D visualizations of human chromosome 10 at N = 3 are shown in the following video: https://youtu.be/FdyrLvCsLGA, accessed on 1 April 2023.

5. A Method of Chromatic Differentiation That Binds the Color to the Frequency of N-Plets

Considering the relevance of the frequency of occurrence of N-plets, we have developed an alternative method of chromatic differentiation (or genometric mapping coloring), a technique that allows each frequency to have its color. The frequency gradation and an example of visualization are shown in Figure 12. Blue corresponds to low frequencies, and red corresponds to high frequencies. This visualization method appears to be more aesthetic and beautiful. Colors are selectable thanks to the color gradient bound to such objective physical and chemical parameters of DNA and RNA as the frequencies of particular molecule fragments.
From the point of view of biological applications, this topic can be potentially valuable for visualization and comparative visualization of genomes. Earlier in [5,6], we showed comparative genomic algorithms, in which we employed color tools to mark differences and similarities in different genes.
For a pictorial presentation, our team created an animation in which we demonstrated an example of integrated visualization with coloring in a similar manner. The scale, i.e., the length of N-plets, will change within each animation frame. Given the high resolution and high-information content, this animation is of particular scientific interest, as it displays the nucleotide sequence structure in dynamics when zoomed. The video illustrates a change in the parameter N in dynamics. For each frame, we calculated the frequency of points anew; each frame had its frequency range.
Video 1. Human chromosome 10, an integral visualization, XZ projection (https://youtube.com/shorts/c4b6vTracDI, accessed on 1 April 2023); Video 2. Salmonella enterica, integral imaging, XY projection (https://youtube.com/shorts/WHdAMWmRylQ, accessed on 1 April 2023).
We find it rational to calculate the color for each point once and for all instead of performing repeated calculations. Considering the presence of symmetries in the arrangement of colors relative to the principal diagonal, the number of computations will significantly decrease. This fits into the concept of an analog of quantum computing, where instead of calculations, we read the addresses of the answers. Note that we can witness manifestations of this computational concept in many other phenomena and processes, described, like DNA, by symmetric square matrices.
Identification of features of the genetic composition of nucleotide sequences allows for a visual comparison of the physicochemical parameters of DNA in various organisms. It is effective for comparative analysis of DNA, establishing relationships, visualization of genetic markers of predisposition to diseases, etc. Several approaches to DNA and RNA visualizations are equally possible based on structural, integral, and frequency mappings in parametric spaces of various dimensions. Figure 13 and Figure 14 show DNA chromatic differentiation images from different organisms in different types of imaging.

6. Conclusions

Functional representation of the molecular genetic composition of DNA in the form of visualization in a specific parametric space is an urgent scientific problem. The proposed methodology extends the previously developed visualization algorithms by coloring these representations. The presented methods help facilitate the perception of voluminous genetic data stored in DNA and RNA molecules. Earlier, we developed tools to analyze various DNA and RNA, including a comparative study of diverse viruses and their variants [6]. The new algorithms rely upon the basis system of Hadamard-Walsh binary-orthogonal functions, enabling the mapping of nucleic acids according to levels corresponding to their physicochemical parameters. These algorithms additionally allow for the performance of spectral composition and decomposition [7] with visualization and chromatic differentiation of their structural–parametric representations.
The results of the conducted research additionally allow us to speak about a new direction of scientific art, as everything created by nature is beautiful. New visualization algorithms allow for showing this beauty in high resolution. This area of computer art is associated with the concept of pentagram music we developed earlier [29].
As a result of the research, meaningful and convenient or perceptual color displays were obtained, some of which are suitable for the visual presentation of genetic information. The other part is yet to be comprehended. Thus, we have considered various options for visualization with chromatic differentiation. These visualizations are rather vivid and representative sets that allow you to see the internal structure and beauty of the molecular genetic code within.
It is worth noting the enormous computational complexity of these methods, which, despite their great simplicity, remain very demanding on computational resources, even though they are highly parallelizable.
For all our measurements and studies, we used an in-house specialized computer program exploiting supercomputer technologies. Furthermore, we constructed great 3D animated models that enabled us to modify various visualization parameters dynamically and obtain fantastic animations. The results presented in this and our previous works allow us to speak about a particular nexus of biology and computer science, namely a theory of information processing and computer graphics.
The analysis and color visualization of colossal amounts of genetic information made it possible to construct structures resembling something alive but, at the same time, with traceable computer–metaphorical meanings. All the above allows us to look at the DNA and everything it controls within the cell as the most complex molecular machine, which, in principle, a living cell is.

Author Contributions

Conceptualization—I.V.S., software—M.Y.L. All authors have read and agreed to the published version of the manuscript.

Funding

This publication has been supported by the RUDN University Scientific Projects Grant System, project No. 202235-2-000.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Lakshminarayanan, V. Maxwell, color vision, and the color triangle. In Light in Nature VII; SPIE: San Diego, CA, USA, 2019; Volume 11099, pp. 54–66. [Google Scholar]
  2. Petoukhov, S.V. Binary oppositions, algebraic holography and stochastic rules in genetic informatics. Biosystems 2022, 221, 104760. [Google Scholar] [CrossRef]
  3. He, M.; Petoukhov, S. Mathematics of Bioinformatics: Theory, Methods and Applications; John Wiley & Sons: Hoboken, NJ, USA, 2011. [Google Scholar]
  4. Petoukhov, S.V. The system-resonance approach in modeling genetic structures. Biosystems 2016, 139, 1–11. [Google Scholar] [CrossRef]
  5. Stepanyan, I.V.; Lednev, M.Y. Parametric Multispectral Mappings and Comparative Genomics. Symmetry 2022, 14, 2517. [Google Scholar] [CrossRef]
  6. Stepanyan, I.V.; Lednev, M.Y. A Comparative Analysis of Different Strains of Coronavirus Based on Genometric Mappings. Symmetry 2022, 14, 942. [Google Scholar] [CrossRef]
  7. Stepanyan, I.; Lednev, M. Spectral Decomposition of Mappings of Molecular Genetic Information in the System Basis of Single Nucleotide Functions. Symmetry 2022, 14, 844. [Google Scholar] [CrossRef]
  8. Kempfer, R.; Pombo, A. Methods for mapping 3D chromosome architecture. Nat. Rev. Genet. 2020, 21, 207–226. [Google Scholar] [CrossRef]
  9. Fritz, A.J.; Sehgal, N.; Pliss, A.; Xu, J.; Berezney, R. Chromosome territories and the global regulation of the genome. Genes Chromosomes Cancer 2019, 58, 407–426. [Google Scholar] [CrossRef] [PubMed]
  10. Reiff, S.B.; Schroeder, A.J.; Kırlı, K.; Cosolo, A.; Bakker, C.; Mercado, L.; Lee, S.; Veit, A.D.; Balashov, A.K.; Vitzthum, C.; et al. The 4D Nucleome Data Portal as a resource for searching and visualizing curated nucleomics data. Nat. Commun. 2022, 13, 2365. [Google Scholar] [CrossRef]
  11. Soudy, M.; Anwar, A.M.; Ahmed, E.A.; Osama, A.; Ezzeldin, S.; Mahgoub, S.; Magdeldin, S. UniprotR: Retrieving and visualizing protein sequence and functional information from Universal Protein Resource (UniProt knowledgebase). J. Proteom. 2020, 213, 103613. [Google Scholar] [CrossRef]
  12. Hao, Z.; Lv, D.; Ge, Y.; Shi, J.; Weijers, D.; Yu, G.; Chen, J. RIdeogram: Drawing SVG graphics to visualize and map genome-wide data on the idiograms. PeerJ Comput. Sci. 2020, 6, e251. [Google Scholar] [CrossRef]
  13. Mateo, L.J.; Murphy, S.E.; Hafner, A.; Cinquini, I.S.; Walker, C.A.; Boettiger, A.N. Visualizing DNA folding and RNA in embryos at single-cell resolution. Nature 2019, 568, 49–54. [Google Scholar] [CrossRef]
  14. Nusrat, S.; Harbig, T.; Gehlenborg, N. Tasks, techniques, and tools for genomic data visualization. In Computer Graphics Forum; Wiley: Hoboken, NJ, USA, 2019; Volume 38, pp. 781–805. [Google Scholar]
  15. Nattestad, M.; Aboukhalil, R.; Chin, C.S.; Schatz, M.C. Ribbon: Intuitive visualization for complex genomic variation. Bioinformatics 2021, 37, 413–415. [Google Scholar] [CrossRef] [PubMed]
  16. Hovenga, V.; Kalita, J.; Oluwadare, O. HiC-GNN: A generalizable model for 3D chromosome reconstruction using graph convolutional neural networks. Comput. Struct. Biotechnol. J. 2023, 21, 812–836. [Google Scholar] [CrossRef] [PubMed]
  17. Jewett, A.I.; Stelter, D.; Lambert, J.; Saladi, S.M.; Roscioni, O.M.; Ricci, M.; Autin, L.; Maritan, M.; Bashusqeh, S.M.; Keyes, T. Moltemplate: A tool for coarse-grained modeling of complex biological matter and soft condensed matter physics. J. Mol. Biol. 2021, 433, 166841. [Google Scholar] [CrossRef]
  18. Chiang, M.; Brackley, C.A.; Marenduzzo, D.; Gilbert, N. Predicting genome organisation and function with mechanistic modelling. Trends Genet. 2022, 38, 364–378. [Google Scholar] [CrossRef]
  19. Vellido, A. The importance of interpretability and visualization in machine learning for applications in medicine and health care. Neural Comput. Appl. 2020, 32, 18069–18083. [Google Scholar] [CrossRef]
  20. Firat, E.E.; Joshi, A.; Laramee, R.S. VisLitE: Visualization Literacy and Evaluation. IEEE Comput. Graph. Appl. 2022, 42, 99–107. [Google Scholar] [CrossRef] [PubMed]
  21. Bikakis, N.; Chrysanthis, P.K.; Papastefanatos, G.; Schreck, T. Special Issue on Machine Learning Approaches in Big Data Visualization. IEEE Comput. Graph. Appl. 2022, 42, 39–40. [Google Scholar] [CrossRef]
  22. Jin, L.; Wong, T.L.; Zhu, X. Colouring of S-labelled planar graphs. Eur. J. Comb. 2021, 92, 103198. [Google Scholar] [CrossRef]
  23. Canizales, J.; Chahal, J.S. A note on the four color theorem. Aequ. Math. 2023, 97, 1–11. [Google Scholar] [CrossRef]
  24. Rowland, D. THE MATHEMATICAL GAZETTE: Combinatorics: A very short introduction. Math. Gaz. 2019, 103, 375–376. [Google Scholar] [CrossRef]
  25. Petoukhov, S.; He, M. (Eds.) Symmetrical Analysis Techniques for Genetic Systems and Bioinformatics: Advanced Patterns and Applications: Advanced Patterns and Applications; IGI Global: Hershey, PA, USA, 2009. [Google Scholar]
  26. Petoukhov, S.V. The stochastic organization of genomes and the doctrine of energy-information evolution based on bio-antenna arrays. Biosystems 2022, 218, 104712. [Google Scholar] [CrossRef] [PubMed]
  27. Pavlidis, G. The Modern Era of Colour Theory. In A Brief History of Colour Theory: Foundations of Colour Science; Springer International Publishing: Cham, Switzerland, 2022; pp. 23–125. [Google Scholar]
  28. Kania, A.; Sarapata, K. Multifarious aspects of the chaos game representation and its applications in biological sequence analysis. Comput. Biol. Med. 2022, 151, 106243. [Google Scholar] [CrossRef] [PubMed]
  29. Quehenberger, R.C.; Stepanyan, I.; Skepper, B. [C G A T] Epita Matrix Genetics: Toward a Visualization of Genetic Codes via “genetic Music”. Advance online publication on Project MUSE. Available online: https://www.muse.jhu.edu/article/741772 (accessed on 1 April 2023).
Figure 1. A structural analogy between the system of binary sub-alphabets and the circle of colors (an image from [25]). In the center, we see a schematic illustration of the two cyclic sequences of the mosaic matrices, which arise because of the positional permutations in the triplets: YY +,123[CAUG] → YY +,231[CAUG] →YY +,312[CAUG] → YY +,123[CAUG] and YY +,321[CAUG] → YY +,213[CAUG] → YY +,132[CAUG] → YY +,321[CAUG]. The numbers above each matrix show a relevant type of permutation of positions in all the triplets.
Figure 1. A structural analogy between the system of binary sub-alphabets and the circle of colors (an image from [25]). In the center, we see a schematic illustration of the two cyclic sequences of the mosaic matrices, which arise because of the positional permutations in the triplets: YY +,123[CAUG] → YY +,231[CAUG] →YY +,312[CAUG] → YY +,123[CAUG] and YY +,321[CAUG] → YY +,213[CAUG] → YY +,132[CAUG] → YY +,321[CAUG]. The numbers above each matrix show a relevant type of permutation of positions in all the triplets.
Symmetry 15 00942 g001
Figure 2. The RGB color triangle and, “complementary” to it, the CMY triangle, as ultrametric subsets of complementary in terms of physicochemical characteristics of DNA of Sierpinski tetrahedra.
Figure 2. The RGB color triangle and, “complementary” to it, the CMY triangle, as ultrametric subsets of complementary in terms of physicochemical characteristics of DNA of Sierpinski tetrahedra.
Symmetry 15 00942 g002
Figure 3. A schematic representation of the coordinate definition for each type of nucleotide. The dotted line shows the transformation mentioned above in step 2, and the solid line depicts the alteration outlined earlier in step 3.
Figure 3. A schematic representation of the coordinate definition for each type of nucleotide. The dotted line shows the transformation mentioned above in step 2, and the solid line depicts the alteration outlined earlier in step 3.
Symmetry 15 00942 g003
Figure 4. Coloring each nucleotide in its color: A (red), C (blue), G (green), T/U (black). A chaos game method application is shown on the left, and on the right is an application of the Walsh method at n = 1.
Figure 4. Coloring each nucleotide in its color: A (red), C (blue), G (green), T/U (black). A chaos game method application is shown on the left, and on the right is an application of the Walsh method at n = 1.
Symmetry 15 00942 g004
Figure 5. A triple of nucleotide binary sub-alphabets with reference to X, Y, and Z coordinates.
Figure 5. A triple of nucleotide binary sub-alphabets with reference to X, Y, and Z coordinates.
Symmetry 15 00942 g005
Figure 6. A schematic representation of the calculation of the N-plet coordinate (CCG) for the chaos game method. Red dotted line—dividing the coordinate in half (step 2 of the algorithm), red solid line—modification of coordinates depending on the current nucleotide (step 3 of the algorithm).
Figure 6. A schematic representation of the calculation of the N-plet coordinate (CCG) for the chaos game method. Red dotted line—dividing the coordinate in half (step 2 of the algorithm), red solid line—modification of coordinates depending on the current nucleotide (step 3 of the algorithm).
Symmetry 15 00942 g006
Figure 7. A Comparison of visualizations obtained employing the Walsh method and that of the chaos game. For chromatic differentiation, the dot color originates from the last nucleotide in the N-plet. With relatively large N-plets, when running the chaos game, the sub-quadrants merge into a point because of the modest size of the image. Consequently, the visualizations become identical to those of Walsh. Brightness denotes the frequency of a dot: higher frequencies produce much richer colors.
Figure 7. A Comparison of visualizations obtained employing the Walsh method and that of the chaos game. For chromatic differentiation, the dot color originates from the last nucleotide in the N-plet. With relatively large N-plets, when running the chaos game, the sub-quadrants merge into a point because of the modest size of the image. Consequently, the visualizations become identical to those of Walsh. Brightness denotes the frequency of a dot: higher frequencies produce much richer colors.
Symmetry 15 00942 g007
Figure 8. A chromatic differentiation at N = 1 for diverse projections and coordinate systems.
Figure 8. A chromatic differentiation at N = 1 for diverse projections and coordinate systems.
Symmetry 15 00942 g008
Figure 9. Coloring at various scale parameters in the rectangular coordinate system in two and three dimensions. Two-dimensional visualizations are presented in the XY projection.
Figure 9. Coloring at various scale parameters in the rectangular coordinate system in two and three dimensions. Two-dimensional visualizations are presented in the XY projection.
Symmetry 15 00942 g009
Figure 10. A chromatic differentiation at various scale parameters in the cyclic coordinate system in two and three dimensions. Two-dimensional visualizations are presented in the XZ projection.
Figure 10. A chromatic differentiation at various scale parameters in the cyclic coordinate system in two and three dimensions. Two-dimensional visualizations are presented in the XZ projection.
Symmetry 15 00942 g010
Figure 11. A chromatic differentiation at various scale parameters in the polar and spherical coordinate systems (see on top and at the bottom of the figure, respectively). Visualizations in the polar coordinate system are presented in the XY projection.
Figure 11. A chromatic differentiation at various scale parameters in the polar and spherical coordinate systems (see on top and at the bottom of the figure, respectively). Visualizations in the polar coordinate system are presented in the XY projection.
Symmetry 15 00942 g011
Figure 12. Examples of visual displays (projections) with varying levels of contrast: (a) a scale for converting the brightness of a point to its corresponding color; contrast ratios: (b) 100%; (c) 80%; (d) 50%.
Figure 12. Examples of visual displays (projections) with varying levels of contrast: (a) a scale for converting the brightness of a point to its corresponding color; contrast ratios: (b) 100%; (c) 80%; (d) 50%.
Symmetry 15 00942 g012
Figure 13. Chromatic differentiation renderings of Drosophila melanogaster chromosome X in the XY projection. Rectangular (a), cyclic (b), and polar (c) coordinate systems.
Figure 13. Chromatic differentiation renderings of Drosophila melanogaster chromosome X in the XY projection. Rectangular (a), cyclic (b), and polar (c) coordinate systems.
Symmetry 15 00942 g013
Figure 14. Integral colored representations of chromosome 1 in Homo sapiens (a), Morella rubra (b), and Rattus norvegicus (c) with scaling parameter of N = 300. One-dimensional representations (see three horizontal rows on the left), where the first line shows X, the second line shows Y, and the third line shows Z. Two-dimensional representations (see square displays on the right); the coordinates (top to bottom, abscissa/ordinate) are X/Y, Y/Z, and X/Z. Homo sapiens, visualizations of chromosome 1. Morella rubra, visualizations of chromosome 1. Rattus norvegicus, visualizations of chromosome 1.
Figure 14. Integral colored representations of chromosome 1 in Homo sapiens (a), Morella rubra (b), and Rattus norvegicus (c) with scaling parameter of N = 300. One-dimensional representations (see three horizontal rows on the left), where the first line shows X, the second line shows Y, and the third line shows Z. Two-dimensional representations (see square displays on the right); the coordinates (top to bottom, abscissa/ordinate) are X/Y, Y/Z, and X/Z. Homo sapiens, visualizations of chromosome 1. Morella rubra, visualizations of chromosome 1. Rattus norvegicus, visualizations of chromosome 1.
Symmetry 15 00942 g014aSymmetry 15 00942 g014b
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Stepanyan, I.V.; Lednev, M.Y. Chromatic Differentiation of Functional Mappings of the Composition of Nucleic Acids. Symmetry 2023, 15, 942. https://doi.org/10.3390/sym15040942

AMA Style

Stepanyan IV, Lednev MY. Chromatic Differentiation of Functional Mappings of the Composition of Nucleic Acids. Symmetry. 2023; 15(4):942. https://doi.org/10.3390/sym15040942

Chicago/Turabian Style

Stepanyan, Ivan V., and Mihail Y. Lednev. 2023. "Chromatic Differentiation of Functional Mappings of the Composition of Nucleic Acids" Symmetry 15, no. 4: 942. https://doi.org/10.3390/sym15040942

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop