Application of Bioinformatics in Microbiome

A special issue of Genes (ISSN 2073-4425). This special issue belongs to the section "Microbial Genetics and Genomics".

Deadline for manuscript submissions: 5 August 2024 | Viewed by 8003

Special Issue Editor

The Forsyth Institute, Cambridge, MA 02142, USA
Interests: microbiome research; next-generation sequencing; gene markers; genomics; meta-genomics; meta-transcriptomics; binformatics analysis; artifical intelligence

Special Issue Information

Dear Colleagues,

The microorganisms found in an environment and their activities are collectively referred to as the microbiome. The advent of sequencing technologies has enabled new means of studying microbiomes; it is now possible to answer the questions of “who are there”, “What can they do” and “what are they doing” for a habitat by sequencing its marker genes, meta-genomes and meta-transcriptomes, respectively. With high-throughput next-generation sequencing (NGS), there has been an explosion of sequencing data, which require novel and efficient computing algorithms, software and pipelines to process, manage and interpret their embedded information. Bioinformatics is an interdisciplinary field of science that combines biology, computer science, informatics, mathematics and statistics to analyze and interpret biological and clinical data. Bioinformatics is not only crucial but essential to extracting meaningful information from cryptic NGS data.

This Special Issue aims to demonstrate the latest developments in bioinformatics in the NGS era that have helped advanced our understanding of the microbiome. The issue’s scope includes, but is not limited to, the following topics:

  1. New computing algorithms for analyzing NGS data;
  2. Data management platforms (online or standalone desktop databases) for NGS data;
  3. Pipeline development (a collection of analytic software streamlined from upstream to downstream applications);
  4. Bioinformatics application in the cloud platform;
  5. Artificial intelligence (AI) in microbiome research;
  6. New microbiome research discovery, both medical and non-medical, with a significant bioinformatics component;
  7. Reviews of the above applications.

Dr. Tsute Chen
Guest Editor

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Genes is an international peer-reviewed open access monthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2600 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

  • microbiome
  • microbiota
  • next-generation sequencing
  • genomics
  • meta-genomics
  • meta-transcriptomics
  • taxonomy
  • artificial intelligence
  • bioinformatics

Published Papers (4 papers)

Order results
Result details
Select all
Export citation of selected articles as:

Research

21 pages, 3244 KiB  
Article
Characterization and Genome Study of a Newly Isolated Temperate Phage Belonging to a New Genus Targeting Alicyclobacillus acidoterrestris
by Dziyana Shymialevich, Michał Wójcicki, Olga Świder, Paulina Średnicka and Barbara Sokołowska
Genes 2023, 14(6), 1303; https://doi.org/10.3390/genes14061303 - 20 Jun 2023
Viewed by 1583
Abstract
The spoilage of juices by Alicyclobacillus spp. remains a serious problem in industry and leads to economic losses. Compounds such as guaiacol and halophenols, which are produced by Alicyclobacillus, create undesirable flavors and odors and, thus, decrease the quality of juices. The [...] Read more.
The spoilage of juices by Alicyclobacillus spp. remains a serious problem in industry and leads to economic losses. Compounds such as guaiacol and halophenols, which are produced by Alicyclobacillus, create undesirable flavors and odors and, thus, decrease the quality of juices. The inactivation of Alicyclobacillus spp. constitutes a challenge because it is resistant to environmental factors, such as high temperatures, and active acidity. However, the use of bacteriophages seems to be a promising approach. In this study, we aimed to isolate and comprehensively characterize a novel bacteriophage targeting Alicyclobacillus spp. The Alicyclobacillus phage strain KKP 3916 was isolated from orchard soil against the Alicyclobacillus acidoterrestris strain KKP 3133. The bacterial host’s range and the effect of phage addition at different rates of multiplicity of infections (MOIs) on the host’s growth kinetics were determined using a Bioscreen C Pro growth analyzer. The Alicyclobacillus phage strain KKP 3916, retained its activity in a wide range of temperatures (from 4 °C to 30 °C) and active acidity values (pH from 3 to 11). At 70 °C, the activity of the phage decreased by 99.9%. In turn, at 80 °C, no activity against the bacterial host was observed. Thirty minutes of exposure to UV reduced the activity of the phages by almost 99.99%. Based on transmission-electron microscopy (TEM) and whole-genome sequencing (WGS) analyses, the Alicyclobacillus phage strain KKP 3916 was classified as a tailed bacteriophage. The genomic sequencing revealed that the newly isolated phage had linear double-stranded DNA (dsDNA) with sizes of 120 bp and 131 bp and 40.3% G+C content. Of the 204 predicted proteins, 134 were of unknown function, while the remainder were annotated as structural, replication, and lysis proteins. No genes associated with antibiotic resistance were found in the genome of the newly isolated phage. However, several regions, including four associated with integration into the bacterial host genome and excisionase, were identified, which indicates the temperate (lysogenic) life cycle of the bacteriophage. Due to the risk of its potential involvement in horizontal gene transfer, this phage is not an appropriate candidate for further research on its use in food biocontrol. To the best of our knowledge, this is the first article on the isolation and whole-genome analysis of the Alicyclobacillus-specific phage. Full article
(This article belongs to the Special Issue Application of Bioinformatics in Microbiome)
Show Figures

Figure 1

12 pages, 1482 KiB  
Article
Novel Clustering Methods Identified Three Caries Status-Related Clusters Based on Oral Microbiome in Thai Mother–Child Dyads
by Samantha Manning, Jin Xiao, Yihong Li, Prakaimuk Saraithong, Bruce J. Paster, George Chen, Yan Wu and Tong Tong Wu
Genes 2023, 14(3), 641; https://doi.org/10.3390/genes14030641 - 03 Mar 2023
Viewed by 1280
Abstract
Early childhood caries (ECC) is a disease that globally affects pre-school children. It is important to identify both protective and risk factors associated with this disease. This paper examined a set of saliva samples of Thai mother–child dyads and aimed to analyze how [...] Read more.
Early childhood caries (ECC) is a disease that globally affects pre-school children. It is important to identify both protective and risk factors associated with this disease. This paper examined a set of saliva samples of Thai mother–child dyads and aimed to analyze how the maternal factors and oral microbiome of the dyads influence the development of ECC. However, heterogeneous latent subpopulations may exist that have different characteristics in terms of caries development. Therefore, we introduce a novel method to cluster the correlated outcomes of dependent observations while selecting influential independent variables to unearth latent groupings within this dataset and reveal their association in each group. This paper describes the discovery of three heterogeneous clusters in the dataset, each with its own unique mother–child outcome trend, as well as identifying several microbial factors that contribute to ECC. Significantly, the three identified clusters represent three typical clinical conditions in which mother–child dyads have typical (cluster 1), high–low (cluster 2), and low–high caries experiences (cluster 3) compared to the overall trend of mother–child caries status. Intriguingly, the variables identified as the driving attributes of each cluster, including specific taxa, have the potential to be used in the future as caries preventive measures. Full article
(This article belongs to the Special Issue Application of Bioinformatics in Microbiome)
Show Figures

Figure 1

11 pages, 1795 KiB  
Article
SCP4ssd: A Serverless Platform for Nucleotide Sequence Synthesis Difficulty Prediction Using an AutoML Model
by Jianqi Zhang, Shuai Ren, Zhenkui Shi, Ruoyu Wang, Haoran Li, Huijuan Tian, Miao Feng, Xiaoping Liao and Hongwu Ma
Genes 2023, 14(3), 605; https://doi.org/10.3390/genes14030605 - 28 Feb 2023
Viewed by 1322
Abstract
DNA synthesis is widely used in synthetic biology to construct and assemble sequences ranging from short RBS to ultra-long synthetic genomes. Many sequence features, such as the GC content and repeat sequences, are known to affect the synthesis difficulty and subsequently the synthesis [...] Read more.
DNA synthesis is widely used in synthetic biology to construct and assemble sequences ranging from short RBS to ultra-long synthetic genomes. Many sequence features, such as the GC content and repeat sequences, are known to affect the synthesis difficulty and subsequently the synthesis cost. In addition, there are latent sequence features, especially local characteristics of the sequence, which might affect the DNA synthesis process as well. Reliable prediction of the synthesis difficulty for a given sequence is important for reducing the cost, but this remains a challenge. In this study, we propose a new automated machine learning (AutoML) approach to predict the DNA synthesis difficulty, which achieves an F1 score of 0.930 and outperforms the current state-of-the-art model. We found local sequence features that were neglected in previous methods, which might also affect the difficulty of DNA synthesis. Moreover, experimental validation based on ten genes of Escherichia coli strain MG1655 shows that our model can achieve an 80% accuracy, which is also better than the state of art. Moreover, we developed the cloud platform SCP4SSD using an entirely cloud-based serverless architecture for the convenience of the end users. Full article
(This article belongs to the Special Issue Application of Bioinformatics in Microbiome)
Show Figures

Figure 1

17 pages, 2419 KiB  
Article
Supervised Machine Learning Enables Geospatial Microbial Provenance
by Chandrima Bhattacharya, Braden T. Tierney, Krista A. Ryon, Malay Bhattacharyya, Jaden J. A. Hastings, Srijani Basu, Bodhisatwa Bhattacharya, Debneel Bagchi, Somsubhro Mukherjee, Lu Wang, Elizabeth M. Henaff and Christopher E. Mason
Genes 2022, 13(10), 1914; https://doi.org/10.3390/genes13101914 - 21 Oct 2022
Cited by 3 | Viewed by 2709
Abstract
The recent increase in publicly available metagenomic datasets with geospatial metadata has made it possible to determine location-specific, microbial fingerprints from around the world. Such fingerprints can be useful for comparing microbial niches for environmental research, as well as for applications within forensic [...] Read more.
The recent increase in publicly available metagenomic datasets with geospatial metadata has made it possible to determine location-specific, microbial fingerprints from around the world. Such fingerprints can be useful for comparing microbial niches for environmental research, as well as for applications within forensic science and public health. To determine the regional specificity for environmental metagenomes, we examined 4305 shotgun-sequenced samples from the MetaSUB Consortium dataset—the most extensive public collection of urban microbiomes, spanning 60 different cities, 30 countries, and 6 continents. We were able to identify city-specific microbial fingerprints using supervised machine learning (SML) on the taxonomic classifications, and we also compared the performance of ten SML classifiers. We then further evaluated the five algorithms with the highest accuracy, with the city and continental accuracy ranging from 85–89% to 90–94%, respectively. Thereafter, we used these results to develop Cassandra, a random-forest-based classifier that identifies bioindicator species to aid in fingerprinting and can infer higher-order microbial interactions at each site. We further tested the Cassandra algorithm on the Tara Oceans dataset, the largest collection of marine-based microbial genomes, where it classified the oceanic sample locations with 83% accuracy. These results and code show the utility of SML methods and Cassandra to identify bioindicator species across both oceanic and urban environments, which can help guide ongoing efforts in biotracing, environmental monitoring, and microbial forensics (MF). Full article
(This article belongs to the Special Issue Application of Bioinformatics in Microbiome)
Show Figures

Figure 1

Back to TopTop