Literature Mining of Disease Associated Noncoding RNA in the Omics Era
Abstract
:1. Introduction
2. Process of Biomedical Literature Mining
- (1)
- Dictionary-based approaches, usually utilizing a comprehensive controlled vocabulary to directly match and identify the entity names from the documents;
- (2)
- Semantic rule-based approaches, applying a set of handcrafted syntactic and semantic rules to fragment the sentences and capture nouns and predicates from the phrases;
- (3)
3. Literature Mining of Noncoding RNA
3.1. Tools for NER, NEN and RE
3.2. Tools for Disease-Associated ncRNA Annotation
3.3. Tools for Knowledge Discovery and Validation
3.4. When Bibliomics Meets Multiomics
4. Challenges and Perspectives
5. Conclusions
Funding
Data Availability Statement
Conflicts of Interest
References
- Reuter, J.A.; Spacek, D.V.; Snyder, M.P. High-throughput sequencing technologies. Mol. Cell 2015, 58, 586–597. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Amaral, P.P.; Dinger, M.E.; Mercer, T.R.; Mattick, J.S. The eukaryotic genome as an RNA machine. Science 2008, 319, 1787–1789. [Google Scholar] [CrossRef] [PubMed]
- Bartel, D.P. MicroRNAs: Target recognition and regulatory functions. Cell 2009, 136, 215–233. [Google Scholar] [CrossRef] [Green Version]
- Kennell, D. Inhibition of host protein synthesis during infection of Escherichia coli by bacteriophage T4. I. Continued synthesis of host ribonucleic acid. J. Virol. 1968, 2, 1262–1271. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Gopalakrishnan, V.; Jha, K.; Jin, W.; Zhang, A. A survey on literature based discovery approaches in biomedical domain. J. Biomed. Inform. 2019, 93, 103141. [Google Scholar] [CrossRef] [PubMed]
- Fan, J. Method and Application of Linguistics in Cross-Cultural Biomedicine text mining. Sci. Educ. Artic. Collect. 2013, 1, 80–81. [Google Scholar]
- Claudio, G.; Alberto, L.; Lorenza, R. Exploiting Shallow Linguistic Information for Relation Extraction from Biomedical Literature. In Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy, 3–7 April 2006. [Google Scholar]
- Eddy, S.R. What is a hidden Markov model? Nat. Biotechnol. 2004, 22, 1315–1316. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Noble, W.S. What is a support vector machine? Nat. Biotechnol. 2006, 24, 1565–1567. [Google Scholar] [CrossRef]
- Zhao, S.; Su, C.; Lu, Z.; Wang, F. Recent advances in biomedical literature mining. Brief. Bioinform. 2021, 22, 1. [Google Scholar]
- Doan, S.; Xu, H. Recognizing Medication related Entities in Hospital Discharge Summaries using Support Vector Machine. Proc. COLING. Int. Conf. Comput. Linguist. 2010, 2010, 259–266. [Google Scholar]
- Leaman, R.; Gonzalez, G. BANNER: An executable survey of advances in biomedical named entity recognition. Pac. Symp. Biocomput. Pac. Symp. Biocomput. 2008, 652–663. [Google Scholar] [CrossRef] [Green Version]
- Chen, G.; Liu, T.; Yu, B.; Wang, B.; Peng, Q. CircRNA-UBE2G1 regulates LPS-induced osteoarthritis through miR-373/HIF-1a axis. Cell Cycle 2020, 19, 1696–1705. [Google Scholar] [CrossRef] [PubMed]
- Blaschke, C.; Andrade, M.A.; Ouzounis, C.; Valencia, A. Automatic extraction of biological information from scientific text: Protein-protein interactions. In Proceedings of the 7th International Conference on Intelligent Systems for Molecular Biology, Heidelberg, Germany, 6–10 August 1999; pp. 60–67. [Google Scholar]
- Yang, H.; Robinson, P.N.; Wang, K. Phenolyzer: Phenotype-based prioritization of candidate genes for human diseases. Nat. Methods 2015, 12, 841–843. [Google Scholar] [CrossRef] [PubMed]
- Herrero-Zazo, M.; Segura-Bedmar, I.; Martinez, P.; Declerck, T. The DDI corpus: An annotated corpus with pharmacological substances and drug-drug interactions. J. Biomed. Inform. 2013, 46, 914–920. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Swanson, D.R. Fish oil, Raynaud’s syndrome, and undiscovered public knowledge. Perspect. Biol. Med. 1986, 30, 7–18. [Google Scholar] [CrossRef] [PubMed]
- Bagewadi, S.; Bobic, T.; Hofmann-Apitius, M.; Fluck, J.; Klinger, R. Detecting miRNA Mentions and Relations in Biomedical Literature. F1000Research 2014, 3, 205. [Google Scholar] [CrossRef] [PubMed]
- Henry, S.; Wijesinghe, D.S.; Myers, A.; McInnes, B.T. Using Literature Based Discovery to Gain Insights Into the Metabolomic Processes of Cardiac Arrest. Front. Res. Metr. Anal. 2021, 6, 644728. [Google Scholar] [CrossRef] [PubMed]
- Naeem, H.; Kuffner, R.; Csaba, G.; Zimmer, R. miRSel: Automated extraction of associations between microRNAs and genes from the biomedical literature. BMC Bioinform. 2010, 11, 135. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Seal, R.L.; Chen, L.L.; Griffiths-Jones, S.; Lowe, T.M.; Mathews, M.B.; O’Reilly, D.; Pierce, A.J.; Stadler, P.F.; Ulitsky, I.; Wolin, S.L.; et al. A guide to naming human non-coding RNA genes. EMBO J. 2020, 39, e103777. [Google Scholar] [CrossRef] [PubMed]
- Li, G.; Ross, K.E.; Arighi, C.N.; Peng, Y.; Wu, C.H.; Vijay-Shanker, K. miRTex: A Text Mining System for miRNA-Gene Relation Extraction. PLoS Comput. Biol. 2015, 11, e1004391. [Google Scholar] [CrossRef] [Green Version]
- Gupta, S.; Ross, K.E.; Tudor, C.O.; Wu, C.H.; Schmidt, C.J.; Vijay-Shanker, K. miRiaD: A Text Mining Tool for Detecting Associations of microRNAs with Diseases. J. Biomed. Semant. 2016, 7, 9. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Lamurias, A.; Clarke, L.A.; Couto, F.M. Extracting microRNA-gene relations from biomedical literature using distant supervision. PLoS ONE 2017, 12, e0171929. [Google Scholar]
- Karagkouni, D.; Paraskevopoulou, M.D.; Chatzopoulos, S.; Vlachos, I.S.; Tastsoglou, S.; Kanellos, I.; Papadimitriou, D.; Kavakiotis, I.; Maniou, S.; Skoufos, G.; et al. DIANA-TarBase v8: A decade-long collection of experimentally supported miRNA-gene interactions. Nucleic Acids Res. 2018, 46, D239–D245. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Huang, H.Y.; Lin, Y.C.; Cui, S.; Huang, Y.; Tang, Y.; Xu, J.; Bao, J.; Li, Y.; Wen, J.; Zuo, H.; et al. miRTarBase update 2022: An informative resource for experimentally validated miRNA-target interactions. Nucleic Acids Res. 2022, 50, D222–D230. [Google Scholar] [CrossRef] [PubMed]
- Dweep, H.; Gretz, N.; Sticht, C. miRWalk database for miRNA-target interactions. Methods Mol. Biol. 2014, 1182, 289–305. [Google Scholar] [PubMed]
- Salhi, A.; Essack, M.; Alam, T.; Bajic, V.P.; Ma, L.; Radovanovic, A.; Marchand, B.; Schmeier, S.; Zhang, Z.; Bajic, V.B. DES-ncRNA: A knowledgebase for exploring information about human micro and long noncoding RNAs based on literature-mining. RNA Biol. 2017, 14, 963–971. [Google Scholar] [CrossRef]
- Roychowdhury, D.; Gupta, S.; Qin, X.; Arighi, C.N.; Vijay-Shanker, K. emiRIT: A text-mining-based resource for microRNA information. Database J. Biol. Databases Curation 2021, 2021, baab031. [Google Scholar] [CrossRef]
- Roy, S.; Curry, B.C.; Madahian, B.; Homayouni, R. Prioritization, clustering and functional annotation of MicroRNAs using latent semantic indexing of MEDLINE abstracts. BMC Bioinform. 2016, 17 (Suppl. 13). [Google Scholar] [CrossRef] [Green Version]
- Le, D.H.; Tran, T.T.H. RWRMTN: A tool for predicting disease-associated microRNAs based on a microRNA-target gene network. BMC Bioinform. 2020, 21, 244. [Google Scholar] [CrossRef]
- Friedrich, J.; Hammes, H.P.; Krenning, G. miRetrieve-an R package and web application for miRNA text mining. NAR Genom. Bioinform. 2021, 3, lqab117. [Google Scholar] [CrossRef]
- Joppich, M.; Weber, C.; Zimmer, R. Using Context-Sensitive Text Mining to Identify miRNAs in Different Stages of Atherosclerosis. Thromb. Haemost. 2019, 119, 1247–1264. [Google Scholar] [CrossRef] [PubMed]
- Thilakaratne, M.; Falkner, K.; Atapattu, T. A systematic review on literature-based discovery workflow. PeerJ Comput. Sci. 2019, 5, e235. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Kim, J.D.; Wang, Y.; Fujiwara, T.; Okuda, S.; Callahan, T.J.; Cohen, K.B. Open Agile text mining for bioinformatics: The PubAnnotation ecosystem. Bioinformatics 2019, 35, 4372–4380. [Google Scholar] [CrossRef] [PubMed]
Tools | Tasks | Methods | PMID | |||||||
---|---|---|---|---|---|---|---|---|---|---|
NER & NEN | RE | DC | KD | Dictionary-Based | Co-Occurrence | Semantic Approaches | Rule-Based | Machine Learning | ||
Bagewadi et al. [18] | Y | Y | Y | Y | Y | 26535109 | ||||
miRSel | Y | Y | Y | Y | 20233441 | |||||
miRTex | Y | Y | Y | Y | 26407127 | |||||
miRiaD | Y | Y | Y | 27216254 | ||||||
IBRel | Y | Y | Y | 28263989 | ||||||
DES-ncRNA | Y | Y | Y | 28387604 | ||||||
emiRIT | Y | Y | Y | 34048547 | ||||||
miRetrieve | Y | Y | Y | 34988440 | ||||||
LSI | Y | Y | 27766940 | |||||||
RWRMTN | Y | Y | 32539680 | |||||||
atheMir | Y | Y | Y | Y | 31378854 | |||||
Henry et al. [19]. | Y | Y | 34250435 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Fan, J. Literature Mining of Disease Associated Noncoding RNA in the Omics Era. Molecules 2022, 27, 4710. https://doi.org/10.3390/molecules27154710
Fan J. Literature Mining of Disease Associated Noncoding RNA in the Omics Era. Molecules. 2022; 27(15):4710. https://doi.org/10.3390/molecules27154710
Chicago/Turabian StyleFan, Jian. 2022. "Literature Mining of Disease Associated Noncoding RNA in the Omics Era" Molecules 27, no. 15: 4710. https://doi.org/10.3390/molecules27154710