Speech Recognition and Machine Learning: Current Trends and Future

A special issue of Big Data and Cognitive Computing (ISSN 2504-2289).

Deadline for manuscript submissions: closed (15 December 2023) | Viewed by 4597

Special Issue Editors

National Institute of Information & Communications Technology (NICT), Advanced Speech Technology Lab, Tokyo, Japan
Interests: speech recognition; speech translation; speech synthesis

E-Mail Website
Guest Editor
1. Tsinghua University and Xinjiang University Joint Lab, Xinjiang, China
2. Speech and Audio Lab, Tsinghua University, Beijing, China
Interests: speech; music

E-Mail Website
Guest Editor
Anhui Province Key Laboratory of Multimodal Cognitive Computation, School of Computer Science and Technology, Anhui University, Hefei 230601, China
Interests: speech enhancement; speech recognition; speech signal processing
Special Issues, Collections and Topics in MDPI journals

Special Issue Information

Dear Colleagues,

Automatic speech recognition (ASR) can provide effective communications in global business activities. Deep neural networks (DNN) have made impressive progress in this. Many researchers regard it as a solved problem. However, the reality is not as optimistic as some big companies advertise. For example, the services of ASR for low-resource languages, which more than 50% of the global population speaks, fall behind.

On the other hand, the progress of natural language processing (NLP) and computational vision (CV) shows promising directions in the future, such as the self-training models. Inspired by these techniques, novel algorithms are proposed to solve the complicated problems occurring in the real world. These applications and algorithms deeply reshape our way of human–computer interactions.

This Special Issue aims to follow these influential trends. Papers that address innovative applications and algorithms related to next-general speech and language processing are all welcome for this issue.

Dr. Sheng Li
Dr. Yi Zhao
Dr. Cunhang Fan
Guest Editors

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Big Data and Cognitive Computing is an international peer-reviewed open access monthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 1800 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

  • automatic speech recognition

Published Papers (1 paper)

Order results
Result details
Select all
Export citation of selected articles as:

Research

17 pages, 2423 KiB  
Article
Design Proposal for a Virtual Shopping Assistant for People with Vision Problems Applying Artificial Intelligence Techniques
by William Villegas-Ch, Rodrigo Amores-Falconi and Eduardo Coronel-Silva
Big Data Cogn. Comput. 2023, 7(2), 96; https://doi.org/10.3390/bdcc7020096 - 12 May 2023
Cited by 4 | Viewed by 3953
Abstract
Accessibility is an increasingly important topic for Ecommerce, especially for individuals with vision problems. To improve their online experience, the design of a voice assistant has been proposed to allow these individuals to browse and shop online more quickly and efficiently. This voice [...] Read more.
Accessibility is an increasingly important topic for Ecommerce, especially for individuals with vision problems. To improve their online experience, the design of a voice assistant has been proposed to allow these individuals to browse and shop online more quickly and efficiently. This voice assistant forms an intelligent system that can understand and respond to users’ voice commands. The design considers the visual limitations of the users, such as difficulty reading information on the screen or identifying images. The voice assistant provides detailed product descriptions and ideas in a clear, easy-to-understand voice. In addition, the voice assistant has a series of additional features to improve the shopping experience. For example, the assistant can provide product recommendations based on the user’s previous purchases and information about special promotions and discounts. The main goal of this design is to create an accessible and inclusive online shopping experience for the visually impaired. The voice assistant is based on a conversational user interface, allowing users to easily navigate an eCommerce website, search for products, and make purchases. Full article
(This article belongs to the Special Issue Speech Recognition and Machine Learning: Current Trends and Future)
Show Figures

Figure 1

Back to TopTop