Symmetry/Asymmetry in Computer Vision and Image Processing

A special issue of Symmetry (ISSN 2073-8994). This special issue belongs to the section "Computer".

Deadline for manuscript submissions: closed (30 November 2023) | Viewed by 13860

Special Issue Editor


E-Mail Website
Guest Editor
1. Computer vision researcher at Huawei Technologies, Dublin 4, Ireland;
2. Department of Computer Science and Statistics, King Juan Carlos University, 28933 Móstoles, Spain
Interests: computer vision; image processing; machine intelligence and learning; biologically inspired machine vision; computational models

Special Issue Information

Dear Colleagues,

The visual relationship between human and machine in recent years is being redefined as we are witnessing an unprecedented rise of research interest in artificial vision and its applications.

Without vision, life as we know it would not be the same. It is the most informative and complex of our senses, extremely useful for our survival and entertainment. From manufacturing, medicine, defense and security, and the automotive industry to social interaction, autonomous navigation, and robotics, our understanding and interaction with the world is interwoven with visual signal processing we perform day to day. In parallel, constant hardware advancements in machines such as computers, cameras, portable devices, robots, and graphic cards, together with increasingly efficient machine learning algorithms, are further pushing the boundaries of what can be attained or expected in the future.

In this new world of discovery opening up before our eyes, machine vision encompasses a broad scope of emerging trends in vision technology (vision hardware and algorithms) across all fields and application areas. Therefore, researchers from areas related to machine learning and computer vision research are encouraged to contribute works in the following fields:

  • Image/signal processing techniques;
  • Video analysis;
  • Deep learning for vision;
  • Statistics and machine learning for vision;
  • Multi-spectral/hyperspectral analysis;
  • RGBD analysis;
  • Image and video compression;
  • Early and biologically inspired vision;
  • Model-based vision;
  • Motion, flow and tracking;
  • Segmentation, grouping and clustering methods;
  • Object detection, scene understanding and image/pattern classification;
  • Video analysis for action and event recognition;
  • Computational photography;
  • Stereo, calibration, geometric modelling, and processing;
  • Remote sensing;
  • Vision applications for quality assurance, medical diagnosis, manufacturing inspection, entertainment, animation industry etc.;
  • Vision applications for visualization, interaction, and graphics. 

Dr. Aristeidis Tsitiridis
Guest Editor

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Symmetry is an international peer-reviewed open access monthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Published Papers (8 papers)

Order results
Result details
Select all
Export citation of selected articles as:

Research

23 pages, 2000 KiB  
Article
Blend of Deep Features and Binary Tree Growth Algorithm for Skin Lesion Classification
by Sunil Kumar, Vijay Kumar Nath and Deepika Hazarika
Symmetry 2023, 15(12), 2213; https://doi.org/10.3390/sym15122213 - 18 Dec 2023
Viewed by 1965
Abstract
One of the most frequently identified cancers globally is skin cancer (SC). The computeraided categorization of numerous skin lesions via dermoscopic images is still a complicated problem. Early recognition is crucial since it considerably increases the survival chances. In this study, we introduce [...] Read more.
One of the most frequently identified cancers globally is skin cancer (SC). The computeraided categorization of numerous skin lesions via dermoscopic images is still a complicated problem. Early recognition is crucial since it considerably increases the survival chances. In this study, we introduce an approach for skin lesion categorization where, at first, a powerful hybrid deep-feature set is constructed, and then a binary tree growth (BTG)-based optimization procedure is implemented using a support vector machine (SVM) classifier with an intention to compute the categorizing error and build symmetry between categories, for selecting the most significant features which are finally fed to a multi-class SVM for classification. The hybrid deep-feature set is constructed by utilizing two pre-trained models, i.e., Densenet-201, and Inception-v3, that are fine-tuned on skin lesion data. These two deep-feature models have distinct architectures that characterize dissimilar feature abstraction strengths. This effective deep feature framework has been tested on two publicly available challenging datasets, i.e., ISIC2018 and ISIC2019. The proposed framework outperforms many existing approaches and achieves notable {accuracy, sensitivity, precision, specificity} values of {98.50%, 96.60%, 97.84%, 99.59%} and {96.60%, 94.21%, 96.38%, 99.39%} for the ISIC2018 and ISIC2019 datasets, respectively. The proposed implementation of the BTG-based optimization algorithm performs significantly better on the proposed feature blend for skin lesion classification. Full article
(This article belongs to the Special Issue Symmetry/Asymmetry in Computer Vision and Image Processing)
Show Figures

Figure 1

12 pages, 1454 KiB  
Article
Cross-Camera Tracking Model and Method Based on Multi-Feature Fusion
by Peng Zhang, Siqi Wang, Wei Zhang, Weimin Lei, Xinlei Zhao, Qingyang Jing and Mingxin Liu
Symmetry 2023, 15(12), 2145; https://doi.org/10.3390/sym15122145 - 02 Dec 2023
Viewed by 1110
Abstract
Multi-camera video surveillance has been widely applied in crowd statistics and analysis in smart city scenarios. Most existing studies rely on appearance or motion features for cross-camera trajectory tracking, due to the changing asymmetric perspectives of multiple cameras and occlusions in crowded scenes, [...] Read more.
Multi-camera video surveillance has been widely applied in crowd statistics and analysis in smart city scenarios. Most existing studies rely on appearance or motion features for cross-camera trajectory tracking, due to the changing asymmetric perspectives of multiple cameras and occlusions in crowded scenes, resulting in low accuracy and poor tracking performance. This paper proposes a tracking method that fuses appearance and motion features. An implicit social model is used to obtain motion features containing spatio-temporal information and social relations for trajectory prediction. The TransReID model is used to obtain appearance features for re-identification. Fused features are derived by integrating appearance features, spatio-temporal information and social relations. Based on the fused features, multi-round clustering is adopted to associate cross-camera objects. Exclusively employing robust pedestrian reidentification and trajectory prediction models, coupled with the real-time detector YOLOX, without any reliance on supplementary information, an IDF1 score of 70.64% is attained on typical datasets derived from AiCity2023. Full article
(This article belongs to the Special Issue Symmetry/Asymmetry in Computer Vision and Image Processing)
Show Figures

Figure 1

19 pages, 2921 KiB  
Article
Audio-Visual Effects of a Collaborative Robot on Worker Efficiency
by Aljaž Javernik, Klemen Kovič, Iztok Palčič and Robert Ojsteršek
Symmetry 2023, 15(10), 1907; https://doi.org/10.3390/sym15101907 - 12 Oct 2023
Cited by 2 | Viewed by 794
Abstract
Collaborative workplaces are increasingly used in production systems. The possibility of direct collaboration between robots and humans brings many advantages, as it allows the simultaneous use of human and robotic strengths. However, collaboration between a collaborative robot and a human raises concerns about [...] Read more.
Collaborative workplaces are increasingly used in production systems. The possibility of direct collaboration between robots and humans brings many advantages, as it allows the simultaneous use of human and robotic strengths. However, collaboration between a collaborative robot and a human raises concerns about the safety of the interaction, the impact of the robot on human health, human efficiency, etc. Additionally, research is unexplored in the field of the collaborative robot’s audio-visual effects on the worker’s efficiency. Our study results contribute to the field of studying collaborative robots’ audio-visual effects on the worker’s behavior. In this research, we analyze the effect of the changing motion parameters of the collaborative robot (speed and acceleration) on the efficiency of the worker and, consequently, on the production process. Based on the experimental results, we were able to confirm the impact of robot speed and acceleration on the worker’s efficiency in terms of assembly time. We also concluded that the sound level and presence of a visual barrier between the worker and robot by themselves have no effect on the worker’s efficiency. The experimental part of the paper clearly identifies the impact of visualization on work efficiency. According to the results, the robot’s audio-visual effects play a key role in achieving high efficiency and, consequently, justifying the implementation of a collaborative workplace. Full article
(This article belongs to the Special Issue Symmetry/Asymmetry in Computer Vision and Image Processing)
Show Figures

Figure 1

21 pages, 18458 KiB  
Article
Symmetric Enhancement of Visual Clarity through a Multi-Scale Dilated Residual Recurrent Network Approach for Image Deraining
by Jameel Ahmed Bhutto, Ruihong Zhang and Ziaur Rahman
Symmetry 2023, 15(8), 1571; https://doi.org/10.3390/sym15081571 - 11 Aug 2023
Cited by 1 | Viewed by 861
Abstract
Images captured during rainy days present the challenge of maintaining a symmetrical balance between foreground elements (like rain streaks) and the background scenery. The interplay between these rain-obscured images is reminiscent of the principle of symmetry, where one element, the rain streak, overshadows [...] Read more.
Images captured during rainy days present the challenge of maintaining a symmetrical balance between foreground elements (like rain streaks) and the background scenery. The interplay between these rain-obscured images is reminiscent of the principle of symmetry, where one element, the rain streak, overshadows or disrupts the visual quality of the entire image. The challenge lies not just in eradicating the rain streaks but in ensuring the background is symmetrically restored to its original clarity. Recently, numerous deraining algorithms that employ deep learning techniques have been proposed, demonstrating promising results. Yet, achieving a perfect symmetrical balance by effectively removing rain streaks from a diverse set of images, while also symmetrically restoring the background details, is a monumental task. To address this issue, we introduce an image-deraining algorithm that leverages multi-scale dilated residual recurrent networks. The algorithm begins by utilizing convolutional activation layers to symmetrically process both the foreground and background features. Then, to ensure the symmetrical dissemination of the characteristics of rain streaks and the background, it employs long short-term memory networks in conjunction with gated recurrent units across various stages. The algorithm then incorporates dilated residual blocks (DRB), composed of dilated convolutions with three distinct dilation factors. This integration expands the receptive field, facilitating the extraction of deep, multi-scale features of both the rain streaks and background information. Furthermore, considering the complex and diverse nature of rain streaks, a channel attention (CA) mechanism is incorporated to capture richer image features and enhance the model’s performance. Ultimately, convolutional layers are employed to fuse the image features, resulting in a derained image. An evaluation encompassing seven benchmark datasets, assessed using five quality metrics against various conventional and modern algorithms, confirms the robustness and flexibility of our approach. Full article
(This article belongs to the Special Issue Symmetry/Asymmetry in Computer Vision and Image Processing)
Show Figures

Figure 1

27 pages, 9831 KiB  
Article
A Study on Perception of Visual–Tactile and Color–Texture Features of Footwear Leather for Symmetric Shoes
by Dan-Dan Xu, Chih-Fu Wu and Chung-Shing Wang
Symmetry 2023, 15(7), 1462; https://doi.org/10.3390/sym15071462 - 22 Jul 2023
Cited by 1 | Viewed by 1367
Abstract
The study applies Kansei engineering in analyzing the color and texture of leather footwear, utilizing neural network verification to mirror consumers’ visual and tactile imageries onto varieties of leather. This aids in the development of an advanced system for selecting leather footwear based [...] Read more.
The study applies Kansei engineering in analyzing the color and texture of leather footwear, utilizing neural network verification to mirror consumers’ visual and tactile imageries onto varieties of leather. This aids in the development of an advanced system for selecting leather footwear based on such impressions. Initially, representative word pairs denoting consumers’ visual and tactile perceptions of leather footwear were delineated. Post-evaluation of these perceptions through a sensibility assessment questionnaire was administered, using 54 samples of leather footwear provided by manufacturers, with each leather type codified in terms of visual and tactile sensibilities. Subsequently, a customized software algorithm was crafted to isolate the primary color and adhesiveness as color features from the leather sample images. Analyzing grayscale values of the images and using pixel neighborhood as a base, the associated calculation methods, such as LBP, SCOV, VAR, SAC, etc., were proposed to extract texture features from the images. The derived color and texture feature values were used as the input layer and the sensory vocabulary quantified values as the output layer. Backpropagation neural network training was conducted on 49 leather samples, with five leather samples used for testing, culminating in the verification of neural network training for three types and 17 combinations. The outcome was an optimal method for leather footwear Kansei engineering and neural network training, establishing a design process for leather footwear characteristics assisted by sensory vocabulary and a backpropagation neural network. Additionally, a computer-aided system for selecting leather footwear, based on these impressions, was designed and validated through footwear design. This study utilized symmetry in footwear design. By using the design of a single shoe to represent the imagery of a pair of symmetrical shoes, we verified whether the leather samples recommended by the leather imagery selection query system met the expected system input settings. Full article
(This article belongs to the Special Issue Symmetry/Asymmetry in Computer Vision and Image Processing)
Show Figures

Figure 1

23 pages, 7593 KiB  
Article
Anomaly Detection in Chest X-rays Based on Dual-Attention Mechanism and Multi-Scale Feature Fusion
by Dong Liu, Shuzhen Lu, Lingrong Zhang and Yaohui Liu
Symmetry 2023, 15(3), 668; https://doi.org/10.3390/sym15030668 - 07 Mar 2023
Cited by 1 | Viewed by 2060
Abstract
The efficient and automatic detection of chest abnormalities is vital for the auxiliary diagnosis of medical images. Many studies utilize computer vision and deep learning approaches involving symmetry and asymmetry concepts to detect chest abnormalities, and achieve promising findings. However, an accurate instance-level [...] Read more.
The efficient and automatic detection of chest abnormalities is vital for the auxiliary diagnosis of medical images. Many studies utilize computer vision and deep learning approaches involving symmetry and asymmetry concepts to detect chest abnormalities, and achieve promising findings. However, an accurate instance-level and multi-label detection of abnormalities in chest X-rays remains a significant challenge. Here, a novel anomaly detection method for symmetric chest X-rays using dual-attention and multi-scale feature fusion is proposed. Three aspects of our method should be noted in comparison with the previous approaches. We improved the deep neural network with channel-dimensional and spatial-dimensional attention to capture the abundant contextual features. We then used an optimized multi-scale learning framework for feature fusion to adapt to the scale variation in the abnormalities. Considering the influence of the data imbalance and other factors, we introduced a seesaw loss function to flexibly adjust the sample weights and enhance the model learning efficiency. The rigorous experimental evaluation of a public chest X-ray dataset with fourteen different types of abnormalities demonstrates that our model has a mean average precision of 0.362 and outperforms existing methods. Full article
(This article belongs to the Special Issue Symmetry/Asymmetry in Computer Vision and Image Processing)
Show Figures

Figure 1

20 pages, 8168 KiB  
Article
An Efficient Machine Learning-Based Model to Effectively Classify the Type of Noises in QR Code: A Hybrid Approach
by Jawad Rasheed, Ahmad B. Wardak, Adnan M. Abu-Mahfouz, Tariq Umer, Mirsat Yesiltepe and Sadaf Waziry
Symmetry 2022, 14(10), 2098; https://doi.org/10.3390/sym14102098 - 08 Oct 2022
Cited by 9 | Viewed by 1963
Abstract
Granting smart device consumers with information, simply and quickly, is what drives quick response (QR) codes and mobile marketing to go hand in hand. It boosts marketing campaigns and objectives and allows one to approach, engage, influence, and transform a wider target audience [...] Read more.
Granting smart device consumers with information, simply and quickly, is what drives quick response (QR) codes and mobile marketing to go hand in hand. It boosts marketing campaigns and objectives and allows one to approach, engage, influence, and transform a wider target audience by connecting from offline to online platforms. However, restricted printing technology and flexibility in surfaces introduce noise while printing QR code images. Moreover, noise is often unavoidable during the gathering and transmission of digital images. Therefore, this paper proposed an automatic and accurate noise detector to identify the type of noise present in QR code images. For this, the paper first generates a new dataset comprising 10,000 original QR code images of varying sizes and later introduces several noises, including salt and pepper, pepper, speckle, Poisson, salt, local var, and Gaussian to form a dataset of 80,000 images. We perform extensive experiments by reshaping the generated images to uniform size for exploiting Convolutional Neural Network (CNN), Support Vector Machine (SVM), and Logistic Regression (LG) to classify the original and noisy images. Later, the analysis is further widened by incorporating histogram density analysis to trace and target highly important features by transforming images of varying sizes to obtain 256 features, followed by SVM, LG, and Artificial Neural Network (ANN) to identify the noise type. Moreover, to understand the impact of symmetry of noises in QR code images, we trained the models with combinations of 3-, 5-, and 7-noise types and analyzed the classification performance. From comparative analyses, it is noted that the Gaussian and Localvar noises possess symmetrical characteristics, as all the classifiers did not perform well to segregate these two noises. The results prove that histogram analysis significantly improves classification accuracy with all exploited models, especially when combined with SVM, it achieved maximum accuracy for 4- and 6-class classification problems. Full article
(This article belongs to the Special Issue Symmetry/Asymmetry in Computer Vision and Image Processing)
Show Figures

Figure 1

28 pages, 4991 KiB  
Article
Machine Vision Approach for Diagnosing Tuberculosis (TB) Based on Computerized Tomography (CT) Scan Images
by Inayatul Haq, Tehseen Mazhar, Qandeel Nasir, Saqib Razzaq, Syed Agha Hassnain Mohsan, Mohammed H. Alsharif, Hend Khalid Alkahtani, Ayman Aljarbouh and Samih M. Mostafa
Symmetry 2022, 14(10), 1997; https://doi.org/10.3390/sym14101997 - 23 Sep 2022
Cited by 4 | Viewed by 2682
Abstract
Tuberculosis is curable, still the world’s second inflectional murderous disease, and ranked 13th (in 2020) by the World Health Organization on the list of leading death causes. One of the reasons for its fatality is the unavailability of modern technology and human experts [...] Read more.
Tuberculosis is curable, still the world’s second inflectional murderous disease, and ranked 13th (in 2020) by the World Health Organization on the list of leading death causes. One of the reasons for its fatality is the unavailability of modern technology and human experts for early detection. This study represents a precise and reliable machine vision-based approach for Tuberculosis detection in the lung through Symmetry CT scan images. TB spreads irregularly, which means it might not affect both lungs equally, and it might affect only some part of the lung. That’s why regions of interest (ROI’s) from TB infected and normal CT scan images of lungs were selected after pre-processing i.e., selection/cropping, grayscale image conversion, and filtration, Statistical texture features were extracted, and 30 optimized features using F (Fisher) + PA (probability of error + average correlation) + MI (mutual information) were selected for final optimization and only 6 most optimized features were selected. Several supervised learning classifiers were used to classify between normal and infected TB images. Artificial Neural Network (ANN: n class) based classifier Multi-Layer Perceptron (MLP) showed comparatively better and probably best accuracy of 99% with execution time of less than a second, followed by Random Forest 98.83%, J48 98.67%, Log it Boost 98%, AdaBoostM1 97.16% and Bayes Net 96.83%. Full article
(This article belongs to the Special Issue Symmetry/Asymmetry in Computer Vision and Image Processing)
Show Figures

Figure 1

Back to TopTop