Object Detection: Algorithms, Computations and Practices

A special issue of Mathematics (ISSN 2227-7390). This special issue belongs to the section "Computational and Applied Mathematics".

Deadline for manuscript submissions: 31 May 2024 | Viewed by 3331

Special Issue Editors

School of Automation, Southeast University, Nanjing 210096, China
Interests: image processing; applied machine learning; image generation; intelligent vision systems
School of Instrument Science and Engineering, Southeast University, Nanjing 210096, China
Interests: virtual reality; augmented reality; computer animation

Special Issue Information

Dear Colleagues,

We are pleased to invite you to submit your work to our journal. Object detection is the application of computer vision, pattern recognition and image processing and other technologies to detect instances of objects of a certain class within an image, and its application scenarios cover important areas such as intelligent security, traffic surveillance, scene understanding, and autonomous driving. In recent years, the rapid development of deep learning techniques has brought tremendous advances in the field of object detection. However, there are still some practical problems that have not yet been solved, such as the occlusions of object in pedestrian detection, scene text detection and recognition, few-shot object detection, tiny object detection, interpretable learning, face detection and recognition in complex real scenes, model drift and re-detection problems in object tracking, etc. Therefore, much efforts have to be engaged to remarkably improve the performance of object detection.

This Special Issue aims to discuss and solve the challenging problems related to object detection within the framework of deep learning. We invite authors to submit manuscripts that are highly related to the topics of this special issue and which have not been published before. The topics of interest include, but are not limited to:

  • Anchor and Anchor-free object detection;
  • Few-shot/zero-shot object detection;
  • Weak/semi/unsupervised object detection;
  • Long-tailed object detection;
  • Small object detection;
  • 3D object detection;
  • Object detection in challenging conditions;
  • Fusion of point cloud and images for object detection;
  • Large-scale datasets for object detection.

Dr. Siyu Xia
Dr. Libo Sun
Guest Editors

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Mathematics is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2600 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

  • object detection
  • computer vision
  • deep learning

Published Papers (3 papers)

Order results
Result details
Select all
Export citation of selected articles as:

Research

14 pages, 11397 KiB  
Article
Improving the Performance of Object Detection by Preserving Balanced Class Distribution
by Heewon Lee and Sangtae Ahn
Mathematics 2023, 11(21), 4460; https://doi.org/10.3390/math11214460 - 27 Oct 2023
Viewed by 1212
Abstract
Object detection is a task that performs position identification and label classification of objects in images or videos. The information obtained through this process plays an essential role in various tasks in the field of computer vision. In object detection, the data utilized [...] Read more.
Object detection is a task that performs position identification and label classification of objects in images or videos. The information obtained through this process plays an essential role in various tasks in the field of computer vision. In object detection, the data utilized for training and validation typically originate from public datasets that are well-balanced in terms of the number of objects ascribed to each class in an image. However, in real-world scenarios, handling datasets with much greater class imbalance, i.e., very different numbers of objects for each class, is much more common, and this imbalance may reduce the performance of object detection when predicting unseen test images. In our study, thus, we propose a method that evenly distributes the classes in an image for training and validation, solving the class imbalance problem in object detection. Our proposed method aims to maintain a uniform class distribution through multi-label stratification. We tested our proposed method not only on public datasets that typically exhibit balanced class distribution but also on private datasets that may have imbalanced class distribution. We found that our proposed method was more effective on datasets containing severe imbalance and less data. Our findings indicate that the proposed method can be effectively used on datasets with substantially imbalanced class distribution. Full article
(This article belongs to the Special Issue Object Detection: Algorithms, Computations and Practices)
Show Figures

Figure 1

14 pages, 5734 KiB  
Article
Enhanced Non-Maximum Suppression for the Detection of Steel Surface Defects
by Seong-Hwan Kang, Vikas Palakonda, Il-Min Kim, Jae-Mo Kang and Sangseok Yun
Mathematics 2023, 11(18), 3898; https://doi.org/10.3390/math11183898 - 13 Sep 2023
Viewed by 1055
Abstract
Quality control in manufacturing equipment relies heavily on the detection of steel surface defects. Recently, there have been an increasing number of efforts in which object detection techniques have been utilized to achieve promising results in the detection of steel surface defects since [...] Read more.
Quality control in manufacturing equipment relies heavily on the detection of steel surface defects. Recently, there have been an increasing number of efforts in which object detection techniques have been utilized to achieve promising results in the detection of steel surface defects since the defect patterns can be considered objects. To enhance the detection performance in the object detection problem, the non-maximum suppression (NMS) step, which eliminates redundant boxes overlapped with a box having the greatest detection score, is essential. In this work, we propose a novel NMS to improve the detection method of steel surface defects. The proposed NMS approach is composed of three novel techniques: IoU regularization, threshold adjustment, and comparison rule modification to enhance the detection performance. To evaluate the performance of the proposed NMS, we carry out extensive numerical experiments using the YOLOv7 and EfficientDet models on the steel surface defect datasets, NEU-DET and GC10-DET. The experimental results demonstrate that the proposed NMS outperforms the conventional NMS methods in both quantitative and qualitative manners. Full article
(This article belongs to the Special Issue Object Detection: Algorithms, Computations and Practices)
Show Figures

Figure 1

13 pages, 2112 KiB  
Article
Illumination Removal via Gaussian Difference L0 Norm Model for Facial Experssion Recognition
by Xiaohe Li and Wankou Yang
Mathematics 2023, 11(12), 2667; https://doi.org/10.3390/math11122667 - 12 Jun 2023
Viewed by 650
Abstract
Face images in the logarithmic space can be considered as a sum of the texture component and lighting map component according to Lambert Reflection. However, it is still not easy to separate these two parts, because face contour boundaries and lighting change boundaries [...] Read more.
Face images in the logarithmic space can be considered as a sum of the texture component and lighting map component according to Lambert Reflection. However, it is still not easy to separate these two parts, because face contour boundaries and lighting change boundaries are difficult to distinguish. In order to enhance the separation quality of these to parts, this paper proposes an illumination standardization algorithm based on extreme L0 Gaussian difference regularization constraints, assuming that illumination is massively spread all over the image but illumination change boundaries are simple, regular, and sparse enough. The proposed algorithm uses an iterative L0 Gaussian difference smoothing method, which achieves a more accurate lighting map estimation by reserving the fewest boundaries. Thus, the texture component of the original image can be restored better by simply subtracting the lighting map estimated. The experiments in this paper are organized with two steps: the first step is to observe the quality of the original texture restoration, and the second step is to test the effectiveness of our algorithm for complex face classification tasks. We choose the facial expression classification in this step. The first step experimental results show that our proposed algorithm can effectively recover face image details from extremely dark or light regions. In the second step experiment, we use a CNN classifier to test the emotion classification accuracy, making a comparison of the proposed illumination removal algorithm and the state-of-the-art illumination removal algorithm as face image preprocessing methods. The experimental results show that our algorithm works best for facial expression classification at about 5 to 7 percent accuracy higher than other algorithms. Therefore, our algorithm is proven to provide effective lighting processing technical support for the complex face classification problems which require a high degree of preservation of facial texture. The contribution of this paper is, first, that this paper proposes an enhanced TV model with an L0 boundary constraint for illumination estimation. Second, the boundary response is formulated with the Gaussian difference, which strongly responds to illumination boundaries. Third, this paper emphasizes the necessity of reserving details for preprocessing face images. Full article
(This article belongs to the Special Issue Object Detection: Algorithms, Computations and Practices)
Show Figures

Figure 1

Back to TopTop