Deep Learning Approach to UAV Detection and Classification by Using Compressively Sensed RF Signal

Mo, Yongguang; Huang, Jianjun; Qian, Gongbin

doi:10.3390/s22083072

Open AccessArticle

Deep Learning Approach to UAV Detection and Classification by Using Compressively Sensed RF Signal

by

Yongguang Mo

¹

,

Jianjun Huang

^1,*

and

Gongbin Qian

²

¹

Guangdong Key Laboratory of Intelligent Information Precessing, College of Electronic and Information Engineering, ATR Key Laboratory, Shenzhen University, Shenzhen 518060, China

²

College of Electronic and Information Engineering, Shenzhen University, Shenzhen 518060, China

^*

Author to whom correspondence should be addressed.

Sensors 2022, 22(8), 3072; https://doi.org/10.3390/s22083072

Submission received: 17 March 2022 / Revised: 14 April 2022 / Accepted: 14 April 2022 / Published: 16 April 2022

(This article belongs to the Section Remote Sensors)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Recently, the frequent occurrence of the misuse and intrusion of UAVs has made it a research challenge to identify and detect them effectively, and relatively high bandwidth and pressure on data transmission and real-time processing exist when sampling UAV communication signals using the RF detection method. In this paper, firstly, for data sampling, we chose a compressed sensing technique to replace the traditional sampling theorem and used a multi-channel random demodulator to sample the signal; secondly, for the detection and identification of the presence, type, and flight pattern of UAVs, a multi-stage deep learning-based UAV identification and detection method was proposed by exploiting the difference in communication signals between UAVs and controllers under different circumstances. The data samples are first passed by detectors that detect the presence of UAVs, then classifiers are used to identify the type of UAVs, and finally flight patterns are judged by the corresponding classifiers, for which two neural network structures (DNN and CNN) are constructed by deep learning algorithms and evaluated and validated by a 10-fold cross-validation method, with the DNN network used for detectors and the CNN network for subsequent type and flying mode classification. The experimental results demonstrate, first, the effectiveness of using compressed sensing for sampling the communication signals of UAVs and controllers; and second, the detecting method with multi-stage DL detects higher efficiency and accuracy compared with existing detecting methods, detecting the presence, type, and flight model of UAVs with an accuracy of over 99%.

Keywords:

unmanned aerial vehicles; detection and identification; radio frequency; compressed sensing; deep learning

1. Introduction

Drones, also known as unmanned aerial vehicles (UAVs), are unmanned aircraft operated by radio-controlled equipment and self-contained programmable control devices or operated completely or intermittently and autonomously by an onboard computer. In recent years, the rapid development of the UAV industries has been driven by the maturing technologies in the fields of information, control, and communication, which has led to the gradual lowering of the threshold for the use of consumer- and industrial-grade UAVs, making them widely used in various civil and commercial applications, such as cinematography, agricultural inspection, and rescue search [1]. However, due to the lack of unified industry standards and norms, the threshold of access is low and most UAVs are operated in a “black flight” state. While bringing convenience to various industries, once there is abuse and illegal intrusion, it can seriously threaten major public events, important areas, airspace security, personal privacy, and national security, and even endanger human lives and cause property damage. For example, they can endanger the safety of important targets and affect the take-off and landing of passenger planes [2]; in addition, they may be used by lawless elements to spy on people’s privacy, smuggle drugs, or even become “suicide drones” by terrorists with bombs [3,4]. Though the government has performed a lot of work on drone regulation in recent years, including registration of real names, building a comprehensive monitoring platform for drone cloud systems, and setting up electronic fences, there are still a large number of hard-to-detect drones that need to be detected and identified by relevant monitoring technologies [5].

Given various problems caused by misuse and malicious intrusion of UAVs, the accurate detection and classification of UAVs are particularly important to ensure public and national security. Many experts and scholars have proposed many different UAV detection methods, which can be roughly divided into UAV detection algorithms based on acoustics [6,7], visual (image or video) [8,9], radar [10,11,12,13], and radio frequency [14,15]. Considering the first three methods in practical application, there are some limitations and deficiencies, and it is difficult to achieve a good detection effect. For example, the acoustic-based UAV detection method has such problems, such as susceptibility to interference from environmental or atmospheric noise and small detection range; the visual-based UAV detection method cannot be used in dim and severely obstructed vision scenarios; and the poor detection performance of the radar detection method is due to the low cross-sectional area of the UAV radar, which is easily obscured by obstacles such as walls and buildings [6,10]. In contrast, the RF fingerprint-based UAV detection method technique is based on listening to UAV communication signals by using the high-gain receiving antenna and high-sensitivity receiving system, which can solve the distance problems related to visual and acoustic techniques and the influence of relevant environmental factors, and the RF fingerprint detection method only requires a sufficient UAV dataset for training, which makes the RF detection method a promising solution.

UAVs are usually flying in the air to achieve a specific task. However, within the no-fly zone, there are techniques that need to be used to detect any drone intrusion in the area for security reasons. In this paper, firstly, for data acquisition, we choose to use compressive sampling [16,17] instead of the traditional sampling theorem since the UAV communication signal is sparse in nature, and for this reason, we construct a multi-channel random demodulation (MCRD) [18] sampling structure to sample and compress the UAV communication signal simultaneously. This reduces the signal bandwidth to some extent and relieves the pressure on data storage, transmission, and processing. Deep learning (DL) is a very popular field of machine learning in recent years, and unlike other machine learning algorithms, deep learning techniques are considered to be one of the most powerful and effective techniques for selecting, extracting, and analyzing features from the original dataset with relatively strong generalization capabilities. Therefore, we choose to use the deep learning approach to build neural networks for training and validation of the original UAV dataset.

In summary, to effectively and accurately detect and classify UAVs, we propose a UAV radio frequency fingerprint detection and classification algorithm based on compressed sensing and deep learning algorithms. The main contributions of this article are as follows:

According to the characteristics of the UAV communication RF signal, the compressed sensing technology is introduced, and its effectiveness is verified through experiments.
A deep learning UAV detection and classification network based on radio frequency compressed signals is constructed by using deep learning algorithms;
Filtering and feature extraction are performed on the compressed measurement signal, which improves the classification effect of UAV types and modes.

The rest of this paper is organized as follows: Section 2 mainly summarizes the related work and techniques; Section 3 focuses on the proposed sampling, pre-processing, and detecting methods; Section 4 describes the dataset, experiments, evaluation metrics, and experimental results used in this paper; Section 5 compares our method with other detecting methods using the same dataset; and Section 6 concludes and provides an outlook on future work.

2. Related Work

In this section, we mainly introduce some work related to the RF fingerprint-based UAV detecting method. In general, the RF-based UAV detecting method has two main parts: one is to collect the background RF signals, the RF signals of various types of UAVs, and build the raw dataset available for training; the second is to train and validate the original dataset of drones using the corresponding detection and classification methods to finish the detection and classification of UAVs.

2.1. Data Acquisition

For data acquisition of UAV RF signals, many papers nowadays basically use the traditional Nyquist sampling theorem for sampling as they do not take into account the sparse nature of UAV communication signals. In [19], Al-Sa’d et al. address the lack of an open-source dataset of radio frequency (RF) signals for UAV communication by using two NI USRP-2943R RF receivers to collect, analyze, and record raw RF signals from UAVs in different flight modes at a bandwidth of at least 80 MHz by the sampling theorem to construct a UAV RF database. The sampling theorem requires that the sampling rate of the signal must be greater than or equal to twice the maximum frequency to recover the original signal from the sampled signal. This means that it will have a sampling rate of 160 MHz or more, which is fully manageable by the sampling system, but there is still a large pressure on the storage, transmission, and processing of data for a UAV detection system with high requirements for lightweight and real-time [20]. However, we note that the RF signals communicated between the UAV and the controller are sparse, which means that it is perfectly possible to use compressed sensing techniques to sample them instead of the Nyquist sampling theorem. Moreover, introducing the CS framework to UAV signal acquisition has at least several advantages: (i) the number of samples collected from sensors is significantly reduced, (ii) the power consumption of the CS-based acquisition system is decreased, (iii) the processing time required by the algorithms required for refinement is reduced, and (iv) the observation bandwidth of the overall acquisition system is increased [21].

Compressed sensing, also known as compressive sampling [16], breaks the limits of the traditional sampling theorem so that the sampling rate of the signal is no longer limited by the signal bandwidth, but depends on the content of the information in the signal. It takes advantage of the sparsity of the signal to obtain the compressed measure of the signal at a sampling rate lower than the Nyquist rate, samples and compresses the signal at the same time, and then recovers the original signal by a reconstruction algorithm. In [21], for the problem of detecting, locating, and tracking mobile RF transmitters, Pasquale et al. detail several emerging areas of this new RF sensing system, and also propose and discuss a technical approach to use compressed sensing for RF sensing systems, and determine its feasibility. It is also pointed out that the use of CS techniques for RF sensing systems does not require a complete signal reconstruction from compressed measurements. In [22], to solve the problem of real-time UAV data return to the receiving station with limited bandwidth, Huang et al. utilize compressed sensing and matrix complementation techniques to achieve real-time transmission of UAV monitoring data, which greatly reduces the data return time while restoring the image almost identical to the original one. In [23], to correctly classify ground targets by UAVs (UAVs), Zhu et al. used the PCA algorithm to remove ground clutter and refined the micro-Doppler characteristics with compressed sensing to extract features, which improved the classification accuracy and achieved an overall accuracy of 92.5%. In [24], Gaigals et al., established the detection of UAV transmission signals by passive radar and compressed and sampled the UAV transmission signals by compressed sensing technique to investigate the effect of different antenna settings on the UAV detection quality. It is fully demonstrated that the compressed sensing technique is fully applicable to the RF signal sampling between the UAV and controller.

2.2. Detection and Classification Methods

Target detection methods have evolved and an increasing number of detection methods are proposed. Examples include some adaptive detectors proposed for the generalized likelihood ratio test (GLRT), the Rao test, the Wald test, or modifications to them [12]; the total Bregman divergence-based matrix information geometry (TBD-MIG) detector [11]; and the Signal Subspace Matching Detector (SSM) [13], etc. However, the RF signal targets for UAVs are currently mainly detected and classified by traditional machine learning or deep learning-based methods.

2.2.1. Traditional Method Based

The traditional machine learning-based UAV detecting method is simply to detect and classify the primary UAV dataset using traditional machine learning methods, such as kNN, XGboost, MLP, SVM, etc., to detect and classify the dataset.

In [25], Azuma et al. transformed the original signal to the wavelet domain in the detection phase and used a Naive Bayes Method to detect drones; in the classification phase, a set of the statistical features was extracted from the energy transient signal, then the important features were selected by Neighborhood Component Analysis (NCA), and finally the selected features were given back to several machine learning algorithms for classification, where the use of the kNN classification method achieved an accuracy rate of 96.3%. They experimented in [26] to achieve 98% classification accuracy using KNN under signal interference from Wi-Fi and Bluetooth transmitters at 25 db. In [27], Medaiyese et al. designed a machine learning RF-based DDI system with three machine learning models developed by the XGBoost algorithm, and experimentally verified that the low-frequency spectrum of the captured RF signal in the communication between the UAV and its flight controller as the input feature vector already contains enough information; and finally, using only the low-frequency spectrum as the input feature vector, the XGBoost algorithm was used to detect and identify the presence of an UAV, three types of UAVs, and corresponding flying modes with an average accuracy of 99.9%, 90.73%, and 70.09%, respectively. In [28], Ibrahim et al. proposed an improved UAV detection and identification method using an integrated learning approach based on hierarchical concepts combined with preprocessing and feature extraction stages of RF data. The detection method makes full use of the KNN and XGBoost methods in machine learning and is evaluated in the publicly available dataset of [15], which shows that the method can detect whether a UAV is flying in the region and can directly identify the type of UAV and corresponding flying modes with an average accuracy of about 99%.

2.2.2. Deep Learning Based

DL is a very popular field of machine learning in recent years, unlike traditional machine learning algorithms; firstly, it can select extracts as well as analyze features directly from the original dataset without relying on manual feature selection and extraction; and secondly, DL techniques can adapt to a variety of different datasets without affecting the classification performance, which means that its generalization ability is very powerful.

In [29], Al-Sa’d et al., for their dataset published in [19], designed three deep neural networks (DNN) for UAV presence, type, and flying modes detection and classification, respectively, which were validated by 10-fold cross-validation and evaluated using various metrics. The classification results show that the average accuracy of the DNN method in detecting the presence, type, and corresponding flying modes of UAVs reached 99.7%, 84.5%, and 46.8%, respectively, which fully demonstrated the feasibility of UAV detection and recognition through the dataset. In [30], Al-Emadi et al. similarly based on this dataset, tried to solve the UAV detection and classification problem by adding a feature extraction layer with a convolutional layer then a fully connected layer for detection and classification, which improved the accuracy of UAV detection and classification. In [31], Shi et al. built an RF UAV dataset including five types of UAV by recording two types of UAV with yunSDR software radio devices based on the dataset [19]. A 1D convolutional neural network was designed for detection and identification based on the characteristics of the sample data and CNN network. The final accuracy of detecting the presence of UAVs was 99.6%; the accuracy of classifying UAV types was 96.8% (6 types); and the identity rate of flying modes reached 94.4% (12 modes).

3. Proposed Sampling and Detection Method

In this section, we mainly introduce the sampling structure and data pre-processing method based on CS theory that we adapt according to the signal characteristics of the UAV RF signals; secondly, we introduce the neural network structure designed in this paper for detecting and classifying UAV RF fingerprints.

3.1. Detection and Classification Methods

3.1.1. Compressive Sensing

In this paper, the signal we sample is the RF signal x(t) of the session between the UAV and the controller. Although the x(t) signal is susceptible to various kinds of noise during communication, leading to a small percentage of zero elements in x(t), many elements are close to zero. It is continuously collected when it is sampled, but the session between the UAV and the controller is intermittent, so at the moment when there is no command transmission, the sampled signal is only noise, making the difference between the values of the elements of the x(t) signal very large. However, after filtering the signal x(t), x(t) is exhibiting sparsity in the frequency band; and when x(t) has sparsity, it just meets the prerequisite condition of compressive sensing, so the sampling of x(t) by compressive sensing is available.

With the development of compressed sensing technology, researchers have proposed a variety of sampling structures based on CS theory, whose representative results mainly include Random Filter (RF) [32], Random Sampling (RS) [33], Random Demodulation (RD) [34], and Modulated Wideband Converter (MWC) [35]. According to the characteristics of the x(t) signal, we constructed a sampling structure of multichannel random demodulation (MCRD) based on CS theory, as shown in Figure 1, which is composed of multiple channels, each of which can be considered as an independent RD system. The structure is similar to MWC, but the randomly mixed signal is different. The mixing function of MWC is periodic, while the mixed signal of MCRD is a random sequence, so only a random sequence needs to be generated in MCRD, and then modulated into different carrier frequencies.

Suppose the UAV communication signal is x(t) with a bandwidth of B. It is divided into m subbands, each with a bandwidth of B/m. Then the signals of these m subbands are multiplied with the random sequence separately and then sampled at low speed (sampling frequency lower than 2 B/m) after passing low-pass filtering to obtain compressed sampling for multiband random demodulation. The specific steps are as follows: the x(t) signal enters m channels at the same time, and in i-th) channel.

x(t) is multiplied with the mixed signal $p_{i} (t)$ time domain, and the spectrum of x(t) is frequency shifted to the lower frequency band; that, is the spectrum of x(t) is moved to the [−f/m, f/m] region, thus the mixed signal is:

$p_{i} (t) = 2 ε_{n} c o s (2 π \frac{i \cdot f}{m} t), t \in [\frac{n}{W}, \frac{n + 1}{W}), n = 0, 1, \dots W - 1, i = 0, 1, \dots, m - 1,$

(1)

where $ε_{n}$ is obtained with equal probability taking $\pm 1$ and W is the Nyquist rate.
After the ideal low pass filter h(t) with a cut off frequency of $f_{s} / 2$ ,the mixed output signal ${\tilde{x}}_{i} (t)$ will be filtered out of the high frequency signal and only the signal whose spectrum is shifted to the low frequency band (the narrow band signal with frequency in [ $-$ /2, $f_{s}$ /2]) will be retained, so the filtered signal is the signal of the i-th band in the original x(t).
Finally, the m group of low-speed sampling sequence $y_{i} (n)$ is obtained by sampling interval $T_{s}$ for the low-speed ADC sampling (ADC sampling satisfies the Nyquist theorem), when:

$y_{i} (n) = \int_{0}^{n \cdot T_{s}} x (τ) d τ \cdot \int_{0}^{n \cdot T_{s}} p_{i} (τ) h (n \cdot T_{s} - τ) d τ = Φ_{i} \cdot x (n),$

(2)

$Φ_{i} = \int_{0}^{n \cdot T_{s}} p_{i} (τ) h (n \cdot T_{s} - τ) d τ,$

(3)

In this paper, because the high-speed pseudo-random sequence (mixed signal

p_{i} (t)

is produced by taking

\pm 1

with equal probability, after low-pass filtering and low-speed sampling, the measurement matrix of the whole MCRD sampling structure becomes a random Bernoulli matrix.

Φ = [\begin{matrix} Φ_{0} & \dots & 0 \\ ⋮ & \begin{matrix} Φ_{1} \\ ⋱ \end{matrix} & ⋮ \\ 0 & \dots & Φ_{m - 1} \end{matrix}],

(4)

If the Nyquist sample of signal

{\tilde{x}}_{i} (t)

on a frequency band of the x(t) signal is

{\tilde{x}}_{i} (m)

with length M, then the compressed measurement signal

y_{i} (n)

in the corresponding ith channel in the MCRD is mathematically represented as the product of the observation matrix Φ ∈

R^{n \times m}

and

{\tilde{x}}_{i} (m)

.

y_{i} (n) = Φ_{i} \times x_{i} (m),

(5)

3.1.2. Data Pre-Processing

The RF signal x(t) communicated between the drone and the controller is sampled by the CS-based MCRD sampling structure to obtain the compressed measurement signal

y_{i} (n)

for each channel; however, the data are still further processed to make it easier to distinguish the characteristics of the data and to collocate each

y_{i} (n)

. The specific preprocessing is as follows.

Zero-centered the compressed measurement signal $y_{i} (n)$ , to remove mainly the zero-frequency component and the offset component.
Computing the $y_{i} (n)$ power spectral density $P_{i} (k)$ .

$P_{i} (k) = \frac{1}{N} \sum_{m = 0}^{N - 1} {[y_{i} (n) e^{j \frac{2 π m k}{N}}]}^{2},$

(6)

where N is the length of $y_{i} (n)$ , and $k \leq n$ .
The conversion signals of each channel are connected to create the complete spectrum. As the connection between each conversion signal obtained after MCRD sampling is the same. Therefore, only the connection of two adjacent channels is given here.

$P (k) = [P_{i} (k), c P_{i + 1} (k)],$

(7)

$c = \frac{\sum_{q = 0}^{Q} P_{i} (k) (N - q)}{\sum_{q = 0}^{Q} P_{i + 1} (k) (q)}, q = 0, 1, \dots, Q - 1$

(8)

where c is a normalization factor calculated as the ratio between the last Q samples of the previous power spectrum $P_{i} (k),$ and the first Q samples of the next power spectrum $P_{i + 1} (k)$ , and N is the total number of in $P_{i} (k)$ . c ensures that continuity of the spectrum is maintained between the two halves of the RF spectrum, as they are captured using different devices; however, the spectral bias is inevitable. Note that Q must be relatively small to successfully stitch the two spectra and large enough to average out any random fluctuations, e.g., Q = 10 for M = 2048.
Finally, only the connected power spectral density P needs to perform maximum normalization to vary all the values to within the interval from 0 to 1, which constitutes one piece of data for the input detection classification network.

3.2. Two/Multistage Classification Network

In this paper, we directly feature extraction and identify the compressed sampled signal without recovering the original signal through reconstruction algorithms. For such signals, feature extraction by conventional methods is relatively difficult, but DNN and CNN networks extract features by learning methods; the latter obviously being a little easier. Secondly, many recent research results [29,30,31] have shown that DNN and CNN networks are perfectly suitable for detecting and classifying RF signals. Therefore, the DNN and CNN network structures are chosen for detection and identification. Below, the CNN and proposed network structure is described in detail.

CNN is a class of DL techniques that has the unique feature of combining feature extraction and classification into a single model. It is composed of layers, each layer is a collection of neurons, and consists of three main layers: convolutional layer, pooling layer, and fully connected layer. The convolutional layer mainly extracts features from the input data; the pooling layer is used to reduce the computational complexity and cost by downsampling; and finally, the fully connected layer and the corresponding activation function are used to classify the features extracted from the previous layer. Thus, CNN provides a flexible architecture to increase and decrease the number of layers according to the need and the complexity of the data.

In this paper, we need to detect and classify the RF signals of communication between the UAV and controller. Within the no-fly zone, we need to detect the presence of the UAV, and identify the type of UAV and the current flying modes of that type of UAV. For this case, we can use a layered learning approach, where the layered model first determines the presence of the UAV; then, identifies the type of UAV based on that RF signal after the presence of the intruding UAV is determined; finally, the flying modes in which the UAV is located are further determined based on that type of RF signal. Given the different detection and classification complexities, we designed the following two different neural network models for them, respectively, as follows.

UAV Detection Network (DNN Network): mainly used in the no-fly area to detect the existence of UAVs; in other words, two types, with and without the presence of UAVs. Since these two signals are still relatively easy to distinguish in the spectrum, we constructed a DNN network consisting of five Dense layers, with the specific parameters shown in Table 1.

Table 1. UAV Detection Network.

Layer	Embedded Structure	Parameter	Activation
1	Input	(None, 2047)	-
2	Dense Dense Dense Dense	256 128 256 128	ReLU ReLU ReLU ReLU
3	Output Layer	2	sigmoid

2.: UAV Identification and Classification Network (CNN Network): mainly used to identify and classify the type and flying modes of UAVs. Firstly, after detecting an intruding UAV, the type of UAV (N class) is identified based on the RF signal, where N is the number of UAV type contained in the UAV dataset; secondly, after determining the type of UAV, then the motion mode under that type is further determined (4 Classes: on, connected and off; hovering; flying; and flying with video recording). For both cases, we use convolutional layers for feature extraction, pooling layers to reduce the information size, and finally fully connected layers for classification. A CNN network structure is designed, consisting of six 1D convolutional layers, each followed by a pooling layer, ending with two fully connected layers for further classification, interspersed with two Dropout layers to prevent overfitting, with the parameters shown in Table 2.

Table 2. UAV Identification and Classification Network.

Layer	Embedded Structure	Parameter	Activation
1	Input Layer	(None, 2047, 1)	-
2	Conv1D Max pooling	filters = 32, kernel size=6	ReLU
3, 4, 5	Conv1D Average pooling	filters = 64, kernel size = 3	ReLU
6, 7	Conv1D Average pooling	filters = 128, kernel size = 3	ReLU
8	Dropout Flatten	0.25	-
9	Dense Dropout Dense	256 0.22 128	ReLU
10	Output Layer	Type of Output	softmax

4. Experiments and Results

In this section, we first introduce the dataset used, followed by the detailed sampling process further in the simulation experiments, and finally the evaluation metrics and experimental results for performing the detection and classification.

4.1. Dataset

During the experiments, we choose the dataset publicly available in [19] to better compare with existing work. Based on this dataset, the compressed sampled dataset is simulated.

4.1.1. Raw Dataset

The dataset contains 227 segments of recorded RF signal strength data, each segment includes two parts: low-frequency signal

x_{l} (n)

and high-frequency signal

x_{h} (n)

; each part contains 1 million samples. They obtained from 3 UAVs (AR, Bebop, and Phantom) experiments with 10 types. There was 10.25 s of background RF activity data (no UAV), and 5.25 s of UAV communicated RF data. Besides the background RF activity, four different modes or states of the UAV are recorded: on, connected and off; hovering; flying; and flying with video recording, as shown in Table 3. The UAV is controlled by the controller, and the RF signal strength data are collected by two RF receivers that receive the first 40 MHZ (low-frequency signal) and the second 40 MHZ strength (high-frequency signal).

4.1.2. Simulation Dataset

In the dataset of [19], it is known that the UAV communication signals x(n) are all already acquired, which are split into two low-frequency signals

x_{l} (n)

and high-frequency signals

x_{h} (n)

with temporal continuity and sparsity, so that the number of channels m in the MCRD sampling structure of Section 3.1 is two when the random Bernoulli matrix becomes:

Φ = [\begin{matrix} Φ_{1} & 0 \\ 0 & Φ_{2} \end{matrix}],

(9)

For increasing the number of samples in the dataset, we divide the

x_{l} (n)

and

x_{h} (n)

signals into 100 segments and record each segment as signal

x_{l / h}_{i} (n)

with length

1 \times 10^{5}

. Based on the length of the

x_{l / h}_{i} (n)

signal and compressed measurement signal, we generated the 2048

\times 1 * 10^{5}

random Bernoulli matrix

Φ_{1 / 2}

. Then, the

x_{l / h}_{i} (n)

and

Φ_{1 / 2}

are substituted into the Equation (5) and low-pass filtered to obtain the corresponding low-dimensional observation signal with length 2048. After obtaining the low-dimensional observed signal

y_{l / h}_{i} (m)

, only feature extraction and spectral concatenation need to be performed according to the preprocessing link in Section 3.1. Note: To reduce the amount of data and considering that the power spectral density

P_{i} (k)

is symmetric here, it is sufficient to take half of each of the two channels

P_{i} (k)

for spectral connection.

4.1.3. Dataset Reliability Analysis

For preliminary validation of the method in this paper and to ensure the reliability of the dataset fed into the neural network, we statistically analyzed and initially examined the spectrum and correlation features of the RF signal dataset generated by simulated compressed sampling. After noise variation reduction and aggregation with smoothing of the spectrum, the average power spectra of the three given scenarios (Class 2, 4, and 10) in the dataset were obtained as shown in Figure 2. In two classes of scenarios, Class 1 represents the RF background activity in the absence of UAVs, and Class 2 represents the RF signal in the presence of UAVs. For four classes of scenarios, Class 1 represents RF background activity without UAV, Class 2 to 4 are Bebop, AR, and Phantom, respectively. For 10 classes, Class 1 represents RF background activity without UAV, Class 2 to 5 represents the four modes of Bebop, Class 6 to 9 represents the four modes of AR and Class 10 is Phantom.

As can be seen from Figure 2, there are certain differences between the average power spectra of different types of UAVs, and there are also certain differences between the average power spectra of different flight modes of the same type of UAV, but the differences are relatively small, so it is difficult to make accurate classification using traditional signal recognition methods; hence, we introduce convolutional neural networks to dig deeper into their intrinsic deep features for signal differentiation.

4.2. Counterpart Two/Multilevel Classification

For this dataset, our approach is shown in Figure 3. The raw RF signal from the drone is pre-processed with data to obtain the corresponding simulated data, which then enters the Detector in the learning phase. The output of this Detector on our RF signal informs whether there is a drone or no drone in the detected area. If there is a drone in the area, this data sample enters Classifier 0 to determine the type of drone detected (Bebop, AR, and Phantom 3). If the detected drone is Phantom 3, the output of the detection method is category number 9. Otherwise, the data sample enters Classifier 1, when the detected drone is Bebop, or Classifier 2, when the detected drone is AR. Classifiers 1 and 2 define the mode of the detected drone as one of the following based on four classes: on, connected and off; hovering; flying; and flying with video recording, where the classes for Bebop UAV are: 1 for on, connected and off mode; 2 for hover mode; 3 for flying mode; and 4 for flying with video recording mode. For the AR UAV, the modes are as follows: 5 for on, connected and off mode; 6 for hover mode; 7 for flying mode; and 8 for flying with video recording mode. The Detector uses the UAV detection network (DNN structure) from Table 1; the Classifier 0, 1, and 2 use the identification and classification network (CNN structure) from Table 2.

4.3. Detection and Classification

4.3.1. Assessment Indicators

In our process of challenging UAV detection and classification systems, we have added recall, error rate, precision, false-negative rate (FNR), false discovery rate (FDR), and F1 score to assess the goodness of the system, taking into account the fact that classes are unequally correlated with each other and that classes are unbalanced, in addition to using accuracy to assess the goodness of the system. These metrics are assessed and summarized in matrix form using a confusion matrix, which can be calculated from the equation below.

Accuracy = \frac{TP + TN}{TP + TN + FP + {FN}^{'}},

(10)

Precision = \frac{TP}{TP + {FP}^{'}},

(11)

Recall = \frac{TP}{TP + {FN}^{'}},

(12)

Error = 1 − Accuracy

(13)

FDR = 1 − Precision

(14)

FNR = 1 − Recall

(15)

F 1 score = 2 \times (\frac{Precision \times Recall}{Precision + Recall})

(16)

where TP, TN, FP and FN are true positives, true negatives, false positives, and false negatives, respectively.

4.3.2. Experimental Results

Within this subsection, we will experiment and discuss the neural network structure proposed in Section 3.2 with the parameters shown in Table 4. This paper uses 10-fold cross-validation to randomly divide the dataset into 10 subsets, with each remaining 9 subsets used for training, and 1 subset used for validation in 10 cycles.

UAV presence detection is actually a binary hypothesis problem, and for the detector (to detect the presence of UAV), the RF background activity without UAV signal and the RF signal with UAV are marked as zero and one, respectively, and then fed into the UAV detection network in Table 1 in Section 3.2. Using 10-fold cross-validation, 30 rounds are trained in batches of 10, and the last activation layer is chosen as a Sigmoid function as well as convergence using the MSE loss function. The final binary classification accuracy and F1 scores reach 100%, as shown in Figure 4a.

If the Detector identifies it as the UAV signal, we need to further classify its UAV type. The RF background activity signal without UAV signal, Bebop, AR, and Phantom are labeled as zero, one, two, and three, respectively; and finally, the data are fed into the CNN network in Table 2 in Section 3.2. By 10-fold cross-validation, 200 rounds are trained in batches of 20, and the last activation layer is chosen as a Softmax function as well as convergence using a Cross-Entropy loss function. The final quadruple classification accuracy was 99.6% and the F1 score was 99.6%, as shown in Figure 4b.

Finally, we further identify which mode the RF signal classified by the type classifier (Classifier 1) is in. For these two classifiers, we also use the CNN network structure from Table 2 in Section 3.2 for classification. The four patterns are also labeled and fed into the CNN network, which is trained for 300 rounds in batches of 20 using 10-fold cross-validation, with the last activation layer chosen as a Softmax function and a cross-entropy loss function for convergence. Finally, to better compare with other papers using this dataset, we took the background RF activity, the four types of AR/Bebop patterns, and the Phantom type, to form a 10-class confusion matrix with a classification accuracy of 99.3% and an F1 score of 99.3%, shown in Figure 4c.

Worth noting: the rows and columns of this matrix’s interior correspond to the predicted and true classes, respectively. The diagonal cells highlighted in green indicate correctly classified segments, while the non-diagonal cells indicate incorrectly classified segments, with the total number of segments and percentage of segments in each cell in bold. Next, the grey column on the far right shows Precision in green and FDR in red; the grey row at the bottom shows Recall in green and FNR in red; the blue cell in the bottom right corner of the figure shows Overall Precision in green and Error in red; the yellow column and row on the far left and top show the F1 score predicted for each class in green and it shows complementarity (completeness) in red; the orange cell in green indicates the average F1 score for all classes, and in red its complementarity.

5. Comparison with Other Methods

We compare the method proposed in this paper with other papers using the same dataset [27,28,29,30]. In paper [29], the authors used four fully connected layers to build a DNN network and utilized 10-fold cross-validation for training to verify the validity of the dataset; in paper [30], the authors tried to use CNN instead of DNN to solve the detection problem and obtain better accuracy; and in paper [27], the authors used only the lower band of RF signals and XGBoost algorithm to solve the classification problem. In the paper [28], the authors combine KNN and XGBoost to solve the pattern classification problem of UAVs using an integrated learning approach. Table 5 summarizes the performance of the five methods, and the results demonstrate that: firstly, the effectiveness of using compressed sensing techniques on sampling the communication signals between the UAV and the controller is verified; and secondly, it shows that the method used in this paper outperforms other learning methods in terms of accuracy, F1 score, and recall, at very low sampling rates.

6. Conclusions and Future Work

In this paper, we introduce the compressed sensing technique into the sampling of the session communication signal between the UAV and controller with a multi-channel random demodulator constructed. For the detection and identification of UAVs, we use the DNN and CNN networks designed in this paper for UAV detection and classification and finally achieve 100% accuracy for detecting UAV presence, 99.6% accuracy for identifying UAV types, and about 99.3% for UAV pattern classification. It is fully verified that the proposed signal sampling method and UAV detection algorithm in this paper is effective and can still obtain good detection results under a very low sampling rate. In future work, we intend to use multi-station detection to achieve UAV localization and tracking under the reduced pressure of data storage, transmission, and processing, through compressed sampling in this paper.

Author Contributions

Conceptualization, Y.M., J.H. and G.Q.; Funding acquisition, J.H. and G.Q.; Methodology, Y.M. and J.H.; Project administration, J.H. and G.Q.; Software, Y.M.; Writing—original draft, Y.M.; Writing—review and editing, Y.M.; and J.H. All authors will be informed about each step of manuscript processing including submission, revision, revision reminder, etc., via emails from our system or assigned Assistant Editor. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Shenzhen Science and Technology Projection (Grant No. 20200821152629001).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used in this study are available online under this website https://data.mendeley.com/datasets/f4c2b4n755/1 (accessed on 1 September 2021).

Conflicts of Interest

The authors declare no conflict of interest.

References

Aragón, R.V.; Castaño, C.R.; Correa, A.C. Impact and Technological Innovation of UAS/Drones in the World Economy: VI International Conference on Innovation and Trends in Engineering (CONIITI 2020). In Proceedings of the 2020 Congreso Internacional de Innovación y Tendencias en Ingeniería (CONIITI), Bogota, Colombia, 30 September–2 October 2020. [Google Scholar]
Wang, S. On the harm of black flight of drones to civil aviation flights. Econ. Trade Pract. 2017, 24, 337. (In Chinese) [Google Scholar]
Hao, Q.; Chen, L. Saudi Arabia Has Taken Control of Fires Caused by Houthi Attacks on Oil Fields That Have Not Affected Production. Available online: https://baijiahao.baidu.com/s?id=1642249467416436888&wfr=spider&for=pc (accessed on 15 April 2022).
Yemen’s Houthis Use Drones to Attack Multiple Targets in Saudi Arabia. Available online: https://baijiahao.baidu.com/s?id=1716994090457160779&wfr=spider&for=pc (accessed on 15 April 2022).
Xie, Y.; Ran, T. Urban-Security-Oriented Low Slow Small Target Detection. In Proceedings of the 2012 Second International Conference on Instrumentation, Measurement, Computer, Communication and Control, Harbin, China, 8–10 December 2012. [Google Scholar]
Hommes, A.; Shoykhetbrod, A.; Noetel, D.; Stanko, S.; Laurenzis, M.; Hengy, S.; Christnacher, F. Detection of acoustic, electro-optical and RADAR signatures of small unmanned aerial vehicles (Invited Paper). Proc. SPIE 2016, 9997. [Google Scholar] [CrossRef]
Bernardini, A.; Mangiatordi, F.; Pallotti, E.; Capodiferro, L. Drone detection by acoustic signature identification. Electron. Imaging 2017, 29, 60–64. [Google Scholar] [CrossRef]
Liu, H.; Wei, Z.; Chen, Y.; Pan, J.; Lin, L.; Ren, Y. Drone Detection Based on an Audio-Assisted Camera Array. In Proceedings of the 2017 IEEE Third International Conference on Multimedia Big Data (BigMM), Laguna Hills, CA, USA, 19–21 April 2017; pp. 402–406. [Google Scholar]
Unlu, E.; Zenou, E.; Riviere, N.; Dupouy, P.-E. Deep learning-based strategies for the detection and tracking of drones using several cameras. IPSJ Trans. Comput. Vis. Appl. 2019, 11, 7. [Google Scholar] [CrossRef] [Green Version]
Martelli, T.; Murgia, F.; Colone, F.; Bongioanni, C.; Lombardo, P. Detection and 3D localization of ultralight aircrafts and drones with a WiFi-based passive radar. In Proceedings of the International Conference on Radar Systems (Radar 2017), Belfast, UK, 23–26 October 2017; pp. 1–6. [Google Scholar] [CrossRef]
Hua, X.; Ono, Y.; Peng, L.; Cheng, Y.; Wang, H. Target Detection Within Nonhomogeneous Clutter Via Total Bregman Divergence-Based Matrix Information Geometry Detectors. IEEE Trans. Signal Process. 2021, 69, 4326–4340. [Google Scholar] [CrossRef]
Liu, J.; Liu, W.; Hao, C.; Orlando, D. Persymmetric Subspace Detectors with Multiple Observations in Homogeneous Environments. IEEE Trans. Aerosp. Electron. Syst. 2020, 56, 3276–3284. [Google Scholar] [CrossRef]
Wax, M.; Adler, A. Detection of the Number of Signals by Signal Subspace Matching. IEEE Trans. Signal Process. 2021, 69, 973–985. [Google Scholar] [CrossRef]
Taha, B.; Shoufan, A. Machine Learning-Based Drone Detection and Classification: State-of-the-Art in Research. IEEE Access 2019, 7, 138669–138682. [Google Scholar] [CrossRef]
Nguyen, P.; Truong, H.; Ravindranathan, M.; Nguyen, A.; Han, R.; Vu, T. Cost-Effective and Passive RF-Based Drone Presence Detection and Characterization. GetMobile Mob. Comput. Commun. 2018, 21, 30–34. [Google Scholar] [CrossRef]
Bourguignon, S.; Carfantan, H.; Idier, J. A Sparsity-Based Method for the Estimation of Spectral Lines from Irregularly Sampled Data. IEEE J. Sel. Top. Signal Process. 2007, 1, 575–585. [Google Scholar] [CrossRef]
Donoho, D.L. Compressed sensing. IEEE Trans. Inf. Theory 2006, 52, 1289–1306. [Google Scholar] [CrossRef]
Smaili, S.; Massoud, Y. A Multi-Channel Random Demodulation Reconfigurable Receiver. IEEE Wirel. Commun. Lett. 2014, 3, 561–564. [Google Scholar] [CrossRef]
Allahham, M.S.; Al-Sa’d, M.F.; Al-Ali, A.; Mohamed, A.; Khattab, T.; Erbad, A. DroneRF dataset: A dataset of drones for RF-based detection, classification, and identification. Data Brief 2019, 26, 104313. [Google Scholar] [CrossRef] [PubMed]
Nguyen, P.; Ravindranatha, M.; Nguyen, A.; Han, R.; Vu, T. Investigating Cost-effective RF-based Detection of Drones. In Proceedings of the 2nd Workshop on Micro Aerial Vehicle Networks, Systems, and Applications for Civilian Use (DroNet’16); Association for Computing Machinery: New York, NY, USA, 2016; pp. 17–22. [Google Scholar] [CrossRef]
Daponte, P.; De Vito, L.; Picariello, F.; Rapuano, S.; Tudosa, I. Compressed Sensing Technologies and Challenges for Aerospace and Defense RF Source Localization. In Proceedings of the 2018 5th IEEE International Workshop on Metrology for AeroSpace (MetroAeroSpace), Rome, Italy, 20–22 June 2018. [Google Scholar] [CrossRef]
Huang, L.; Li, X.; Huang, M.; Li, Q.; Zhao, B.; Sun, W.; Zhang, P. Compressed sampling techniques for UAV data return:opportunities and challenges. J. Shenzhen Univ. 2019, 36, 473–481. (In Chinese) [Google Scholar]
Zhu, L.; Zhang, S.; Ma, Q.; Zhao, H.; Chen, S.; Wei, D. Classification of UAV-to-Ground Targets Based on Enhanced Micro-Doppler Features Extracted via PCA and Compressed Sensing. IEEE Sens. J. 2020, 20, 14360–14368. [Google Scholar] [CrossRef]
Gaigals, G.; Vavilina, E. Simulation of compressed sensing based passive radar for drone detection. In Proceedings of the 2017 5th IEEE Workshop on Advances in Information, Electronic and Electrical Engineering (AIEEE), Riga, Latvia, 24–25 November 2017. [Google Scholar] [CrossRef]
Ezuma, M.; Erden, F.; Anjinappa, C.K.; Ozdemir, O.; Guvenc, I. Micro-UAV Detection and Classification from RF Fingerprints Using Machine Learning Techniques. In Proceedings of the 2019 IEEE Aerospace Conference, Big Sky, MT, USA, 2–9 March 2019. [Google Scholar] [CrossRef] [Green Version]
Ezuma, M.; Erden, F.; Anjinappa, C.K.; Ozdemir, O.; Guvenc, I. Detection and Classification of UAVs Using RF Fingerprints in the Presence of Wi-Fi and Bluetooth Interference. IEEE Open J. Commun. Soc. 2019, 1, 60–76. [Google Scholar] [CrossRef]
Medaiyese, O.; Syed, A.; Lauf, A.P. Machine Learning Framework for RF-Based Drone Detection and Identification System. In Proceedings of the 2021 2nd International Conference on Smart Cities, Automation & Intelligent Computing Systems (ICON-SONICS), Tangerang, Indonesia, 12–13 October 2021. [Google Scholar] [CrossRef]
Nemer, I.; Sheltami, T.; Ahmad, I.; Yasar, A.U.-H.; Abdeen, M.A.R. RF-Based UAV Detection and Identification Using Hierarchical Learning Approach. Sensors 2021, 21, 1947. [Google Scholar]
Al-Sa’D, M.; Al-Ali, A.; Mohamed, A.; Khattab, T.; Erbad, A. RF-based drone detection and identification using deep learning approaches: An initiative towards a large open source drone database. Futur. Gener. Comput. Syst. 2019, 100, 86–97. [Google Scholar] [CrossRef]
Al-Emadi, S.; Al-Senaid, F. Drone Detection Approach Based on Radio-Frequency Using Convolutional Neural Network. In Proceedings of the 2020 IEEE International Conference on Informatics, IoT, and Enabling Technologies (ICIoT), Doha, Qatar, 2–5 February 2020. [Google Scholar] [CrossRef]
Shi, H.-D.; Lu, H.; Bian, Z.-A. Deep convolutional network multi-target UAV signal detection method. J. Air Force Eng. Univ. 2021, 22, 29–34. (In Chinese) [Google Scholar]
Tropp, J.; Wakin, M.; Duarte, M.; Baron, D.; Baraniuk, R. Random Filters for Compressive Sampling and Reconstruction. In Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, Toulouse, France, 14–19 May 2006. [Google Scholar] [CrossRef] [Green Version]
Donoho, D.L.; Huo, X. Uncertainty principles and ideal atomic decomposition. IEEE Trans. Inf. Theory 2001, 47, 2845–2862. [Google Scholar] [CrossRef] [Green Version]
Kirolos, S.; Laska, J.; Wakin, M.; Duarte, M.; Baron, D.; Ragheb, T.; Massoud, Y.; Baraniuk, R. Analog-to-Information Conversion via Random Demodulation. In Proceedings of the IEEE Dallas/CAS Workshop on Design, Application, Integration and Software, Richardson, TX, USA, 29–30 October 2006. [Google Scholar] [CrossRef] [Green Version]
Mishali, M.; Eldar, Y.C. Blind Multiband Signal Reconstruction: Compressed Sensing for Analog Signals. IEEE Trans. Signal Process. 2009, 57, 993–1009. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Multi-channel Random Demodulation Sampling Structure.

Figure 2. Average power spectrum of the RF activities for 2, 4, and 10 classes scenarios: In (a), class 1 is for RF background activities and class 2 is for the drones RF communications (to be supplied to the UAV Detection Network). In (b), class 1 is for RF background activities and classes 2–4 are for the Bebop, AR, and Phantom drones (to be supplied to the UAV Identification Network). In (c), class 1 is for RF background activities, classes 2–5 are for the Bebop 4 different flight modes, classes 6–9 are for the AR 4 different flight modes; and lastly, class 10 is for the Phantom single flight mode (to be supplied to the UAV Classification Network).

Figure 3. Our approach consists of dividing the dataset into training and testing data stage and a learning stage with four classifiers to specify the class of any sample.

Figure 4. The output confusion matrix for 2-classes, 4-classes, and 10-classes scenario: Where (a) is the 2-classes output confusion matrix (detecting the presence of a drone); (b) is the 4-classes output confusion matrix (detecting the presence of a drone and identification its type); and (c) is confusion matrix of outputs for 10-classes scenario (detecting the presence of a drone, identification its type, and determining its flight mode).

Table 3. Composition of Drone-RF Dataset.

Class-2	Class-4	Class-10	Segments	Ratio
No UAV	No UAV	No UAV	41	18.06%
UAV	Bebop	On, connected, off	21	9.25%
		Hovering	21	9.25%
		Flying	21	9.25%
		Flying with video recording	21	9.25%
	AR	On, connected, off	21	9.25%
		Hovering	21	9.25%
		Flying	21	9.25%
		Flying with video recording	18	7.94%
	Phantom 3	On, connected, off	21	9.25%

Table 4. Selection of Experimental Parameters.

Experiments	Epochs	Batch Size	Activation Functions	Loss	Learning_Rate
Detector	30	10	Sigmoid	mse	Default
Classifier1	200	20	Softmax	CrossEntropy	Default
Classifier23	300	120	Softmax	CrossEntropy	0.00003

Table 5. Results of different sampling methods and machine learning.

Reference	Sampling	Approach	2-Classes	10-Classes
[29]	NS ¹	DNN	99.7%	46.80%
[30]	NS ¹	CNN	99.8%	59.20%
[27]	NS ¹	XGBoost	99.9%	70.09%
[28]	NS ¹	Hierarchical	99.5%	99.20%
Ours	CS ²	DNN/CNN	100%	99.30%

¹ is sampling theory, ² is Compressive Sensing.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mo, Y.; Huang, J.; Qian, G. Deep Learning Approach to UAV Detection and Classification by Using Compressively Sensed RF Signal. Sensors 2022, 22, 3072. https://doi.org/10.3390/s22083072

AMA Style

Mo Y, Huang J, Qian G. Deep Learning Approach to UAV Detection and Classification by Using Compressively Sensed RF Signal. Sensors. 2022; 22(8):3072. https://doi.org/10.3390/s22083072

Chicago/Turabian Style

Mo, Yongguang, Jianjun Huang, and Gongbin Qian. 2022. "Deep Learning Approach to UAV Detection and Classification by Using Compressively Sensed RF Signal" Sensors 22, no. 8: 3072. https://doi.org/10.3390/s22083072

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Deep Learning Approach to UAV Detection and Classification by Using Compressively Sensed RF Signal

Abstract

1. Introduction

2. Related Work

2.1. Data Acquisition

2.2. Detection and Classification Methods

2.2.1. Traditional Method Based

2.2.2. Deep Learning Based

3. Proposed Sampling and Detection Method

3.1. Detection and Classification Methods

3.1.1. Compressive Sensing

3.1.2. Data Pre-Processing

3.2. Two/Multistage Classification Network

4. Experiments and Results

4.1. Dataset

4.1.1. Raw Dataset

4.1.2. Simulation Dataset

4.1.3. Dataset Reliability Analysis

4.2. Counterpart Two/Multilevel Classification

4.3. Detection and Classification

4.3.1. Assessment Indicators

4.3.2. Experimental Results

5. Comparison with Other Methods

6. Conclusions and Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI