Next Article in Journal
Specifying Inputs for the Computational Structure of a Surgical System via Optical Method and DLT Algorithm Based on In Vitro Experiments on Cardiovascular Tissue in Minimally Invasive and Robotic Surgery
Next Article in Special Issue
Research on the Applicability of Vibration Signals for Real-Time Train and Track Condition Monitoring
Previous Article in Journal
Simulation and Analysis of Mie-Scattering Lidar-Measuring Atmospheric Turbulence Profile
Previous Article in Special Issue
Effects of Wind Conditions on Wind Turbine Temperature Monitoring and Solution Based on Wind Condition Clustering and IGA-ELM
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Neural Algorithm for the Detection and Correction of Anomalies: Application to the Landing of an Airplane

LAAS-CNRS, Université de Toulouse, 7 Av. du Colonel Roche, 31400 Toulouse, France
*
Author to whom correspondence should be addressed.
Sensors 2022, 22(6), 2334; https://doi.org/10.3390/s22062334
Submission received: 22 February 2022 / Revised: 10 March 2022 / Accepted: 15 March 2022 / Published: 17 March 2022

Abstract

:
The location of the plane is key during the landing operation. A set of sensors provides data to get the best estimation of plane localization. However, data can contain anomalies. To guarantee correct behavior of the sensors, anomalies must be detected. Then, either the faulty sensor is isolated or the detected anomaly is filtered. This article presents a new neural algorithm for the detection and correction of anomalies named NADCA. This algorithm uses a compact deep learning prediction model and has been evaluated using real and simulated anomalies in real landing signals. NADCA detects and corrects both fast-changing and slow-moving anomalies; it is robust regardless of the degree of oscillation of the signals and sensors with abnormal behavior do not need to be isolated. NADCA can detect and correct anomalies in real time regardless of sensor accuracy. Likewise, NADCA can deal with simultaneous anomalies in different sensors and avoid possible problems of coupling between signals. From a technical point of view, NADCA uses a new prediction method and a new approach to obtain a smoothed signal in real time. NADCA has been developed to detect and correct anomalies during the landing of an airplane, hence improving the information presented to the pilot. Nevertheless, NADCA is a general-purpose algorithm that could be useful in other contexts. NADCA evaluation has given an average F-score value of 0.97 for anomaly detection and an average root mean square error (RMSE) value of 2.10 for anomaly correction.

1. Introduction

Anomaly detection is about finding patterns that do not adhere to what is considered normal behavior [1]. Abnormal events are a major problem as people’s lives can be at risk and companies as well as public institutions can suffer serious losses.
Fraudulent activity in the banking sector, deforestation in the environmental sector, cancer in the healthcare sector, fake news in the social media sector, hacker attacks in cybersecurity, malfunctions in the manufacturing sector, traffic jams in the transportation sector, etc. are some examples of anomalies. Some examples of anomaly detection in different fields are presented in [2,3,4,5,6].
Commercial aircraft flights are a good example where anomaly detection is very important. Although fault tolerant architectures are in place, anomaly detection is paramount to passivate faulty components. A faulty actuator can be switched to its sane redundant counterpart. A faulty sensor can be put aside from the data fusion process [7]. In particular, the location of an airplane is an essential piece of information during the landing process. It is obtained from a set of sensors that present redundancies and whose values are fused. Thus, each sensor involved in the data fusion must provide measures without anomalies.
Normally, the set of sensors consists of a global positioning system (GPS), an inertial reference system (IRS), an instrument landing system (ILS), and a radio-altimeter (RA). Typically, these sensors work properly with a specific accuracy and specific fusion techniques are applied to get a good estimate of the airplane’s location [7].
However, sensors can provide data with anomalies. Anomaly detection methods can be applied to guarantee optimal quality of measures. When an anomaly is detected, either the anomalous sensor is isolated or the detected anomaly is filtered.
This article presents a new algorithm named NADCA (Neural Algorithm for the Detection and Correction of Anomalies) to detect and correct anomalies in time series. This algorithm is a general-purpose algorithm, but it has been developed in the framework of a project in the field of aeronautics to detect and correct sensor anomalies during airplane landing.
NADCA uses a predictive model based on deep learning. More precisely, NADCA is based on a recurrent neural network (RNN) called Long Short-Term Memory (LSTM) [8].
Deep learning has been used with success for classification and prediction purposes [9]. In particular, different NN architectures have been successfully leveraged for time series analysis [9]. Deep learning has the ability to automatically discover complex features without having any domain knowledge. Consequently, NN is a good platform to solve the time series anomaly detection problem.
LSTM is a good choice for the prediction task of time series because it can deal with chronologically ordered sequences and can track long-term dependencies in these sequences. Like most NN-based algorithms, LSTM relies on the assumption that training and test data share similar statistics.
In [10], various deep learning models for anomaly detection, including prediction methods, are investigated. Their suitability for a given data set is also analyzed. A more recent review about deep anomaly detection is provided in [11]. This work reviews 12 diverse modeling perspectives on leveraging deep learning techniques for the detection of anomalies. It also discusses how these methods address some notorious anomaly detection challenges to demonstrate the importance of deep anomaly detection.
An anomaly detection technique based on LSTM is proposed in [12]. The model is trained using normal data. Then, the prediction error distribution between measure and prediction is computed. An error threshold allows to decide when the time series has a normal or anomalous behavior. An LSTM-based encoder-decoder for multi-sensor anomaly detection is presented in [13]. Another deep learning method to detect anomalies in time series combining wavelet transform and NNs is presented in [14]. In [15], LSTM is used for detecting anomalies in flight data. A set of eleven canonical anomalies is tested.
A more recent work uses convolutional neural networks (CNNs) to detect anomalies [16]. This approach allows to obtain a model that generalizes well without using a large number of examples during the learning process. This is possible as CNNs achieve a good parameter selection.
Autoencoders are NNs that learn to copy their input to their output. In [17], autoencoders are also used to detect anomalies.
Unlike the above deep learning methods, NADCA uses differences between consecutive measures to train a model. The model predicts a difference in each iteration. This difference added to the corresponding measure produces the prediction of the next measure. This approach is advantageous because the prediction does not depend on the accuracy of the sensor and reduces non-stationary aspects of the original time series. Moreover, the prediction of a single difference does not require a significant number of previous measurements. This fact reduces the necessary number of examples during training.
Another original aspect resides in the design of NADCA. NADCA allows data to be processed in a general way regardless of the degree of oscillation present in the sensor data. That is interesting because NADCA only predicts a sample and uses a small number of measures at each iteration.
The criterion for deciding whether a measure is an anomaly or not is also different. The algorithm compares a prediction with the corresponding measure and uses a threshold (U) to decide. The threshold can be fixed or adaptive depending on the nature of the data. The prediction is always obtained from a smooth signal, i.e., the signal is smoothed when it shows oscillations. A signal without oscillations is defined as a signal whose smoothed signal is the same as the original signal (more explanations in Section 2.6).
Predicting from a smooth signal makes the prediction error small and less than a constant. This means that the algorithm is robust for the detection and correction of anomalies regardless of the degree of oscillation of the signal.
When the signal has no oscillations, the threshold U is the maximum prediction error. When the signal has oscillations, U is the maximum distance among the samples between the smoothed signal and the raw values. In both cases, U is determined using a set of signals without anomalies. This approach detects both fast-changing and slow-moving anomalies.
Regarding anomaly detection in sensors during landing, the work of [18] stands out. In that thesis, the author provides a comparative analysis of several existing machine learning techniques to detect anomalies. The faulty sensor is isolated once the anomaly has been detected. The simulation of the sensors during landing is another important aspect of this work. In this way, data are easily obtained to test the algorithms.
Beyond the analysis of [18], an original aspect of our work is the use of an algorithm that allows the detection of anomalies together with their correction. Note that the NADCA algorithm is especially designed to deal with anomalies during the landing phase where airplanes normally do not have abrupt trajectory changes. During a sudden change of trajectory, NADCA could detect anomalies in all the sensors.
A more recent paper studies the stability of aircraft lateral movement during the ILS approach [19]. To estimate the lateral stability index, a gated recurrent unit (GRU) [20] is used where GRU is a simplified version of LSTM.
Concerning landing data, NADCA analyzes anomalies according to the X, Y, and Z axes of the runway reference system. The values of the sensors according to these reference axes can be coupled. When this occurs, the origin of the anomaly is unclear. However, the existence of coupling is not a problem for NADCA. NADCA detects and corrects the anomalies following the order X, Y, and Z. If an anomaly appears in any sensor coordinate, it is corrected before analyzing the next coordinate, since the latter can be a function of the first coordinate.
Each coordinate can be represented by a multichannel signal (a channel per sensor). NADCA uses a unique predictive model per coordinate. The prediction is carried out in a compact way, encouraging the sensors to help each other. The prediction on each sensor is used to detect and correct each anomaly. Ref. [21] also considers multichannel signals compactly but only to detect anomalies. It does not perform a correction of the anomaly, and it does not prevent possible coupling effects. In contrast to NADCA, the algorithm is unsupervised and does not need training.
From a technical point of view, NADCA has two important innovations. As explained, the algorithm compares a prediction with the corresponding measure and uses U to decide. This is also the basic behavior of an algorithm to detect anomalies using a predictive model. Anomalies that change abruptly, that is, in the time interval between two consecutive samples, are easily detected. However, there are many anomalies that vary more slowly. When this happens, anomaly detection algorithms that use this basic behavior fail. This occurs since the prediction is calculated from the closest previous measurement. NADCA solves this problem using a new strategy to calculate this prediction. It can even detect and correct drift anomalies. On the other hand, NADCA can also work with signals regardless of whether the signal has oscillations or not. A similar algorithm is applied for both types of signals. However, for signals with oscillations, an additional step is necessary to obtain a smoothed signal. The smoothed signal is created in real time and this is also a novel aspect.
To summarize, the advantages of our approach are as follows: it is suitable for working with multiple time series, it provides a compact model for all sensors, detection and correction of any anomaly is done at the same time, it is robust regardless of the degree of oscillation of the signals, it detects both fast-changing and slow-moving anomalies, it only needs a small number of measures at each iteration because it predicts one sample, the characteristics of the anomaly (e.g., type, duration, etc.) can be selected and sensor behavior can be analyzed, sensors with abnormal behavior do not need to be isolated because NADCA produces corrected values, it does not depend on the accuracy of the sensor, it can cope with simultaneous anomalies on different sensors, it can be implemented in real time, and it can detect the origin of any anomaly avoiding the coupling problem.
As far as we know, there is no other algorithm capable of detecting and correcting anomalies with all these advantages, especially when the algorithm is applied during the landing process.
This article is organized as follows. Section 2 reviews some basic concepts referring to the aircraft landing phase and to the neuronal tools used by NADCA. Section 3 describes the algorithm NADCA. Section 4 explains some elements of NADCA using real landings while Section 5 shows some examples of anomaly detection and correction using NADCA. Section 6 discusses the methodology and results. Finally, Section 7 concludes the article.

2. Background

This section reviews some important concepts for understanding NADCA, as well as for understanding the aircraft landing application.

2.1. Admissible Work Interval for Detecting and Correcting Anomalies during Landing

A coordinate system is placed at the origin of the runway (see Figure 1). The plane begins to land when it is almost aligned with the X axis of the runway. The landing ends when the plane makes contact with the runway. The NADCA algorithm works in that interval.

2.2. Sequence Prediction and Time Series

Supervised machine learning algorithms use a set of samples for the training process. Each sample is an observation or measure.
Machine learning algorithms can be used for sequence prediction. Sequence prediction involves predicting the next value for a given input sequence. In this case, the set of samples is different because a sequence describes a set of ordered measures (for example, measures ordered chronologically, i.e., times series). Consequently, the order of the samples used in the algorithms must be respected.
In this article, time series from a set of sensors are used. The concepts of time series and signal are used indistinctly. Predictions in times series are made with the help of a LSTM network.

2.3. LSTM Network

An LSTM network is a kind of RNN [9]. It attempts to model sequence-dependent behavior by feeding back the output of a NN layer at time t to the input of the same NN layer at time t + 1. LSTM propagates the information learned at a time t to the future. In general, a classic RNN likes to remember everything. By contrast, LSTM saves relevant information and forgets information that is not important.
LSTM architectures are not unique. Depending on the type of problem, some architectures perform better than others. Some architectures are as follows: vanilla, stacked, CNN, encoder-decoder, etc. [22,23]. We selected a Stacked architecture in which LSTM layers are stacked one on top of another into deep networks.
An LSTM network was used to create the predictive model of NADCA. This supervised algorithm predicts acceptably if it has been trained with a significant number of examples. Predictions are robust when the predictive model is used in time series with no oscillations.

2.4. Sensors, Signals, Location, and Coupling

During a landing, the complete set of signals with respect to the runway reference can be described by three multichannel signals: [XGPS, XIRS] for the X coordinate, [YILS, YGPS, YIRS] for the Y coordinate, and [ZILS, ZRA, ZGPS, ZIRS] for the Z coordinate. Each signal is denoted by the “CoordinateSensor” symbol.
The airplane’s GPS provides latitude, longitude, and altitude. These values represent the position of the airplane in geodesic coordinates (WGS84). The airplane location with respect to the runway (XGPS, YGPS, ZGPS) can be calculated by means of a coordinate system conversion. In a similar way, the airplane location provided by the IRS with respect to the runway (XIRS, YIRS, ZIRS) can be calculated.
The radio altimeter measures the aircraft altitude (HRA), i.e., the vertical distance between the aircraft and the ground. In order to get ZRA, one must apply a correction with respect to the relief under the aircraft, using a terrain database:
Z R A = H R A + H t e r r a i n
where Hterrain is the altitude of the terrain with respect to the runway threshold. The Hterrain value can be obtained using the XGPS or XIRS values.
The ILS is a ground-based system that emits signals along the vertical and lateral axis so that the aircraft can follow a line of reference named the localizer (LOC) in the lateral axis and the glideslope (GS) on the vertical axis. The ILS can be manipulated to obtain the airplane’s position coordinates with respect to the runway (YILS, ZILS). These values can be calculated using Equations (2) and (3). These equations provide a good approximation to the real values [18].
Y I L S = σ L O C   × s × L X L
where L is the runway length (usually 3500 m), s is the LOC sensitivity (usually 0.7 m/μA) and σ L O C   is the LOC deviation in μA. The X value can be obtained using the XGPS or XIRS values.
Z I L S = X + 300 × tan G P A + ρ G S
where GPA is the angle of reference (3°) and ρ G S is the noise of the GS. The X value can be obtained using the XGPS or XIRS values.
The GPS and IRS coordinates do not depend on the coordinates of other sensors. However, ZRA, YILS, and ZILS depend on the GPS or IRS. NADCA avoids this coupling because it detects and corrects anomalies following the order X, Y, and Z. An XGPS anomaly (or XIRS anomaly) is detected and corrected before the corresponding values are used to calculate ZRA, YILS, and ZILS.
Figure 2 shows the Z coordinate of four simulated time series (ZGPS, ZIRS, ZILS, and ZRA) during the landing process. Unlike the Z coordinate of GPS and IRS, the Z coordinate of ILS and RA is a signal with oscillations. A table, to the right of Figure 2, crosses the coordinates (according to the runway reference system) and signal for each sensor. In addition, the sensor coordinate cell indicates whether or not the signal has oscillations.
NADCA acts on each coordinate independently and takes into account whether the signal has oscillations or not.

2.5. Predictive Models

NADCA works on each X, Y, and Z axis independently. Therefore, there are three prediction models (PMX, PMY, and PMZ), one for each axis. Each predictive model only works with signals without oscillations. This means that for ILS and RA signals, a smoothed signal is constructed in real time before being used by the predictive model. A letter L is used to denote the corresponding smoothed signals. Working with smoothed signals guarantees a low and stable prediction error.
Figure 3 shows a predictive model for the Z axis denoted PMZ. It predicts using the multichannel signal (ZLILS, ZLRA, ZGPS, ZIRS) where ZLILS and ZLRA are the corresponding smooth signals of ZILS and ZRA. PMZ predicts a difference of consecutive measurements from a set of differences obtained from some previous measurements. In this example, the predictive model takes 15 measurements, or 14 differences for each sensor up to sample i. Then, an LSTM compact architecture predicts a difference of measurements at time i + 1 for each sensor. The prediction of the measurement at time i + 1 ( P i + 1 S e n s o r ) is equal to the predicted difference ( Δ i S e n s o r ) plus the measurement at time i ( M i S e n s o r ). Figure 3 also shows the difference prediction and measure prediction for GPS where the letter Z is not used for simplicity.
Likewise, NADCA uses a PMY that acts on [YL ILS, Y GPS, Y IRS] and a PMX that acts on [XGPS, XIRS]. The PMZ works with an LSTM network whose main architecture has 3 stacked layers with 300 cells per layer. Similar architectures are used for PMY and PMX.

2.6. Smoothing Data with the Savitzky–Golay Filter

The Savitzky–Golay filter (SG) [24] is a particular type of low-pass filter, well adapted for data smoothing.
The SG filter removes high frequency noise from data. It has the advantage of preserving the original shape and features of the signal better than other types of filtering approaches, such as moving average techniques. The main idea behind this approach is to make for each point a least-square fit with a polynomial of high order over an odd-sized window centered at the point.
This filter is useful for obtaining a smoothed signal from a signal with oscillations and is used for ILS and RA signals in our approach.

3. Neural Algorithm for the Detection and Correction of Anomalies (NADCA)

The main elements of NADCA are the following:
Sensor measurements (... Mi−1, Mi, Mi+1).
A reference Pi+1 using a predictive model PM.
A threshold U + .
The basic version of NADCA (see Figure 4), named NADCA-B, is summarized in Algorithm 1 as follows:
Algorithm 1: NADCA-B algorithm.
If the distance (absolute difference) between Mi+1 and Pi+1 is > U then “Anomaly”
  If “Anomaly” then “Anomaly Correction” using predictions.
else “No Anomaly”
In general, sensor data are non-stationary during landing. To work with stationary data, differences between consecutive data values are calculated. In this way, the predictive model predicts a difference Δi at each iteration i instead of a raw measure value. This prediction is hence independent of the sensor accuracy.
The difference Δ i is added to the measure M i to predict the measure at time i + 1. The closer the value of this prediction P i + 1 is to the measure M i + 1 , the better the prediction. The predictive model predicts a difference Δi from a set of previous differences PD = [ D i N D , …, D i 1 ] where ND is the number of differences used and D i 1 = M i M i 1 . The number of previous measures is denoted NM. For example, if NM = 15, then ND = 14.
NADCA-B is simple but not always effective in detecting and correcting any type of anomaly. The maximum prediction error between Pi+1 and Mi+1 must be small and less than a constant, but NADCA-B does not always produce such prediction error. To optimally detect and correct any anomaly, a generalization of NADCA-B is necessary. This generalization is explained according to how NADCA-B is used in signals without oscillations (NADCA-L) or in signals with oscillations (NADCA-O).

3.1. NADCA-L: Generalization of NADCA-B for Signals without Oscillations

Figure 5 explains in detail how NADCA-L detects and corrects anomalies using a generalization of NADCA-B.
This generalization means that the prediction at i + 1 can be approximated in different ways.
If P i + 1 S e n s o r = M i S e n s o r + Δ i is a good approximation of the real measure at time i+1, the following approximation P i + 1 S e n s o r = M i 1 S e n s o r + Δ i 1 + Δ i also offers a small prediction error. In general, P i + 1 S e n s o r = M i K S e n s o r + Δ i K + + Δ i where K is a positive integer indexing an initial measure I M = M i K S e n s o r . A more precise equation is as follows:
P i + 1 S e n s o r = I M + Δ C i K + + Δ C i n + + Δ C i 1 + ( Δ i + C i * )
where Δ C i n = Δ i n + C i * , C i * = 1 K C i K + + C i 1 , C n = M n + 1 S e n s o r P n + 1 S e n s o r is a prediction error for Δ n and n is an integer.
The C i * parameter represents a correction by the average of the prediction error on the K last time points. It works well for fast-changing anomalies (e.g., noise). However, slow-moving anomalies such as drift might not be well detected.
For a potential slow-moving anomaly, C i * is increased as i increases. The following equation shows that a drift-like anomaly starts at sample i-N if:
C i n c * C i n c 1 * > 0  
where 1 n c N and N < K. The value of N is fixed, e.g., N = 15. A new C i * * = C i N * is selected and is used to detect a potential slow-moving anomaly.
In general, C i * * is close to or equal to C i * when there is no anomaly or when there is a fast-changing anomaly. For a slow-moving anomaly, the value of C i * * is fixed using Equation (5) to detect the anomaly in the following iterations. Equation (4) allows to calculate P i + 1 (for simplicity, the exponent “sensor” has been omitted) using C i * . A new P i + 1 * * could also be obtained using C i * * instead of C i * in (4).
If the following condition is true
| M i + 1 P i + 1 | > U
then there is an anomaly (mainly a fast-moving anomaly). However, a slow-moving anomaly is detected if
M i + 1 P i + 1 + P i + 1 P i + 1 * * > U ,   that   is ,   M i + 1 P i + 1 + C i * C i * *   >   U .
Equation (7) is necessary since C i * * and C i * can move away at some point and however, this does not mean that a slow-moving anomaly is starting.
P i + 1 S e n s o r is a reference for NADCA-L at each iteration. The set of all predicted values { P i + 1 S e n s o r } can be denoted by RefSensor.
In addition, NADCA-L also uses Equation (4) for correcting an anomaly in real time once it has been detected. If the anomaly has a short duration, Equation (4) is good enough to make the correction. For a long duration anomaly, a small deviation might appear. In this case, given an anomaly starting at sample i, the following equation could be used to improve quality of the correction:
P j = P j +   α   × M
where j is a sample within the anomaly and M = ji. The parameter α can be determined experimentally (see Section 5.1).
The NADCA-L method is summarized in Algorithm 2 as follows:
Algorithm 2: NADCA-L algorithm.
Given a sample i, U, a set of NM measures [ M i N M 1 M i ], a set of K predictions [ Δ i K Δ i 1 ] and prediction errors [ C i K C i 1 ] for the set of measures [ M i K M i 1 ] and M i + 1 :
 1. Calculate the set of differences PD using the NM measures.
 2. Calculate Δ i using PM and PD.
 3. Calculate C i * and C i * * .
 4. Calculate P i + 1 S e n s o r and P i + 1 S e n s o r * * using (4).
 5. Calculate d i s 1 = | M i + 1 S e n s o r P i + 1 S e n s o r | and d i s 2 = M i + 1 S e n s o r P i + 1 S e n s o r + C i * C i * *
 6. If d i s 1     U and d i s 2   U then “No anomaly” at i + 1. Save ( Δ i , C i ) for the next iteration. Updating K ←K+1 allows the same IM to be used for the next iteration.
 7. If d i s 1 > U then “fast-changing anomaly” at i + 1. Correct the anomaly at i + 1 changing M i + 1 to P i + 1 . Save ( Δ i , C i ) for the next iteration. Updating KK+1 allows the same IM to be used for the next iteration.
 8. If d i s 2 > U and d i s 1 < U then “slow-moving anomaly” at i + 1. Correct the anomaly at i + 1 changing M i + 1 to P i + 1 . Save ( Δ i , C i ) for the next iteration. Updating KK+1 allows the same IM to be used for the next iteration.
NADCA-L works in real time. This means that steps 1–4 described above are calculated during the time difference between two consecutive samples (sampling period). Once M i + 1 S e n s o r is known, steps 5–8 allow to decide if there is anomaly or not (see Figure 6).

3.2. NADCA-O: Generalization of NADCA-B for Signals with Oscillations

Figure 7 explains in detail how NADCA-O detects and corrects anomalies in signals with oscillations.
In general, the predictive model applied to the raw data of a non-stationary oscillating signal does not have a small prediction error less than a constant. This characteristic is not good for detecting and correcting anomalies in a robust way. One solution is to find a smooth signal (L) from the raw data. Each prediction on this smoothed signal constitutes a reference to determine if there is an anomaly or not. As the smooth signal does not present oscillations, the prediction error is small and less than a constant (e.g., in Section 4.3.1, prediction errors are calculated. GPS and IRS envelopes are constant lines).
NADCA-O contains two steps: the determination of L in real time and the NADCA-L algorithm.
A SG filter is used to determine L in real time. The SG filter is a general approach where the smooth signal depends only on the sensor data.
Given a set of NT measures [ M i N T 1 M i ], the SG filter can be applied to obtain the corresponding smooth measures [ M i N T 1 L M i L ]. Typically, this process takes place offline. The SG filter uses a sliding window of, for example, about NS = 100 measurements ( N S N T ).
We want to apply the SG filter on a signal in real time where in the first iteration there are only NM samples (e.g., NM = 15) and for the next iterations, one sample per iteration is added. In general, the NM value is inferior to NS. To apply the SG filter in real time where only NM measurements are available in the first iteration, two changes are required. First, synthetic samples are added by repeating the set [ M 1 M N M ] until the selected NS value is reached. After some iterations, synthetic samples are not necessary, and for each sample i, the measures [ M i N M 1 M i ] are the last measures of the set [ M i N T 1 M i ]. Second, at i, the SG filter is applied using the set of measures [ M i N T L M i 1 L ;   M i ] to get M i L . Consequently, with both changes, the real-time SG filter result is of good quality, similar to an offline result.
With NADCA-O, the threshold U is the maximum distance between the prediction of the smooth signal P i + 1 L and the measurement of the original signal M i + 1 S e n s o r . The value of U is determined by selecting the maximum value for each sample from a set of normal landings. In general, U is not constant for all samples.
The NADCA-O is summarized in Algorithm 3 as follows:
Algorithm 3: NADCA-O algorithm.
Given a sample i, U, NT, a set of measures [ M i N M 1 M i ] and M i + 1 :
 1. Calculate [ M i N T 1 L M i L ] using NT measures and the SG filter. If [ M i N T 1 L M i 1 L ] is known, use the SG filter over the set [ M i N T 1 L M i 1 L ,   M i ].
 2. Calculate the set of differences PD using a set of NM measures [ M i N M 1 L M i L ].
 3. Calculate Δ i L using PM and PD.
 4. Use NADCA-L where P i + 1 L replaces P i + 1 S e n s o r .
NADCA-O works in real time. It means that steps 1–3 described above are calculated during the time difference between two consecutive samples. Once M i + 1 S e n s o r is known, step 4 allows to decide if there is anomaly or not.

4. NADCA for Real Landings

A set of 36 landings from the same airport was selected. Each landing had the following signals: [ZILS, ZRA, ZGPS, ZIRS] for the Z coordinate, [YILS, YGPS, YIRS] for the Y coordinate, and [XGPS, XIRS] for the X coordinate. The approach phase was filtered for each landing. These 36 landings form a real data set.
The data were useful to carry out the learning and validation process for the predictive model creation and to determine decision thresholds U that were used to decide if there was an anomaly or not. There was a predictive model for each coordinate. Likewise, each sensor had its U threshold for each coordinate.
The algorithm NADCA-L was used for XGPS, XIRS, YIRS, ZGPS, and ZIRS. The algorithm NADCA-O was used for YGPS, YILS, ZILS, and ZRA where L was created from the SG filter.
Section 4.1 shows some figures to visualize the sensor values of a real landing. These values are represented with the help of the runway coordinate system according to the X, Y, and Z axis.

4.1. Example of Real Landing

4.1.1. Z Axis

Figure 8 and Figure 9 show the GPS, IRS, ILS, and RA values of a real landing according to the Z axis. In Figure 9, the ILSL and RAL values are represented by a black line. Those values are the corresponding smoothed signals of ILS and RA using the SG filter.

4.1.2. Y Axis

Figure 10 shows the GPS, IRS, and ILS values of a real landing according to the Y axis. The GPS values are not exactly the expected values of a GPS sensor. Normally, a GPS sensor should give similar values to the GPSL signal. Consequently, a GPSL is required to process this pseudo-GPS (P_GPS) data. The GPSL and ILSL values are represented by a black line. Those values are the corresponding smoothed signals of P_GPS and ILS using the SG filter.

4.1.3. X Axis

Figure 11 shows a portion of IRS values as a function of GPS values of a real landing according to the X axis. This portion is not a perfect line at a 45 degree angle. In general, this angle increases as the plane approaches the runway.

4.2. Predictive Model Using Real Landings

In this section, three predictive models (PMZ, PMY, and PMX) for real data according to the X, Y, and Z axes are analyzed. Each predictive model only works with signals without oscillations. In this way, the convergence of the learning process is better and the anomaly detection process is more robust. On the other hand, data preparation is more laborious because signals with oscillations are smoothed using the SG filter.
Each predictive model was created using 30,554 examples for training and 15,050 examples for validation.

4.2.1. Z Axis

Figure 12 represents PMZ. This model uses the data from GPS, IRS, ILSL, and RAL. PMZ is a stacked LSTM model. For clarity, the Z coordinate has been omitted in the figure.
Each example used to create PMZ contains ND + 1 consecutive differences where the last difference is the target that the model should predict from a set of NM previous measurements (NM = 15). This set of examples was split into two parts. This was a train-validation split. The first part was used to create the LSTM model. The remaining examples were used to evaluate the model.
The selected LSTM network architecture has three LSTM layers and 300 cells per layer. Using this architecture, the learning process adapts the weights of network. To do this, a backpropagation algorithm was used together with the set of learning examples. This algorithm, in addition to the number of layers and cells per layer, requires some hyperparameters to be defined. Specifically, the optimization algorithm (used to train the network) is Adam’s algorithm and the loss function (used to evaluate the network that is minimized by the optimization algorithm) is mean squared error (mse). The number of epochs (an epoch is one pass through all samples in the training dataset and updating the network weights) is 70. The batch size (a batch is one pass through a subset of samples in the training dataset after which the network weights are updated) is 32. The activation function is Relu (an activation is required to allow the neural network the ability to model non-linear processes).
The network can be trained using the learning examples and simultaneously, it can also be evaluated with the help of the validation examples. This evaluation provides an estimate of the performance of the network at making predictions for unseen data in the future.
A positive evaluation means a good fit between the learning and validation sets. A good fit is a case where the performance of the model is good on both the training and validation sets. This can be evaluated from a plot (loss as a function of the number of epochs) where the train and validation losses decrease and stabilize around the same point. With this result, behaviors such as overfitting and underfitting are avoided. Figure 13 shows the training and validation loss meeting. The convergence of the curves is fast and stable. Similar results can be obtained using different sets of examples for a train-validation split.

4.2.2. Y Axis

Figure 14 represents PMY. This model used the data from GPSL, IRS, and ILSL. For clarity, the Y coordinate has been omitted in the figure. PMY is a stacked LSTM model. It has 3 layers of 300 cells each. The number of previous measurements is 15.
The convergence of the curves is fast and stable (see Figure 15).

4.2.3. X Axis

Figure 16 represents PMX. This model uses the data from GPS and IRS. For clarity, the X coordinate is omitted in the figure.
PMX is a stacked LSTM model. It has 3 layers of 440 cells each. The number of previous measurements is 50. The number of previous measures as well as the number of cells per layer were increased to achieve a better fit between the learning and validation sets (see Figure 17).
The validation and learning graphs crossed and slightly diverged from epoch 32. From this epoch, overfitting appeared. To avoid this, the PMX for epoch 32 was selected.
This PMX is not the best possible model. This means that this model gives a prediction error greater than an optimal solution. A higher number of real landings (i.e., more examples) should prevent overfitting and provide a better PMX.
As discussed in Section 4.3.3, this PMX provided a prediction error acceptable for the IRS. However, the prediction error is important for GPS data. Consequently, this model was only used to detect anomalies in XIRS.
NADCA was primarily tested on the Z and Y axes because they are more diverse and contain more complicated signals than the X axis. The X axis only contains signals without oscillations. However, the Z and Y axes have signals with and without oscillations. In addition, the signals without oscillations have non-standard behavior.

4.3. Thresholding Using Real Landings

This subsection explains the U thresholds for each sensor and coordinate. U represents a prediction error when the time series does not show oscillations. U represents a maximum error for each sample between a smooth signal L and the corresponding raw values when the time series shows oscillations. Each threshold is denoted as U C o o r d i n a t e S e n s o r .

4.3.1. Z Axis

Prediction errors are calculated using PMZ and data without anomalies.
Figure 18 shows the prediction error for ZGPS and ZIRS. R e f Z _ G P S and R e f Z _ G P S represent P i + 1 Z _ G P S and P i + 1 Z _ I R S value sets (for the Z coordinate), respectively. These values are altitudes.
The ZIRS threshold can be set to U Z I R S = 0.06. This result is good to detect anomalies. On the other hand, the ZGPS threshold can be set to U Z G P S = 1.2. This threshold is also small and acceptable to detect anomalies. However, U Z G P S is higher than U Z I R S . This means that ZGPS data may have minor anomalies.
For ILS, U Z I L S is the envelope of the maximum error between R e f Z _ I L S and Z I L S , where R e f Z _ I L S is the set of predicted values using Z I L S L (see Figure 19).
For RA, U Z R A is determined with the help of two envelopes, one envelope for positive differences and another for negative ones. Each envelope corresponds to the maximum error between R e f Z _ R A and Z R A , where R e f Z _ R A is the set of predicted values using Z R A L (see Figure 20).

4.3.2. Y Axis

Prediction errors are calculated using PMY and data without anomalies. The thresholds for P_GPS and IRS are a constant. U Y G P S = 14 is the envelope of the maximum error between R e f Y _ G P S and Y G P S where R e f Y _ G P S is the set of predicted values using Y G P S L . Y IRS is the only signal without oscillations. The maximum prediction error determines a threshold U Y G P S = 0.35.
For ILS, U Y I L S is the envelope of the maximum error between R e f Y _ I L S and Y I L S where R e f Y _ I L S is the set of predicted values using Y I L S L (see Figure 21).

4.3.3. X Axis

Prediction errors are calculated using PMX and data without anomalies. The thresholds for GPS and IRS are a constant because these are signals without oscillations. The maximum prediction error for IRS determines a threshold U X I R S = 0.35. It is good to detect anomalies. However, the maximum prediction error for GPS sets a threshold U X G P S = 14, too high to detect anomalies. The chosen PMX is not the best possible model.

5. Examples of Anomaly Detection and Correction

In this section, real and simulated anomalies in real landing signals are detected and corrected using NADCA. For anomalies of long duration, Equation (7) was used. Section 5.1 explains how the parameter α of Equation (8) was determined.

5.1. Determination of the Parameter α

The parameter α of Equation (8) can be determined using a relationship between α and C i * . This relationship was found experimentally using a set of different examples with anomalies. For each example, the best α and its corresponding C i * are selected. Figure 22 shows the result obtained for the GPS Z-coordinate.

5.2. Real Anomalies

This subsection presents two real anomalies that were detected and corrected by NADCA.

5.2.1. Scale Factor Anomaly

This anomaly affected ZGPS values for one landing. It is a small scale factor anomaly that was detected and corrected using NADCA-L (see Figure 23).

5.2.2. Noise Anomaly

This anomaly appeared at YILS. It can be interpreted as noise. This anomaly was detected and corrected using NADCA-O (see Figure 24).

5.3. Simulated Anomalies

This subsection presents some simulated anomalies that appear in different landings. Unlike real anomalies, simulated anomalies are evaluated using two parameters: F-score [25] and root mean square error (RMSE) [26].
F-score compares the binary plot of the detected anomaly (DBP) and the “True” binary plot (TBP) that represents where the anomaly was generated. The value varies between 0 and 1. The best result is 1. It is useful to evaluate anomaly detection in a simple way by a number.
Assume that an anomaly appears in the time interval [T1, T2]. RMSE calculates the error between the original signal without anomaly and the signal with anomaly correction in the interval [T1, T2]. It is useful to evaluate anomaly correction, especially in signals without oscillations.

5.3.1. Example 1: Landing with Bias in ZGPS and Noise in YILS

Figure 25 shows two anomalies on a specific landing. The bias anomaly in ZGPS is a simulated anomaly. The noise anomaly in YILS is a small real anomaly.
Table 1 shows the result for each signal of this landing using NADCA. There is a small anomaly in YILS. However, this anomaly was not artificially generated. Consequently, RMSE and F-Score calculation are not possible. There is an anomaly in ZGPS. This anomaly was artificially generated. The F-score is 1 because NADCA perfectly detects the anomaly. The RMSE is 0.57. This value is small. There are no anomalies in XGPS, XIRS, YGPS, YIRS, ZIRS, ZILS, or ZRA and consequently, the value of F-score and RMSE is N/A.

5.3.2. Example 2: Landing with Noise in ZGPS

Figure 26 shows a simulated noise anomaly on ZGPS.
Table 2 shows the result for each signal of the landing using NADCA. There is an anomaly in ZGPS. The F-score (see Table 2) is 0.99 because TBP is determined prior to detection without discontinuities and DBP has a no anomalous sample anomaly. That sample intersects the NADCA correction. The binary plot of the detected anomaly shows that sample.
The RMSE is 0.52. This value is small. There are no anomalies in XGPS, XIRS, YGPS, YIRS, YILS, ZIRS, ZILS, or ZRA and consequently, the value of F-score and RMSE is N/A.

5.3.3. Example 3: Landing with a Noisy Bias in ZGPS

Figure 27 shows an example of a simulated noisy bias anomaly on ZGPS. The F-score (see Table 3) is 1. In this example, the correction has to be precise in order to connect with the end of the anomaly.

5.3.4. Example 4: Landing with Drift in ZGPS

Figure 28 shows an example of a simulated drift anomaly on ZGPS. The F-score (see Table 4) is 0.87. This value is lower than 1 because the anomaly was detected 80 samples after the starting point of the anomaly. That is, the anomaly has a slow-moving variation and anomaly detection only occurs when Equation (7) is satisfied. The correction with a RMSE = 0.43 is of good quality.

5.3.5. Example 5: Landing with Anomaly in YGPS

Figure 29 shows an example of a simulated noisy bias anomaly on YGPS. The RMSE is 0.86 (see Table 5). The RMSE was calculated using the anomaly correction and the corresponding portion of the smoothed signal of the signal without anomaly. This calculation is different from the RMSE of a signal without oscillations. Thanks to the oscillations, other corrections are possible. Consequently, a higher RMSE value could also be an acceptable correction. The F-score is 1.

5.3.6. Example 6: Coupling with Anomaly in XIRS

Figure 30 shows, on the left side, a simple example of coupling between XIRS and YILS for a simulated anomaly in XIRS. The YILS values are calculated using Equation (2) where X = XIRS. A simulated anomaly appears in both XIRS and YILS. A small coupling between XIRS and ZRA is also present. The Hterrain value of Equation (1) was obtained using XIRS values.
NADCA works following the order X, Y, and Z. It detects and corrects the anomaly in XIRS and consequently the anomaly does not appear in YILS and ZRA. If NADCA correctly detects the anomaly in XIRS, then there is no coupling problem and NADCA knows that the source of the anomaly is in XIRS. The right side of Figure 30 shows the anomaly detection and correction on XIRS.
NADCA can also work after each sample has been generated for each signal, even if there is a coupling problem. Anomalies in XIRS, YILS, and ZRA could be detected and corrected. However, the source of the anomaly would not be clear.
Table 6 shows a F-score of 0.99 due to a non-anomalous sample and a RMSE = 0.61.

5.3.7. Example 7: Landing with Anomaly in YGPS

Figure 31 shows an example of a simulated drift anomaly on YGPS. The RMSE is 2.9 (see Table 7). The RMSE was calculated using the anomaly correction and the corresponding portion of the smoothed signal of the signal without anomaly.
The F-score is 0.86. This value is not 1 because NADCA can only detect the anomaly when the anomalous values leave the zone of normal oscillations.
Equation (5) is not the only criterion used to start analyzing a possible slow-moving anomaly. For signals with oscillations, such as the YGPS, consecutive raw data differences might be a better criterion than using the C i * parameter.

5.3.8. NADCA Overall Assessment

NADCA was evaluated using a set of 80 simulated sensor anomalies during landing. An average F-score value of 0.97 was obtained in relation to the detection of anomalies and an average root mean square error (RMSE) value of 2.10 regarding the correction of anomalies.
The average F-score value is very high. It does not reach the value 1 because, mainly, NADCA consumes some samples before detecting slow moving anomalies. The average RMSE value is acceptable. This could be lower considering, for example, a higher ND number (see Section 3 where ND = 14). However, a low ND is preferable. In this way, NADCA can start working as soon as possible. This is important since there are landings that do not last a long time.
Other strategies for correction could have been considered, for example, using algorithms described in [27]. However, preference has been given to using the same prediction algorithm that simultaneously allows both detecting and correcting anomalies with acceptable quality.

6. Discussion

NADCA is an algorithm for the detection and correction of anomalies in time series. The algorithm differentiates between time series with oscillations and without oscillations.
Three versions of NADCA have been described. NADCA-B is only useful for detecting some obvious anomalies, NADCA-L detects and corrects anomalies in signals without oscillations, and NADCA-O detects and corrects anomalies in signals with oscillations. NADCA-B can be seen as a particular case of NADCA-L. Furthermore, NADCA-L is a special case of NADCA-O.
NADCA is robust because the predictions are made on smoothed signals. When a time series has oscillations, the algorithm creates a smooth signal by using the SG filter. A smoothed signal guarantees a small prediction error less than a constant.
NADCA has been used for both simulated and real anomalies on real landings.
NADCA is applied following the order of the coordinates X, Y, and Z. In this way, if an anomaly appears in any sensor coordinate, it is corrected before analyzing the next coordinate since the latter can be a function of the previous coordinate. Consequently, coupling problems are avoided.
Regarding the thresholds that derive from a prediction error, we can compare U Z G P S = 1.2 and U Z I R S = 0.06 . One would expect them to be similar, which is not the case. This may originate from some samples in ZGPS that could be small anomalies. However, they may not be relevant.
The predictive model for the X axis is not the best to predict the behavior of XGPS. This comes from the fact that the model only combines two sensors and the number of landings used to create the model is small. On the other hand, for the Y and Z axes, despite the small number of landings, the models generalize well for the selected airport. This is so because each model uses more sensors in a compact way.
NADCA was developed primarily to detect and correct anomalies during the landing phase. During this phase, the plane does not make abrupt changes and therefore, NADCA detects anomalies related to the sensors’ operation. However, an abrupt change in the trajectory of the aircraft would generate changes in the sensor signals that would be considered anomalous. These changes usually happen during the approximation phase that has not been considered in this work.
It is uncertain whether each predictive model could correctly predict the behavior of the sensors for landings in another airport. This does not have to be the case, and therefore, it is left for future work to consider new landing data from various airports in order to create a predictive model that generalizes to any airport.

7. Conclusions

NADCA is a new algorithm for anomaly detection and correction in time series. The algorithm is robust because it differentiates between oscillating and non-oscillating time series and always makes predictions on smooth signals.
NADCA uses a predictive model based on an LSTM neural architecture. The predictions provide a reference. The difference between this reference and the raw values is compared with a specific threshold U to decide whether or not there is an anomaly. NADCA was tested in time series that describe the landing phase of an airplane with promising results. This algorithm guarantees the quality of measures during landing. Generalization to several airports could be considered if additional data sets from various airports were made available. Importantly, NADCA is a general-purpose algorithm that could also be used in other contexts. Future work will consider applying NADCA for applications in other domains.
The following points summarize the main conclusions of this paper:
  • NADCA is a new algorithm for anomaly detection and correction. Detection and correction are performed simultaneously.
  • NADCA uses a new prediction strategy to detect and correct both fast-changing and slow-moving anomalies.
  • NADCA distinguishes between signals with oscillations and without oscillations. The algorithm is similar for both types of signals, however, signals with oscillations require an additional step. This step consists of obtaining a smoothed signal in real time.
  • NADCA works in real time. It uses information from sensors in a compact way and only needs to predict one sample at each iteration.
  • NADCA evaluation has given an average F-score value of 0.97 for detection and an average RMSE value of 2.1 for correction.
  • The different examples in this article show the simultaneous detection and correction of both fast changing anomalies (e.g., Figure 27) and slow-moving anomalies (e.g., Figure 28). NADCA can deal with simultaneous anomalies in different sensors (e.g., Figure 25). Figure 30 shows how NADCA avoids the coupling problem.
  • Once the anomaly is detected, the corresponding sensor does not need to be isolated.

Author Contributions

Conceptualization, A.M.; methodology, A.M.; software, A.M.; validation, A.M.; formal analysis, A.M.; investigation, A.M.; resources, L.T.-M. and R.P.; data curation, E.C., P.R. and R.P.; writing—original draft preparation, A.M., L.T.-M., E.C. and P.R.; writing—review and editing, A.M., L.T.-M., E.C. and P.R.; supervision, L.T.-M.; project administration, L.T.-M. and R.P.; funding acquisition, L.T.-M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the French Direction Générale de l’Aviation Civile (DGAC), project COCOTIER (COncept de COckpit et Technologies Intégrées En Rupture, 2019–2022), Contract DGAC/DTA/SDC n°2019-08.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Acknowledgments

We thank Guillaume Alcalay (Airbus), Philippe Goupil (Airbus), and Damien Gonzalez (Capgemini) for their expertise in the aeronautics field, their help with the data, and their valuable comments about the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Anomaly Detection. Available online: https://en.wikipedia.org/wiki/Anomaly_detection (accessed on 10 March 2022).
  2. Spence, C.; Parra, L.; Sajda, P. Detection, synthesis and compression in mammographic image analysis with a hierarchical image probability model. In Proceedings of the IEEE Workshop on Mathematical Methods in Biomedical Image Analysis, Kauai, HI, USA, 9–10 December 2001; IEEE Computer Society: Washington, DC, USA, 2001; pp. 3–10. [Google Scholar]
  3. Aleskerov, E.; Freisleben, B.; Rao, B. Cardwatch: A neural network-based database mining system for credit card fraud detection. In Proceedings of the IEEE Computational Intelligence for Financial Engineering, New York, NY, USA, 24–25 March 1997; pp. 220–226. [Google Scholar]
  4. Fujimaki, R.; Yairi, T.; Machida, K. An approach to spacecraft anomaly detection problem using kernel feature space. In Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, Chicago, IL, USA, 21–24 August 2005; ACM Press: New York, NY, USA, 2005; pp. 401–410. [Google Scholar]
  5. Ducharlet, K.; Travé-Massuyès, L.; le Lann, M.-V.; Miloudi, Y. A Multi-phase Iterative Approach for Anomaly Detection and Its Agnostic Evaluation. In Trends in Artificial Intelligence Theory and Applications. Artificial Intelligence Practices; Springer: Cham, Switzerland, 2020; pp. 505–517. ISBN 978-3-030-55788-1. [Google Scholar]
  6. Dorise, A.; Travé-Massuyès, L.; Alonso, C.; Subias, A.; Vacher, F.; Baczkowsky, L. Anomaly Detection for Radiation Hardening of Space Electronics—Application of Machine Learning Algorithms on an Atmel SAM3X Microcontroller. In Proceedings of the Avionics, Data, Control and Software Systems—ADCSS 2020, ESA, Amsterdam, The Netherlands, 20–22 October 2020. [Google Scholar]
  7. Ifqir, S.; Combastel, C.; Zolghadri, A.; Alcalay, G.; Goupil, P.; Merlet, S. Multi-Sensor Data Fusion for Civil Aircraft IRS/GPS/ILS Integrated Navigation System. In Proceedings of the European Control Conference (ECC), Rotterdam, The Netherlands, 29 June–2 July 2021. [Google Scholar]
  8. Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
  9. Emmert-Streib, F.; Zhen, Y.; Han, F.; Shailesh, T.; Matthias, D. An Introductory Review of Deep Learning for Prediction Models with Big Data. Front. Artif. Intell. 2020, 3, 4. Available online: https://www.frontiersin.org/article/10.3389/frai.2020.00004 (accessed on 10 March 2022). [CrossRef] [PubMed] [Green Version]
  10. Chalapathy, R.; Chawla, S. Deep learning for anomaly detection: A survey. arXiv 2019, arXiv:1901.03407. [Google Scholar]
  11. Pang, G.; Shen, C.; Cao, L.; Van Den Hengel, A. Deep Learning for Anomaly Detection: A Review. ACM Comput. Surv. 2021, 54, 2. [Google Scholar] [CrossRef]
  12. Malhotra, P.; Vig, L.; Shroff, G.M.; Agarwal, P. Long short-term memory networks for anomaly detection in time series. In Proceedings of the European Symposium on Artificial Neural Networks (ESANN 2015), Bruges, Belgium, 22–24 April 2015. [Google Scholar]
  13. Malhotra, P.; Ramakrishnan, A.; Anand, G.; Vig, L.; Agarwal, P.; Shroff, G. LSTM-based encoder-decoder for multi-sensor anomaly detection. arXiv 2016, arXiv:1607.00148. [Google Scholar]
  14. Kanarachos, S.; Christopoulos, S.R.G.; Chroneos, A.; Fitzpatrick, M.E. Detecting anomalies in time series data via a deep learning algorithm combining wavelets, neural networks and hilbert transform. Expert Syst. Appl. 2017, 85, 292–304. [Google Scholar] [CrossRef]
  15. Nanduri, A.; Sherry, L. Anomaly detection in aircraft data using Recurrent Neural Networks (RNN). In Proceedings of the Integrated Communications Navigation and Surveillance (ICNS), Herndon, VA, USA, 19–21 April 2016; pp. 5C2-1–5C2-8. [Google Scholar]
  16. Munir, M.; Siddiqui, S.A.; Dengel, A.; Ahmed, S. Deepant: A deep learning approach for unsupervised anomaly detection in time series. IEEE Access 2019, 7, 1991–2005. [Google Scholar] [CrossRef]
  17. Amarbayasgalan, T.; Jargalsaikhan, B.; Ryu, K.H. Unsupervised novelty detection using deep autoencoders with density-based clustering. Appl. Sci. 2018, 8, 1468. [Google Scholar] [CrossRef] [Green Version]
  18. Wiegman, A.P. Fault-Tolerant Sensor Fusion for Aircraft Height Estimation. Master’s Thesis, Faculty of Engineering at Stellenbosch University, Stellenbosch, South Africa, 2018. [Google Scholar]
  19. Mori, R. Anomaly Detection and Cause Analysis During Landing Approach Using Recurrent Neural Network. J. Aerosp. Inf. Syst. 2021, 18, 679–685. [Google Scholar] [CrossRef]
  20. Cho, K.; van Merrienboer, B.; Bahdanau, D.; Bengio, Y. On the Properties of Neural Machine Translation: Encoder-Decoder Approaches. In Proceedings of the SSST-8, Doha, Qatar, 7 October 2014; Association for Computational Linguistics: Stroudsburg, PA, USA, 2014; pp. 103–111. [Google Scholar]
  21. Mur, A.; Dormido, R.; Vega, J.; Duro, N.; Dormido-Canto, S. Unsupervised Event Characterization and Detection in Multichannel Signals: An EEG application. Sensors 2016, 16, 590. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Hojjat, S.; Julianne, B.; Sharan, S.; Joseph, B.; Errol, C.; Shahrokh, V. Recent Advances in Recurrent Neural Networks. 2018. Available online: https://arxiv.org/pdf/1801.01078.pdf (accessed on 10 March 2022).
  23. Sherstinsky, A. Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) network. Phys. D Nonlinear Phenom. 2020, 404, 132306. [Google Scholar] [CrossRef] [Green Version]
  24. Savitzky, A.; Golay, M.J.E. Smoothing and differentiation of data by simplified least squares procedures. Anal. Chem. 1964, 36, 1627–1639. Available online: https://pubs.acs.org/doi/abs/10.1021/ac60214a047 (accessed on 10 March 2022). [CrossRef]
  25. F-Score. Available online: https://en.wikipedia.org/wiki/F-score (accessed on 10 March 2022).
  26. Root Mean Square Deviation. Available online: https://en.wikipedia.org/wiki/Root-mean-square_deviation (accessed on 10 March 2022).
  27. Li, M.-W.; Xu, D.-Y.; Geng, J.; Hong, W.-C. A ship motion forecasting approach based on empirical mode decomposition method hybrid deep learning network and quantum butterfly optimization algorithm. Nonlinear Dyn. 2022, 71, 2447–2467. [Google Scholar] [CrossRef]
Figure 1. Anomaly detection and correction zone during the landing of an airplane.
Figure 1. Anomaly detection and correction zone during the landing of an airplane.
Sensors 22 02334 g001
Figure 2. Example of simulated time series of the Z coordinate during the landing process: ZGPS, ZIRS, ZILS, and ZRA. On the right side, a table relates each sensor to each coordinate. The sensor coordinate cell shows whether or not the signal has oscillations. There is no signal if the cell is empty.
Figure 2. Example of simulated time series of the Z coordinate during the landing process: ZGPS, ZIRS, ZILS, and ZRA. On the right side, a table relates each sensor to each coordinate. The sensor coordinate cell shows whether or not the signal has oscillations. There is no signal if the cell is empty.
Sensors 22 02334 g002
Figure 3. PMZ prediction. In the lower part, an example of measure prediction for the GPS is explained.
Figure 3. PMZ prediction. In the lower part, an example of measure prediction for the GPS is explained.
Sensors 22 02334 g003
Figure 4. Main elements and basic behavior of NADCA. The red dot at time i + 1 is the measure prediction.
Figure 4. Main elements and basic behavior of NADCA. The red dot at time i + 1 is the measure prediction.
Sensors 22 02334 g004
Figure 5. NADCA-L: Generalization of NADCA-B for anomaly detection and correction in signals without oscillations.
Figure 5. NADCA-L: Generalization of NADCA-B for anomaly detection and correction in signals without oscillations.
Sensors 22 02334 g005
Figure 6. Steps of NADCA-L.
Figure 6. Steps of NADCA-L.
Sensors 22 02334 g006
Figure 7. NADCA-O: Generalization of NADCA for anomaly detection and correction in signals with oscillations.
Figure 7. NADCA-O: Generalization of NADCA for anomaly detection and correction in signals with oscillations.
Sensors 22 02334 g007
Figure 8. GPS and IRS for the Z coordinate (real values of a landing).
Figure 8. GPS and IRS for the Z coordinate (real values of a landing).
Sensors 22 02334 g008
Figure 9. ILS, ILSL, RA, and RAL for the Z coordinate (real values of a landing).
Figure 9. ILS, ILSL, RA, and RAL for the Z coordinate (real values of a landing).
Sensors 22 02334 g009
Figure 10. IRS, GPS, GPSL, ILS, and ILSL for the Y coordinate (real values of a landing).
Figure 10. IRS, GPS, GPSL, ILS, and ILSL for the Y coordinate (real values of a landing).
Sensors 22 02334 g010
Figure 11. IRS portion as a function of GPS (real values for the X coordinate).
Figure 11. IRS portion as a function of GPS (real values for the X coordinate).
Sensors 22 02334 g011
Figure 12. PMZ (real values).
Figure 12. PMZ (real values).
Sensors 22 02334 g012
Figure 13. Evaluation curve for the Z coordinate using real landing values.
Figure 13. Evaluation curve for the Z coordinate using real landing values.
Sensors 22 02334 g013
Figure 14. PMY (real values).
Figure 14. PMY (real values).
Sensors 22 02334 g014
Figure 15. Evaluation curve for the Y coordinate using real landing values.
Figure 15. Evaluation curve for the Y coordinate using real landing values.
Sensors 22 02334 g015
Figure 16. Predictive model for the X coordinate (real values).
Figure 16. Predictive model for the X coordinate (real values).
Sensors 22 02334 g016
Figure 17. Evaluation curve for the X coordinate using real landing values.
Figure 17. Evaluation curve for the X coordinate using real landing values.
Sensors 22 02334 g017
Figure 18. Envelopes for GPS and IRS using real values.
Figure 18. Envelopes for GPS and IRS using real values.
Sensors 22 02334 g018
Figure 19. Envelope for ILS using real values.
Figure 19. Envelope for ILS using real values.
Sensors 22 02334 g019
Figure 20. An envelope for the positive differences and another for the negative ones using real RA values.
Figure 20. An envelope for the positive differences and another for the negative ones using real RA values.
Sensors 22 02334 g020
Figure 21. Envelope for ILS using real values for the Y coordinate.
Figure 21. Envelope for ILS using real values for the Y coordinate.
Sensors 22 02334 g021
Figure 22. Relationship between α and C * (or C * * ).
Figure 22. Relationship between α and C * (or C * * ).
Sensors 22 02334 g022
Figure 23. Anomaly detected and corrected using NADCA-L. The small anomaly appears in red.
Figure 23. Anomaly detected and corrected using NADCA-L. The small anomaly appears in red.
Sensors 22 02334 g023
Figure 24. Anomaly detected and corrected using NADCA-O.
Figure 24. Anomaly detected and corrected using NADCA-O.
Sensors 22 02334 g024
Figure 25. Bias and small noise anomalies detected and corrected on a specific landing using NADCA.
Figure 25. Bias and small noise anomalies detected and corrected on a specific landing using NADCA.
Sensors 22 02334 g025
Figure 26. Noise anomaly detected and corrected on a specific landing using NADCA.
Figure 26. Noise anomaly detected and corrected on a specific landing using NADCA.
Sensors 22 02334 g026
Figure 27. Noisy bias anomaly detected and corrected on a specific landing using NADCA.
Figure 27. Noisy bias anomaly detected and corrected on a specific landing using NADCA.
Sensors 22 02334 g027
Figure 28. Drift anomaly detected and corrected on a specific landing using NADCA.
Figure 28. Drift anomaly detected and corrected on a specific landing using NADCA.
Sensors 22 02334 g028
Figure 29. Drift anomaly detected and corrected on a specific landing using NADCA.
Figure 29. Drift anomaly detected and corrected on a specific landing using NADCA.
Sensors 22 02334 g029
Figure 30. On the left side, coupling problem between XIRS and YILS for a drift anomaly on XIRS. On the right side, the anomaly detection and correction on XIRS.
Figure 30. On the left side, coupling problem between XIRS and YILS for a drift anomaly on XIRS. On the right side, the anomaly detection and correction on XIRS.
Sensors 22 02334 g030
Figure 31. Drift anomaly detected and corrected on a specific landing using NADCA.
Figure 31. Drift anomaly detected and corrected on a specific landing using NADCA.
Sensors 22 02334 g031
Table 1. Result for each signal after using NADCA.
Table 1. Result for each signal after using NADCA.
XGPSXIRSYGPSYIRSYILSZGPSZIRSZILSZRA
AnomalyNoNoNoNoYesYesNoNoNo
F-scoreN/AN/AN/AN/AN/A1N/AN/AN/A
RMSEN/AN/AN/AN/AN/A0.57N/AN/AN/A
Table 2. Result for each signal after using NADCA.
Table 2. Result for each signal after using NADCA.
XGPSXIRSYGPSYIRSYILSZGPSZIRSZILSZRA
AnomalyNoNoNoNoNoYesNoNoNo
F-scoreN/AN/AN/AN/AN/A0.99N/AN/AN/A
RMSEN/AN/AN/AN/AN/A0.52N/AN/AN/A
Table 3. Result for each signal after using NADCA.
Table 3. Result for each signal after using NADCA.
XGPSXIRSYGPSYIRSYILSZGPSZIRSZILSZRA
AnomalyNoNoNoNoNoYesNoNoNo
F-scoreN/AN/AN/AN/AN/A0.99N/AN/AN/A
RMSEN/AN/AN/AN/AN/A0.43N/AN/AN/A
Table 4. Result for each signal after using NADCA.
Table 4. Result for each signal after using NADCA.
XGPSXIRSYGPSYIRSYILSZGPSZIRSZILSZRA
AnomalyNoNoNoNoNoYesNoNoNo
F-scoreN/AN/AN/AN/AN/A0.87N/AN/AN/A
RMSEN/AN/AN/AN/AN/A0.43N/AN/AN/A
Table 5. Result for each signal after using NADCA.
Table 5. Result for each signal after using NADCA.
XGPSXIRSYGPSYIRSYILSZGPSZIRSZILSZRA
AnomalyNoNoYesNoNoNoNoNoNo
F-scoreN/AN/A 1N/AN/AN/AN/AN/AN/A
RMSEN/AN/A0.86N/AN/AN/AN/AN/AN/A
Table 6. Result for each signal after using NADCA.
Table 6. Result for each signal after using NADCA.
XGPSXIRSYGPSYIRSYILSZGPSZIRSZILSZRA
AnomalyNoYesNoNoNoNoNoNoNo
F-scoreN/A0.99N/AN/AN/AN/AN/AN/AN/A
RMSEN/A0.61N/AN/AN/AN/AN/AN/AN/A
Table 7. Result for each signal after using NADCA.
Table 7. Result for each signal after using NADCA.
XGPSXIRSYGPSYIRSYILSZGPSZIRSZILSZRA
AnomalyNoNoYesNoNoNoNoNoNo
F-scoreN/AN/A0.84N/AN/AN/AN/AN/AN/A
RMSEN/AN/A2.9N/AN/AN/AN/AN/AN/A
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Mur, A.; Travé-Massuyès, L.; Chanthery, E.; Pons, R.; Ribot, P. A Neural Algorithm for the Detection and Correction of Anomalies: Application to the Landing of an Airplane. Sensors 2022, 22, 2334. https://doi.org/10.3390/s22062334

AMA Style

Mur A, Travé-Massuyès L, Chanthery E, Pons R, Ribot P. A Neural Algorithm for the Detection and Correction of Anomalies: Application to the Landing of an Airplane. Sensors. 2022; 22(6):2334. https://doi.org/10.3390/s22062334

Chicago/Turabian Style

Mur, Angel, Louise Travé-Massuyès, Elodie Chanthery, Renaud Pons, and Pauline Ribot. 2022. "A Neural Algorithm for the Detection and Correction of Anomalies: Application to the Landing of an Airplane" Sensors 22, no. 6: 2334. https://doi.org/10.3390/s22062334

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop