Machine Learning for Self-Coherent Detection Short-Reach Optical Communications

Wu, Qi; Xu, Zhaopeng; Zhu, Yixiao; Zhang, Yikun; Ji, Honglin; Yang, Yu; Qiao, Gang; Liu, Lulu; Wang, Shangcheng; Liang, Junpeng; Wei, Jinlong; Li, Jiali; He, Zhixue; Zhuge, Qunbi; Hu, Weisheng

doi:10.3390/photonics10091001

Open AccessEditor’s ChoiceReview

Machine Learning for Self-Coherent Detection Short-Reach Optical Communications

by

Qi Wu

^1,2

,

Zhaopeng Xu

^2,*

,

Yixiao Zhu

¹,

Yikun Zhang

¹,

Honglin Ji

²,

Yu Yang

²,

Gang Qiao

²,

Lulu Liu

²,

Shangcheng Wang

²,

Junpeng Liang

²,

Jinlong Wei

²,

Jiali Li

²,

Zhixue He

²,

Qunbi Zhuge

^1,2 and

Weisheng Hu

^1,2

¹

State Key Laboratory of Advanced Optical Communication System and Networks, Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai 200240, China

²

Peng Cheng Laboratory, Shenzhen 518055, China

^*

Author to whom correspondence should be addressed.

Photonics 2023, 10(9), 1001; https://doi.org/10.3390/photonics10091001

Submission received: 8 August 2023 / Revised: 24 August 2023 / Accepted: 29 August 2023 / Published: 31 August 2023

(This article belongs to the Special Issue Machine Learning Applied to Optical Communication Systems)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Driven by emerging technologies such as the Internet of Things, 4K/8K video applications, virtual reality, and the metaverse, global internet protocol traffic has experienced an explosive growth in recent years. The surge in traffic imposes higher requirements for the data rate, spectral efficiency, cost, and power consumption of optical transceivers in short-reach optical networks, including data-center interconnects, passive optical networks, and 5G front-haul networks. Recently, a number of self-coherent detection (SCD) systems have been proposed and gained considerable attention due to their spectral efficiency and low cost. Compared with coherent detection, the narrow-linewidth and high-stable local oscillator can be saved at the receiver, significantly reducing the hardware complexity and cost of optical modules. At the same time, machine learning (ML) algorithms have demonstrated a remarkable performance in various types of optical communication applications, including channel equalization, constellation optimization, and optical performance monitoring. ML can also find its place in SCD systems in these scenarios. In this paper, we provide a comprehensive review of the recent progress in SCD systems designed for high-speed optical short- to medium-reach transmission links. We discuss the diverse applications and the future perspectives of ML for these SCD systems.

Keywords:

optical fiber communication; self-coherent detection; machine learning; short-reach transmission

1. Introduction

With the advent of the 6G era, the Internet of Things, and the metaverse, there has been an explosive growth in data traffic in recent years, which poses higher requirements for current optical interconnects in terms of capacity and reliability. Coherent detection transceivers were introduced in 2006, which have been widely utilized in optical communication systems spanning thousands of kilometers, such as transoceanic, transcontinental, and metropolitan networks. In a coherent system, a local oscillator (LO) laser is employed to linearly map the received optical field into the electrical domain. Linear mapping allows for the obtainment of the amplitude, phase, and polarization information of the optical signal and for compensation against a number of transmission impairments, including fiber chromatic dispersion (CD), nonlinearity, random polarization rotation, and polarization mode dispersion using advanced digital signal processing (DSP) techniques [1,2,3,4,5,6,7,8,9]. Consequently, coherent detection enables large-capacity and high-spectral-efficiency (SE) long-haul optical communications.

On the other hand, short-to-medium distance optical networks mainly encompass data-center interconnects, passive optical networks, mobile front-haul, and industrial internet. These networks typically involve a great number of optical connections, making cost sensitivity a crucial factor for the deployed optical devices [10]. However, the utilization of LO in coherent detection necessitates temperature control circuits at the receiver to align with the frequency of the laser at the transmitter. This significantly increases the manufacturing cost of optical devices and hinders the deployment of coherent transceivers in cost-sensitive and large-scale short-to-medium distance optical links [11]. Furthermore, since the LO and the transmitter laser are different, phase noise and frequency offset estimation need to be performed in DSP, leading to the increased power consumption of the DSP chips. In contrast, direct detection systems have a natural structural advantage over coherent detection systems as they do not require a narrow-linewidth and high-stable LO at the receiver. This eliminates the need for complicated temperature control circuits, frequency offset estimation, and carrier phase recovery [11]. As a result, the manufacturing cost of direct detection transceivers is lower, making them promising for short-to-medium distance optical networks over the past decade.

The intensity modulation and direct detection (IMDD) scheme, as a classic direct detection system, encodes information directly onto the optical intensity. At the receiver, the optical intensity is converted into photocurrents through square-law detection using a single photodetector (PD), achieving the mapping from the optical domain to the electrical domain. While the IM-DD system is simple and practical, its transmission performance is limited by power fading caused by fiber CD [12]. The frequency-selective fading limits its applications for high data rates or long-distance transmission [12].

To address the issue of power fading, researchers have proposed to use vestigial sideband (VSB) modulation systems. One approach is to utilize an optical filter to eliminate one of the sidebands of the real-valued double-sideband (DSB) signal [13], reducing the influence of the fiber CD. While VSB modulation enhances the system’s resistance to CD, it also introduces nonlinear impairments due to the presence of an incompletely suppressed sideband. As a result, single-sideband (SSB) modulation systems without vestigial components have been developed as an alternative [14,15,16,17], which can be achieved using IQ modulators or optical frequency shifters. To further improve the electrical SE and transmission capacity beyond the SSB systems, single-polarization phase retrieval (PR) receiver [18,19,20,21], carrier-assisted differential detection (CADD) receiver [10,22,23], and asymmetric self-coherent detection (ASCD) receiver [24] have been proposed to achieve linear detection of complex-valued DSB signals, effectively doubling the electrical SE with respect to SSB and IM-DD systems. Additionally, polarization-division-multiplexing can double the capacity and SE of single-polarization direct detection systems. However, the random birefringence of the fiber leads to polarization rotation, resulting in polarization fading [25,26,27,28,29,30] in direct detection systems with a co-propagating optical carrier. In order to deal with this effect, Stokes-vector receiver (SVR) [25] and Jones-space field recovery (JSFR) [30] schemes have been proposed. SVR performs polarization rotation in Stokes space, allowing for up to three-dimensional real-valued modulation. The JSFR scheme, however, first recovers the optical field and then performs polarization rotation in Jones space, enabling four-dimensional modulation including the amplitude and phase of two polarizations [30]. The above-mentioned schemes in which the optical carrier and the signal are transmitted together, allowing for phase- or polarization-diversity, are commonly known as self-coherent detection (SCD) systems. SCD systems recover the optical field in the receiver DSP, allowing compensation for the CD similar to coherent detection. The power fading effect induced by the traditional IMDD channel will no longer be a problem in SCD systems.

Although SCD has numerous advantages, there are still several issues in SCD systems that need to be addressed, such as signal-to-signal beating interference (SSBI) and optical field reconstruction. In the past decade, machine learning (ML) technology has rapidly advanced, and its applications have spread across various fields, including image recognition [31], natural language processing [32], medical diagnosis [33], and optical fiber communications [34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78]. ML techniques often achieve a higher accuracy or lower complexity compared to traditional approaches in many scenarios. In optical fiber communications, ML has been extensively studied and has shown a promising performance in optical performance monitoring [34,35], modulation format recognition [36,37], channel equalization [38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72], and constellation shaping [73,74]. In this paper, we provide a comprehensive overview of the application of ML techniques in SCD communication systems, with a particular focus on their applications in nonlinear impairment compensation, IQ imbalance correction, PR, polarization demultiplexing, and optical signal processing. In Chapter 2, we provide a brief introduction to the principles and challenges of various self-coherent systems. In Chapter 3, we provide extensive applications, as well as a detailed analysis of the performance of ML techniques in SCD systems. Finally, in Chapter 4, we summarize the findings and provide an outlook on the future development of ML technology in SCD systems. All the abbreviations used in this paper are listed in Appendix A.

2. SCD Systems

2.1. VSB System

To mitigate the impact of CD-induced power fading impairment, a VSB system is proposed, employing a simple receiver configuration as depicted in Figure 1a. This receiver setup requires only a single PD and an analog-to-digital converter (ADC). The modulated optical spectrum of the VSB signal is shown in Figure 1b. The most commonly-used method in VSB systems is to employ an optical filter to remove unwanted spectral components in the optical domain. By selectively filtering out specific frequency components, the spectral shape of the VSB signal can be modified [13], allowing for effective suppression of the vestigial sideband. In addition to optical filters, VSB modulation can also be achieved through dual-drive Mach–Zehnder modulators (MZMs) and radio frequency delays. In the VSB system, the dominant impairment originates from the unfiltered residual spectral components, which can be expressed as in [21]:

E (t) = C + S_{s} (t) + S_{r} (t)

(1)

{|E (t)|}^{2} = {|C|}^{2} + C^{*} (S_{s} (t) + S_{r} (t)) + C {(S_{s} (t) + S_{r} (t))}^{*} + {|S_{s} (t) + S_{r} (t)|}^{2}

(2)

where C, S_s(t), and S_r(t) denote the co-propagating optical carrier, the desired SSB signal, and the residual sideband signal, respectively. The superscript * denotes conjugate operation. After square-law detection as shown in Equation (2), the mirror image of residual components can cause nonlinear distortion to the signal, which can be compensated for by using nonlinear equalizers such as neural networks (NNs).

2.2. SSB System

To suppress the residual signal S_r(t) and nonlinear impairment, an SSB system is proposed based on the Hilbert transformation and IQ electrical-to-optical modulator [14,15]. The Hilbert transformation enables us to remove the unwanted sideband, generating a complex-valued electrical signal which drives the IQ modulator to convert into an optical signal, as shown in Figure 1c. The receiver structure of the SSB system is the same as the VSB system. After optical-to-electronic conversion, the dominant distortion is called SSBI, denoted by |S_s(t)|². For the SSB system, the received signal can be expressed as in [21]:

{|E (t)|}^{2} = {|C|}^{2} + C^{*} S_{s} (t) + C S_{s} {(t)}^{*} + {|S_{s} (t)|}^{2}

(3)

Fortunately, for the SSB signal, the impact of SSBI can be mitigated by employing phase recovery algorithms based on the minimum phase condition [79,80,81] or deep learning techniques. These methods help in recovering the phase information lost in optical-to-electronic conversion and enable the compensation of CD in the DSP, avoiding the impact of power fading.

2.3. PR Receiver

Although the resistance to CD is improved in VSB and SSB systems, the electrical SE of these systems is the same as the IM-DD system, defined as the achieved date rate divided by the electrical bandwidth of the receiver. To increase the SE, a PR receiver [18,19,20,21] is proposed to detect a complex-valued DSB signal, as shown in Figure 2a. The PR receiver consists of two PDs and one dispersive element, as shown in Figure 2b. The two detected photocurrents i₁(t) and i₂(t) are expressed as [82]

i_{1} (t) = {|C + S_{d} (t)|}^{2}, i_{2} (t) = {|(C + S_{d} (t)) \otimes h_{D} (t)|}^{2},

(4)

where S_d(t) and h_D(t) are the DSB signal and the transfer function of the dispersive element. Using a fully-connected convolutional NN (CNN), or other nonlinear equalization algorithms, the optical field could be reconstructed in the receiver DSP [83,84]. Note that the PR receivers also enable to recover the phase of optical SSB signal, which will be introduced in Section 3.3.

2.4. CADD

Another kind of receiver used to detect a complex-valued DSB signal is the CADD receiver [10]. The receiver structure is shown in Figure 2c, consisting of one optical hybrid, one PD, and two balanced photodetectors (BPDs), which is more complex than the PR receiver. However, it can achieve a higher modulation bandwidth and electrical SE than the PR receiver. In the receiver DSP, certain SSBI iterative cancellation algorithms and deep NNs are also used for optical field reconstruction [10,85]. With the help of ML techniques, the channel parameters such as the optical delay values and the carrier-to-signal power ratio (CSPR) can be optimized accurately to achieve a better system performance than the SSBI iterative cancellation algorithm.

2.5. SVR

The direct detection system has been pursuing polarization division multiplexing because it can double its capacity and SE. However, for the optical field where the signal and carrier are transmitted together, the optical signal suffers from polarization fading after passing through a polarization beam splitter (PBS). Polarization fading can result in the failure of optical field recovery on random X- or Y-polarization, making it hard to achieve polarization demultiplexing using multi-input multi-output (MIMO) equalization. Thus, the famous SVR [25] was proposed to combat polarization fading in Stokes space. The receiver structure is shown in Figure 3a, where three received Stokes vectors S₁, S₂, and S₃ are used to address the polarization rotation. The transmitted Stokes vectors could be recovered using S₁, S₂, and S₃ and a de-rotation matrix. Thus, the polarization diversity of the DD system is successfully accomplished.

2.6. JSFR

Although the polarization fading issue is solved, at most three modulation dimensions are supported in the real-valued three-dimensional Stokes space. Great efforts are made to exploit the fourth modulation dimension, but these fail to compensate for CD. More recently, the JSFR scheme was proposed to realize polarization diversity for a direct detection system with a co-propagating optical carrier, as shown in Figure 3b. It utilizes the optical coupler to mix the two polarizations to eliminate the impact of the polarization fading effect. The generalized receiver (GR) in this scheme can be implemented using one PD, PR receiver, CADD, and ASCD, according to different modulation formats. Using JSFR, the amplitudes and phases of both X- and Y-polarizations can be recovered, which provides the potential of realizing high-SE and large-capacity optical interconnects for short-reach optical networks. For these polarization-diverse SCD systems, ML can be used to handle the coupling between the polarization modes, namely polarization tracking and polarization mode demultiplexing [82,85].

3. ML Techniques in SCD System

In this section, we will introduce the applications of ML techniques in SCD systems including nonlinearity compensation [86,87], IQ imbalance correction [88], PR in SSB [89,90,91], optical field recovery in PR receiver [83,84] and CADD schemes [82,92], and polarization tracking and demultiplexing in JSFR schemes [85]. In addition, the transfer learning [93,94,95] technique has been employed to realize fast remodeling in SSB system, which could be scalable to other DD systems. Finally, we briefly introduce the photonics neuromorphic computing [96] used in SCD systems to extract the phase information and demodulate the quadrature amplitude modulation (QAM) formats.

3.1. Nonlinear Compensation

3.1.1. Fiber Nonlinearity

In optical communication, the electrical field evolution of light in a single-mode fiber can be described by the well-known nonlinear Schrödinger equation (NLSE) [1], which takes the following form:

\frac{\partial A}{\partial z} + \frac{i β_{2}}{2} \frac{\partial^{2} A}{\partial t^{2}} = - \frac{α}{2} A + i γ {|A|}^{2} A,

(5)

where z, α, β₂, and γ are, respectively, the propagation distance, the loss coefficient, the group-velocity dispersion (or second-order dispersion) coefficient, and the fiber nonlinear Kerr coefficient. The NLSE is a nonlinear partial differential equation that does not have an analytical solution, and the nonlinear parameter γ describes the effects of self-phase modulation and cross-phase modulation. In the case of SCD systems, the transmitted optical field has a strong optical carrier, making it more susceptible to fiber nonlinear impairments. It is widely known that NNs have powerful nonlinear fitting capabilities. Therefore, researchers have proposed the use of NNs to compensate for fiber nonlinearity, including various types of NNs such as artificial neural networks (ANNs) [86], long short-term memory networks (LSTMs) [87], and others, showing a better performance compared to traditional digital back-propagation and perturbation algorithms. LSTMs are a specific type of recurrent NN (RNN) model designed to mitigate the vanishing gradient problem commonly encountered in traditional RNNs. LSTMs have proven to be effective tools for mitigating transmission impairments, including both linear and nonlinear distortions, making them valuable for various applications in signal processing and communication systems. In [87], a linear network-assisted LSTM is proposed to mitigate the fiber nonlinearity in the wavelength-division-multiplexing (WDM) SSB system. Figure 4 depicts the architecture of a linear network-assisted LSTM.

The output

\hat{y}

can be expressed as [87]:

\hat{y} = W_{L} X + W_{N L} B i - L S T M (X_{N L}) + b_{N L},

(6)

where X, X_NL, W_L, W_NL, Bi-LSTM, and b_NL are, respectively, the linear input vector, nonlinear vector, the weight matrix for the fully-connected layer of the linear module, the weight matrix for the nonlinear module, the one-layer Bi-LSTM network operations, and the bias vector of the nonlinear modules. Compared to conventional Bi-LSTM, the linear network-assisted LSTM achieves a significant improvement in terms of the Q-factor while also significantly reducing computation complexity.

3.1.2. Device Nonlinearity

Apart from the fiber nonlinearity, another nonlinear impairment comes from the electro-optic modulation. When the dual-drive MZM or IQ modulator is used for complex-valued QAM formats, the modulation nonlinearity will be enhanced with an increase in the peak-to-peak voltage. Figure 5a shows the bias point of the MZM and the modulation nonlinearity induced by the function of sin(·). Additionally, other device nonlinearity such as the responsibility curve of PD also deteriorates the system performance. In scenarios involving multiple nonlinear impairments, traditional methods face challenges in accurately estimating the channel parameters and compensating for the mixed nonlinear effects. However, ML demonstrates its excellent capability for parameter optimization in such complex channels. In [86], a sparsely connected ANN is proposed to address the fiber nonlinearity and modulation nonlinearity. The principle of ANN pruning is shown in Figure 5b. A weight threshold is set, and connections with weights below this threshold are pruned, thereby reducing the complexity of the NN. The pruned sparsely connected ANN is shown in Figure 5c. By implementing this method, the number of connections in the ANN is reduced by an order of magnitude, while maintaining the bit-error-rate (BER) performance without significant degradation.

3.1.3. SSBI Cancellation

Unlike coherent detection, the direct detection system does not utilize LO and BPD to cancel the common-mode component inside the photocurrent, known as SSBI. Therefore, for direct detection systems, SSBI generated by the PD becomes the primary impairment limiting the system’s transmission capacity. As observed from the fourth term in Equation (3), SSBI takes the form of a quadratic term of the original signal. The spectra of the signal and its SSBI are depicted in Figure 6a, illustrating that the bandwidth of SSBI is twice that of the original signal in the electrical domain. Consequently, SSBI distorts the signal, degrading system performance. Certain methods have been proposed to handle SSBI in direct detection systems such as the Volterra nonlinear equalization and SSBI iterative mitigation methods. Additionally, ML methods such as NNs can also play a significant role in SSBI cancellation. Compared to traditional algorithms, an NN-based equalizer offers tremendous improvements in SSBI elimination, improving the performance of the transmission system. In [81], a soft-combined ANN was proposed and its structure is shown in Figure 6b. The output of the soft-combined ANN is an average of the outputs of all of the ANNs. The results reveal that the soft-combined ANN exhibits a superior performance compared to a single ANN in compensating for both linear and nonlinear SSBI impairments in the signal. Remarkably, this improved performance is achieved while maintaining the same symbol length of the required training sequence.

3.2. IQ Imbalance Correction

For complex QAM modulation, IQ imbalance and crosstalk can lead to an incorrect signal decision resulting in a degraded BER performance. Therefore, in classical DSP steps, IQ orthogonalization algorithms are commonly used for compensation. In optical communication systems where both nonlinear impairments and IQ imbalance exist, the traditional DSP algorithms used for compensation can be replaced by a MIMO-ANN. The joint compensation approach, which addresses both types of impairments simultaneously, generally yields superior results compared to using separate compensation for each impairment individually. In [88], a MIMO-ANN is proposed to compensate for the fiber nonlinearity, SSBI, and IQ imbalance simultaneously. Figure 7 displays its structure, consisting of two ANNs. The in-phase and quadrature components, X_I and X_Q, and their delay copies are fed into these two ANNs. Y_I and Y_Q are the outputs of the MIMO-ANN. In order to minimize the cost function, the back-propagation algorithm is employed to update the weights and biases of layers. After the training processes, the optimized ANNs are used to equalize the received data. The experimental results confirm the outstanding performance of the MIMO-ANN scheme in mitigating interference between two orthogonal signals.

3.3. PR and Optical Field Recovery

In a DD system, the phase information of the optical signal is lost during envelope detection by the PD while the intensity information is retained in the photocurrent. To recover the phase information, the Kramers–Kronig (KK) receiver algorithm was proposed for DD in 2016 [79], which relies on the minimum phase condition. If the SSB signal satisfies the minimum phase condition, the phase can be extracted from the intensity information using a Hilbert transformation. To successfully apply the KK receiver algorithm, a high CSPR is required. However, achieving a high CSPR comes with certain challenges. It introduces an additional sensitivity penalty and increases the impact of nonlinear fiber propagation effects. These factors need to be carefully considered when implementing the KK receiver algorithm in DD systems. To alleviate the CSPR requirement, a supervised learning CNN model was proposed in [89,90] to emulate the KK algorithm for the PR task. The architecture of the NN model is illustrated in Figure 8a. The input of the NN is the received photocurrent, namely |E(t)|² in Equation (2). The down-sampling blocks, labeled as D_i (i = 1, 2, 3), consist of a convolutional layer followed by the Rectified Linear Unit (ReLU) activation function. The up-sampling blocks, labeled as U_i (i = 1, 2, 3), incorporate a combination of convolutional layers, transposed convolutional layers, and ReLU activation functions. In this NN model, the target outputs are selected as the in-phase and quadrature components, rather than the amplitude and phase. Through simulations, it has been demonstrated that the ML-based PR scheme accurately reconstructs the phase of a modulation phase signal even at weak carrier power levels. This ML-based approach relaxes the CSPR requirement and improves the receiver sensitivity compared to the original KK algorithm. Overall, the proposed NN model provides a promising solution for PR, leveraging the power of deep learning techniques to enhance the performance of SCD systems.

In addition to constructing SSB signals that do not satisfy the minimum phase condition, PR receivers and their corresponding algorithms can be utilized to restore the phase of the optical field. It can be applied to phase recovery of SSB signals or complex-valued DSB signals. Gerchberg–Saxton algorithm is most commonly employed for PR [18,19,20,21], but it requires multiple iterations to converge. For SSB signals, the received two optical photocurrents (i₁(t) and i₂(t)) can be fed as inputs to an NN [91], as shown in Figure 8b. This NN consists of eight convolutional blocks aimed at down-sampling and up-sampling. The outputs of the NN are the real and imaginary parts of the optical field. By implementing this NN to achieve PR, the required dispersion value of the dispersive element is significantly decreased and the computational complexity is also reduced by 30%. Most importantly, the SSB signal no longer requires a strong optical carrier to satisfy the minimum phase condition. With the same Erbium-doped fiber amplifier launch power, it is possible to increase the number of WDM channels and reduce nonlinear fiber impairments, which potentially provides a larger capacity.

On the other hand, the optical and electrical SE could be improved if the PR receiver is utilized to detect the complex-valued DSB optical signal. The deep-learning-enabled direct detection scheme [83,84] was proposed to recover the optical field at a low CSPR, which is shown in Figure 9. Similarly, the inputs are two samples of photocurrents. The NN based on deep residual learning blocks consists of two convolutional layers and several residual modules. Its output is the desired phase information of the optical field. Residual learning is a technique that introduces shortcut connections into the traditional CNN structure, providing benefits in terms of training speed and network performance. The deep residual network architecture is built around stacked residual blocks, with each block consisting of two convolutional layers and a shortcut connection. The shortcut connections enable the direct propagation of information from one layer to another, bypassing intermediate layers. The integration of shortcut connections and stacked residual blocks improves the training efficiency and enables the effective learning of deep CNN models. This architecture has been proven highly effective in various computer vision tasks, enabling the construction of deeper networks without the issues of vanishing or exploding gradients. In [83], the residual learning technique is applied to accurately recover the transmitted signal in the presence of a large SSBI under the low CSPR condition. Compared with the conventional SSBI cancellation scheme, the deep-learning-enabled DD receiver shows a significant reduction of 8 dB in the optimum CSPR when detecting a complex-valued DSB signal.

3.4. Polarization Demultiplexing

For polarization-multiplexed optical communication systems, random birefringence in optical fibers can lead to random coupling between polarization modes. Therefore, at the receiver end, DSP algorithms are required to accomplish polarization demultiplexing. Additionally, the coupling of polarization states varies over time, necessitating algorithms with the ability to track polarization changes. In the phase- and polarization-diverse JSFR scheme, a MIMO-NN was proposed [81,85] to simultaneously achieve linear optical field recovery, polarization demultiplexing, and non-linear SSBI mitigation. The receiver structure, along with the MIMO-NN, is depicted in Figure 10. The MIMO-NN consists of four layers and takes the six digital waveforms as inputs. It first extracts the in-phase and quadrature components of the dual-polarization optical field. Then, the MIMO-NN performs polarization mode demultiplexing by utilizing the inverse matrix of the polarization rotation unitary matrix. This integrated scheme enables the reconstruction of the optical field, the demultiplexing of the polarization modes, and the mitigation of nonlinear SSBI effects. By harnessing the capabilities of the MIMO-NN, the receiver achieves the detection of four-dimensional modulated signals, encompassing the amplitudes and phases of both polarizations. This advanced technique significantly enhances the SE of the DD system, approaching the performance levels of coherent detection systems.

3.5. Fast Remodeling

Transfer learning (TL) refers to the process of leveraging knowledge and experience gained from previous tasks to improve performance on new target tasks. In TL, the source task and the target task may not be consistent, meaning that they may differ in terms of data distribution, input/output spaces, or even objectives. In optical fiber communications, to reduce the number of training symbols and epochs, TL has been introduced and proven to enable fast remodeling [93,94], nonlinear equalization, and optical signal-to-noise ratio estimation [95]. In [93], a TL-assisted ANN was proposed for a multi-channel nonlinearity mitigation scheme in an SSB system. In the case of multi-channel transmission where multiple channels co-propagate in the same fiber, there exists a correlation of nonlinear distortion. This means that the nonlinear effects introduced by one channel can impact the other channels. Understanding and accounting for this correlation is crucial in designing and optimizing multi-channel transmission systems. By considering the correlation of nonlinear distortion, more accurate modeling and compensation techniques can be developed to mitigate the impact of nonlinearities and improve the overall system performance.

The principle of TL-ANN for multiple channels is shown in Figure 11. At the initial stage, an ANN is trained using labeled training data that have been collected. Once the initial training is complete, the prior distribution of parameters from the trained source model is transferred to accelerate the remodeling process. This parameter transfer avoids the need for re-initialization in the retraining method. By leveraging the learned knowledge from the source model, the remodeling process can be accelerated and potentially achieve a better performance. Subsequently, a few samples are used to train the parameters of the TL-ANN so that it can converge and accurately compensate for the impairments in the current channel. The experimental results show that the required training epochs can be reduced by 80% without BER performance degradation, saving considerable computational complexity.

3.6. Optical Signal Processing

Photonic NNs, also known as optical NNs, are a class of NNs that utilize the principles of photonics to perform signal processing in the optical domain. Instead of relying on traditional electronic components, these networks employ optical elements for computation and communication. One specific implementation of photonic NNs is photonic reservoir computing (RC), which is an ML framework that utilizes a fixed, random dynamical system called the reservoir to process data. In the case of a photonic RC, the reservoir is implemented using photonic components and principles. In [96], a recurrent optical spectrum slicing (ROSS) neuromorphic accelerator was proposed to realize an SCD receiver. This network aimed to extract phase information and demodulate QAM formats while simultaneously mitigating CD. The structure of the neuromorphic receiver based on ROSS is illustrated in Figure 12.

At the receiver side, a semiconductor optical amplifier compensates for the transmission and insertion losses of the integrated chip. The structure includes three recurrent nodes, each consisting of a Mach–Zehnder delay interferometer (MZDI) in a loop equipped with variable optical attenuators, phase shifters, and optical delays. PDs follow these nodes and are then connected to transimpedance amplifiers and ADCs. The subsequent DSP includes a light-based RNN for each quadrature. This configuration enables the extraction of phase information, demodulation of QAM formats, and effective mitigation of CD using the photonic components and principles employed in the ROSS structure. The photonic RC contributes to reducing the power consumption associated with high-bandwidth PDs/ADCs and heavy digital equalization algorithms.

4. Conclusions

This paper introduced the challenges and advancements in SCD systems and reviewed the application of ML techniques in addressing these challenges. The utilization of ML algorithms has exhibited promising results in compensating for various impairments such as fiber nonlinearity, IQ imbalance, SSBI, PR, polarization demultiplexing, and fast channel remodeling. CNNs, LSTMs, sparsely connected ANNs, and MIMO-NNs have been successfully employed to achieve accurate nonlinear impairment compensation and efficient signal processing. Furthermore, transfer learning has been utilized to reduce training time and improve modeling in multi-channel scenarios, while the residual learning method combined with a CNN has been proven effective for optical field recovery. Additionally, the emergence of photonic NNs, such as photonic reservoir computing, harnesses the advantages of photonics for information processing in SCD systems. The integration of ML techniques into SCD systems has resulted in significant enhancements in modulation dimensions, SE, transmission performance, and capacity. Integrating machine learning into direct detection systems may also raise costs due to specialized hardware needs for efficient computation. The actual impact varies with the technology advancements and performance benefits gained. However, further research is necessary to optimize ML models, explore novel network architectures, and address practical implementation challenges to fully leverage the potential of ML in SCD systems. In the context of SCD systems, machine-learning techniques are increasingly favored for tasks such as optical field recovery or phase retrieval: tasks that traditional nonlinear equalization algorithms struggle to achieve. Regarding challenges linked to applying ML in SCD systems, these involve concerns about computational complexity and hardware requirements, especially for ASIC chips. Consequently, it is essential to focus future endeavors on exploring and resolving the intricacies of ML algorithms to facilitate their practical implementation.

In summary, the combination of SCD systems and ML techniques holds tremendous promise for enabling high-capacity, cost-effective, and reliable optical communication networks in the 6G era and beyond. The advancements in ML algorithms offer new avenues for overcoming the challenges and improving the overall performance of SCD systems.

Author Contributions

Conceptualization, Q.W., Z.X., Y.Z. (Yixiao Zhu), Q.Z. and W.H.; methodology, Q.W., Z.X. and Y.Z. (Yixiao Zhu); investigation, Q.W., Y.Z. (Yikun Zhang), H.J., Y.Y., G.Q., L.L., S.W., J.L. (Junpeng Liang), J.W., J.L. (Jiali Li) and Z.H.; writing—original draft preparation, Q.W.; writing—review and editing, Q.W., Z.X. and Y.Z. (Yixiao Zhu). All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by National Natural Science Foundation of China (62271305).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. This table gives the abbreviations and their definitions used in the paper.

Abbreviation	Definition	Abbreviation	Definition
LO	Local oscillator	CD	Chromatic dispersion
DSP	Digital signal processing	IM-DD	Intensity modulation and direct detection
SE	Spectral efficiency	PD	Photodetector
VSB	Vestigial sideband	DSB	Double-sideband
PR	Phase retrieval	SSB	Single-sideband
CADD	Carrier-assisted differential detection	ASCD	Asymmetric self-coherent detection
SVR	Stokes-vector receiver	JSFR	Jones-space field recovery
SCD	Self-coherent detection	SSBI	Signal-to-signal beating interference
ML	Machine learning	ADC	Analog-to-digital converter
MZM	Mach–Zehnder modulator	NN	Neural network
CNN	Convolutional neural network	BPD	Balanced photodetector
CSPR	Carrier-to-signal power ratio	PBS	Polarization beam splitter
MIMO	Multi-input multi-output	GR	Generalized receiver
QAM	Quadrature amplitude modulation	NLSE	Nonlinear Schrödinger equation
ANN	Artificial neural network	LSTM	Long short-term memory network
RNN	Recurrent neural network	WDM	Wavelength-division-multiplexing
BER	Bit-error-rate	KK	Kramers–Kronig
ReLU	Rectified Linear Unit	TL	Transfer learning
RC	Reservoir computing	ROSS	Recurrent optical spectrum slicing
MZDI	Mach–Zehnder delay interferometer

References

Kikuchi, K. Fundamentals of Coherent Optical Fiber Communications. J. Light. Technol. 2016, 34, 157–179. [Google Scholar] [CrossRef]
Cvijetic, M. Coherent and Nonlinear Lightwave Communications; Artech House: Boston, MA, USA, 2016; ISBN 13-978-0890065907. [Google Scholar]
Batagelj, B.; Janyani, V.; Tomazic, S. Research challenges in optical communications towards 2020 and beyond. Inf. MIDEM 2014, 44, 177–184. [Google Scholar]
Ip, E.M.; Kahn, J.M. Fiber Impairment Compensation Using Coherent Detection and Digital Signal Processing. J. Light. Technol. 2009, 28, 502–519. [Google Scholar] [CrossRef]
Uzunidis, D.; Logothetis, M.; Stavdas, A.; Hillerkuss, D.; Tomkos, I. Fifty Years of Fixed Optical Networks Evolution: A Survey of Architectural and Technological Developments in a Layered Approach. Telecom 2022, 3, 619–674. [Google Scholar] [CrossRef]
Savory, S.J.; Gavioli, G.; Killey, R.I.; Bayvel, P. Electronic compensation of chromatic dispersion using a digital coherent receiver. Opt. Express 2007, 15, 2120–2126. [Google Scholar] [CrossRef]
Winzer, J.; Essiambr, R. Advanced Optical Modulation Formats. Proc. IEEE 2006, 94, 952–985. [Google Scholar] [CrossRef]
Zhong, K.; Zhou, X.; Huo, J.; Yu, C.; Lu, C.; Lau, A.P.T. Digital Signal Processing for Short-Reach Optical Communications: A Review of Current Technologies and Future Trends. J. Light. Technol. 2018, 36, 377–400. [Google Scholar] [CrossRef]
Wu, Q.; Zhu, Y.; Cheng, Z.; Yin, L.; Hu, W. Spectrally sliced heterodyne coherent receiver with halved electrical bandwidth. Chin. Opt. Lett. 2022, 20, 090601-1-090601-6. [Google Scholar] [CrossRef]
Shieh, W.; Sun, C.; Ji, H. Carrier-assisted differential detection. Light. Sci. Appl. 2020, 9, 1–9. [Google Scholar] [CrossRef]
Che, D.; Sun, C.; Shieh, W. Maximizing the spectral efficiency of Stokes vector receiver with optical field recovery. Opt. Express 2018, 26, 28976–28981. [Google Scholar] [CrossRef]
Che, D.; Hu, Q.; Shieh, W. Linearization of Direct Detection Optical Channels Using Self-Coherent Subsystems. J. Light. Technol. 2016, 34, 516–524. [Google Scholar] [CrossRef]
Zhang, J.; Yu, J.; Li, X.; Wei, Y.; Wang, K.; Zhao, L.; Zhou, W.; Kong, M.; Pan, X.; Liu, B.; et al. 100 Gbit/s VSB-PAM-n IM/DD transmission system based on 10 GHz DML with optical filtering and joint nonlinear equalization. Opt. Express 2019, 27, 6098–6105. [Google Scholar] [CrossRef] [PubMed]
Li, Z.; Erkilinc, M.S.; Shi, K.; Sillekens, E.; Galdino, L.; Thomsen, B.C.; Bayvel, P.; Killey, R.I. SSBI Mitigation and the Kramers–Kronig Scheme in Single-Sideband Direct-Detection Transmission with Receiver-Based Electronic Dispersion Compensation. J. Light Technol. 2017, 35, 1887–1893. [Google Scholar] [CrossRef]
Wu, Q.; Zhu, Y.; Hu, W. Low-complexity and Non-iterative SSBI Decomposition and Cancellation Algorithm for SSB Direct Detection System. In Proceedings of the Optical Fiber Communication Conference, San Diego, CA, USA, 6–10 March 2022; p. M2H.6. [Google Scholar]
Bo, T.; Kim, B.; Yu, Y.; Kim, D.; Kim, H. Generation of Broadband Optical SSB Signal Using Dual Modulation of DML and EAM. J. Light Technol. 2021, 39, 3064–3071. [Google Scholar] [CrossRef]
Bo, T.; Kim, H. Generalized model of optical single sideband generation using dual modulation of DML and EAM. Opt. Express 2020, 28, 28491–28501. [Google Scholar] [CrossRef]
Chen, H.; Fontaine, N.K.; Gene, J.M.; Ryf, R.; Neilson, D.T.; Raybon, G. Dual Polarization Full-Field Signal Waveform Reconstruction Using Intensity Only Measurements for Coherent Communications. J. Light. Technol. 2020, 38, 2587–2597. [Google Scholar] [CrossRef]
Gerchberg, R. A practical algorithm for the determination of phase from image and diffraction plane pictures. Optik 1972, 35, 237–246. [Google Scholar]
Karar, A.; Falou, A.; Barakat, J.; Gürkan, Z.; Zhong, K. Recent Advances in Coherent Optical Communications for Short-Reach: Phase Retrieval Methods. Photonics 2023, 10, 308. [Google Scholar] [CrossRef]
Wu, Q.; Zhu, Y.; Zhuge, Q.; Hu, W. Carrier-Assisted Phase Retrieval with Multiple Projections. J. Light. Technol. 2022, 36, 465–468. [Google Scholar] [CrossRef]
Qin, P.; Bai, C.; Wang, Z.; Xu, H.; Yang, L.; Zhang, Y.; Ge, P.; Luo, X. Offset double sideband carrier assisted differential de-tection with field recovery at low carrier-to-signal power ratio. Opt. Express 2022, 30, 48112–48132. [Google Scholar] [CrossRef]
Qin, P.; Bai, C.; Xu, H.; Yang, L.; Zhang, Y.; Luo, X.; Chi, X.; Zhang, R.; Li, P.; Wu, Y. Offset double-sideband signal field re-covery at low CSPR using filter-assisted direct detection. Opt. Express 2022, 30, 48112–48132. [Google Scholar] [CrossRef]
Li, X.; O’sullivan, M.; Xing, Z.; Alam, S.; Mousa-Pasandi, M.E.; Plant, D.V. Asymmetric self-coherent detection. Opt. Express 2021, 29, 25412–25427. [Google Scholar] [CrossRef] [PubMed]
Che, D.; Li, A.; Chen, X.; Hu, Q.; Wang, Y.; Shieh, W. Stokes Vector Direct Detection for Linear Complex Optical Channels. J. Light. Technol. 2015, 33, 678–684. [Google Scholar] [CrossRef]
Kikuchi, K. Quantum Theory of Noise in Stokes Vector Receivers and Application to Bit Error Rate Analysis. J. Light. Technol. 2020, 38, 3164–3172. [Google Scholar] [CrossRef]
Qian, D.; Cvijetic, N.; Hu, J.; Wang, T. 108 Gb/s OFDMA-PON with Polarization Multiplexing and Direct Detection. J. Light. Technol. 2009, 28, 484–493. [Google Scholar] [CrossRef]
Ji, H.; Li, J.; Li, X.; Dong, S.; Xu, Z.; Su, Y.; Shieh, W. Complementary Polarization-Diversity Coherent Receiver for Self-Coherent Homodyne Detection with Rapid Polarization Tracking. J. Light. Technol. 2022, 40, 2773–2779. [Google Scholar] [CrossRef]
Wu, Q.; Zhu, Y.; Zhuge, Q.; Hu, W. Dual-Carrier-Assisted Phase Retrieval for Polarization Division Multiplexing. J. Light. Technol. 2022, 40, 7297–7306. [Google Scholar] [CrossRef]
Wu, Q.; Zhu, Y.; Jiang, H.; Zhuge, Q.; Hu, W. Optical Field Recovery in Jones Space. J. Light. Technol. 2022, 41, 66–74. [Google Scholar] [CrossRef]
Krizhevsky, A.; Sutskever, I.; Hinton, G. ImageNet Classification with Deep Convolutional Neural Networks. In Proceedings of the Advances in Neural Information Processing Systems, Lake Tahoe, NV, USA, 3–16 December 2012; pp. 1097–1105. [Google Scholar]
Otter, D.W.; Medina, J.R.; Kalita, J.K. A Survey of the Usages of Deep Learning for Natural Language Processing. IEEE Trans. Neural Netw. Learn. Syst. 2021, 32, 604–624. [Google Scholar] [CrossRef]
Richens, J.; Lee, C.; Johri, S. Improving the accuracy of medical diagnosis with causal machine learning. Nat. Commun. 2022, 11, 3923. [Google Scholar] [CrossRef]
Musumeci, F.; Rottondi, C.; Nag, A.; Macaluso, I.; Zibar, D.; Ruffini, M.; Tornatore, M. An Overview on Application of Machine Learning Techniques in Optical Networks. IEEE Commun. Surv. Tutor. 2018, 21, 1383–1408. [Google Scholar] [CrossRef]
Rottondi, C.; Barletta, L.; Giusti, A.; Tornatore, M. Machine-learning method for quality of transmission prediction of unes-tablished lightpaths. J. Opt. Commun. Netw. 2018, 10, A286. [Google Scholar] [CrossRef]
Khan, F.N.; Zhou, Y.; Lau, A.P.T.; Lu, C. Modulation format identification in heterogeneous fiber-optic networks using artificial neural networks. Opt. Express 2012, 20, 12422–12431. [Google Scholar] [CrossRef] [PubMed]
Saif, W.S.; Esmail, M.A.; Ragheb, A.M.; Alshawi, T.A.; Alshebeili, S.A. Machine Learning Techniques for Optical Performance Monitoring and Modulation Format Identification: A Survey. IEEE Commun. Surv. Tutor. 2020, 22, 2839–2882. [Google Scholar] [CrossRef]
Eriksson, T.A.; Bulow, H.; Leven, A. Applying Neural Networks in Optical Communication Systems: Possible Pitfalls. IEEE Photon-Technol. Lett. 2017, 29, 2091–2094. [Google Scholar] [CrossRef]
Chen, Z.; Wang, W.; Zou, D.; Ni, W.; Luo, D.; Li, F. Real-Valued Neural Network Nonlinear Equalization for Long-Reach PONs Based on SSB Modulation. IEEE Photon-Technol. Lett. 2022, 35, 167–170. [Google Scholar] [CrossRef]
Xu, Z.; Ji, H.; Yang, Y.; Qiao, G.; Wu, Q.; Li, J.; Lu, W.; Liu, L.; Wang, S.; Wei, J.; et al. Mixed-Precision Integer-Arithmetic-Only Neural Network-Based Equalizers for DML-Based Short-Reach IM/DD Systems. In Proceedings of the 2023 European Conference on Optical Communication (ECOC), Glasgow, Scotland, 1–5 October 2023; p. Th.A.3.4. [Google Scholar]
Xu, Z.; Dong, S.; Manton, J.H.; Shieh, W. Low-Complexity Multi-Task Learning Aided Neural Networks for Equalization in Short-Reach Optical Interconnects. J. Light. Technol. 2021, 40, 45–54. [Google Scholar] [CrossRef]
Giacoumidis, E. Blind Nonlinearity Equalization by Machine Learning based Clustering for Single-and Multi-Channel Coherent Optical OFDM. J. Light. Technol. 2018, 36, 721–727. [Google Scholar] [CrossRef]
Zibar, D.; Piels, M.; Jones, R.; Schaeffer, C.G. Machine Learning Techniques in Optical Communication. J. Light. Technol. 2016, 34, 1442–1452. [Google Scholar] [CrossRef]
Lu, X.; Qiao, L.; Zhou, Y.; Yu, W.; Chi, N. An I-Q-Time 3-dimensional post-equalization algorithm based on DBSCAN of machine learning in CAP VLC system. Opt. Commun. 2019, 430, 299–303. [Google Scholar] [CrossRef]
Nguyen, T.; Mhatli, S.; Giacoumidis, E.; Van Compernolle, L.; Wuilpart, M.; Mégret, P. Fiber Nonlinearity Equalizer Based on Support Vector Classification for Coherent Optical OFDM. IEEE Photon-J. 2016, 8, 2. [Google Scholar] [CrossRef]
Giacoumidis, E.; Mhatli, S.; Stephens, M.F.; Tsokanos, A.; Wei, J.; McCarthy, M.E.; Doran, N.J.; Ellis, A.D. Reduction of Non-linear Inter-Subcarrier Intermixing in Coherent Optical OFDM by a Fast Newton-based Support Vector Machine Nonlinear Equalizer. J. Light. Technol. 2017, 35, 2391–2397. [Google Scholar] [CrossRef]
Chagnon, M.; Karanov, B.; Schmalen, L. Experimental demonstration of a dispersion tolerant end-to-end deep learning-based im-dd transmission system. In Proceedings of the European Conference on Optical Communication (ECOC), Roma, Italy, 23–27 September 2018; pp. 1–3. [Google Scholar]
Katumba, A.; Yin, X.; Dambre, J.; Bienstman, P. A Neuromorphic Silicon Photonics Nonlinear Equalizer for Optical Com-Munications with Intensity Modulation and Direct Detection. J. Light. Technol. 2019, 37, 2232–2239. [Google Scholar] [CrossRef]
Haigh, P.A.; Ghassemlooy, Z.; Rajbhandari, S.; Papakonstantinou, I.; Popoola, W. Visible Light Communications: 170 Mb/s Using an Artificial Neural Network Equalizer in a Low Bandwidth White Light Configuration. J. Light. Technol. 2014, 32, 1807–1813. [Google Scholar] [CrossRef]
Tanimura, T.; Hoshida, T.; Kato, T.; Watanabe, S.; Morikawa, H. Data-analytics-based optical performance monitoring technique for optical transport networks. In Proceedings of the Optical Fiber Communication Conference, San Diego, CA, USA, 11–15 March 2018; p. Tu3E.3. [Google Scholar]
Xu, Z.; Sun, C.; Manton, J.H.; Shieh, W. Joint Equalization of Linear and Nonlinear Impairments for PAM4 Short-Reach Direct Detection Systems. IEEE Photon-Technol. Lett. 2021, 33, 425–428. [Google Scholar] [CrossRef]
You, Y.; Jiang, Z.; Janz, C. Machine Learning-Based EDFA Gain Model. In Proceedings of the 2018 European Conference on Optical Communication (ECOC), Rome, Italy, 23–27 September 2018; pp. 1–3. [Google Scholar]
Morais, R.M.; Pedro, J. Machine Learning Models for Estimating Quality of Transmission in DWDM Networks. J. Opt. Commun. Netw. 2018, 10, D84–D99. [Google Scholar] [CrossRef]
Yankov, M.; Ros, F.; Moura, U.; Carena, A.; Zibar, D. Flexible Raman Amplifier Optimization Based on Machine Learn-ing-Aided Physical Stimulated Raman Scattering Model. J. Light. Technol. 2017, 35, 1887–1893. [Google Scholar]
Panayiotou, T.; Savva, G.; Shariati, B.; Tomkos, I.; Ellinas, G. Machine learning for QoT estimation of unseen optical network states. In Proceedings of the Optical Fiber Communication Conference 2019, San Diego, CA, USA, 3–7 March 2019; p. Tu2E.2. [Google Scholar]
Jimenez, T.; Aguado, J.C.; de Miguel, I.; Duran, R.J.; Angelou, M.; Merayo, N.; Fernandez, P.; Lorenzo, R.M.; Tomkos, I.; Abril, E.J. A Cognitive Quality of Transmission Estimator for Core Optical Networks. J. Light. Technol. 2013, 31, 942–951. [Google Scholar] [CrossRef]
Xu, Z.; Sun, C.; Ji, T.; Ji, H.; Manton, J.; Shieh, W. Cascade Recurrent Neural Network Enabled 100-Gb/s PAM4 Short-Reach Optical Link Based on DML. In Proceedings of the Optical Fiber Communication Conference, San Diego, CA, USA, 8–12 March 2020; p. W2A.45. [Google Scholar]
Sambo, N.; Pointurier, Y.; Cugini, F.; Valcarenghi, L.; Castoldi, P.; Tomkos, I. Light path Establishment Assisted by Offline QoT Estimation in Transparent Optical Networks. J. Opt. Commun. Netw. 2010, 2, 928–937. [Google Scholar] [CrossRef]
Seve, E.; Pesic, J.; Delezoide, C.; Bigo, S.; Pointurier, Y. Learning process for reducing uncertainties on network parameters and design margins. J. Opt. Commun. Netw. 2018, 10, A298–A306. [Google Scholar] [CrossRef]
Wang, D.; Zhang, M.; Li, J.; Li, Z.; Li, J.; Song, C.; Chen, X. Intelligent constellation diagram analyzer using convolutional neural network-based deep learning. Opt. Express 2017, 25, 17150–17166. [Google Scholar] [CrossRef] [PubMed]
Musumeci, F.; Rottondi, C.E.M.; Corani, G.; Shahkarami, S.; Cugini, F.; Tornatore, M. A Tutorial on Machine Learning for Failure Management in Optical Networks. J. Light. Technol. 2019, 37, 4125–4139. [Google Scholar] [CrossRef]
Varughese, S.; Lippiatt, D.; Richter, T.; Tibuleac, S.; Ralph, S.E. Identification of Soft Failures in Optical Links Using Low Complexity Anomaly Detection. In Proceedings of the Optical Fiber Communication Conference (OFC) 2019, San Diego, CA, USA, 3–7 March 2019; p. W2A.46. [Google Scholar]
Shahkarami, S.; Musumeci, F.; Cugini, F.; Tornatore, M. Machine-Learning-Based Soft-Failure Detection and Identification in Optical Networks. In Proceedings of the Optical Fiber Communications Conference and Exposition (OFC 2018), San Diego, CA, USA, 11–15 March 2018; pp. 1–3. [Google Scholar] [CrossRef]
Khan, F.N.; Fan, Q.; Lu, C.; Lau, A.P.T. An Optical Communication’s Perspective on Machine Learning and Its Applications. J. Light. Technol. 2019, 37, 493–516. [Google Scholar] [CrossRef]
Vela, A.P.; Ruiz, M.; Fresi, F.; Sambo, N.; Cugini, F.; Meloni, G.; Poti, L.; Velasco, L.; Castoldi, P. BER Degradation Detection and Failure Identification in Elastic Optical Networks. J. Light. Technol. 2017, 35, 4595–4604. [Google Scholar] [CrossRef]
Qin, P.; Bai, C.; Qi, Q.; Xu, H.; Yang, F.; Li, P.; Zhang, Y. Analytical Study and Noise Mitigation for Complex-Valued Optical DSB Transmission with Carrier-Assisted Differential Detection Receiver. IEEE Access 2023, 11, 31443–31455. [Google Scholar] [CrossRef]
Zhang, K.; Fan, Y.; Ye, T.; Tao, Z.; Oda, S.; Tanimura, T.; Akiyama, Y.; Hoshida, T. Fiber nonlinear noise-to-signal ratio esti-mation by machine learning. In Proceedings of the Optical Fiber Communication Conference (OFC) 2019, San Diego, CA, USA, 3–7 March 2019; p. Th2A.45. [Google Scholar]
Schaedler, M.; Kuschnerov, M.; Pachnicke, S.; Bluemm, C.; Pittala, F.; Changsong, X. Subcarrier Power Loading for Coherent Optical OFDM optiMized by Machine Learning. In Proceedings of the Optical Fiber Communication Conference, San Diego, CA, USA, 3–7 March 2019. [Google Scholar]
Wang, C.; Fu, S.; Xiao, Z.; Tang, M.; Liu, D. Long Short-Term Memory Neural Network (LSTM-NN) Enabled Accurate Optical Signal-to-Noise Ratio (OSNR) Monitoring. J. Light. Technol. 2019, 37, 4140–4146. [Google Scholar] [CrossRef]
Kamalov, V.; Jovanovski, L.; Vusirikala, V.; Zhang, S.; Yaman, F.; Nakamura, K.; Inoue, T.; Mateo, E.; Inada, Y. Evolution from 8QAM live Traffic to PS 64-QAM with Neural-Network Based Nonlinearity Compensation on 11000 km Open Subsea Cable. In Proceedings of the Optical Fiber Communication Conference, San Diego, CA, USA, 11–15 March 2018. [Google Scholar]
Wang, Z.; Yang, A.; Guo, P.; He, P. OSNR and nonlinear noise power estimation for optical fiber communication systems using LSTM based deep learning technique. Opt. Express 2018, 26, 21346–21357. [Google Scholar] [CrossRef]
Jones, R.; Eriksson, T.; Yankov, M.; Zibar, D. Deep Learning of Geometric Constellation Shaping Including Fiber Nonlinearities. In Proceedings of the 2018 European Conference on Optical Communication (ECOC), Rome, Italy, 23–27 September 2018; pp. 1–3. [Google Scholar]
Jones, R.; Yankov, M.; Zibar, D. End-to-end learning for GMI optimized geometric constellation shape. In Proceedings of the 2018 European Conference on Optical Communication (ECOC), Dublin, Ireland, 22–26 September 2019; pp. 1–3. [Google Scholar]
Khan, F.N.; Zhong, K.; Zhou, X.; Al-Arashi, W.H.; Yu, C.; Lu, C.; Lau, A.P.T. Joint OSNR monitoring and modulation format identification in digital coherent receivers using deep neural networks. Opt. Express 2017, 25, 17767–17776. [Google Scholar] [CrossRef]
Jarajreh, M.A.; Giacoumidis, E.; Aldaya, I.; Le, S.T.; Tsokanos, A.; Ghassemlooy, Z.; Doran, N.J. Artificial Neural Network Nonlinear Equalizer for Coherent Optical OFDM. IEEE Photon-Technol. Lett. 2015, 27, 387–390. [Google Scholar] [CrossRef]
Khan, F.N.; Yu, Y.; Tan, M.C.; Yu, C.; Lau, A.P.T.; Lu, C. Simultaneous OSNR monitoring and modulation format identification using asynchronous single channel sampling. In Proceedings of the Asia Communications and Photonics Conference 2015, Hong Kong, China, 19–23 November 2015; p. AS4F.6. [Google Scholar]
Tanimura, T.; Hoshida, T.; Kato, T.; Watanabe, S.; Morikawa, H. Convolutional Neural Network-Based Optical Performance Monitoring for Optical Transport Networks. J. Opt. Commun. Netw. 2019, 11, A52–A59. [Google Scholar] [CrossRef]
Antonelli, C.; Mecozzi, A.; Shtaif, M.; Chen, X.; Chandrasekhar, S.; Winzer, P.J. Kramers-Kronig coherent receiver. Optica 2016, 3, 1220–1227. [Google Scholar] [CrossRef]
Bo, T.; Kim, H. Kramers-Kronig receiver operable without digital upsampling. Opt. Express 2018, 26, 13810–13818. [Google Scholar] [CrossRef] [PubMed]
Bo, T.; Kim, H. Toward Practical Kramers-Kronig Receiver: Resampling, Performance, and Implementation. J. Light. Technol. 2018, 37, 461–469. [Google Scholar] [CrossRef]
Ming, H.; Zhang, L.; Yang, F.; Ruan, X.; Zhang, F. High-speed SSB transmission with a silicon MZM and a soft combined artificial neural network-based equalization. Opt. Lett. 2020, 45, 2066–2069. [Google Scholar] [CrossRef] [PubMed]
Wu, Q.; Li, X.; Zhu, Y.; Chen, C.; Xu, Z.; Li, J.; He, Z.; Hu, W. Single-Wavelength Net 1 Tb/s Transmission in SMF and 6.4 Tb/s in Weakly-Coupled 7-Core MCF Using a Phase-and Polarization-Diverse Direct Detection Receiver in Jones spacey. In Proceedings of the 2023 OptoElectronics and Communications Conference (OECC), Shanghai, China, 2–6 July 2023. [Google Scholar]
Li, X.; An, S.; Ji, H.; Li, J.; Shieh, W.; Su, Y. Deep-learning-enabled high-performance full-field direct detection with dispersion diversity. Opt. Express 2022, 30, 11767–11788. [Google Scholar] [CrossRef]
Li, X.; Li, J.; An, S.; Liu, H.; Shieh, W.; Su, Y. Deep-learning-enabled Direct Detection with Reduced Computational Complexity and High Electrical-spectral-efficiency. J. Light. Technol. 2023, 41, 5495–5502. [Google Scholar] [CrossRef]
Wu, Q.; Zhu, Y.; Jiang, H.; Fu, M.; Zhang, Y.; Zhuge, Q.; Hu, W. Four-dimensional direct detection with Jones space optical full field recovery. arXiv 2022, arXiv:2212.14587. [Google Scholar]
Wan, Z.; Li, J.; Shu, L.; Luo, M.; Li, X.; Fu, S.; Xu, K. Nonlinear equalization based on pruned artificial neural networks for 112-Gb/s SSB-PAM4 transmission over 80-km SSMF. Opt. Express 2018, 26, 10631–10642. [Google Scholar] [CrossRef]
Fang, X.; Chen, X.; Yang, F.; Zhang, L.; Zhang, F. 6.4Tb/s SSB WDM Transmission Over 320km SSMF with Linear Network-Assisted LSTM. Photonic. Tech. Lett. 2021, 33, 1407–1410. [Google Scholar] [CrossRef]
An, S.; Zhu, Q.; Li, J.; Ling, Y.; Su, Y. 112-Gb/s SSB 16-QAM signal transmission over 120-km SMF with direct detection using a MIMO-ANN nonlinear equalizer. Opt. Express 2019, 27, 12794–12805. [Google Scholar] [CrossRef]
Orsuti, D.; Antonelli, C.; Chiuso, A.; Santagiustina, M.; Mecozzi, A.; Galtarossa, A.; Palmieri, L. Phase Retrieval Receiver Based on Deep Learning for Minimum-Phase Signal Recovery. In Proceedings of the 2018 European Conference on Optical Com-munication (ECOC), Basel, Switzerland, 18–22 September 2022; pp. 1–3. [Google Scholar]
Orsuti, D.; Antonelli, C.; Chiuso, A.; Santagiustina, M.; Mecozzi, A.; Galtarossa, A.; Palmieri, L. Deep Learning-Based Phase Retrieval Scheme for Minimum-Phase Signal Recovery. J. Light. Technol. 2023, 41, 578–592. [Google Scholar] [CrossRef]
Orsuti, D.; Cappelletti, M.; Santagiustina, M.; Galtarossa, A.; Palmieri, L. Edge-carrier-assisted Phase-Retrieval Based on Deep Learning Enabling low CSPR and low Applied Dispersion Values. In Proceedings of the Optical Fiber Communication Con-ference, San Diego, CA, USA, 5–9 March 2023; p. M2F.3. [Google Scholar]
Zhu, Y.; Li, L.; Fu, Y.; Hu, W. Symmetric carrier assisted differential detection receiver with low-complexity signal-signal beating interference mitigation. Opt. Express 2020, 28, 19008–19022. [Google Scholar] [CrossRef] [PubMed]
Zhang, J.; Xia, L.; Zhu, M.; Hu, S.; Xu, B.; Qiu, K. Fast remodeling for nonlinear distortion mitigation based on transfer learning. Opt. Lett. 2019, 44, 4243–4246. [Google Scholar] [CrossRef] [PubMed]
Xu, Z.; Sun, C.; Ji, T.; Ji, H.; Manton, J.; Shieh, W. Transfer learning aided neural networks for nonlinear equalization in short-reach direct detection systems. In Proceedings of the Optical Fiber Communication Conference, San Diego, CA, USA, 8–12 March 2020; p. T4D.4. [Google Scholar]
Xia, L.; Zhang, J.; Hu, S.; Zhu, M.; Song, Y.; Qiu, K. Transfer learning assisted deep neural network for OSNR estimation. Opt. Express 2019, 27, 19398–19406. [Google Scholar] [CrossRef]
Sozos, K.; Deligiannidis, S.; Mesaritakis, C.; Bogris, A. Self-Coherent Receiver Based on a Recurrent Optical Spectrum Slicing Neuromorphic Accelerator. J. Light. Technol. 2023, 41, 2666–2674. [Google Scholar] [CrossRef]

Figure 1. (a) Receiver structure for VSB and SSB systems. (b) Optical spectrum of a VSB signal. (c) Optical spectrum of an SSB signal.

Figure 2. (a) Optical spectrum of a complex-valued DSB signal. (b) Phase retrieval receiver. D is the dispersive element. (c) Receiver of the CADD scheme.

Figure 3. (a) Stokes-vector receiver. PBS: polarization beam splitter; BPD: balanced photodetectors. (b) Receiver structure of JSFR scheme. GR: generalized receiver, which could be a PD, PR receiver, CADD receiver, or ASCD receiver.

Figure 4. Conceptual illustration of a linear network-assisted LSTM.

Figure 5. (a) Transfer function of MZM. (b) Principle for ANN pruning. (c) Sparsely connected ANN.

Figure 6. (a) Electrical spectra of a typical direct detection signal and its SSBI. (b) Structure of a soft-combined ANN.

Figure 7. Block diagram of the MIMO-ANN.

Figure 8. (a) Temporal CNN architecture with a detailed view of the down-sampling (D) and up-sampling (U) blocks. Conv/TConv 1D: 1D convolutional/transposed convolutional layer. (b) NN architecture for phase retrieval receiver.

Figure 9. Time-domain data channels in the deep CNN.

Figure 10. Receiver of the JSFR scheme concatenated with a four-layer NN used for polarization demultiplexing. PBS: polarization beam splitter; OC: 3 × 3 optical coupler; D: dispersive element; PD: photodetector.

Figure 11. Schematic diagram of TL-assisted nonlinear compensation in a multichannel scheme.

Figure 12. The neuromorphic receiver based on ROSS concatenated with a light RNN. SOA: semiconductor optical amplifier; TIA: transimpedance amplifier; ADC: analog-to-digital converter. Three recurrent nodes comprised from an MZDI in a loop equipped with a variable optical attenuator, phase shifters, and optical delays.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wu, Q.; Xu, Z.; Zhu, Y.; Zhang, Y.; Ji, H.; Yang, Y.; Qiao, G.; Liu, L.; Wang, S.; Liang, J.; et al. Machine Learning for Self-Coherent Detection Short-Reach Optical Communications. Photonics 2023, 10, 1001. https://doi.org/10.3390/photonics10091001

AMA Style

Wu Q, Xu Z, Zhu Y, Zhang Y, Ji H, Yang Y, Qiao G, Liu L, Wang S, Liang J, et al. Machine Learning for Self-Coherent Detection Short-Reach Optical Communications. Photonics. 2023; 10(9):1001. https://doi.org/10.3390/photonics10091001

Chicago/Turabian Style

Wu, Qi, Zhaopeng Xu, Yixiao Zhu, Yikun Zhang, Honglin Ji, Yu Yang, Gang Qiao, Lulu Liu, Shangcheng Wang, Junpeng Liang, and et al. 2023. "Machine Learning for Self-Coherent Detection Short-Reach Optical Communications" Photonics 10, no. 9: 1001. https://doi.org/10.3390/photonics10091001

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Machine Learning for Self-Coherent Detection Short-Reach Optical Communications

Abstract

1. Introduction

2. SCD Systems

2.1. VSB System

2.2. SSB System

2.3. PR Receiver

2.4. CADD

2.5. SVR

2.6. JSFR

3. ML Techniques in SCD System

3.1. Nonlinear Compensation

3.1.1. Fiber Nonlinearity

3.1.2. Device Nonlinearity

3.1.3. SSBI Cancellation

3.2. IQ Imbalance Correction

3.3. PR and Optical Field Recovery

3.4. Polarization Demultiplexing

3.5. Fast Remodeling

3.6. Optical Signal Processing

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI