Recent Progress of Neuromorphic Computing Based on Silicon Photonics: Electronic–Photonic Co-Design, Device, and Architecture

Xu, Bo; Huang, Yuhao; Fang, Yuetong; Wang, Zhongrui; Yu, Shaoliang; Xu, Renjing

doi:10.3390/photonics9100698

Open AccessReview

Recent Progress of Neuromorphic Computing Based on Silicon Photonics: Electronic–Photonic Co-Design, Device, and Architecture

by

Bo Xu

¹,

Yuhao Huang

¹

,

Yuetong Fang

¹

,

Zhongrui Wang

²

,

Shaoliang Yu

³

and

Renjing Xu

^1,*

¹

Thrust of Microelectronics of Function Hub, The Hong Kong University of Science and Technology (Guangzhou), Nansha, Guangzhou 511400, China

²

Faculty of Engineering, The University of Hong Kong, Hong Kong 999077, China

³

Research Center for Intelligent Optoelectronic Computing, Zhejiang Laboratory, Hangzhou 311121, China

^*

Author to whom correspondence should be addressed.

Photonics 2022, 9(10), 698; https://doi.org/10.3390/photonics9100698

Submission received: 20 August 2022 / Revised: 16 September 2022 / Accepted: 19 September 2022 / Published: 27 September 2022

(This article belongs to the Special Issue Emerging Frontiers in Silicon Photonics)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

The rapid development of neural networks has led to tremendous applications in image segmentation, speech recognition, and medical image diagnosis, etc. Among various hardware implementations of neural networks, silicon photonics is considered one of the most promising approaches due to its CMOS compatibility, accessible integration platforms, mature fabrication techniques, and abundant optical components. In addition, neuromorphic computing based on silicon photonics can provide massively parallel processing and high-speed operations with low power consumption, thus enabling further exploration of neural networks. Here, we focused on the development of neuromorphic computing based on silicon photonics, introducing this field from the perspective of electronic–photonic co-design and presenting the architecture and algorithm theory. Finally, we discussed the prospects and challenges of neuromorphic silicon photonics.

Keywords:

neuromorphic computing; silicon photonics; optical neural networks; neuromorphic photonics; optoelectronics

1. Introduction

In the past few years, neural networks on conventional computers have been unable to meet the requirements of computational speed and energy consumption due to the limitation of memory walls. In the field of electronic hardware, researchers continue to create deeper and more complex neural network architectures to exploit the potential of electronic hardware platforms [1,2]. As a result, many innovations in processing units can unlock the capabilities of traditional electronic systems. For instance, by aggregating a large number of processing cores, the GPU (graphics processing unit) has exceptionally high parallel computing capability, significantly better than the CPU (central processing unit), which greatly facilitates the development of deep learning [3].

As artificial intelligence evolves, the demand for high performance, energy efficiency, and larger bandwidth in deep learning are endless. As the exponential expansion of electronic transistors, highlighted by Moore’s Law, reaches its physical limits, conventional silicon-based electronic components gradually developed a bottleneck of unsustainable performance growth. More underlying electronic components were proposed for the stage of neural networks; memristors [4], phase-change memories (PCMs) [5], ferroelectric random-access memories (FeRAMs) [6], and magnetic random-access memory (MRAMs) [7] are examples of innovative non-volatile memories with high processing speed, huge store capacity, and extended endurance and have been used to demonstrate neuromorphic computing. They can perform efficient neuromorphic computations better than conventional electronic components. Nevertheless, electronic connections fundamentally suffer from harsh trade-offs between bandwidth and interconnectivity, which limits the development of high-speed neuromorphic computing.

In the last few years, photonics has started to gain tremendous attention in academia because of the light-speed data processing and parallel transmission that can be achieved at every level of integrated photonic circuits. Unlike electrons, light has more dimensions, such as wavelength, polarization, and spatial mode, which makes approaches to neuromorphic computing or deep learning more creative and feasible. Furthermore, the mature and advanced technology of silicon photonics provides a perfect platform for large-scale photonic fabrication and integration. Amongst all recent schemes, silicon photonics has been regarded as one of the most promising technologies for neuromorphic computing.

Silicon photonics takes advantage of existing CMOS compatibility; therefore, it can be integrated with available CMOS circuits without the need for additional complex processes. Research on neuromorphic computing based on silicon photonics has made rapid progress [8,9], and various photonic devices have been used for this purpose. Mach-Zehnder Interferometer (MZI), micro-ring resonator (MRR), microcomb, plasma photonic crystal, and phase change materials (PCM) have made optical implementations of optical neural networks or neuromorphic computing based on silicon photonics more feasible and imaginative. In silicon photonics, electronic–photonic co-design is currently one of the most promising schemes for the photonic implementation of neural networks. Moreover, the wide field of emerging applications and devices compatible with silicon photonics, such as MRR, microcomb and PCM, provides the possibility of co-design with more mature electronics. In recent years, numerous architectures and algorithms have been proposed to implement neuromorphic photonic processors, and all these efforts show new possibilities and directions for photonic neural networks.

Here we reviewed the progress made over the past few years in this rapidly evolving field, including the electronic–photonic co-design, devices, architectures, and algorithms for neuromorphic computing based on silicon photonics. First, we investigate the microstructure of neurons from the perspective of electronic–photonic co-design and explore the various trade-offs between all-optical and optoelectronic implementations of neurons. Then, we discussed three types of devices—PCM, soliton microcomb, and metasurface and highlighted their essential roles in this area. Subsequently, we focused on photonic neural networks and corresponding algorithms at the system level. Finally, we provided perspectives for further improvement of neuromorphic computing based on silicon photonics.

2. Electronic–Photonic Co-Design

Silicon photonics utilizes basic CMOS fabrication techniques and combines electronic and photonic circuits. Most of the early work on optical neural networks in silicon photonics utilized both optics and electronics. In this section, we reviewed different approaches to implementing micro-architecture functionality with silicon photonic devices and discussed the difference between electronic–photonic co-design and all-optical neuromorphic computing.

2.1. Weight

The weighting function is essential to mimic a biological synapse since changing weights is the primary function of learning in a neural network. As learning continues, these parameters are adjusted toward the values that produce the correct output. In silicon photonics, MRRs are a common method for adjusting the weight value and were first employed to implement the weighting function and matrix multiplication [10]. In 2014, a silicon photonic architecture called “broadcast-and-weight” (see Figure 1a) was proposed and demonstrated to realize tunable weighted connections [11].

Devices using phase change materials allow for the storage of up to eight levels of data in a single unit that can be adjusted by light pulses [14]. Chalcogenide Ge₂Sb₂Te₅ (GST) is a well-studied phase-change material that has been shown to enable photonic synapses in spiking neurons. Figure 1c shows the scheme for integrating photonic synapses with similar function to neural synapses, which contains multiple PCM islands for precise control of synaptic weights [12]. This study also demonstrates that employing PCM islands in combination with a tapered waveguide structure is more effective than using a regular non-tapered waveguide. However, the application of GST is limited by its high optical absorption and phase transition time. In order to overcome the high optical absorption of GST, Ting Yu et al. adopted Sb₂S₃ to set the synaptic weights [15]. After the linear weighted addition, the optical signals interact with the GST layer nonlinearly, as shown in Figure 1e,f. Their results show the advantage of noise robustness and potential sub-picosecond delay of embedding this material on a silicon photonics neural network platform.

In summary, the implementation of the weighting function in silicon photonics without electro-optical conversion relies on non-volatile materials. The advantages of using all-optical weighting functions without electrical–optical conversions are evident in many ways, such as high bandwidth and high speed. Nevertheless, optical weight in the photonic synapses suffers from the cost of a large area, which is one of the key indicators for evaluating the practicality of optical schemes. For instance, a conventional memristor-based synapse can be as tiny as 10 nm when implemented electrically [16], whereas the suggested synapse in reference [12] is as huge as 6

μ

m by 1

μ

m. Compared to all-optical methods, neurons with the same functionality and precision can differ in the area by three or four orders of magnitude using the electrical method.

2.2. Summation

In addition to the straightforward sum of input and weights, the summation function can be more complex; for example, it can also be a minimum, maximum, majority, product or several normalization procedures. The selected network design and paradigm determine the exact algorithm for merging neural inputs, and it is easy to implement the summation function in a software method. However, the summation function is significant in an optical artificial neural network since it always affects the scalability. In most previous work, summation requires signal conversion because photodetectors are essential for combining inputs in the function [10]; therefore, the capability of micropillar semiconductor lasers, DFB lasers, and VCSELs was investigated in order to implement the summation function optically [17].

PCM-based implementation can support noncoherent neuron summation functions in the all-optical domain [18]; however, the scalability is limited by current all-optical methods since the optical summation functions are not compatible with WDM. Further, all-optical implementation is required to support a large number of optical inputs at the same time, where photodetectors, as the mainstream method, can provide a simple implementation but still suffer from low reliability, high noise sensitivity, and low integration.

2.3. Activation

The activation function can be materialized by using electronic or photonic methods, while the effective expressiveness of the artificial neural network depends on the nonlinearity of the activation function.

Currently, the photonic activation function is still in its preliminary stage. For instance, some researchers are still realizing it through digital computers and feeding the modulated optical signals to the next layer. In such OEO (optical–electrical–optical) activation function, the frequent electro–optical conversion processes lead to a large number of delays and power consumption, which limits the speed of neural networks. Therefore, Ian A. D. Williamson et al. proposed hardware nonlinear optical activation, which does not require repeated bidirectional optoelectronic signal conversions, as shown in Figure 2a. In this activation function, most of the signal power remains in the optical domain because it converts only a small portion of the input optical signal into the analog electric signal to intensity-modulate the original optical signal [19]. In addition to this, optical modulators by using a combined structure of MRR [20] and SOA [21] are also suitable for realizing nonlinear activation; for example, as shown in Figure 2b, the proposed layouts exploit a Semiconductor Optical-Amplifier (SOA)-based sigmoid activation within a fiber loop. Vertical-cavity surface-emitting lasers (VCSELs) are also great candidates for activation functions, and they exhibit many profound advantages [22], such as being easy to integrate into 2D or 3D arrays, low power consumption, and coupling efficiency to optical fibers. In reference [23], J. Robertson et al. discussed experimentally and theoretically the dynamics of spikes in VCSELs with respect to controlled suppression. Furthermore, according to reference [24], excitation and suppression can be achieved in VCSELs using dual-polarization injection.

Figure 2c,d shows a PCM-based all-optical neuron [18], where PCM is placed on the MR, resembling the activation unit. Due to changes introduced in the structure of the PCM on the waveguide crossing, the transmission response undergoes a considerable change to obtain the ReLU function.

2.4. STDP

The learning capability is required to build neural networks in both spiking and traditional neurons. In spiking neurons, Spiking Time Dependent Plasticity (STDP) learning is inherently asynchronous and online, which contrasts with the more conventional learning functions; STDP changes the weight of the synapse according to the precise timing of individual pre-synaptic spikes and post-synaptic spikes [25]. Functionally, STDP serves as a mechanism that performs the Hebbian learning rule that the strength of neural connections is determined by the correlations between pre- and post-synaptic activity. SOAs and MRRs are both promising candidates for implementing STDP learning. A photonics approach to implement STDP using SOA and Electro-Absorption Modulator (EAM) was introduced experimentally in reference [26]. STDP-based unsupervised learning is also theoretically explored in principal component analysis (PCA). Moreover, high-order passive MRRs were demonstrated to be an alternative photonic STDP approach [27], and since the proposed scheme is passive, it has a lower power consumption compared to schemes using SOA, where the intracavity effect induces a power difference at the output of the MRR from which the inter-spike is calculated.

2.5. All-Optical versus Optoelectronic Neurons Implementation

As discussed previously, most previous work on ONNs was implemented based on optoelectronic hardware [28]. However, in optoelectronic hardware, massive power is consumed in the electrical-to-optical and its inverse conversion because the device works more in the electrical domain in the summation and activation functions. For instance, photodetectors are frequently used to convert optical signals to electrical outputs, which imposes restrictions on the speed and power efficiency of ONNs. Moreover, O/E/O neurons rely on modulators that utilize the nonlinearity of the electro-optic transfer function. Nevertheless, modulators and photodetectors are susceptible to noise, and their noise accumulation can seriously affect the accuracy and energy consumption of ONNs based on optoelectronic hybrid hardware.

All-optical implementation seems to be a promising approach to addressing the problems of optoelectrical hybrid hardware. Compared to the electronic implementation, all-optical neurons usually rely on the semiconductor carriers or photosensitivity that occur in many materials. The most obvious advantage is that the optical signal flow in all-optical neurons does not require any conversion; therefore, they are inherently faster than O/E/O schemes. Meanwhile, all-optical schemes using passive optical components can be easily integrated with CMOS technology. The photonic implementation also provides the advantages of the high bandwidth in photonic communication and the low complexity in nonlinearity implementation. However, many challenges remain with all-optical neurons, and cascadability is a key challenge for photonic implementations. Designing all-optical neurons requires more efficient optical devices due to their insertion loss; nevertheless, cascadability must compensate for the power consumption at the level of the system in some all-optical implementations [29].

From a more practical perspective, electronic and photonic co-design hardware enables neuromorphic computing, which is one of the core research routes until the required optical devices leap over the current challenges. Nevertheless, the gap between electronics and photonics neural networks has always existed because most architectures are designed for a specific platform rather than for optical hardware. Moreover, photonics endows the traditional neural networks in the electrical domain with many more unique benefits that are difficult or even unable to implement by using electrical devices, even though we acknowledge that the capabilities of ONNs lag far behind the electrical neural networks. That is one of the most core reasons for us to conduct massive research on ONNs and those optical components.

Here, we propose a possible framework for the optoelectronic-hybrid AI computing chip that can be accessible in a reachable position nowadays. As shown in Figure 3, the framework consists of three parts: an optical engine, an analog electronic part, and a digital ASIC or DSP. The optical engine and DSP require some analog electronic components in order to interact with one another because they function with signals of different strengths. The DSP sends signals to the electrical driver block, which amplifies them and uses them to power the optical engine. The DSP or ASIC chip has a series of processing units that recover, decode, and error-correct the data streams after compensating for various transmission problems. Various applications might need slightly different DSP layouts or may not need all of the processing modules. A far better fit between these parts can be achieved by co-designing the photonic integrated circuit (PIC) and the DSP chip. The trade-offs between different DSP and PIC characteristics can be more precisely identified with the use of a co-design technique, which also enhances system-level performance optimization.

3. Devices

In this section, we reviewed some important devices based on solitons, PCM, and metasurfaces that can be employed to implement ONNs. As we showed before, MRRs and photodetectors are used to achieve electrical to optical conversion and optical to electrical conversion, respectively. In addition to these three kinds of devices, other components based on silicon photonics, such as lasers, couplers, and modulators, are also critical parts of optical circuits and neural networks. As for on-chip lasers, both vertical Cavity Surface Emitting Lasers (VCSEL) and microdisk lasers support the design of scalable neural networks, although many researchers still employ off-chip lasers to constitute neural networks. Moreover, waveguides are of great importance in silicon photonics because they are equivalent to metal wires in the electrical domain. How to minimize optical loss, including propagation loss and bending loss, is under massive research. In addition, MRRs, microdisks, and MZIs are widely employed to design modulators, switches, and filters [9]. Although silicon photonics can now be considered a mature technology platform compared to optical neural networks, the issue of connecting light to and from silicon photonic components with high efficiency remains a challenge.

Recently, devices that utilize PCM, soliton microcombs, and metasurfaces have been of great interest in photonic neuromorphic computing. In reference [30], the concept of time–wavelength multiplexing for ONNs was introduced, and the Kerr microcomb was applied to implement a photonic perceptron. In 2021, Xu et al. demonstrated a universal optical vector convolutional accelerator based on simultaneously interleaving temporal, wavelength, and spatial dimensions enabled by an integrated microcomb source [8]. Meanwhile, Feldmann et al. proposed a scheme for an integrated photonic tensor core using phase-change-material memory arrays and soliton microcombs [31]. Moreover, metasurfaces demonstrate a different dimension for on-chip optical neural networks [32]. Metasurfaces with subwavelength resonators are used to manipulate the wavefront of light, allowing the miniaturization of free-space and bulky systems for diffractive neural networks (DNN). By using soliton microcombs, PCMs, and metasurfaces, these novel methods based on silicon photonics enable optical neuromorphic computing, providing an effective way to break through previous bottlenecks in machine learning in electronics.

3.1. Soliton Microcombs

3.1.1. Basic Science

Optical Frequency Combs (OFCs) have traditionally been built on the mode-locked mechanism or fiber lasers; however, recent advances have demonstrated an OFC generated in a Kerr-nonlinear optical microresonator, commonly referred to as microcombs. A partial history of the development of optical of the optical frequency comb can be obtained from Figure 4. A microcomb depends mainly on dissipative Kerr solitons (DKSs), and solitons are stable waveforms that can retain their shape when propagating within a dispersive medium [33]. Optical solitons have particle properties, and multiple solitons can form a variety of bound states through interactions. On a fundamental level, solitons can be found in a plethora of nonlinear systems. In terms of nonlinear dynamics and phenomena, DKSs as a type of solitons have attracted significant attention and demonstrated a wide range of applications such as RF photonics, parallel coherent LiDAR, optical frequency synthesizer, and photonic neuromorphic computing [34,35,36]. As shown in Figure 5a, a microresonator driven by the CW pump is widely used to build a Kerr soliton microcomb. By balancing the loss and gain in the active medium while balancing the nonlinearity and dispersion and remaining in the anomalous group velocity dispersion (GVD) regime, the high-Q microresonators can support fully mode-locked comb states called DKS [37]. Figure 5b shows the interactions of Kerr combs with other coexistent nonlinear effects, such as nonlinear photon scattering and second-order nonlinear processes, which are associated with the light-matter interaction in high-Q microcavity platforms [38].

Microcombs are still poorly studied for photonic computing; however, recent advances in microcombs show they could be a promising candidate as a chip-scale light source. Kerr-nonlinearity induced by optical parametric oscillation in a microcavity was first reported in 2004 [39]. Figure 4 illustrates a significant development in soliton microcombs. In 2003, the first silicon laser using a silicon waveguide as a gain medium was demonstrated [40]. Then, in 2005, Alexander W. Fang et al. reported the first electrically pumped hybrid silicon laser [41]. As an indispensable component of a fully integrated silicon photonic circuit, recent research on on-chip silicon lasers has also paved the way for future integrated neuromorphic computing circuits [42]. In parallel, in 2007, an optical frequency comb using an approach different from the comb-like mode structure of mode-locked lasers was first experimentally generated by the interaction between a continuous-wave pump laser of a known frequency with the modes of a monolithic ultra-high-Q microresonator via the Kerr nonlinearity [43]. In reference [44], a method for the generation of soliton microcomb was first proposed and is used by academics to date. By artificially scanning the pump frequency or the cavity resonance from blue to red (or from red to blue using a heater for resonance), the pump laser can reach the DKSs region and thus generate the soliton microcombs. Recently, the integrated turnkey soliton microcomb was demonstrated and theoretically explained [45]. The microcomb co-integrated with a pump laser operates at CMOS frequencies as low as 15 GHz with optical isolation removed, providing significant advantages for high volume production, and this integration can also be exploited by integrated neuromorphic computing circuits, as we discussed previously [45].

Figure 4. The development history of microcombs and some significant events. (a) Reprinted with permission from Ref. [40], ©2004 Optica Publishing Group. (b) Reprinted with permission from Ref. [42], Copyright © 2005, Springer Nature Limited. (c) Reprinted with permission from Ref. [43]. Copyright © 2007, Nature Publishing Group. (d) Reprinted with permission from [44]. Copyright © 2019, Springer Nature Limited. (e) Reprinted with permission from [45]. Copyright © 2020, Springer Nature Limited.

3.1.2. Computing Based on Soliton Microcomb

Recent research has shown the potential to leverage soliton microcombs while implementing the neural network [8,31]. Prior to the use of soliton microcombs, different multiplexing methods to realize parallel synapses successfully demonstrated their positive application in ONNs. For example, photonic reservoir computing employs time-domain multiplexing to build large-scale input layers with many nodes [46]. However, the training and scalability of time-division multiplexed networks are limited in the current scheme, whereas the introduction of microcomb sources into ONNs can combine the use of wavelength, time and spatial multiplexing simultaneously, offering benefits to the matters discussed before.

Figure 5. Computing Based on Soliton Microcomb. (a) Overview of high-Q resonator platforms for the generation of Kerr frequency combs [38]. (b) Illustration of Kerr comb interaction with other cubic and quadratic nonlinear effects [38]. (c) Sketch of the multiplexed all-optical MVM [31]. (d) Optical micrograph of a high-Q Si₃N₄ photonic-chip-based microresonator used for frequency comb generation [31]. (e) Optical micrograph of a fabricated 16 × 16 [31]. (a,b) Reprinted with permission from Ref. [38]. © 2021 Wiley-VCH GmbH. (c–e) Reprinted with permission from Ref. [31]. Copyright 2021, The author(s), under exclusive license to Springer Nature Limited.

In reference [30], a single perceptron based on soliton microcombs was reported operating at 11 billion (10⁹) operations per second, in which the perceptron has 49 synapses. Based on wavelength multiplexing with 49 microcomb wavelengths, simultaneously with temporal multiplexing, the input nodes of the perceptron differ from conventional ONNs because they are temporally defined by multiplexing the symbols that are then routed, determined by their location in time. By following this work, Xu et al. demonstrated a photonic convolutional accelerator (CA) with the concept of time–wavelength multiplexing [8], where they combined the CA front end and a fully connected neuron layer to form an optical CNN. The chromatic dispersion is used to employ time delay to the wavelength-multiplexed optical signals, which are then combined along the dimension. By using the microcomb source, the CA is capable of speeds up to 11.3222 TOPS, and it can process 250,000-pixel images using a single processing core.

Feldmann et al. made an integrated photonic hardware accelerator using the phase-change material memory and soliton microcombs [31], as shown in Figure 5e, where the processor encodes data onto the on-chip microcomb teeth and performs matrix–vector multiplication using the non-volatile configuration of the PCM’s array of integration units. In this approach, the hybrid integrated microcomb and PCMs integrated onto waveguides enable in-memory photonic computing using WDM capability; moreover, the parallelized implementation is CMOS-compatible and promises higher bandwidth, lower power, and real speed of light since the convolutional operation is a passive transmission measurement, thus making it possible to process the entire image in a single step.

Combined with recent advances in soliton microcomb, ONNs fully integrated on a chip is potentially viable. However, in addition to soliton microcombs, solitons can be exploited to implement more computing systems. For example, a variety of logic devices such as half-subtractor, comparator, and logic AND, OR, XOR, and NOT gates can be implemented by solitons due to their elastic interaction and capability of controlling [47,48]. Nevertheless, these implementations are inherently digital computing; instead, the approaches would throw away the advantages of photonics in analog computing. Moreover, reference [49] theoretically proposed a soliton-based reservoir computing scheme, where they explored the possibility of using interacting soliton chains to work as a reservoir.

3.2. Devices Based on PCM

Devices combined with PCMs are of great interest in building advanced neural networks based on silicon photonics. As shown in Figure 6, the past decades have witnessed the development of PCMs. At present, PCMs are one of the most mature and widely investigated materials. Among emerging non-volatile memory devices, PCM-based optical neural networks are promising to address the limitations of electrical neuromorphic computing.

3.2.1. Basic Science

The basic principle of PCM devices is to induce phase change between the amorphous and crystalline states of PCM by employing thermally/optically/electrically pulses, during which the conductivity of PCM also switches between low electrical ones (amorphous states) and high electrical ones (crystalline states), corresponding to the low reflectivity state (HRS) and high reflectivity state (LRS), respectively. The crystallization process is generally referred to as SET, which requires PCM to be exposed to a long (hundreds of nanoseconds) voltage or laser pulse. In order to reset the PCM or achieve the amorphization process, short high lasers or electrical pulses sufficient to melt the phase-change layer can be used. Due to these properties above, PCMs were first used as storage mediums for rewritable compact disks such as high-density digital versatile disks (HD-DVD) or blue-ray disks and RAM, as shown in Figure 6.

However, the introduction of PCM into photonic neuromorphic computing poses new challenges to the majority of PCMs, namely Ge-Te-Sb (GST), which are selected according to the criteria of low electrical resistivity and high optical absorption because of low optical loss and high phase change speed obtain more critical in nanophotonic waveguides due to the proximity of the optical mode to the PCMs. Recent proposed PCMs such as GeTe [50], GSST [51], Sb₂Se₃, and Sb₂S₃ [52,53] can address the emerging problems in the application of photonic PCM; in particular, Sb₂Se₃ and Sb₂S₃ can offer near zero loss in both amorphous and crystalline states [54].

3.2.2. Phase Change Materials for Integrated Photonics Computing

As we discussed previously, inducing the high refractive index changes to achieve phase tuning of PCMs is the main criterion for employing PCM in photonic devices, while non-volatile tuning allows for low energy overhead in silicon photonic devices.

Phase change memory as the forerunner of the application of phase change materials has a long history since it was first reported by Yamada and his colleagues [55]. In 2012, Penrice et al. demonstrated an all-optical phase change memory by depositing a GST film on a Si₃N₄ ring resonator structure on silicon, as shown in Figure 7a. Later, Rios et al. performed a complete analysis of GST memory devices [56] and showed that the resonance wavelength, the Q-factor, and the extinction ratio could be used to retrieve the state of GST, as shown in Figure 7d,e. These two works paved the way for future research on phase change memory. In recent years, Si₃N₄ multiple ring resonators with GST patches coupled to a single waveguide, the usage of PWM for switching of PCM, and numerous novel methods for implementing phase change memory have been presented one after another.

After the first report of the phase change memory, the possibility of applying PCM to neuromorphic computation was investigated. Subsequently, PCM has been continuously explored and utilized to implement neuromorphic computing [56,57,58]. A method for creating scalable PCM-based optical synaptic networks using silicon-based ring resonators with GST patches was proposed [59], which differs from the scheme of creating PCM synapses on a single waveguide [12] (as shown in Figure 1c). By incorporating wavelength division multiplexing better, Feldmann designed the spiking synaptic network to achieve both supervised and unsupervised self-learning, as described above, also using a waveguide implementation with a PCM integrated on top [18].

In order to reconfigure phase-change photonic devices, it is frequently necessary to adjust the intensity [14] and pulse shape [60] of an incident light wave. Resonant structures [14] are frequently utilized to enable the wavelength-selective operation and improved modulation depths. However, the comparable feature is not present in polarization space, which limits the addressability of distinct pieces in cascaded systems. Hence, June et al. proposed a hybridized-active-dielectric structure in a nanowire configuration, in which the GST as the active material undergoes a phase change by sending power- and polarization-modulated laser pulses and silicon acts as dielectric [61]. Polarization-selective dielectric resonances of the Si cavity modify the total absorption in the GST layer. With up to five independent levels of reconfigurable, non-volatile polarization-division demultiplexing of electrical conductivity, they demonstrate matrix–vector multiplication based on the hybridized-active-dielectric structure (MAC-type operations) with input polarization as the tunable vector element, which unlocks an additional route in phase-change photonics.

Given the negative properties of GST in photonic PCM networks, GSST, Sb₂S₃, and other alternative materials were produced to address the problems. In 2021, Yu and his colleagues used Sb₂S₃ instead of GST in some crucial parts of networks, such as the weighting function [15]. They used two different transitions within chalcogenide in optical synapse weighting and all-optical nonlinear thresholding to enable a low loss and high-speed all-optical deep neural network. GSST was also demonstrated as a promising candidate. In reference [62], Volker utilized MZIs with GSST on both arms, using the electrical switching method instead of the optical approach. Recently, the integrated photonic tensor core for parallel convolutional processing based on GST demonstrated a scalable, high-speed (10¹² MAC per second) and low-power (17 fJ/MAC) scheme by using the integrated microcomb as we discussed previously [31].

Figure 7. Phase change materials for integrated photonics computing. (a–c) Schematic overview of the proposed memory element [63]. (d) Optical microscopy image of an array with 25 different memory elements [64]. (e) Three-dimensional scheme of the platform using partially etched ridge waveguides in silicon [64]. (a–c) Reprinted with permission from Ref. [63], AIP Publishing. (d,e) Reprinted with permission from Ref. [64]. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

3.3. Metasurfaces

Metasurfaces consisting of subwavelength-spaced building blocks have inhomogeneously distributed optical resonators, and optical wavefront can be reshaped and redirected by the abrupt phase shift caused by resonators to generate phase discontinuities when light passes through them. Optical computing benefits from metasurfaces since they provide multiparametric optical modulation in a single element, which is competitive and promising to miniaturize and integrate bulky free-space optical systems, such as optical integrators [65], differentiators [66], and diffractive neural networks [67].

3.3.1. Basic Science

Metasurfaces are two-dimensional metamaterials, which were earlier studied by designing the meta-atoms and their different spatial arrangement with the aim of generating abnormal effective medium coefficients and investigating the accompanying aberrant physical phenomena [68]. However, metasurfaces are gradually substituting metamaterials as a research hotspot because they are compatible with standard lithography and nanoimprinting techniques and can be easily tuned dynamically.

As we discussed above, metasurfaces consist of hundreds of subwavelength structure units, each of which generates a spherical wave packet that forms a new electromagnetic wavefront at the metasurface–air interface [69]. Hence, by adjusting the shape, size, and direction of the metasurface unit, metasurfaces can allow ultimate control over beam propagation, divergence, and information encoding [70]. The limitation also lies in the wave-manipulation functionalities since metasurfaces are fixed after design and fabrication, and in this context, progressive research has been devoted to addressing this issue in order to implement reconfigurable metasurfaces upon external tuning [71,72]. As shown in Figure 8, electronic control [73], light control [74], temperature control [75], mechanical control, and power control are the common methods employed by tunable metasurface structures [71]. Moreover, multifunctionality is also a challenge to metasurfaces, which can be addressed by segmented or interleaved metasurfaces [76,77] and by adding complex transmission profiles using the linear property of the Fourier transform [78].

Optical metasurfaces can operate in visible and near-infrared spectrums. Because electromagnetic waves in this wavelength range are commonly used in imaging applications, optical metasurfaces in this band are of greater interest than those operating in the mid- and far-infrared wavelength ranges [82]. Moreover, the feasibility of low-cost scalable manufacturing of metasurfaces, depending on their specific design, such as NIL, DUV, and CL, provides a promising prospect for the application of metasurfaces [79,80,81].

3.3.2. Computing Based on Metasurfaces

Metasurfaces can be used to build a meta-system that implements more complex applications, such as optical analog computing and diffractive deep neural network (D²NN), in both free-space and integrated on-chip ways. Here, we mainly talked about integrated metasurface systems because the metasurface design for integrated photonics platforms follows the same guidelines as free-space metasurface design.

Integrated metasurfaces enable optical analog computing by exploiting the Fourier transform properties. For instance, the light propagation in planar waveguides can be controlled by in-plane one-dimensional metasurface or, in other words, metaline, and be leveraged to implement mathematical operations. Figure 9a shows a three-layer meta-system that can perform the spatial differentiation of the input signal, and Figure 9b demonstrates the on-chip metalens structure within the system [83]. Moreover, the 1D metalens has a numerical aperture of up to 2.14, allowing light to be focused to within 10 m with less than 1 dB loss. In reference [84], on-chip convolution operation was demonstrated based on silicon metasurface with the help of inverse design optimization, for which the proposed convolver performed spatial convolution on the provided function at two wavelengths of 1000 and 1550 nm.

Metasurfaces also provide an exceptional solution to the issue of miniature integration of ONNs. Early studies have shown several methods to implement diffractive deep neural networks (D²NN) based on free-space optics [85]. The integrated metasystems require less power consumption and footprint and also allow for more information capacity due to the multimode nature of the diffraction. Zarei et al. demonstrated an integrated photonic neural network based on on-chip cascaded one-dimensional (1D) metasurfaces at a wavelength of 1.55 µm [86]. Problems exist in many ways, such as the diverse, effective refractive index of the same slot at different positions for different angles of light input and mutual interference between adjacent slots of different lengths; thus, to address these issues, an optical deep learning framework was proposed [87]. In this framework, the pre-trained neuron values are physically translated into the various phase delays, and the relevant phase delays are created by altering the size of the silicon slots. In combination with phase-change materials, metasurfaces can enable a prototypical optical convolutional neural network that utilizes the phase transition of the GST to control the waveguide spatial modes with a very high precision of up to 64 levels in modal contrast [88].

Two-dimensional metasurfaces were also used to build on-chip integrated neuromorphic systems [32], where the metasurface devices use a polarization multiplexing scheme and can perform on-chip multi-channel sensing and multitasking in the visible. This proposed method provides a promising avenue for expanding the field of optical neural networks because many multiplexing schemes of the metasurface, such as polarization, wavelength, vortex, etc., can be endowed to all-optical neural networks.

4. Architecture and Algorithm

4.1. Implementation by Interference of Light

Mach–Zehnder interferometer (MZIs), which are extensively utilized in optical modulators [89], optical communication [90], and photonic computing [91], are crucial components of the optical interference-based network. By using attenuators and phase shifters on the MZI arms to control the weights to change the phase and amplitude of the optical signal, the MZI can act as a natural matrix multiplication unit without fundamental losses.

In 2017, Shen et al. utilized Singular Value Decomposition (SVD)-based methodology to implement the coherent optical computing architecture [28], as shown in Figure 10a,b. Although SVD-based matrix multiplication was demonstrated by reference [92], this was the first practical and advanced demonstration using a Photonic Integrated Circuit (PIC), where the layers of this architecture consist of an Optical Interference Unit (a cascaded MZI mesh using SVD) and an Optical Nonlinearity Unit (implemented with digital electronics).

Although MZIs have shown promising potential in overcoming the bottlenecks of state-of-the-art electronics, it still suffers from a larger footprint than their counterparts and the accumulation of phase errors; hence, different approaches were recently proposed to reduce the area cost of MZI mesh. The author of reference [93] designed a slimmed architecture using a sparse tree network block, a single unitary block, and a diagonal block for each neural network layer, as shown in Figure 10c. They co-designed the optical hardware and software training implementation and achieved area savings of 15–38% for MZIs-based ONNs of various sizes. Another work demonstrated a similar idea that uses the fast Fourier transform (FFT) to reduce area cost [94], see Figure 10d, where the structured neural network used is friendly to hardware implementation while greatly reducing the complexity of the computation, offering motivation to prune SVD-based ONNs. In this work, they used OFFT and its inverse (OIFFT) to implement a structured neural network with circulant matrix representation and pruned the weight matrices by Group Lasso Regularization.

In silicon photonics, it is less common to implement noncoherent architectures by MZIs, while in optical fiber systems, MZIs always work as intensity modulators [95], thus allowing the construction of a recurrent neural network working in combination with an arbitrary waveform generator. Back to the silicon photonics, RNNs can likewise be built using the SOA-MZI units, as shown in Figure 2b, in which the SOA is placed on the arms of MZIs, thus performing as a wavelength converter for cross-gain modulation [21] and enabling an all-optical WDM functionality.

4.2. Implementation by Resonance of Light

The schemes realized by the resonance of light are noncoherent and can be implemented using microrings for WDM-Matrix–vector multiplication (WDM-MVM) operation. The “broadcast-and-weight” architecture based on the WDM concept was first introduced by Tait in 2014 [11], where optical signals are modulated in parallel and reconfigured by tuning MRs that operate only at specific wavelengths. The WDM concept provides a feasible direction for constructing neuromorphic computing architecture based on microrings since the compactness of microrings can significantly reduce the footprint and improve integration.

“Broadcast-and-weight” tools are constantly used to implement MLP, CNN, and SNN, and several variations of them have been proposed. The “Hitless weight-and-aggregate” architecture that aims to overcome weight corruption caused by thermal factors isolated each weight and tuned them independently [96], as shown in Figure 11a, where they co-designed the microring silicon photonic architectures with FPGA, providing a way to construct large-scale matrix multiplication using MRRs in the wavelength domain and reducing the system decomposition complexity. Combing MRs with memristors to implement a CNN is a novel approach based on the MR-weight bank [97], in which the convolution layer mainly consists of a weight resistor array, a photonics weight bank, and an SRAM buffer. The MR bank receives weights through memristors and stores them in the SRAM buffer. The Convlight architecture proposed in this work shows a scalable memristor-integrated photonic CNN accelerator, which is the first of its kind. Furthermore, Xu et al., as we discussed before, proposed to use microcombs to enable broad bandwidth convolutional ONNs [8], where the schematic of the photonic CNN is shown in Figure 11b.

SNN can be implemented with or without the “broadcast-and-weight” protocol. Reference [98] explored the possibility of implementing SNN based on the “broadcast-and-weight” protocol. In addition to the “broadcast-and-weight” architecture, in 2019, Feldmann et al. used microrings integrated with PCM cells to implement an all-optical spiking neurosynaptic network [18], which can control the propagation of light propagation through the ports of PCM cells by simply altering the PCM’s state. RC can also be implemented by MRs, for example, in reference [99], where they built a 4 × 4 swirl reservoir on a silicon platform with nodes consisting of nonlinear microporous resonators. An analysis of performance on nonlinear Boolean problems was presented to show the capability of this reservoir, which differs from the conventional swirl-topology reservoir architectures, to set their nodes at near instability.

4.3. Algorithm

Previously, most ONN training was achieved by digital electronics, as backpropagation in the silicon photonic domain remains a challenge because nonlinearity in the backward direction must be the gradient of its inverse direction, which remains a challenge. Another problem with online training is that each local optical parameter, such as the intensity of the section, is required to be probed within the integrated photonic circuits.

In order to address the second problem, Hughes et al. proposed the photonic analog of the backpropagation algorithm using adjoint variable methods, as shown in Figure 12c–e [100]. The adjoint variable method is the technology previously utilized in the optimization and inverse design of photonic structures [101], and this training algorithm is effective for the integrated photonic ANNs and other photonic platforms because of its derivation from Maxwell’s equations.

Statistical optimization tools can be alternatives to backpropagation; for example, genetic algorithm and particle swarm optimization were used to train the hyperparameters and optimize the weights in reference [102], as shown in Figure 12a,b. These two algorithms are both gradient-free and show effectiveness when used for classification tasks with different datasets in a trained ONN. Bayesian optimization has also been used in reservoir computing in several simulations [103] because Bayesian optimization also provides a better grasp of the significance of various hyper-parameters. Figure 12f–h,i show the Bayesian optimization on a toy 1D problem.

In RC networks, nonlinearity inversion can be realized because, in the training methods [105], the reservoir’s states are estimated through a single photodetector at its output, which includes an approximate inversion of the nonlinearity of photodetectors. This method solves the lost ability to observe the condition of reservoirs by photodetectors (because in all-optical RC networks, fewer photodetectors are required, which receives the weighted total of all the optical signals), which is necessary for many linear training algorithms. Moreover, as we discussed, STDP is a crucial and popular synaptic weight plasticity model used to simulate the synaptic plasticity between neurons, and thus unsupervised learning using the STDP mechanism for training spiking neural networks (SNNs) is the most common in recent work.

Moreover, how to reduce and resolve inherent optoelectronic stochasticity and non-ideality are both key issues for photonic integrated circuits. On the one hand, the accumulation of errors in photonic systems due to their analog nature and the abundance of optoelectronic noise has the potential to drastically affect their performance; on the other hand, manufacturing variations and thermal crosstalk limit their practicality and performance. Hence, Wu et al. proposed a photonic Generative adversarial network (GAN) and the corresponding noise-aware training approaches [104]. After training with noise-aware training methodologies, namely, the input-compensatory approach (IC-GAN) and the kernel weight-compensatory approach (WC-GAN), the photonic generative network may not only withstand but also profit from a certain degree of hardware noise, as shown in Figure 12j–l. Previous research also showed several offline noise-aware training schemes, including injecting noises into layer inputs, synaptic weights, and preactivation [106]. Furthermore, the first self-calibrated photonic chip was demonstrated by Xu and his colleagues [107]. As we discussed, self-calibration is imperative in both electronics and photonics because it could guarantee a stable performance of devices. The self-calibration technique incorporating an optical reference path into the PIC used the Kramers–Kronig relationship to recover the phase response from amplitude measurements and achieved a fast-converging self-calibration algorithm.

In addition to the methods above, forward propagation in ONNs is much easier to implement and can be computed in a constant time with very low power consumption. Although it is simple in form and convenient in use, only in some simple ONNs or very deep RNNs, it can be executed at a faster rate.

5. Outlook and Discussion

In this paper, we reviewed the recent advances achieved in neuromorphic computing based on silicon photonics in detail. An overview of micro-architecture functionalities, devices, architectures, and algorithms in neuromorphic computing based on silicon photonics is displayed. Silicon photonics, which has been explored for a long time, shows great promise for implementing neuromorphic computing since it offers sufficient integration and maturity for the current photonic computing. Hence, neuromorphic silicon photonics is an emerging field that combines the speed and parallelism of photonics with the adaptiveness of deep learning, which can be theoretically orders of magnitude ahead of traditional electronics. The utilization of new concepts such as WDM, novel devices such as PCM, soliton microcombs and metasurfaces, viable fabrication techniques, and advanced algorithms can allow for unprecedented developments in the next generation of optical neural networks. Here, we summarized the current challenges and pointed out the possible opportunities for the materialization of future optical neural networks.

Electronic–photonic co-design: In the absence of electronic controllers, the electronic–photonic co-design neural network is a more practical route for current ANNs until the efficiency of electronic control can be found as a competitive candidate in the optical domain. Although monolithic fabrication provides excellent opportunities to integrate electronics and photonics on the same substrate, the high latencies and power consumption caused by the electronic components pose challenges for electronic controllers. In ONNs, the controller should manage photonic devices and maintain stable operations of neurons in real-time, at high speed and efficiency.

Lasers: There is an essential requirement of light generation for an optical computing system. Light sources are required for supplying the modulated input signals or carrying out the nonlinear activation in optical computing, from straightforward operations such as MVM to complex computations such as AI algorithms. However, at face value, silicon appeared to be a poor choice for applications such as light sources, modulators, photodetectors, etc., because of its low electro-optic (EO) coefficients and indirect bandgap. Thanks to an abundance of research and development from both academia and the industry, on-chip silicon lasers were developed by using hybrid integration, heterogeneous integration based on wafer bonding, and monolithic integration based on direct epitaxial growth. Alternatively, due to the inherent inhomogeneously broadened gain profiles of QD lasers [108], they can achieve the coveted microcomb emission with high-power flat-top spectra, making them ideally suited for WDM architectures [109].

On-chip integration: Note that on-chip ONNs are the mainstream in current research because mature CMOS technology has enormous advantages for large-scale and highly integrated ONNs. However, the cost of on-chip optical networks is extremely expensive in terms of money overhead, labor expenses, technology requirements, and so on. For instance, coherent architectures which are promising to be fabricated in an integrated way are hampered by the issues with MZIs, due to the large area requirement and phase-noise corruption. In addition, on-chip integration suffers from lifetime instability due to thermal crosstalk and manufacturing process variations for many other photonic devices such as MRs and PCMs. Moreover, silicon on-chip lasers are more susceptible to environmental factors and remain an obstacle in academia.

Training: As we discussed before, the training process is usually completed on digital computers in many works. On the one hand, it is crucial to devise an effective training method for current ONNs which can work in the optical domain in real time. On the other hand, exploring photonic architectures that can efficiently support neural network training is promising but challenging, as backpropagation imposes additional requirements on the current photonic neural networks. The training of networks based on scattering and diffraction of light can be a reference for silicon-based neuromorphic photonics. Recent work using metasurfaces to implement diffractive neural networks has taken an important step and shown possibilities for future training methods [32]. Moreover, the photonic accelerator based on memristors hybrid hardware supports backpropagation [110], which can also be a great example.

Scalability: Scalability is the most evident problem between ONNs and electronic ANNs. The advances achieved in ONNS are indelible, but the issues are that many works focus on small-scale ONNs compared to electrical ANNs, which can have millions of weight parameters. The most practical approach to addressing this problem is to optimize and improve optical components. On the other hand, more structures suitable for optical neural networks are required to be proposed to reduce the complexity of the network; hence, they could pave the way to scale the photonic networks.

On the horizon in the future, there are numerous difficulties to be addressed. Nonetheless, photonic neural networks have found applications in many fields beyond the reach of traditional computer technology, such as intelligent signal processing, high-performance computing, nonlinear programming, and controlling, and these promising prospects motivate people to explore the future of optical neural networks further.

Author Contributions

Conceptualization, R.X.; writing—original draft preparation, B.X.; writing—review and editing, B.X., Y.H., Y.F., Z.W., S.Y., and R.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Key R&D program of China (2021ZD0109904) and Start-up fund from the Hong Kong University of Science and Technology (Guangzhou).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Sherstinsky, A. Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) Network. Phys. D Nonlinear Phenom. 2020, 404, 132306. [Google Scholar] [CrossRef]
Alzubaidi, L.; Zhang, J.; Humaidi, A.J.; Al-Dujaili, A.; Duan, Y.; Al-Shamma, O.; Santamaría, J.; Fadhel, M.A.; Al-Amidie, M.; Farhan, L. Review of Deep Learning: Concepts, CNN Architectures, Challenges, Applications, Future Directions. J. Big Data 2021, 8, 53. [Google Scholar] [CrossRef] [PubMed]
Akopyan, F.; Sawada, J.; Cassidy, A.; Alvarez-Icaza, R.; Arthur, J.; Merolla, P.; Imam, N.; Nakamura, Y.; Datta, P.; Nam, G.-J.; et al. TrueNorth: Design and Tool. Flow of a 65 MW 1 Million Neuron Programmable Neurosynaptic Chip. IEEE Trans. Comput. Des. Integr. Circuits Syst. 2015, 34, 1537–1557. [Google Scholar] [CrossRef]
Thomas, A.; Niehörster, S.; Fabretti, S.; Shepheard, N.; Kuschel, O.; Küpper, K.; Wollschläger, J.; Krzysteczko, P.; Chicca, E. Tunnel Junction Based Memristors as Artificial Synapses. Front. Neurosci. 2015, 9, 241. [Google Scholar] [CrossRef]
Kalikka, J.; Akola, J.; Jones, R.O. Simulation of Crystallization in Ge2Sb2Te5: A Memory Effect in the Canonical Phase-Change Material. Phys. Rev. B 2014, 90, 184109. [Google Scholar] [CrossRef]
Morozovska, A.N.; Kalinin, S.V.; Yelisieiev, M.E.; Yang, J.; Ahmadi, M.; Eliseev, E.A.; Evans, D.R. Dynamic Control. of Ferroionic States in Ferroelectric Nanoparticles. Acta Mater. 2022, 237, 118138. [Google Scholar] [CrossRef]
Zheng, Y.; Wu, Y.; Li, K.; Qiu, J.; Han, G.; Guo, Z.; Luo, P.; An, L.; Liu, Z.; Wang, L.; et al. Magnetic Random Access Memory (MRAM). J. Nanosci. Nanotechnol. 2007, 7, 117–137. [Google Scholar] [CrossRef]
Xu, X.; Tan, M.; Corcoran, B.; Wu, J.; Boes, A.; Nguyen, T.G.; Chu, S.T.; Little, B.E.; Hicks, D.G.; Morandotti, R.; et al. 11 TOPS Photonic Convolutional Accelerator for Optical Neural Networks. Nature 2021, 589, 44–51. [Google Scholar] [CrossRef]
Sunny, F.P.; Taheri, E.; Nikdast, M.; Pasricha, S. A Survey on Silicon Photonics for Deep Learning. ACM J. Emerg. Technol. 2021, 17, 1–57. [Google Scholar] [CrossRef]
Tait, A.N.; de Lima, T.F.; Zhou, E.; Wu, A.X.; Nahmias, M.A.; Shastri, B.J.; Prucnal, P.R. Neuromorphic Photonic Networks Using Silicon Photonic Weight Banks. Sci. Rep. 2017, 7, 7430. [Google Scholar] [CrossRef] [Green Version]
Tait, A.N.; Nahmias, M.A.; Shastri, B.J.; Prucnal, P.R. Broadcast and Weight: An Integrated Network For Scalable Photonic Spike Processing. J. Light. Technol. 2014, 32, 4029–4041. [Google Scholar] [CrossRef]
Cheng, Z.; Ríos, C.; Pernice, W.H.P.; Wright, C.D.; Bhaskaran, H. On-Chip Photonic Synapse. Sci. Adv. 2017, 3, e1700160. [Google Scholar] [CrossRef] [PubMed]
Teo, T.Y.; Ma, X.; Pastor, E.; Wang, H.; George, J.K.; Yang, J.K.W.; Wall, S.; Miscuglio, M.; Simpson, R.E.; Sorger, V.J. Programmable Chalcogenide-Based All-Optical Deep Neural Networks. Nanophotonics 2022, 11, 4073–4088. [Google Scholar] [CrossRef]
Ríos, C.; Stegmaier, M.; Hosseini, P.; Wang, D.; Scherer, T.; Wright, C.D.; Bhaskaran, H.; Pernice, W.H.P. Integrated All-Photonic Non-Volatile Multi-Level Memory. Nat. Photon. 2015, 9, 725–732. [Google Scholar] [CrossRef]
Yu, T.; Ma, X.; Pastor, E.; George, J.; Wall, S.; Miscuglio, M.; Simpson, R.; Sorger, V. All-Chalcogenide Programmable All-Optical Deep Neural Networks. arXiv 2021, arXiv:2102.10398. [Google Scholar]
Yang, J.J.; Strukov, D.B.; Stewart, D.R. Memristive Devices for Computing. Nat. Nanotechnol. 2012, 8, 13–24. [Google Scholar] [CrossRef]
Robertson, J.; Wade, E.; Hurtado, A. Electrically Controlled Neuron-Like Spiking Regimes in Vertical-Cavity Surface-Emitting Lasers at Ultrafast Rates. IEEE J. Sel. Top. Quantum Electron. 2019, 25, 5100307. [Google Scholar] [CrossRef]
Feldmann, J.; Youngblood, N.; Wright, C.D.; Bhaskaran, H.; Pernice, W.H.P. All-Optical Spiking Neurosynaptic Networks with Self-Learning Capabilities. Nature 2019, 569, 208–214. [Google Scholar] [CrossRef]
Williamson, I.A.D.; Hughes, T.W.; Minkov, M.; Bartlett, B.; Pai, S.; Fan, S. Reprogrammable Electro-Optic Nonlinear Activation Functions for Optical Neural Networks. IEEE J. Sel. Top. Quantum Electron. 2019, 26, 7700412. [Google Scholar] [CrossRef]
Amin, R.; George, J.K.; Sun, S.; Lima, T.F.; de Tait, A.N.; Khurgin, J.B.; Miscuglio, M.; Shastri, B.J.; Prucnal, P.R.; El-Ghazawi, T.; et al. ITO-Based Electro-Absorption Modulator for Photonic Neural Activation Function. APL Mater. 2019, 7, 081112. [Google Scholar] [CrossRef]
Mourgias-Alexandris, G.; Dabos, G.; Passalis, N.; Totovi, A.; Tefas, A.; Pleros, N. All-Optical WDM Recurrent Neural Networks with Gating. IEEE J. Sel. Top. Quantum Electron. 2019, 26, 6100907. [Google Scholar] [CrossRef]
Robertson, J.; Wade, E.; Kopp, Y.; Bueno, J.; Hurtado, A. Toward Neuromorphic Photonic Networks of Ultrafast Spiking Laser Neurons. IEEE J. Sel. Top. Quantum Electron. 2019, 26, 7700715. [Google Scholar] [CrossRef]
Robertson, J.; Deng, T.; Javaloyes, J.; Hurtado, A. Controlled Inhibition of Spiking Dynamics in VCSELs for Neuromorphic Photonics: Theory and Experiments. Opt. Lett. 2017, 42, 1560–1563. [Google Scholar] [CrossRef] [PubMed]
Xiang, S.; Zhang, Y.; Guo, X.; Wen, A.; Hao, Y. Photonic Generation of Neuron-Like Dynamics Using VCSELs Subject to Double Polarized Optical Injection. J. Light. Technol. 2018, 36, 4227–4234. [Google Scholar] [CrossRef]
Finelli, L.A.; Haney, S.; Bazhenov, M.; Stopfer, M.; Sejnowski, T.J. Synaptic Learning Rules and Sparse Coding in a Model Sensory System. PLoS Comput. Biol. 2008, 4, e1000062. [Google Scholar] [CrossRef]
Toole, R.; Tait, A.N.; de Lima, T.F.; Nahmias, M.A.; Shastri, B.J.; Prucnal, P.R.; Fok, M.P. Photonic Implementation of Spike-Timing-Dependent Plasticity and Learning Algorithms of Biological Neural Systems. J. Light. Technol. 2015, 34, 470–476. [Google Scholar] [CrossRef]
Mesaritakis, C.; Skontranis, M.; Sarantoglou, G.; Bogris, A. Micro-Ring-Resonator Based Passive Photonic Spike-Time-Dependent-Plasticity Scheme for Unsupervised Learning in Optical Neural Networks. In Proceedings of the 2020 Optical Fiber Communications Conference and Exhibition (OFC), San Diego, CA, USA, 8–12 March 2020; p. t4c.2. [Google Scholar] [CrossRef]
Shen, Y.; Harris, N.C.; Skirlo, S.; Prabhu, M.; Baehr-Jones, T.; Hochberg, M.; Sun, X.; Zhao, S.; Larochelle, H.; Englund, D.; et al. Deep Learning with Coherent Nanophotonic Circuits. Nat. Photon. 2017, 11, 441–446. [Google Scholar] [CrossRef]
Giewont, K.; Hu, S.; Peng, B.; Rakowski, M.; Rauch, S.; Rosenberg, J.C.; Sahin, A.; Stobert, I.; Stricker, A.; Nummy, K.; et al. 300-Mm Monolithic Silicon Photonics Foundry Technology. IEEE J. Sel. Top. Quantum Electron. 2019, 25, 8200611. [Google Scholar] [CrossRef]
Xu, X.; Tan, M.; Wu, J.; Boes, A.; Corcoran, B.; Nguyen, T.G.; Chu, S.T.; Little, B.E.; Morandotti, R.; Mitchell, A.; et al. Photonic Perceptron Based on a Kerr Microcomb for High-Speed, Scalable, Optical Neural Networks. Laser Photon. Rev. 2020, 14, 2000070. [Google Scholar] [CrossRef]
Feldmann, J.; Youngblood, N.; Karpov, M.; Gehring, H.; Li, X.; Stappers, M.; Gallo, M.L.; Fu, X.; Lukashchuk, A.; Raja, A.S.; et al. Parallel Convolutional Processing Using an Integrated Photonic Tensor Core. Nature 2021, 589, 52–58. [Google Scholar] [CrossRef]
Luo, X.; Hu, Y.; Ou, X.; Li, X.; Lai, J.; Liu, N.; Cheng, X.; Pan, A.; Duan, H. Metasurface-Enabled on-Chip Multiplexed Diffractive Neural Networks in the Visible. Light. Sci. Appl. 2022, 11, 158. [Google Scholar] [CrossRef] [PubMed]
Kippenberg, T.J.; Gaeta, A.L.; Lipson, M.; Gorodetsky, M.L. Dissipative Kerr Solitons in Optical Microresonators. Science 2018, 361. [Google Scholar] [CrossRef] [PubMed]
Wu, J.; Xu, X.; Nguyen, T.G.; Chu, S.T.; Little, B.E.; Morandotti, R.; Mitchell, A.; Moss, D.J. RF Photonics: An Optical Microcombs’ Perspective. IEEE J. Sel. Top. Quantum Electron. 2018, 24, 6101020. [Google Scholar] [CrossRef]
Riemensberger, J.; Lukashchuk, A.; Karpov, M.; Weng, W.; Lucas, E.; Liu, J.; Kippenberg, T.J. Massively Parallel Coherent Laser Ranging Using a Soliton Microcomb. Nature 2020, 581, 164–170. [Google Scholar] [CrossRef]
Spencer, D.T.; Drake, T.; Briles, T.C.; Stone, J.; Sinclair, L.C.; Fredrick, C.; Li, Q.; Westly, D.; Ilic, B.R.; Bluestone, A.; et al. An Optical-Frequency Synthesizer Using Integrated Photonics. Nature 2018, 557, 81–85. [Google Scholar] [CrossRef] [PubMed]
Chang, L.; Liu, S.; Bowers, J.E. Integrated Optical Frequency Comb Technologies. Nat. Photon. 2022, 16, 95–108. [Google Scholar] [CrossRef]
Lin, G.; Song, Q. Kerr Frequency Comb Interaction with Raman, Brillouin, and Second Order Nonlinear Effects. Laser Photon. Rev. 2022, 16, 2100184. [Google Scholar] [CrossRef]
Kippenberg, T.J.; Spillane, S.M.; Vahala, K.J. Kerr-Nonlinearity Optical Parametric Oscillation in an Ultrahigh-Q Toroid Microcavity. Phys. Rev. Lett. 2004, 93, 083904. [Google Scholar] [CrossRef]
Boyraz, O.; Jalali, B. Demonstration of a Silicon Raman Laser. Opt Express 2004, 12, 5269. [Google Scholar] [CrossRef]
Fang, A.W.; Park, H.; Cohen, O.; Jones, R.; Paniccia, M.J.; Bowers, J.E. Electrically Pumped Hybrid AlGaInAs-Silicon Evanescent Laser. Opt. Express 2006, 14, 9203. [Google Scholar] [CrossRef]
Rong, H.; Jones, R.; Liu, A.; Cohen, O.; Hak, D.; Fang, A.; Paniccia, M. A Continuous-Wave Raman Silicon Laser. Nature 2005, 433, 725–728. [Google Scholar] [CrossRef] [PubMed]
Del’Haye, P.; Schliesser, A.; Arcizet, O.; Wilken, T.; Holzwarth, R.; Kippenberg, T.J. Optical Frequency Comb Generation from a Monolithic Microresonator. Nature 2007, 450, 1214–1217. [Google Scholar] [CrossRef] [PubMed]
Herr, T.; Brasch, V.; Jost, J.D.; Wang, C.Y.; Kondratiev, N.M.; Gorodetsky, M.L.; Kippenberg, T.J. Temporal Solitons in Optical Microresonators. Nat. Photon. 2014, 8, 145–152. [Google Scholar] [CrossRef]
Shen, B.; Chang, L.; Liu, J.; Wang, H.; Yang, Q.-F.; Xiang, C.; Wang, R.N.; He, J.; Liu, T.; Xie, W.; et al. Integrated Turnkey Soliton Microcombs Operated at CMOS Frequencies. In Proceedings of the CLEO: Science and Innovations, Virtual, 11–15 May 2020; p. SF3O.4. [Google Scholar] [CrossRef]
Antonik, P.; Marsal, N.; Brunner, D.; Rontani, D. Human Action Recognition with a Large-Scale Brain-Inspired Photonic Computer. Nat. Mach. Intell. 2019, 1, 530–537. [Google Scholar] [CrossRef]
Sadegh-Bonab, S.; Alipour-Banaei, H. A Novel Proposal for an All-Optical 2-Bit Adder/Subtractor Based on Photonic Crystal Ring Resonators. Photon. Nanostructures-Fundam. Appl. 2020, 39, 100777. [Google Scholar] [CrossRef]
Ghadi, A. All-Optical Computing Circuits Half-Subtractor and Comparator Based on Soliton Interactions. Optik 2021, 227, 166079. [Google Scholar] [CrossRef]
Silva, N.A.; Ferreira, T.D.; Guerreiro, A. Reservoir Computing with Solitons. New J. Phys. 2021, 23, 023013. [Google Scholar] [CrossRef]
Stegmaier, M.; Ríos, C.; Bhaskaran, H.; Wright, C.D.; Pernice, W.H.P. Nonvolatile All-Optical 1 × 2 Switch for Chipscale Photonic Networks. Adv. Opt. Mater. 2017, 5, 1600346. [Google Scholar] [CrossRef]
Zhang, Y.; Chou, J.B.; Li, J.; Li, H.; Du, Q.; Yadav, A.; Zhou, S.; Shalaginov, M.Y.; Fang, Z.; Zhong, H.; et al. Broadband Transparent Optical Phase Change Materials for High-Performance Nonvolatile Photonics. Nat. Commun. 2019, 10, 4279. [Google Scholar] [CrossRef] [Green Version]
Jia, W.; Menon, R.; Sensale-Rodriguez, B. Unique Prospects of Phase Change Material Sb 2 Se 3 for Ultra-Compact Reconfigurable Nanophotonic Devices. Opt. Mater. Express 2021, 11, 3007. [Google Scholar] [CrossRef]
Lawson, D.; Hewak, D.W.; Muskens, O.L.; Zeimpekis, I. Time-Resolved Reversible Optical Switching of the Ultralow-Loss Phase Change Material Sb2Se3. J. Opt. 2022, 24, 064013. [Google Scholar] [CrossRef]
Fang, Z.; Zheng, J.; Saxena, A.; Whitehead, J.; Chen, Y.; Majumdar, A. Non-Volatile Reconfigurable Integrated Photonics Enabled by Broadband Low-Loss Phase Change Material. Adv. Opt. Mater. 2021, 9, 2002049. [Google Scholar] [CrossRef]
Yamada, N.; Ohno, E.; Akahira, N.; Nishiuchi, K.; Nagata, K.; Takao, M. High Speed Overwritable Phase Change Optical Disk Material. Jpn. J. Appl. Phys. 1987, 26, 61. [Google Scholar] [CrossRef]
Liu, B.; Wei, T.; Hu, J.; Li, W.; Ling, Y.; Liu, Q.; Cheng, M.; Song, Z. Universal Memory Based on Phase-Change Materials: From Phase-Change Random Access Memory to Optoelectronic Hybrid Storage. Chin. Phys. B 2021, 30, 058504. [Google Scholar] [CrossRef]
Nisar, M.S.; Yang, X.; Lu, L.; Chen, J.; Zhou, L. On-Chip Integrated Photonic Devices Based on Phase Change Materials. Photonics 2021, 8, 205. [Google Scholar] [CrossRef]
Wang, X.; Qi, H.; Hu, X.; Yu, Z.; Ding, S.; Du, Z.; Gong, Q. Advances in Photonic Devices Based on Optical Phase-Change Materials. Molecules 2021, 26, 2813. [Google Scholar] [CrossRef]
Chakraborty, I.; Saha, G.; Sengupta, A.; Roy, K. Toward Fast Neural Computing Using All-Photonic Phase Change Spiking Neurons. Sci. Rep. 2018, 8, 12980. [Google Scholar] [CrossRef]
Li, X.; Youngblood, N.; Ríos, C.; Cheng, Z.; Wright, C.D.; Pernice, W.H.; Bhaskaran, H. Fast and Reliable Storage Using a 5 Bit, Nonvolatile Photonic Memory Cell. Optica 2018, 6, 1–6. [Google Scholar] [CrossRef]
Lee, J.S.; Farmakidis, N.; Wright, C.D.; Bhaskaran, H. Polarization-Selective Reconfigurability in Hybridized-Active-Dielectric Nanowires. Sci. Adv. 2022, 8, eabn9459. [Google Scholar] [CrossRef]
Miscuglio, M.; Meng, J.; Yesiliurt, O.; Zhang, Y.; Prokopeva, L.J.; Mehrabian, A.; Hu, J.; Kildishev, A.V.; Sorger, V.J. Artificial Synapse with Mnemonic Functionality Using GSST-Based Photonic Integrated Memory. In Proceedings of the 2020 International Applied Computational Electromagnetics Society Symposium (ACES), Monterey, CA, USA, 27–31 July 2020; pp. 1–3. [Google Scholar] [CrossRef]
Pernice, W.H.P.; Bhaskaran, H. Photonic Non-Volatile Memories Using Phase Change Materials. Appl. Phys. Lett. 2012, 101, 171101. [Google Scholar] [CrossRef]
Rios, C.; Hosseini, P.; Wright, C.D.; Bhaskaran, H.; Pernice, W.H.P. On-Chip Photonic Memory Elements Employing Phase-Change Materials. Adv Mater 2014, 26, 1372–1377. [Google Scholar] [CrossRef]
Babashah, H.; Kavehvash, Z.; Koohi, S.; Khavasi, A. Integration in Analog Optical Computing Using Metasurfaces Revisited: Toward Ideal Optical Integration. J. Opt. Soc. Am. B 2017, 34, 1270. [Google Scholar] [CrossRef]
Sol, J.; Smith, D.R.; Hougne, P. del Meta-Programmable Analog Differentiator. Nat. Commun. 2022, 13, 1713. [Google Scholar] [CrossRef]
Lin, X.; Rivenson, Y.; Yardimci, N.T.; Veli, M.; Luo, Y.; Jarrahi, M.; Ozcan, A. All-Optical Machine Learning Using Diffractive Deep Neural Networks. Science 2018, 361, 1004–1008. [Google Scholar] [CrossRef]
Spägele, C.; Tamagnone, M.; Kazakov, D.; Ossiander, M.; Piccardo, M.; Capasso, F. Multifunctional Wide-Angle Optics and Lasing Based on Supercell Metasurfaces. Nat. Commun. 2021, 12, 3787. [Google Scholar] [CrossRef]
Burckel, D.B.; Wendt, J.R.; Eyck, G.A.T.; Ginn, J.C.; Ellis, A.R.; Brener, I.; Sinclair, M.B. Micrometer-Scale Cubic Unit Cell 3D Metamaterial Layers. Adv. Mater. 2010, 22, 5053–5057. [Google Scholar] [CrossRef]
Sun, S.; Yang, K.-Y.; Wang, C.-M.; Juan, T.-K.; Chen, W.T.; Liao, C.Y.; He, Q.; Xiao, S.; Kung, W.-T.; Guo, G.-Y.; et al. High-Efficiency Broadband Anomalous Reflection by Gradient Meta-Surfaces. Nano Lett. 2012, 12, 6223–6229. [Google Scholar] [CrossRef]
Zahra, S.; Ma, L.; Wang, W.; Li, J.; Chen, D.; Liu, Y.; Zhou, Y.; Li, N.; Huang, Y.; Wen, G. Electromagnetic Metasurfaces and Reconfigurable Metasurfaces: A Review. Front. Phys. 2021, 8, 593411. [Google Scholar] [CrossRef]
Hu, J.; Bandyopadhyay, S.; Liu, Y.; Shao, L. A Review on Metasurface: From Principle to Smart Metadevices. Front. Phys. 2021, 8, 586087. [Google Scholar] [CrossRef]
Chen, K.; Feng, Y.; Monticone, F.; Zhao, J.; Zhu, B.; Jiang, T.; Zhang, L.; Kim, Y.; Ding, X.; Zhang, S.; et al. A Reconfigurable Active Huygens’ Metalens. Adv. Mater. 2017, 29, 1606422. [Google Scholar] [CrossRef]
Cong, L.; Srivastava, Y.K.; Zhang, H.; Zhang, X.; Han, J.; Singh, R. All-Optical Active THz Metasurfaces for Ultrafast Polarization Switching and Dynamic Beam Splitting. Light. Sci. Appl. 2018, 7, 28. [Google Scholar] [CrossRef] [PubMed]
Rahmani, M.; Xu, L.; Miroshnichenko, A.E.; Komar, A.; Camacho-Morales, R.; Chen, H.; Zárate, Y.; Kruk, S.; Zhang, G.; Neshev, D.N.; et al. Reversible Thermal Tuning of All-Dielectric Metasurfaces. Adv. Funct. Mater. 2017, 27, 1700580. [Google Scholar] [CrossRef]
Tang, S.; Cai, T.; Xu, H.-X.; He, Q.; Sun, S.; Zhou, L. Multifunctional Metasurfaces Based on the “Merging” Concept and Anisotropic Single-Structure Meta-Atoms. Appl. Sci. 2018, 8, 555. [Google Scholar] [CrossRef]
Maguid, E.; Yulevich, I.; Veksler, D.; Kleiner, V.; Brongersma, M.L.; Hasman, E. Photonic Spin-Controlled Multifunctional Shared-Aperture Antenna Array. Science 2016, 352, 1202–1206. [Google Scholar] [CrossRef]
Rubin, N.A.; D’Aversa, G.; Chevalier, P.; Shi, Z.; Chen, W.T.; Capasso, F. Matrix Fourier Optics Enables a Compact Full-Stokes Polarization Camera. Science 2019, 365. [Google Scholar] [CrossRef]
Yoon, G.; Lee, D.; Rho, J. Demonstration of Equal-Intensity Beam Generation by Dielectric Metasurfaces. J. Vis. Exp. 2019, 148, e59066. [Google Scholar] [CrossRef]
Yoon, G.; Kim, J.; Mun, J.; Lee, D.; Nam, K.T.; Rho, J. Wavelength-Decoupled Geometric Metasurfaces by Arbitrary Dispersion Control. Commun. Phys. 2019, 2, 129. [Google Scholar] [CrossRef]
Mudachathi, R.; Tanaka, T. Up Scalable Full Colour Plasmonic Pixels with Controllable Hue, Brightness and Saturation. Sci. Rep. 2017, 7, 1199. [Google Scholar] [CrossRef] [Green Version]
Yoon, G.; Tanaka, T.; Zentgraf, T.; Rho, J. Recent Progress on Metasurfaces: Applications and Fabrication. J. Phys. D Appl. Phys. 2021, 54, 383002. [Google Scholar] [CrossRef]
Wang, Z.; Li, T.; Soman, A.; Mao, D.; Kananen, T.; Gu, T. On-Chip Wavefront Shaping with Dielectric Metasurface. Nat. Commun. 2019, 10, 3547. [Google Scholar] [CrossRef]
Liao, K.; Gan, T.; Hu, X.; Gong, Q. AI-Assisted on-Chip Nanophotonic Convolver Based on Silicon Metasurface. Nanophotonics 2020, 9, 3315–3322. [Google Scholar] [CrossRef]
Qian, C.; Lin, X.; Lin, X.; Xu, J.; Sun, Y.; Li, E.; Zhang, B.; Chen, H. Performing Optical Logic Operations by a Diffractive Neural Network. Light. Sci. Appl. 2020, 9, 59. [Google Scholar] [CrossRef] [PubMed]
Zarei, S.; Marzban, M.; Khavasi, A. Integrated Photonic Neural Network Based on Silicon Metalines. Opt. Express 2020, 28, 36668. [Google Scholar] [CrossRef] [PubMed]
Fu, T.; Zang, Y.; Huang, H.; Du, Z.; Hu, C.; Chen, M.; Yang, S.; Chen, H. On-Chip Photonic Diffractive Optical Neural Network Based on a Spatial Domain Electromagnetic Propagation Model. Opt. Express 2021, 29, 31924. [Google Scholar] [CrossRef]
Wu, C.; Yu, H.; Lee, S.; Peng, R.; Takeuchi, I.; Li, M. Programmable Phase-Change Metasurfaces on Waveguides for Multimode Photonic Convolutional Neural Network. Nat. Commun. 2021, 12, 96. [Google Scholar] [CrossRef] [PubMed]
Liao, L.; Samara-Rubio, D.; Morse, M.; Liu, A.; Hodge, D.; Rubin, D.; Keil, U.; Franck, T. High Speed Silicon Mach-Zehnder Modulator. Opt. Express 2005, 13, 3129–3135. [Google Scholar] [CrossRef]
Su, T.; Zhang, M.; Chen, X.; Zhang, Z.; Liu, M.; Liu, L.; Huang, S. Improved 10-Gbps Uplink Transmission in WDM-PON with RSOA-Based Colorless ONUs and MZI-Based Equalizers. Opt. Laser Technol. 2013, 51, 90–97. [Google Scholar] [CrossRef]
Shokraneh, F.; Geoffroy-Gagnon, S.; Nezami, M.S.; Liboiron-Ladouceur, O. A Single Layer Neural Network Implemented by a 4 × 4 MZI-Based Optical Processor. IEEE Photon. J. 2019, 11, 4501612. [Google Scholar] [CrossRef]
Miller, D.A.B. Self-Configuring Universal Linear Optical Component. Photon. Res. 2013, 1, 1–15. [Google Scholar] [CrossRef]
Shibuya, T.; Zhao, Z.; Liu, D.; Li, M.; Ying, Z.; Zhang, L.; Xu, B.; Yu, B.; Chen, R.T.; Pan, D.Z. Hardware-Software Co-Design of Slimmed Optical Neural Networks. In Proceedings of the 24th Asia and South Pacific Design Automation Conference, Tokyo, Japan, 21–24 January 2019; pp. 705–710. [Google Scholar] [CrossRef]
Gu, J.; Zhao, Z.; Feng, C.; Liu, M.; Chen, R.T.; Pan, D.Z. Towards Area-Efficient Optical Neural Networks: An FFT-Based Architecture. In Proceedings of the 2020 25th Asia and South Pacific Design Automation Conference, Beijing, China, 13–16 January 2020; pp. 476–481. [Google Scholar] [CrossRef]
Paquot, Y.; Duport, F.; Smerieri, A.; Dambre, J.; Schrauwen, B.; Haelterman, M.; Massar, S. Optoelectronic Reservoir Computing. Sci. Rep. 2012, 2, 287. [Google Scholar] [CrossRef]
Cheng, Q.; Kwon, J.; Glick, M.; Bahadori, M.; Carloni, L.P.; Bergman, K. Silicon Photonics Codesign for Deep Learning. Proc. IEEE 2020, 108, 1261–1282. [Google Scholar] [CrossRef]
Dang, D.; Dass, J.; Mahapatra, R. ConvLight: A Convolutional Accelerator with Memristor Integrated Photonic Computing. In Proceedings of the 2017 IEEE 24th International Conference on High Performance Computing (HiPC), Jaipur, India, 18–21 December 2017; pp. 114–123. [Google Scholar] [CrossRef]
Shiflett, K.; Wright, D.; Karanth, A.; Louri, A. PIXEL: Photonic Neural Network Accelerator. In Proceedings of the 2020 IEEE International Symposium on High Performance Computer Architecture (HPCA), San Diego, CA, USA, 22–26 February 2020; pp. 474–487. [Google Scholar] [CrossRef]
Coarer, F.D.-L.; Sciamanna, M.; Katumba, A.; Freiberger, M.; Dambre, J.; Bienstman, P.; Rontani, D. All-Optical Reservoir Computing on a Photonic Chip Using Silicon-Based Ring Resonators. IEEE J. Sel. Top. Quantum Electron. 2018, 24, 7600108. [Google Scholar] [CrossRef]
Hughes, T.W.; Minkov, M.; Shi, Y.; Fan, S. Training of Photonic Neural Networks through in Situ Backpropagation and Gradient Measurement. Optica 2018, 5, 864. [Google Scholar] [CrossRef]
Hughes, T.; Veronis, G.; Wootton, K.P.; England, R.J.; Fan, S. Method for Computationally Efficient Design of Dielectric Laser Accelerator Structures. Opt. Express 2017, 25, 15414–15427. [Google Scholar] [CrossRef] [Green Version]
Zhang, T.; Wang, J.; Dan, Y.; Lanqiu, Y.; Dai, J.; Han, X.; Sun, X.; Xu, K. Efficient Training and Design of Photonic Neural Network through Neuroevolution. Opt. Express 2019, 27, 37150–37163. [Google Scholar] [CrossRef]
Antonik, P.; Marsal, N.; Brunner, D.; Rontani, D. Bayesian Optimisation of Large-Scale Photonic Reservoir Computers. Cogn. Comput. 2021, 1–9. [Google Scholar] [CrossRef]
Wu, C.; Yang, X.; Yu, H.; Peng, R.; Takeuchi, I.; Chen, Y.; Li, M. Harnessing Optoelectronic Noises in a Photonic Generative Network. Sci. Adv. 2022, 8, eabm2956. [Google Scholar] [CrossRef]
Freiberger, M.; Katumba, A.; Bienstman, P.; Dambre, J. Training Passive Photonic Reservoirs with Integrated Optical Readout. IEEE Trans. Neural Netw. Learn. Syst. 2019, 30, 1943–1953. [Google Scholar] [CrossRef]
Moon, S.; Shin, K.; Jeon, D. Enhancing Reliability of Analog Neural Network Processors. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 2019, 27, 1455–1459. [Google Scholar] [CrossRef]
Xu, X.; Ren, G.; Feleppa, T.; Liu, X.; Boes, A.; Mitchell, A.; Lowery, A.J. Self-Calibrating Programmable Photonic Integrated Circuits. Nat. Photon. 2022, 16, 595–602. [Google Scholar] [CrossRef]
Wan, Y.; Xiang, C.; Guo, J.; Koscica, R.; Kennedy, M.; Selvidge, J.; Zhang, Z.; Chang, L.; Xie, W.; Huang, D.; et al. High Speed Evanescent Quantum-Dot Lasers on Si. Laser Photon. Rev. 2021, 15, 2100057. [Google Scholar] [CrossRef]
Wan, Y.; Inoue, D.; Jung, D.; Norman, J.C.; Shang, C.; Gossard, A.C.; Bowers, J.E. Directly Modulated Quantum Dot Lasers on Silicon with a Milliampere Threshold and High Temperature Stability. Photon. Res. 2018, 6, 776. [Google Scholar] [CrossRef]
Dang, D.; Chittamuru, S.V.R.; Pasricha, S.; Mahapatra, R.; Sahoo, D. BPLight-CNN: A Photonics-Based Backpropagation Accelerator for Deep Learning. ACM J. Emerg. Technol. 2021, 17, 1–26. [Google Scholar] [CrossRef]

Figure 1. Weighting function in ONNs. (a) Concept of a broadcast-and-weight network with modulators used as neurons. MRR: microring resonator, BPD: balanced photodiode, LD: laser diode, MZM: Mach–Zehnder modulator, WDM: wavelength-division multiplexer [10]. (b) Micrograph of 4-node recurrent broadcast-and-weight network with 16 tunable microring (MRR) weights and fiber-to-chip grating couplers [10]. (c) Schematic of the integrated photonic synapse resembling the function of the neural synapse [12]. (d) Scanning electron microscope image of the active region of the photonic synapse [12]. (e) The weighting and summation mechanisms are based on a cascade of Sb₂S₃-SiN hybrid photonic switches that serve the same function as an optical counterpart of an FPGA (Field Programmable Gate Array) [13]. (f) The NLAF (Non-linear Activation Function) module consists of a single-mode hybrid silicon waveguide [13]. (a,b) Adapted with permission from Ref. [10]. CC BY 4.0. (c,d) Adapted with permission from Ref. [12]. CC BY 4.0. (e,f) Adapted with permission from Ref. [12] CC BY 4.0.

Figure 2. Activation function in ONNs. (a) Schematic of the proposed optical-to-optical activation function [19]. (b) Experimental setup used for the evaluation of the 4-input WDM RNN (Recurrent neural network) [21]. (c,d) Schematic of the network realized in this study, consisting of several pre-synaptic input neurons and one post-synaptic output neuron connected via PCM synapses [18]. (a) Reprinted with permission from Ref. [19]. Copyright © 2020, IEEE. (b) Reprinted with permission from Ref. [21]. Copyright © 2020, IEEE. (c,d) Reprinted with permission from Ref. [18], Springer Nature Limited.

Figure 3. A possible framework for optoelectronic-hybrid AI computing.

Figure 6. The development history of phase change materials.

Figure 8. Basic science of metasurfaces. (a) Reprinted with permission from Ref. [69]. Copyright © 2010 WILEY-VCH Verlag GmbH Co. KGaA, Weinheim. (b) Reprinted with permission from Ref. [70]. Copyright © 2012, American Chemical Society. (c) Reprinted with permission from Ref. [73]. © 2017 WILEY-VCH Verlag GmbH Co. KGaA, Weinheim. (d) Adapted from Ref. [74]. CC BY 4.0. (e) Reprinted with permission from Ref. [75]. © 2017 WILEY-VCH Verlag GmbH Co. KGaA, Weinheim. (f) Reprinted with permission from Ref. [79], JoVE. (g) Reprinted with permission from Ref. [80]. CC BY 4.0. (h) Reprinted with permission from Ref. [81]. CC BY 4.0.

Figure 9. Computing Based on Metasurfaces. (a) Top view of an on-chip metalens structure. The input light is fed through the waveguide on the left. As the light passes through the microlens, the output is collected through single-mode waveguides at different spatial locations [83]. (b) The SEM image of the three-layer meta-system can perform the spatial differentiation of the input signal [83]. (c,d) Overall view of the SEM image of the on-chip nanophotonic convolver and characteristic structure of this device [84]. (e–g) Top-view, oblique-view, and false-color cross-sectional view of the scanning electron microscope (SEM) images, respectively, of the MDNN [32]. (a,b) Adapted from Ref. [83]. CC BY 4.0. (c,d) Adapted from Ref. [84]. CC BY 4.0. (e–g) Adapted from Ref. [32]. CC BY 4.0.

Figure 10. Implementation by interference of light. (a) Optical micrograph illustration of the experimentally demonstrated OIU [28]. (b) Schematic representation of our two-layer ONN experiment [28]. (c) Proposed slimmed layer implementation [93]. (d) Schematic diagram of a single layer of the proposed architecture [94]. (a,b) Reprinted with permission from Ref. [28]. Copyright © 2017, Nature Publishing Group. (c) Reprinted with permission from Ref. [93], ACM. (d) Reprinted with permission from Ref. [94]. Copyright © 2020 IEEE.

Figure 11. Implementation by resonance of light. (a) Hitless weight-and-aggregation architecture for M × N vector-matrix multiplier [96]. (b) Experimental schematic of the optical CNN [8]. (a) Reprinted with permission from Ref. [96]. Copyright © 2020 IEEE. (b) Reprinted with permission from Ref. [8]. Copyright © 2021. The author(s), under exclusive license to Springer Nature Limited.

Figure 12. Algorithm. (a) The flowcharts of the learning algorithms for the ONNs based on GA (genetic algorithm) [102]. (b) The flowcharts of the learning algorithms for the ONNs based on PSO (particle swarm optimization) [102]. (c–e) Schematic illustration of the proposed method for experimental measurement of gradient information [100]. (f–i) Illustration of the Bayesian optimization on a toy 1D problem [103]. (j–l) Generating handwritten numbers with GAN. (j) to (l) Forty-nine images (size, 14 × 14 pixels) generated by NF-GAN (noise free–trained GAN), IC-GAN (input-compensatory GAN), and WC-GAN (weight-compensatory GAN) [104]. (a,b) Adapted from Ref. [102]. CC BY 4.0. (c–e) Adapted from Ref. [100]. CC BY 4.0. (f–i) Reprinted with permission from Ref. [103]. Copyright © 2021, Springer Science Business Media, LLC, part of Springer Nature. (j–l) Adapted from Ref. [104]. CC BY 4.0.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xu, B.; Huang, Y.; Fang, Y.; Wang, Z.; Yu, S.; Xu, R. Recent Progress of Neuromorphic Computing Based on Silicon Photonics: Electronic–Photonic Co-Design, Device, and Architecture. Photonics 2022, 9, 698. https://doi.org/10.3390/photonics9100698

AMA Style

Xu B, Huang Y, Fang Y, Wang Z, Yu S, Xu R. Recent Progress of Neuromorphic Computing Based on Silicon Photonics: Electronic–Photonic Co-Design, Device, and Architecture. Photonics. 2022; 9(10):698. https://doi.org/10.3390/photonics9100698

Chicago/Turabian Style

Xu, Bo, Yuhao Huang, Yuetong Fang, Zhongrui Wang, Shaoliang Yu, and Renjing Xu. 2022. "Recent Progress of Neuromorphic Computing Based on Silicon Photonics: Electronic–Photonic Co-Design, Device, and Architecture" Photonics 9, no. 10: 698. https://doi.org/10.3390/photonics9100698

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Recent Progress of Neuromorphic Computing Based on Silicon Photonics: Electronic–Photonic Co-Design, Device, and Architecture

Abstract

1. Introduction

2. Electronic–Photonic Co-Design

2.1. Weight

2.2. Summation

2.3. Activation

2.4. STDP

2.5. All-Optical versus Optoelectronic Neurons Implementation

3. Devices

3.1. Soliton Microcombs

3.1.1. Basic Science

3.1.2. Computing Based on Soliton Microcomb

3.2. Devices Based on PCM

3.2.1. Basic Science

3.2.2. Phase Change Materials for Integrated Photonics Computing

3.3. Metasurfaces

3.3.1. Basic Science

3.3.2. Computing Based on Metasurfaces

4. Architecture and Algorithm

4.1. Implementation by Interference of Light

4.2. Implementation by Resonance of Light

4.3. Algorithm

5. Outlook and Discussion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI