A 2D Heterostructure-Based Multifunctional Floating Gate Memory Device for Multimodal Reservoir Computing

Zha, Jiajia; Xia, Yunpeng; Shi, Shuhui; Huang, Haoxin; Li, Siyuan; Qian, Chen; Wang, Huide; Yang, Peng; Zhang, Zhuomin; Meng, You; Wang, Wei; Yang, Zhengbao; Yu, Hongyu; Ho, Johnny C.; Wang, Zhongrui; Tan, Chaoliang

Published in:
Advanced Materials

Published: 18/01/2024

Document Version:
Final Published version, also known as Publisher's PDF, Publisher's Final version or Version of Record

License:
CC BY-NC

Publication record in CityU Scholars:
Go to record

Published version (DOI):
10.1002/adma.202308502

Publication details:

Citing this paper
Please note that where the full-text provided on CityU Scholars is the Post-print version (also known as Accepted Author Manuscript, Peer-reviewed or Author Final version), it may differ from the Final Published version. When citing, ensure that you check and use the publisher's definitive version for pagination and other details.

General rights
Copyright for the publications made accessible via the CityU Scholars portal is retained by the author(s) and/or other copyright owners and it is a condition of accessing these publications that users recognise and abide by the legal requirements associated with these rights. Users may not further distribute the material or use it for any profit-making activity or commercial gain.

Publisher permission
Permission for previously published items are in accordance with publisher's copyright policies sourced from the SHERPA RoMEO database. Links to full text versions (either Published or Post-print) are only available if corresponding publishers allow open access.

Take down policy
Contact lbscholars@cityu.edu.hk if you believe that this document breaches copyright and provide us with details. We will remove access to the work immediately and investigate your claim.
A 2D Heterostructure-Based Multifunctional Floating Gate Memory Device for Multimodal Reservoir Computing

Jiajia Zha, Yunpeng Xia, Shuhui Shi, Haoxin Huang, Siyuan Li, Chen Qian, Huide Wang, Peng Yang, Zhuomin Zhang, You Meng, Wei Wang, Zhengbao Yang, Hongyu Yu, Johnny C. Ho,* Zhongrui Wang,* and Chaoliang Tan*

The demand for economical and efficient data processing has led to a surge of interest in neuromorphic computing based on emerging two-dimensional (2D) materials in recent years. As a rising van der Waals (vdW) p-type Weyl semiconductor with many intriguing properties, tellurium (Te) has been widely used in advanced electronics/optoelectronics. However, its application in floating gate (FG) memory devices for information processing has never been explored. Herein, an electronic/optoelectronic FG memory device enabled by Te-based 2D vdW heterostructure for multimodal reservoir computing (RC) is reported. When subjected to intense electrical/optical stimuli, the device exhibits impressive nonvolatile electronic memory behaviors including $\approx 10^8$ extinction ratio, $\approx 100$ ns switching speed, $>4000$ cycles, $>4000$-s retention stability, and nonvolatile multibit optoelectronic programmable characteristics. When the input stimuli weaken, the nonvolatile memory degrades into volatile memory. Leveraging these rich nonlinear dynamics, a multimodal RC system with high recognition accuracy of 90.77% for event-type multimodal handwritten digit-recognition is demonstrated.

1. Introduction

Ultrathin two-dimensional (2D) materials, featured with atomically thin thickness and dangling-bond-free layered surface can be easily integrated into a wide range of van der Waals (vdW) heterostructures for advanced electronics/optoelectronics with desired functionalities, thus shedding light on extending Moore’s law and breaking the von Neumann bottleneck in the era of big data.[1–3] Reservoir computing (RC) based on 2D materials highlighting low training complexity and high energy efficiency offers a promising solution to the ever-increasing data volume.[4–9] Researchers have leveraged the ferroelectricity in 2D $\alpha$-phase indium selenide ($\alpha$-In$_2$Se$_3$) to construct a stackable RC system for hierarchical information processing,[6] and also realized a RC system for image recognition with...
low-energy consumption using the charge-trap memory in the 2D molybdenum disulfide (MoS$_2$) channel.\cite{7} Moreover, an in-sensor RC system for language learning has been reported based on the photoelectricity in 2D tin sulfide (SnS).\cite{8} However, the data generated by smart devices in our daily lives is typically multimodal, posing a huge challenge to existing RC systems that are dedicated to single modality learning. Promisingly, Liu et al. reported a multimodal RC system based on 2D $\alpha$-In$_2$Se$_3$ for multimodal handwritten digit recognition recently.\cite{9} However, in their simulation, the input signals are limited to nonevent signals, and their device cannot serve as long-term memory unit.

The construction of 2D RC systems has largely relied on devices in the diode geometry, while emerging three-terminal transistor–geometry memory devices show potential as an ideal data processing platform due to their versatile control parameters and high compatibility with traditional complementary metal-oxide-semiconductor (CMOS) devices.\cite{10-12} Although 2D material-based three-terminal memory devices include ferroelectric field-effect transistors (FeFETs) and floating gate (FG) memory devices, the latter typically presents greater application prospects with its ultrafast operation speed,\cite{10,11} facile extension in the vertical dimension,\cite{13} and the achievement of logic-in-memory in 2D FG transistor-based computing systems.\cite{14,15} While electrical control is the most common method to operate 2D FG devices, optical sensitivity is indispensable for realizing multimodal (RC). The choice of the FG material is therefore vital to endow 2D FG memory units with both electrical and optical operations. Although optoelectronic memory has been reported among heterostructures with graphene, 2D perovskites, and MoS$_2$ nanoflakes as the FG layers, their best-known feature is the nonvolatile optoelectronic memory, rather than the nonlinear fading behavior after switching off the optical pulse.\cite{16-19} Tellurium (Te) is a recently rediscovered vdW semiconductor that features a one-dimensional (1D) crystal structure and presents many interesting properties, including thickness-dependent bandgap, thermoelectricity, and piezoelectricity.\cite{20,21} It has been used to construct various advanced electronics and optoelectronics, such as p-type field-effect transistors (FETs) with high room-temperature hole mobility, short-wave infrared photodetectors, three-dimensional (3D) monolithic circuits, and memory units in in-sensor RC at the optical communication band.\cite{22-26} Moreover, as a Weyl semiconductor, the peculiar band structure and spin properties originating from its composition of parallel 1D spiral molecular chains enable its applications in building low energy-consumption valley transistors.\cite{27,28} Moreover, Te demonstrates exceptional potential as a FG material for developing 2D FG memory units due to its distinct band splitting\cite{29,30} and rapid charge recombination at shallow trap states.\cite{31} This material combining with MoS$_2$ channel offers an added advantage of exhibiting nonlinear electronic/optoelectronic memory which occurs following the attenuation of stimulus intensity.

In this work, we first present a MoS$_2$/hexagonal boron nitride (h-BN)/Te heterostructure on silicon dioxide/silicon (SiO$_2$/Si) substrate, where MoS$_2$ serves as the semiconducting channel, h-BN functions as the tunnel dielectric, Te functions as the FG, and SiO$_2$ and heavily doped Si are used as a gate oxide and a control gate, respectively. We report on the nonvolatile/volatile electrically and optically controlled memory behaviors of this device by carefully tuning the stimulus intensity. Under intense electrical pulse stimulation, the fabricated device exhibits superior nonvolatile electronic memory performance including fast switching speed ($\approx$100 ns), a high extinction ratio ($\approx$10$^3$), long cycle (>4000 cycles), and retention (>4000 s) stability, which are comparable to the most advanced 2D FG memory devices. Similarly, under intense optical pulse illumination, this device presents multibit nonvolatile optoelectronic memory behavior. More interestingly, when the stimulus intensity is weakened, we observe nonlinear fading memory behavior attributed to the charge trapping states in MoS$_2$. Based on these rich, dynamic memory behaviors, we design a RC system for multimodal signal processing, which achieves a high accuracy of 90.77% on the event-based handwritten digit dataset, even surpassing the nonevent digits classification using 2D material-based RC. Our demonstration of this multifunctional 2D FG device will enrich the field of 2D electronics/optoelectronics and thus pave the way for future smart computing systems at the edge.

2. Results and Discussion

2.1. Fabrication and Characterization of the MoS$_2$/h-BN/Te Heterostructure

In this paper, we present a 2D vdW heterostructure-based electronic/optoelectronic FG memory device, which consists of Te as the FG, h-BN as the tunnel dielectric, multilayer MoS$_2$ as the channel, and SiO$_2$/heavily doped Si (p++ Si) as the gate oxide/control gate, respectively, as illustrated in Figure 1a.

The source/drain electrodes were defined by using standard electron beam lithography (EBL) and thermal evaporation of chromium/gold (Cr/Au) (8/60 nm). The false-color scanning electron microscope (SEM) image in Figure 1b shows the general layout of the MoS$_2$/h-BN/Te heterostructure on the SiO$_2$/p++ Si (SiO$_2$: 50 nm) substrate. The thickness of each material (MoS$_2$: $\approx$4.35 nm; h-BN: $\approx$12.81 nm; Te: $\approx$27.37 nm) was characterized using atomic force microscope (AFM), as shown in Figure S1 (see the Supporting Information). The heterostructure was stacked from the drop-casting solution-synthesized Te nanoflakes onto the SiO$_2$/Si substrate,\cite{23} followed by transferring freshly exfoliated h-BN and MoS$_2$ nanoflakes step by step via a dry transfer method.\cite{33} Details on material preparation and heterostructure fabrication can be found in the Experimental Section and Figure S2 (Supporting Information). Raman spectra were used to identify the involved vdW nanoflakes, as shown in Figure 1c. The Raman spectrum of Te revealed three peaks (E$_1$: 93.3 cm$^{-1}$, A$_1$: 120.6 cm$^{-1}$, E$_2$: 142.0 cm$^{-1}$), which are consistent with previous literature. Additionally, the lattice vibration mode (E$_{2g}$: 1367.4 cm$^{-1}$) in h-BN nanoflakes further supports these findings.\cite{22,23,33} The frequency difference of 24.5 cm$^{-1}$ between the E$_{2g}$ (383.3 cm$^{-1}$) and A$_{1g}$ (407.8 cm$^{-1}$) modes in the Raman spectrum indicates the multilayer nature of MoS$_2$.\cite{34} Furthermore, the Raman spectrum measured on the final MoS$_2$/h-BN/Te heterostructure confirms its constitution. To confirm the device structure and demonstrate the clean and flat interfaces between the vdW materials, the cross-section transmission electron microscopy (TEM) was conducted. The left panel in Figure 1d shows the scanning transmission electron microscopy (STEM) on the cross-section cut from the fabricated heterostructure via
focused ion beam (FIB), where the sharp interface clearly reveals the high-quality fabrication of the heterostructure. The flatness at the atomic scale suggests a low defect density on material surfaces. The vdW heterostructure composition is further demonstrated by the homogenous elemental distribution shown in energy-dispersive spectrometry (EDS) in the right panel of Figure 1d.

2.2. Nonvolatile Electronic/Optoelectronic Memory Triggered By Intense Electrical/Optical Stimuli

To comprehensively evaluate the nonvolatile electronic memory performance of the MoS$_2$/h-BN/Te heterostructure-based FG memory device, we characterized the memory window, switching speed, and retention stability. Figure 2a shows the transfer
Figure 2. Nonvolatile electronic memory. a) Transfer curves with varying $V_{cg, max}$ from 5 to 15 V. $V_{ds} = 0.1$ V. The magnitude of the corresponding gate leakage currents is indicated by the right $y$-axis. b) Extracted memory window $\Delta V$ as a function of $V_{cg, max}$. c) Output curves characterized in a small electrical bias under different control gate voltages. d) Switching speed test of the FG memory device. The program and erase voltages are $+12$ and $-12$ V, respectively. $V_{ds} = 0.1$ V. e) Cycling endurance of the FG memory device. The applied program/erase voltages are $\pm 10$ V/0.2 s. f) Retention stability of the FG memory device. The high-/low-resistance states are switched by $\pm 5$ V/1 s electrical pulses. g) Energy band diagram of the FG memory device in different electronic memory states.

curves under different gate voltages. The threshold voltages vary significantly in the forward and backward sweeping directions, which fully demonstrate the charge trapping ability of the Te FG. To reduce the effect of contact resistance, we extracted the memory window from the transfer curves presented in Figure 2b using the Y-function method.$^{[35,36]}$ Note S1 (Supporting Information) details the extraction process of $V_T$. As shown in Figure S3 (Supporting Information), the extracted threshold voltage values are $V_{Forward}/V_{Backward} = -14.33/14.19$ V when sweeping the control gate voltage from $-15$ to $+15$ V/+15 to $-15$ V, respectively. The memory window, defined as $\Delta V = V_{Backward} - V_{Forward} = 28.52$ V corresponds to a ratio of 95.1% between the memory window range (W.R.) and the sweeping range (S.R.). More threshold voltage values, memory windows, and M.R./S.R. ratios under different control gate voltages are summarized in Table S1 (Supporting Information). The trapped charge density in the Te FG at $\Delta V = 28.52$ V can be estimated by $n = \Delta V \epsilon_{h-BN} \epsilon_0 / q d_{SiO_2} = 1.185 \times 10^{13}$ cm$^{-2}$, where $\epsilon_{h-BN}$, $\epsilon_0$, q, and $d_{SiO_2}$ are the relative permittivity of h-BN, permittivity of vacuum, elementary charge, and thickness of SiO$_2$, respectively.$^{[37]}$ This value is about two orders of magnitude higher than that in multilayer graphene, which fully implies the superiority of Te as the FG in FG memory devices.$^{[38,39]}$ Also, the linearly increased memory window versus maximum control gate voltage as shown in Figure 2b, indicates the strong tuning ability of the applied control gate voltage on charge storing in the Te FG. The output characteristic curves under different control gate voltages measured at a small bias voltage ($V_{ds} = \pm 10$ mV)
are given in Figure 2c. Due to the electric field strength of the control gate voltages being weakened by the stored charge in the Te FG, the channel current exhibits only two distinct states (i.e., high-/low-resistance states). Moreover, the linear I–V curve suggests that a good contact has been achieved during fabrication. The atomically sharp and clean interfaces between the vdW materials in this FG memory unit enable the extremely fast operations. The ultrafast switching speed (±100 ns) of the fabricated device is demonstrated in Figure 2d. The detailed characterization process is illustrated in Figure S4 (Supporting Information). In the measurement, the electrical pulse amplitude for both programming/erasing (P/E) was kept at 12 V. In the programming process, electrical pulses with different pulse widths increasing from 10 ns to 1 μs were applied to control gate to switch the device (Figure S4a, Supporting Information). Based on the results, it can be concluded that the +12 V/100 ns pulse can slightly induce electron tunneling into the Te FG, while the +12 V/500 ns pulse can completely switch the device from a low-resistance state to a high-resistance state. In the erasing process, the −12 V/50 ns pulse is sufficient to “erase” the electrons stored in Te (Figure S4b, Supporting Information). The asymmetric switching speed in the programming/erasing processes is attributed to the band movement under positive/negative external electric fields. In this study, applying a +12 V control gate pulse induced a downward movement for both Te and MoS2 energy bands relative to h-BN, resulting in a higher effective tunneling barrier. Conversely, a −12 V electrical pulse induced an upward movement for both Te and MoS2 energy bands, leading to a lower effective tunneling barrier. The lower effective tunneling barrier for electrons resulted in a faster erasing operation. The ultrafast programming/erasing electrical pulses with different pulse width (widths ranging from 10 ns–1 μs) were generated by the B1500A semiconductor parameter analyzer and pulse waveforms were characterized by a digital oscilloscope (Tektronix TBS 1102B-EDU), as shown in Figure S5 (Supporting Information). To better evaluate the charge trapping ability in Te FG, we measured the direct threshold voltage after erasing and programming operations in near-zero control gate sweep, which is shown in Figure S6 (Supporting Information). We first applied a −10 V/1 s electrical pulse to erase the memory unit to the original state-1 and then applied a +10 V/1 μs electrical pulse to program the memory unit to state-0. We can observe that the threshold voltage shifts left over time and the state-0 in 4000 s is still clearly distinct from the original state-1, which fully demonstrates the considerable charge trapping ability of Te FG.

The FG device’s cycling endurance is presented in Figure 2e, where it exhibits highly reproducible switching between low- and high-resistance states over 4000 cycles under periodic programming and erasing trains (P: +12 V/0.2 s, E: −12 V/0.2 s). The dynamic P/E behavior is illustrated in Figure S7 (Supporting Information). Two distinct states (high-resistance state (LRS) and low-resistance state (HRS)) retaining for over 4000 s presented in Figure 2f demonstrate the long-term memory stability of the FG device. Moreover, the high write and read margin (10⁶) between LRS and HRS corroborates the considerable charge storage capacity of Te FG. On the other hand, after delicately controlling the charge quantity tunneling through h-BN dielectric by carefully tuning the electrical pulse width and amplitude at each time, our device can also store multibit information within the write and read margin, which is shown in Figure S8 (Supporting Information), where 150 distinct potentiation/depression states are programmed by negative/positive electrical pulse trains (±5 V/1 μs), respectively.

The working mechanism of the device under electrical operation is illustrated in Figure 2g. The left panel depicts the energy band diagrams of different materials before being brought into contact, which is also supported by the Kelvin probe microscopy (KPFM) results (Figure S9, Supporting Information.). Figure S9a (Supporting Information) provides the optical microscopy image of a typical device for KPFM measurement, which is composed of MoS2, h-BN, Te nanoflakes with similar thicknesses used in other devices. Figure S9c (Supporting Information) depicts the surface potential profile obtained along the light blue line in KPFM mapping of the overlapped area (Figure S9b, Supporting Information) and from which, we can conclude that the potential different between multilayer MoS2 and Te nanoflake is very small. The band diagram of the device under programming operation is provided by the middle panel, where electrons in the MoS2 channel will tunnel through the h-BN dielectric and be stored in the Te FG under the positive electrical pulse applied to the control gate. The resulting positive electric field will lead to the off-state in MoS2. The right panel describes the erasing operation, where the stored electrons will tunnel back to the MoS2 driven by the negative electrical pulse and recover the channel to the on-state.

The Fowler–Nordheim (F–N) plot was conducted to further evaluate the tunneling barrier for electrons at the h-BN/Te interface, and the results are shown in Figure S10 (Supporting Information). The stacked h-BN/Te heterostructure used for the F–N measurement is depicted in Figure S10a (Supporting Information), with the tunneling area (S = 93.75 μm²) indicated by the purple dashed line. Figure S10b,c (Supporting Information) illustrates the measurement set-up and the band diagrams for direct/F–N tunneling, respectively. The temperature-independent I–V curve in Figure S10d (Supporting Information) demonstrates the quantum tunneling across the h-BN dielectric. Figure S10e (Supporting Information) shows the tunneling current (ι) measured under forward bias at room temperature in logarithm scale. By replotted it as a F–N plot, i.e., ln (ι/E²) versus 1/E, the barrier height can be determined by the intercept (Figure S10f and its inset, Supporting Information). The value extracted by the F–N plot (Φbarrier = 2.79 eV) is close to that (±2 eV) given by Anderson’s rule (Note S2 for more details, Supporting Information). The control gate voltage at which the F–N tunneling begins can be estimated by Vcg = VFn(Δdoxide + ΔdBN/ΔdBN) ≈ 10 V, assuming similar dielectric constants for h-BN and SiO2. It can be concluded that direct tunneling and F–N tunneling dominate at Vcg < 10 V and Vcg > 10 V, respectively.

To further elucidate the electronic memory operations in the FG memory unit, we established an ideal physical model based on the equivalent circuit diagram (Figure S11, Supporting Information), please refer to Note S3 (Supporting Information) for detailed discussion. This model indicates that the electric intensity in h-BN dielectric when using 50 nm SiO2 dielectric is around 2 times of that when using 300 nm SiO2 dielectric, that helps explain why the amplitude of control gate voltage (12 V) for ultrafast operations used in this work is much smaller than other
Figure 3. Nonvolatile optoelectronic memory. a) Photocurrent evolution triggered by optical pulse trains with different wavelength (blue: 450 nm, green: 532 nm, red: 635 nm). The laser power is kept as ≈0.72 nW. b) Nonvolatile optoelectronic memory tuned by illuminating laser pulse trains on the FG memory device. Each pulse train consists of five laser pulses. Photocurrent evolution tuned by laser pulse trains with c) different laser powers, d) different pulse numbers, and e) different frequencies (duty ratio = 90%). The read bias is kept as 0.1 V in all the measurements. f) Energy band diagram of the FG memory device in different optoelectronic memory states.

reports (≈20–30 V).\textsuperscript{10,11} Moreover, this physical model can also be used to predict the ideal operation time in our FG memory device, which is shown in Figure S12 (Supporting Information). In erasing process, the ideal operation time is ≈8.5 ns under the control gate voltage of −12 V; in programming process, the ideal operation time is ≈17.2 ns under the control gate voltage of +12 V. These results suggest that the FG memory device performance can be further improved by optimizing the fabrication process.

This device exhibits nonvolatile multibit optoelectronic memory, as demonstrated in Figure 3; and Figures S13 and S14 (Supporting Information). Initially, red (635 nm), green (532 nm), and blue (450 nm) lights have been used to characterize the optoelectronic memory, and the results are presented in Figure 3a. The findings indicate that the FG device, based on the vdW MoS\textsubscript{2}/h-BN/Te heterostructure, has an optimal photoresponse to green light at the same laser power level (≈0.72 nW); leading to the selection of 532 nm laser as the light source in subsequent measurements unless otherwise stated. Figure 3b illustrates the dynamic light-erase conductance evolution under periodic exposure to green laser pulses, with the laser pulse width, frequency, and power set to 100 ms, 1 Hz, and 0.2817 nW, respectively. Prior to each optical erasing, the MoS\textsubscript{2} channel was programmed using an electrical pulse to the high-resistance state. Subsequently, the MoS\textsubscript{2} conductance increases with increasing the number of optical pulse trains (each train includes 5 laser pulses) until saturation.
Moreover, the multibit nonvolatile memory can be tuned by adjusting laser parameters. Figure 3c demonstrates the realization of 3-bit optical memory under various laser powers, where the laser pulse width and interval are 0.1 and 9.9 s, respectively. The dynamic multistate optical memory is presented in Figure 3d, where the channel current exhibits different resistance states by controlling the laser pulse numbers. The laser power is fixed at 0.2397 nW, the pulse frequency is 1 Hz, and the pulse width is 0.1 s. Varying the laser frequency also leads to an increase in the channel current with continuous exposure to laser pulses, as shown in Figure 3e. Figure S13 (Supporting Information) depicts the multilevel optical memory states, where more than 60 states are clearly distinguished. The long-term optical memory retention is demonstrated in Figure S14 (Supporting Information), where Figure S14a–d (Supporting Information) presents the channel current evolution triggered by different amounts of laser pulses. The insets provide the stepwise current evolution in the optical erasing processes. For comparison, the current evolution in the dark is given in Figure S14e (Supporting Information). As summarized in Figure S14f (Supporting Information), the multibit optical memory states with a large read margin of over 105 can maintain for over 1000 s. In the optical erasing process, the applied laser frequency, pulse width, and power are 1 Hz, 100 ms, and 0.3220 nW, respectively.

The working mechanism of the optoelectronic memory is illustrated in Figure 3f. Prior to each optical erasing operation, the FG device is programmed to a high-resistance state using a proper electrical pulse; left and middle panels illustrate the energy band diagram. Under illumination, photogenerated electron/hole pairs will accumulate in the MoS2 channel. Part of the holes hold a certain probability to tunnel through the h-BN dielectric and neutralize the stored electrons in Te FG. This diminishes the effective electric field, as depicted in the right panel.

To highlight the advantages of using Te as the charge trapping layer in 2D FG devices, we compared the nonvolatile electronic/optoelectronic performance of our device with other reports, and the results are presented in Table 1. Our FG memory unit exhibits comparable electronic programming/erasing capabilities to those of the most advanced devices reported in the literature. Furthermore, its impressive optoelectronic memory behavior suggests potential applications in future in-sensor memory computing systems.

Moreover, Te can be utilized as an internal gate to enable a MoS2 transistor, as demonstrated by the basic electrical characterization presented in Figure S15 (Supporting Information). Figure S15a (Supporting Information) illustrates the schematic of the MoS2 transistor with the h-BN nanoflake serving as the gate dielectric. The linear output curves under small bias in Figure S15b (Supporting Information) indicate good contact between the source/drain electrodes and the channel material. The negligible hysteresis window displayed in Figure S15c (Supporting Information) confirms the clean interface between the MoS2 and h-BN nanoflakes, consistent with the cross-section TEM results. The subthreshold swing (SS) values were further extracted from its transfer curves and the values are 69.08 and 64.95 mV dec–1 in forward and backward sweeping directions, respectively (Figure S15d, Supporting Information). These values are very close to the ideal limitation at room temperature (60 mV dec–1) and help illustrate the good performance of Te FET.

2.3. Volatile Electronic/Optoelectronic Memory Triggered by Mild Electrical/Optical Stimuli

Charge trapping states are quite common in MoS2 nanoflakes obtained by mechanical exfoliation, and they will bring up the current relaxation under mild electrical/optical stimuli thus laying the foundation for the MoS2-based synaptic devices. Here we also demonstrate that the volatile electronic/optoelectronic memory behaviors could be achieved in our multifunctional FG memory unit with using Te as the FG.

The volatile optoelectronic memory is illustrated in Figure 4a–f and Figure S16 (Supporting Information). Figure 4a,b compares the optoelectronic current evolution triggered by laser pulses of different intensities. When an intense laser pulse is applied, the channel current reaches a higher value and then stabilizes. In contrast, when a weak optical pulse is used, the photocurrent quickly recovers to its initial state. Figure S16 (Supporting Information) shows the short-term memory states triggered by an optical pulse train with mild intensity. The laser power, pulse width, and interval are 0.1340 nW, 0.1 s, and 0.9 s, respectively. Figure 4c depicts the volatile optoelectronic memory behaviors triggered by optical pulse trains with varying pulse numbers. As the pulse numbers increase, both the peak value of the excitatory postsynaptic current (EPSC) and the recovery time will also increase. Figure 4d presents the EPSC triggered by optical pulse trains with different pulse intervals, demonstrating that the EPSC can be adjusted by controlling the stimulation parameters. The paired-pulse facilitation (PPF) effect triggered by optical stimulation is illustrated in Figure 4e and the optoelectronic PPF ratio ΔA2/ΔA1 × 100% is provided as a function of the optical pulse interval is provided in Figure 4f. The PPF behavior triggered by paired optical pulses with different intervals is shown in Figure S17 (Supporting Information), from the bottom up, the interval increases from 1 to 10 s. The results are fit to a double exponential decay function: PPF = C1 × e−Δ1/τ1 + C2 × e−Δ2/τ2, where the blue line in Figure 4f, where τ1 = 2.4554 s and τ2 = 2.4560 s, respectively.

After weakening the electrical stimuli intensity, our FG device also exhibits volatile electronic memory behavior. In Figure S18 (Supporting Information), we present the short-term memory state triggered by an electrical pulse with a mild intensity of −2 V and a pulse width of 100 ms. In contrast to the nonvolatile memory behavior triggered by intense electrical pulses, the EPSC in this case recovers to its initial state within several seconds. In Figure 4g, we illustrate the PPF behavior triggered by paired electrical pulses with different intervals, ranging from 0.1 to 1 s from the bottom up. The pulse amplitude and width are −2 V and 10 ms, respectively. Figure 4h provides an example of the PPF effect triggered by paired electrical pulse with an interval of 0.2 s. We define the electronic PPF ratio as ΔA2/ΔA1 × 100% and present it as a function of the electrical pulse interval in Figure 4i. The results can be fitted by the double exponential decay function: PPF = C1 + C2 × e−Δ1/τ1 + C2 × e−Δ2/τ2, where τ1 = 0.0672 s and τ2 = 0.3274 s, respectively.

In addition, we also fabricated a device by carefully eliminating overlapping between the source/drain electrodes and Te nanoflake as the control experiment and this device exhibit similar memory behaviors obtained on the device throughout the
Table 1. Comparison between different nonvolatile electronic/optoelectronic 2D FG devices.

<table>
<thead>
<tr>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>ReS&lt;sub&gt;2&lt;/sub&gt;/h-BN/graphene</td>
<td>83.3%/60 V</td>
<td>10&lt;sup&gt;8&lt;/sup&gt;</td>
<td>1 µs</td>
<td>&gt;400</td>
<td>&gt;4000 s</td>
<td>N.A.</td>
<td>N.A.</td>
<td>N.A.</td>
<td>[53]</td>
</tr>
<tr>
<td>InSe/h-BN/graphene</td>
<td>80.0%/40 V</td>
<td>10&lt;sup&gt;10&lt;/sup&gt;</td>
<td>21 ns</td>
<td>&gt;2000</td>
<td>&gt;5400 s</td>
<td>N.A.</td>
<td>N.A.</td>
<td>N.A.</td>
<td>[10]</td>
</tr>
<tr>
<td>MoS&lt;sub&gt;2&lt;/sub&gt;/h-BN/graphene</td>
<td>75.7%/35 V</td>
<td>10&lt;sup&gt;6&lt;/sup&gt;</td>
<td>20 ns</td>
<td>&gt;1300</td>
<td>&gt;5300 s</td>
<td>N.A.</td>
<td>N.A.</td>
<td>N.A.</td>
<td>[11]</td>
</tr>
<tr>
<td>MoS&lt;sub&gt;2&lt;/sub&gt;/h-BN/graphene</td>
<td>N.A.</td>
<td>N.A.</td>
<td>N.A.</td>
<td>N.A.</td>
<td>10&lt;sup&gt;6&lt;/sup&gt;</td>
<td>&gt;30000 s</td>
<td>18</td>
<td>N.A.</td>
<td>N.A.</td>
</tr>
<tr>
<td>PtS&lt;sub&gt;2&lt;/sub&gt;/h-BN/graphene</td>
<td>62.5%/40 V</td>
<td>10&lt;sup&gt;7&lt;/sup&gt;</td>
<td>N.A.</td>
<td>&gt;1000</td>
<td>&gt;1000 s</td>
<td>N.A.</td>
<td>N.A.</td>
<td>74</td>
<td>[54]</td>
</tr>
<tr>
<td>Graphene/h-BN/MoS&lt;sub&gt;2&lt;/sub&gt;</td>
<td>64.3%/70 V</td>
<td>10&lt;sup&gt;6&lt;/sup&gt;</td>
<td>N.A.</td>
<td>&gt;10000 s</td>
<td>10&lt;sup&gt;6&lt;/sup&gt;</td>
<td>&gt;10000 s</td>
<td>13</td>
<td>N.A.</td>
<td>[55]</td>
</tr>
<tr>
<td>ReS&lt;sub&gt;2&lt;/sub&gt;/h-BN/MoS&lt;sub&gt;2&lt;/sub&gt;</td>
<td>80.0%/50 V</td>
<td>10&lt;sup&gt;7&lt;/sup&gt;</td>
<td>N.A.</td>
<td>&gt;2000</td>
<td>&gt;10000 s</td>
<td>N.A.</td>
<td>&gt;800 s</td>
<td>3</td>
<td>[19]</td>
</tr>
<tr>
<td>SnS&lt;sub&gt;2&lt;/sub&gt;/h-BN/graphene</td>
<td>81.0%/40 V</td>
<td>10&lt;sup&gt;6&lt;/sup&gt;</td>
<td>N.A.</td>
<td>&gt;600</td>
<td>&gt;4000 s</td>
<td>10&lt;sup&gt;4&lt;/sup&gt;</td>
<td>N.A.</td>
<td>&gt;50</td>
<td>[17]</td>
</tr>
<tr>
<td>MoS&lt;sub&gt;2&lt;/sub&gt;/h-BN/Te</td>
<td>95.1%/15 V</td>
<td>10&lt;sup&gt;8&lt;/sup&gt;</td>
<td>P/E&lt;sup&gt;b)&lt;/sup&gt; 500 ns/100 ns</td>
<td>&gt;4000</td>
<td>&gt;4000 s</td>
<td>&gt;10&lt;sup&gt;5&lt;/sup&gt;</td>
<td>&gt;1000 s</td>
<td>&gt;60</td>
<td>This work</td>
</tr>
</tbody>
</table>

<sup>a</sup> N.A.: Not applicable; <sup>b</sup> P/E: Programming/Erasing.
Figure 4. Volatile electronic/optoelectronic memory. a) Volatile optoelectronic memory behavior triggered by a weak optical pulse. b) Nonvolatile optoelectronic memory behavior triggered by an intense optical pulse. c) Volatile optoelectronic memory behavior stimulated by pulse trains with different pulse numbers. d) Volatile optoelectronic memory behavior stimulated by pulse trains with different pulse intervals. e) PPF effect induced by paired optical pulse. f) Optoelectronic PPF ratio defined as $\frac{\Delta A_2}{\Delta A_1} \times 100\%$ as the function of the optical pulse interval. g) Current evolution triggered by paired electrical pulse with different intervals. h) PPF effect induced by paired electrical pulse. i) Electronic PPF ratio defined as $\frac{\Delta A_2}{\Delta A_1} \times 100\%$ as the function of the electrical pulse interval.

manuscript (Figure S19, Supporting Information). This helps confirm the origin of the multifunctional memory behaviors is the used vdW materials and the effect of the metal electrode is trivial.

2.4. Multimodal Reservoir Computing

We leveraged the nonlinear dynamics of the FG device to stimulate a multimodal RC system. The Te FG device serves as the memory unit for classifying the N-MNIST database (Neuromorphic-Modified National Institute of Standards and Technology database). Figure 5a depicts the electronic/optoelectronic FG device used for electrical/optical signal sensing and processing. Using this device, we designed a RC system capable of directly learning and inferring from electrical/optical signals without additional dedicated transducers. RC systems typically comprise of three components: the input layer, the intermediate representation or reservoir, and the output layer. Only the weights of the output layer require training, reducing the training burden compared to traditional compared to traditional artificial neural networks (ANNs).[51,52] However, the challenge of processing multimodal signals with RC systems lies in feature extraction and multisensory fusion. In conventional RC systems, multiple reservoirs are required for feature extraction because each reservoir can only respond to one type of input signal. These extracted features for different modalities are then integrated and classified by the readout layer, as shown in Figure 5b. In contrast, our electronic/optoelectronic FG device demonstrates an effective response to electrical/optical signals simultaneously, allowing for a mixed-input RC system that achieves both feature extraction and multisensory fusion within
a single reservoir (Figure 5c). Reducing the need for features extractions and multisensory fusion will greatly benefit the energy and time consumption, representing a significant advancement in the multimodal RC method over traditional RC systems.

The multimodal signal processing capability of our multimodal RC system enables it to be applied in long-distance communication technology, such as augmented reality and surveillance. It is well known the transmission speeds for sound signal and image signal usually vary a lot, in that case, the sound and image information encoded into electrical and optical spikes, respectively, need to be processed asynchronously. Besides processing synchronous electrical and optical signal in long-distance communication, our multimodal RC system will also find their applications in brain science, where the multimodal signal encoding is a must. To evaluate its performance in such scenarios, our multimodal RC system is applied to conduct the task of digit-recognition based on the N-MNIST dataset. The N-MNIST dataset was captured using a dynamic vision sensor (DVS) that was mounted on a motorized pan-tilt unit, which performed a saccade movement to capture images from frame-based static image datasets (MNIST). A sample from the N-MNIST dataset can be seen in Figure 5d. The event stream of the N-MNIST dataset has a slightly higher spatial resolution of 34 × 34 pixels compared to that of the MNIST dataset. Each recording was converted into 5 frames, and only the positive channel was used for every input. This frameless dataset contains additional temporal information while retaining similar spatial information to the original dataset. For this task,
we assume that the upper half of the digits can only be sensed as electrical signals, and the lower half can only be sensed as visual signals, represented by optical and electrical pulses, respectively. The intensity of each pixel of an input frame denotes whether there is an electrical or optical pulse applied to the corresponding FG reservoir device in the sensor array. The possible inputs to one FG reservoir device across the 5 temporal frames form 5-bit vectors, corresponding to 12 illumination patterns (Figures S20–23, Supporting Information). The stability and repeatability of single device is demonstrated in Figure S24 (Supporting Information). We have sampled the response to each pattern (5-bit electrical pulse patterns and 5-bit optical pulse patterns) in the single device for several times and the mean/median and quartiles are visualized in a grouped box chart. The small current variations reveal the high stability of our device and its application potential in real multimodal RC system. The final currents of the FG reservoir devices in the array after receiving the five frames were used to perform pattern recognition through the trained software readout layer (Figure S25, Supporting Information).

The research findings reveal that when stimulated by single-modal electrical or optical pulses, only half of the digit was fed to the reservoir, resulting in unsatisfactory accuracies of 80.04% and 83.01%, respectively, as depicted in Figure 5i. However, a mixed-input fed to the reservoir produced a high recognition accuracy of 90.77% (indicated by the green dot in Figure 5i), even surpassing the software baseline (89.19%, indicated by the blue dot in Figure 5i). Figure 5e illustrates the confusion matrix in classifying the N-MNIST test set with mixed input, while Figure 5f presents the initial and final weights distribution in the readout layer, the latter shows a quasirandom distribution. Furthermore, Figure 5h demonstrates the 2D distribution of the extracted feature vectors produced by the multimodal reservoir using linear discriminant analysis (LDA) for dimensionality reduction. These encoded features have been effectively clustered in the 2D feature space, indicating the success of the nonlinear transformations by the volatile electronic/optoelectronic memory behavior triggered by weak stimuli in our device.

Finally, a comparison between RC and conventional ANNs for the N-MNIST-based digit-recognition task is made in terms of training cost, as shown in Figure 5g. The results demonstrate that a conventional two-layer fully connected neural network with 900 hidden units (same as the number of reservoir nodes) requires a significant number of weights to be trained (521 000), which grows quickly with the network going deeper. However, this number remains constant (9000) for RC, further confirming the advantage of RC for real-time edge learning.

To further demonstrate the superiority of our multimodal RC system, we compared the classification accuracies achieved by using different methods (mixed optical and electrical input for the full and half digit dataset, single electrical input for the full and half digit dataset, and single optical input for the full and half digit dataset), which is shown in Figure S26 (Supporting Information). It can be found that the classification accuracy with mixed electrical and optical input for the full digit dataset is higher than single electrical or optical input for the full digit dataset and that also helps demonstrate the superiority of our multimodal RC system.

3. Conclusions

In summary, we have fabricated a 2D electronic/optoelectronic FG memory device with Te as the charge-trapping layer for multimodal RC. Thanks to the superior charge trapping ability of Te, this device exhibited excellent nonvolatile electronic memory behavior when stimulated by intense electrical/optical pulses, including a high extinction ratio (≈105), a close to unit (95.1%) ratio between the memory window range and sweeping range, a fast switching speed (≤100 ns), a long cycle (>4000 cycles), and retention (>4000 s) stability, or the long-term optoelectronic memory behavior (>1000 s) and nonvolatile multibit (>60) optoelectronic programmable characteristics. These performances are comparable to the most advanced 2D electronic/optoelectronic FG devices. We also demonstrated that the device's memory could degrade into volatile memory after weakening the input stimuli due to the charge trapping in MoS2 channel. Everaging the device's rich dynamics, we simulated a multimodal RC network that achieved a high recognition accuracy of 90.77% in the event-based N-MNIST digit-recognition task, parallel to that of the software baseline. This work offers new possibilities for advanced electronics/optoelectronics in the 2D family and provides design insights for future smart signal processing systems at the edge.

4. Experimental Section

Heterostructure Preparation and Device Fabrication: Te nanoflakes were synthesized using a hydrothermal method and deposited onto a SiO2/Si substrate via drop-casting.32,23 The Si substrate covered by 50 nm thermal oxide (SiO2) was purchased from Silicon Valley Microelectronics, Inc., where the Si was heavily doped by boron in p-type and its resistivity is less than 0.005 ohm cm. The h-BN and MoS2 nanoflakes were mechanically exfoliated on the polydimethylsiloxane (PDMS) directly from their maternal crystals, which were purchased from 2D Semiconductor Inc., and then transferred step-by-step onto the bottom heterostructure using a dry transfer method.23 The electrodes were defined using a standard electron beam lithography (EBL) process (TESCAN, VEGA3), followed by the thermal evaporation of Cr/Au (8/60 nm).

Device Characteristics: The heterostructures were imaged using an optical microscope (Nikon, EBLIPSE LV100ND) and the false-color SEM image was processed using MOUNTAINS8 software based on the original SEM image captured from TESCAN, VEGA3. The height profiles of the vdW nanoflakes were characterized by AFM (Bruker, Dimension Icon with Scan Asyst), and the Raman spectra were obtained at room temperature using a Renishaw Raman Microscope with a polarized laser at a wave-length of 532 nm. Electronic memory behaviors were characterized using a B1500A semiconductor parameter analyzer, and the waveform of the electrical pulses was measured with a digital oscilloscope (Tektronix TBS 1102B-EDU). Optoelectronic memory behaviors were characterized on a custom-built platform under ambient conditions, where monochrome lasers (450, 532, and 635 nm) were combined with an optical chopper and guided to the device via an optical fiber. Channel current was monitored using an Agilent 4155C semiconductor analyzer (Agilent Technologies, Santa Clara, CA). In measuring the current evolution triggered by 5-bit electrical pulse trains, 32 (5-bit) electrical pulse trains (“00000” to “11111”) “0” represents the electrical pulse amplitude is zero and “1” represents the electrical pulse amplitude is nonzero) were generated by the B1500A semiconductor analyzer and the current evolution in MoS2 channel was monitored by Agilent 4155C semiconductor analyzer; similarly, in characterizing the current evolution triggered by 5-bit optical pulse trains, 32 (5-bit) optical pulse trains (“00000” to “11111”) “0” represents the absence of a laser pulse to the device and “1” represents the presence of a laser pulse to the device) were generated by a monochrome laser (532 nm)
modulated by a mechanical chopper. For each pulse train, after 5 consecutive laser pulses hit the device, then the light path was manually cut off.

Simulation of the Multimodal Reservoir Computing System with the FG Devices: The whole spike train in N-MNIST dataset can be represented as a $34 \times 34 \times 2 \times T_s$ sized spike train, where $T_s$ refers to the height and width of the sensing field, $T_D$ denotes the length of recording time, and $2$ indicates that the DVS camera records the spike events in two channels—On and Off. In the experiments, the On channel was only used. To preprocess neuromorphic datasets, the event-to-frame integrating method was used and each recording was converted to $5$ frames. The height and width of each sample were cropped from $34$ to $30$. The simulated multimodal RC system was created with $900$ electrically/optically operated memory devices. The conductance programming by both electrical and optical stimuli was calibrated using measurement data acquired on a single device. The readout layer consisted of a $900$ sized spiketrain, where "34" refers to the height and \( \times \) width of the sensing field, \( 34 \) denotes the length of recording time, and \( 2 \) denoted the length of recording time, and \( 2 \) denotes the length of recording time, and \( 2 \) denotes the length of recording time.


