# Fileset

[nn5c06174_si_001.pdf](https://mdr.nims.go.jp/filesets/1fe36db9-94c2-4b74-a70c-2698b7331654/download)

## Creator

[Daiki Nishioka](https://orcid.org/0000-0002-3369-7700), [Hina Kitano](https://orcid.org/0009-0008-9132-0275), [Wataru Namiki](https://orcid.org/0000-0003-4053-7366), Satofumi Souma, [Kazuya Terabe](https://orcid.org/0000-0003-3988-3456), [Takashi Tsuchiya](https://orcid.org/0000-0002-6950-6160)

## Rights

[Creative Commons BY Attribution 4.0 International](https://creativecommons.org/licenses/by/4.0/)

## Other metadata

[Two Orders of Magnitude Reduction in Computational Load Achieved by Ultrawideband Responses of an Ion-Gating Reservoir](https://mdr.nims.go.jp/datasets/470dca15-9f14-4adc-b3cf-8f5554a9ac93)

## Fulltext

1 Supporting information Two Orders of Magnitude Reduction in Computational Load Achieved by Ultra-wideband Responses of Ion-Gating Reservoir  Daiki Nishioka1*, Hina Kitano2,3, Wataru Namiki2, Satofumi Souma4, Kazuya Terabe2 and Takashi Tsuchiya2,3*   1International Center for Young Scientists (ICYS), National Institute for Materials Science (NIMS), 1-1 Namiki, Tsukuba, Ibaraki, 305-0044, Japan. 2Research Center for Materials Nanoarchitectonics (MANA), NIMS, 1-1 Namiki, Tsukuba, Ibaraki, 305-0044, Japan. 3Department of Applied Physics, Faculty of Advanced Engineering, Tokyo University of Science, Katsushika, Tokyo 125-8585, Japan 4Department of Electrical and Electronic Engineering, Kobe University, Kobe 657-8501, Japan   *Email: TSUCHIYA.Takashi@nims.go.jp, NISHIOKA.Daiki@nims.go.jp  2 Note S1: Effect of Channel Length on the Modulation Range of Drain Current (On/Off Ratio) In this section, we examine how the channel length of the device affects the modulation range of the drain current, i.e., the on/off ratio. Figure S1(a) shows normalized transport characteristics obtained from the devices presented in Fig. 1c, corresponding to Drain0 through Drain2. These devices share a channel width of 30 µm and have channel lengths of 5 µm, 20 µm, and 100 µm, respectively. To focus on the on/off ratio of each channel, the horizontal axis represents the relative gate voltage with respect to the Dirac point, defined as VG' = VG - VDirac, and the vertical axis shows the normalized increase in drain current relative to the value at VG' = 0 V. As a result, we observed that shorter channels tend to exhibit a suppressed increase in drain current in response to gate voltage. For instance, the on/off ratio z at a given gate voltage (VG' = -1.35 V) was found to increase with the channel length L, as shown in Figure S1(b). To explain this behavior, we describe the drain current ID using a composite resistance model, as illustrated in Figure S1(c). The total channel resistance Rtot is expressed as the sum of the VG'-independent lead resistance RL (primarily due to contact and lead electrodes) and the VG'-dependent channel resistance RC: 𝑅!"!(𝑉#) =  𝑅$ + 𝑅%(𝑉#)  (S1) In this model, for shorter channels, RC becomes smaller, making the contribution of RL relatively larger. Therefore, even if RC decreases under gate bias, the overall increase in ID is limited as long as RL remains dominant. Let RC,off and RC,on be the channel resistances in the off-state (VG' = 0 V) and on-state (VG' = -1.35 V), respectively. The channel-length dependence of the on/off ratio z(L) can then be expressed as: 𝑧(𝐿) =𝑅$  +  𝑅%,"''𝑅$  +  𝑅%,"(   (S2) The channel resistances can further be expressed using their respective resistivities: 𝑧(𝐿) =𝑅$  +𝜌%,"''𝑊 𝐿𝑅$  +𝜌%,"(𝑊 𝐿   (S3) Here, rC,off and rC,on are the resistivities of the channel in the off and on states, respectively. The red line in Figure S1(b) shows the fitting result of the experimental data (gray dots) using the above model (Equation S3), indicating that the model accurately captures the channel-length dependence of the on/off ratio. Here, RL was treated as a length-independent constant, and the parameters RL, rC,off and rC,on were numerically determined to be 128.1 Ω, 1842 Ω, and 203.2 Ω, respectively.   3  Figure S1: (a) Normalized transfer characteristics for devices with a fixed channel width of 30 µm and channel lengths of 5 µm, 20 µm, and 100 µm (Drain0–Drain2), plotted against the relative gate voltage VG’ =VG−VDirac. The vertical axis shows the increase in drain current relative to VG’ =0 V. The dashed line indicates VG’ =-1.35 V where on/off ratios were calculated. (b) Channel-length dependence of the on/off ratio z evaluated at a fixed VG’, showing a monotonic increase with increasing channel length. The red dashed curve represents the fitting based on the composite resistance model (Eq. S3). (c) Schematic illustration of the composite resistance model, where the total channel resistance Rtot is composed of the gate-independent lead resistance RL and the gate-dependent channel resistance RC(VG).  Next, we used the obtained RL to model the gate-voltage dependence of the drain current and validate the model. Using the composite conductance Gtot = 1/ Rtot, the drain current is expressed as: 𝐼) (𝑉#) =  𝐺!"!(𝑉#)𝑉)  =𝐺$𝐺%(𝑉#)𝐺$  +  𝐺%(𝑉#) 𝑉)  (S4) where GL = 1/ RL is the lead conductance, and GC = 1/ RC is the channel conductance. Ideally, GC = a|VG'|, showing linear dependence on gate voltage. However, due to the voltage drop along the channel induced by the source–drain bias VD, the potential distribution varies within the channel. In particular, near VG' ~ 0 V, both n-type and p-type regions may coexist, leading to a broadening of the conductance profile and preventing the emergence of a sharp V-shaped response. To account for this behavior, we constructed a segmented channel model in which the channel is divided into N = L/l sub-regions of length l = 20 nm, as illustrated in Figure S2. Let VC,i be the local channel potential in the i-th segment (i=1,2,…,N). The effective gate voltage applied to that segment is V’G,eff = VG' - VC,i, and the partial conductance Gi is defined as: 𝐺* = 𝑎 ⋅ 5𝑉#,+'', 5 +  𝑏 =  𝑎 ⋅ 5𝑉#, − 𝑉%,*5 +  𝑏 (S5) Here, 𝑎 = (𝑊 𝑙⁄ ) ∙ 𝜇𝐶-)$ is the slope of conductance change with effective gate voltage, and b is the minimum conductance at V’G,eff = 0 V. Due to thermally excited carriers at room temperature, b > 0. Assuming experimental condition of VD = 0.5 V, the local channel potential VC,i was defined using a linear profile: 𝑉%,* =𝑉)𝑁⋅ ?𝑖 −12A −𝑉)2(S6) 87654321Normalized drain current-1.5 -1.0 -0.5 0.0 0.5VG'  = VG - VDirac /V L = 5 µm L = 20 µm L = 100 µmRL RC(VG)VDID(VG) = VD / Rtot(VG)Lead Channel87654321On/off ratio z12080400Chanel length L /µm Experiment Modela b c 4 It should be noted that this definition ensures VC,i = 0V at the center of the channel (i=N/2), thereby preventing an artificial shift of the Dirac point caused by the introduction of VD. This is consistent with the use of the relative gate voltage VG’, which already compensates for the Dirac point shift. The total channel conductance GC is then given by the series combination of the segment conductances: 𝐺% = CD1𝐺*.*/0E10= CD1 𝑎 ⋅ 5𝑉#, − 𝑉%,*5 +  𝑏.*/0E10(S7) Substituting into Equation S4, the drain current response becomes: 𝐼)(𝑉#,) =𝐺2 ⋅ G∑1 𝑎 ⋅ 5𝑉#, − 𝑉%,*5 +  𝑏.*/0 I10𝐺$ + G∑1 𝑎 ⋅ 5𝑉#, − 𝑉%,*5 +  𝑏.*/0 I10 𝑉) (S8) Using this model, we performed fitting to the measured data. The value of RL was taken from the previous model (Equation S3), while a and b were treated as fitting parameters. Here, b corresponds to the minimum conductance near the Dirac point, and a is a coefficient related to the channel geometry, carrier mobility, and EDL capacitance as described above. Theoretically, based on a carrier mobility of ~3000 cm2/V·s (from Hall measurements) and an EDL capacitance of ~1 µF/cm2, the estimated value of a is approximately 4.5, which is consistent with the fitting results shown in Table S1.  Figure S2: Segmented channel model used to describe the broadened conductance behavior near the Dirac point. The channel is divided into N=L/l segments of length l=20 nm, with each segment assigned a local potential VC,i defined by a linear voltage drop under VD=0.5 V. The effective gate voltage in each segment is VG’−VC,i, and the local conductance is given by Gi=a∣VG’−VC,i ∣+b.  The red curves in Figures S3(a–c) illustrate the transport characteristics reproduced using Equation S8 and RL = 128.1 Ω. The comparison with the experimental data (gray dots) shows that the model successfully captures the saturation behavior of ID caused by the lead resistance. In contrast, the ElectrolytePositionPotentialVG’V ’G,effGDS0 LChannelVC,iii-1 i+1 ……20 nm……Gi Gi+1Gi-1 V ’G,effGiba 5 simplified model without RL (i.e., ID = GC·VD) fails to match the experimental results, particularly under large negative VG'. This discrepancy becomes more pronounced in short-channel devices, where the contribution of RL is relatively significant. These results strongly support the validity of our model and indicate that the observed channel-length dependence of the on/off ratio in the devices arises from the relative contribution of the lead resistance.  Figure S3: Comparison of the modeled (red curves) and experimental (gray dots) drain current characteristics for devices with different channel lengths of (a) 5 µm, (b) 20 µm and (c) 100 µm. The model incorporates the lead resistance RL and the segmented channel conductance to reproduce the observed saturation behavior. Dashed curves represent the simplified model without RL, which fails to capture the measured response, especially under large negative VG’.   Table S1: Fitting parameters a and b for the segmented conductance model. Channel Parameter a Parameter b L = 5 µm 7.54 0.135 L = 20 µm 5.61 0.294 L = 100 µm 3.76 0.206  43210Drain current /mA-1.2 -0.8 -0.4 0.0VG' /VL = 5 µm Experiment Model Model (w/o- RL)2.52.01.51.00.50.0Drain current /mA-1.2 -0.8 -0.4 0.0VG' /VL = 20 µm Experiment Model Model (w/o- RL)8006004002000Drain current /µA-1.2 -0.8 -0.4 0.0VG' /VL = 100 µm Experiment Model Model (w/o- RL)a b c 6  Figure S4: Optical microscope image of the Hall bar-type graphene channel. The Hall measurements shown in Figure 1e were performed using an EDLT fabricated by placing ion-gel and Au foil on this Hall bar-type channel. I+ and I- are the current terminals, V+ and V- are the voltage terminals, and VH is the terminal for Hall voltage measurements.   50 µmGrapheneCr/AuI+ I-V+ V-VH 7 Note S2: Simulation-Based Reproduction and Interpretation of 100~ns Switching Behavior The experimentally observed switching response on the order of 99 ns can be quantitatively explained by incorporating electrostatic screening effects into numerical simulations based on the Nernst-Planck-Poisson (NPP) equations as follows. In this framework, the NPP equations are solved to obtain the time evolution of ionic concentrations (both anions and cations), fluxes, and electrostatic potential within the ion gel, where the presence of the graphene channel is taken into account by calculating the charge distribution using a tight-binding (TB) method combined with a quantum capacitance model. The electrostatic potential is obtained by solving the Poisson equation over the entire simulation domain, including the graphene layer, under the boundary condition that a gate voltage is applied to the opposite side of the ion gel. Then the channel current in graphene is calculated using the Boltzmann transport theory. The detailed methodology has been reported previously[1]. In the present study, we explicitly consider the electrostatic screening within the ion gel, which acts as the gate dielectric. Given the relatively high ionic conductivity of the gel, s = 6 mS/cm, it is reasonable to assume that the electric field inside the electrolyte is dynamically suppressed via ionic motion, which is captured by solving the time-dependent NPP equations. That is, we consider a Debye-length-dependent additional term in the Poisson equation as 𝑑3𝜙(𝑧)𝑑𝑧3= −𝜌(𝑧)𝜀+(𝜙(𝑧) − 𝜙4567)𝜆)3   (S9) along the ion-gel thickness direction (normal to the graphene plane), where the second term on the right-hand side effectively accounts for the electrostatic screening arising from mobile ions under the assumption of linearized Boltzmann statistics, with the Debye screening length 𝜆) = Q𝜀𝑘8𝑇/2𝑛𝑒3 estimated based on the ion density n. In the above Poisson’s equation, 𝜙4567  denotes the bulk electrostatic potential determined by the charge neutrality condition, and is assumed to be moderately shifted toward the graphene channel potential near the graphene interface (within a few nanometers) due to asymmetric electrostatic coupling and incomplete ionic screening.  In the actual simulation, we assume the ion density 𝑛 = 3.23 × 1039 m1: and the diffusion constant 𝐷 = 1.5 × 10100 m3/s for both anion and cation. These parameter choices are consistent with the experimentally observed conductivity s ≈ 6 mS/cm through the Nernst-Einstein relation:  𝜎 =2𝑒3𝑛𝐷/𝑘;𝑇  at the room temperature. The permittivity is assumed as 𝜀 = 10𝜀<. Figure S5 shows the time evolution of the normalized graphene channel current in response to a pulsed gate voltage applied to a graphene FET with a 500 μm-thick ion gel as the gate dielectric. The pulse width is 5 μs, with the base (off-state) gate voltage set to Vb = -0.1 V and the input (on-state) gate voltage set to Vin = -0.55 V. As can be seen from the result, a switching response on the order of 100 ns, comparable to the experimental observation, is successfully reproduced. The agreement between the experimental and simulation results suggests that the relaxation time on the order of 100 ns can be interpreted as 𝜏 = 𝑑-)$3 /𝐷, where the typical width of the electric double layer is assumed to be dEDL =1.2 nm and the diffusion constant is taken as 𝐷 = 1.5 × 10100 m3/s  as mentioned above, yielding t ≈ 100 ns.   8  Figure S5: Time evolution of the normalized graphene channel current in response to a pulsed gate voltage applied to a graphene FET with a 500 μm-thick ion gel as the gate dielectric. The inset shows a magnified view of the time window from 0 to 0.5 μs, highlighting the fast switching response on the order of 100 ns.      9 Note S3: CEDL Evaluation The electric double-layer capacitance CEDL of the ion-gel/graphene EDLT was 0.91 µF/cm2 in the p-type region and 0.88 µF/cm2 in the n-type region. These values were calculated by applying the widely used relationship =>=?!= 𝐶-)$ (the differential capacitance of the EDL) to the changes in Hall density and electron density obtained from Hall measurements as a function of VG2-4). Figure S6 shows the carrier density dependence obtained from Hall measurements, with linear fitting performed over the p-type region (from VG-VDirac=-1.525 V to -0.425 V) and the n-type region (from VG-VDirac=0.375 V to 1.275 V). From these fittings, CEDL was calculated, and as shown in Figure S6, the fitting lines match well across a wide VG range. This result indicates that CEDL has little dependence on the applied VG. The CEDL values calculated here are consistent with experimentally and theoretically estimated values for graphene EDLTs using ionic liquids3-5). The relatively low CEDL<1 µF/cm2, compared to the typical ~10 µF/cm2 for ionic liquid-based EDLs6), arises from the limitation imposed by the series quantum capacitance Cq5).   Figure S6: Carrier density changes in the graphene channel as a function of the applied gate voltage obtained from Hall measurements. The red and blue lines represent linear fittings in the p-type and n-type regions, respectively, and the slopes of these fittings were used to calculate the electric double-layer capacitance CEDL.  86420Carrier density / ×1012 cm-2-1.0 0.0 1.0VG−VDirac  / V!"!#!= 0.91  µF cm"⁄!"!#!= 0.88  µF cm"⁄ 10  Figure S7: Conceptual design of vertically stacked ion-gating transistors with multi-timescale responses. (a) Schematic illustration of a vertical organic electrochemical transistor (vOECT) composed of stacked channel materials with different ion diffusion coefficients Dion, enabling engineered temporal dynamics across layers. (b) Vertically integrated graphene-based architecture, where varying the channel area and using multilayer graphene allows integration of multiple timescales. Ion trapping in multilayers may further enhance tunability.  DSubstrateSDElectrolyteGHigh Dion ChannelLow Dion Channel…Mobile ionFastSlow DSubstrateSDElectrolyteGSmall ChannelLarge Channel…Mobile ionFastSlowLayered Graphenea b 11  Figure S8: Influence of channel geometry on relaxation time. (a)–(h) Optical microscope images of graphene channels and source/drain electrodes for individually fabricated devices. The channel width is fixed at 30 µm, and the channel lengths L are 5 µm, 10 µm, 20 µm, 50 µm, 100 µm, 200 µm, 500 µm, and 1000 µm, respectively. The white boxes in the images indicate a scale bar of 500 µm. (i) Channel resistance of graphene as a function of channel length. (j) Overview and (k) magnified view of the normalized drain current responses to pulsed gate voltages for each device. (l) Relaxation time as a function of channel length. 1.20.80.40.0Normalized drain current1.51.00.50.0-0.5Time /ms Channel length  5 µm 10 µm 20 µm 50 µm 100 µm 200 µm 500 µm 1000 µm 2.01.61.20.8VG /V2.01.61.20.8VG /VL :1000 µmL :500 µmL :200 µmL :100 µmL :50 µmL :20 µmL :10 µmL :5 µm1.20.80.40.0Normalized drain current40200Time /µs10-6246810-524Relaxation time /s2 4102 41002 410002Channel length /µm2520151050Chennel resistance /kΩ10008006004002000Channel length /µm1-1/ea b c de f g hij kl 12 Note S4: Detailed calculation method for information processing capacity Here, we describe the detailed procedure for calculating the information processing capacity (IPC). The IPC is evaluated based on the reservoir's ability to reconstruct a target signal ym, which is defined as follows, from a random input sequence u(k): 𝑦@(𝑘) =h𝑃A",$[𝑢(𝑘 − 𝑑)]B=/<  (S10) In this expression, m is the index representing a specific polynomial term; d is the delay; D is the maximum delay; and nm,d denotes the polynomial order associated with a given index and delay. The polynomial Pn' represents the n'-th order orthogonal polynomial generated via the Gram-Schmidt process, and is defined as: 𝑃A%[𝑢(𝑘 − 𝑑)] = 𝑢A%(𝑘 − 𝑑) −D𝑐*CA%D𝑃*[𝑢(𝑘 − 𝑑)]A10*/<  (S11) 𝑐*CA%D =∑ 𝑃*[𝑢(𝑘 − 𝑑)]𝑢A%(𝑘)E&'('F/0∑ 𝑃*[𝑢(𝑘 − 𝑑)]3E&'('F/0  (S12) with P0 = 1. For example, the first- and second-order polynomials are given by: 𝑃0[𝑢(𝑘 − 𝑑)] = 𝑢(𝑘 − 𝑑) − 𝑢n  (S13) 𝑃3[𝑢(𝑘 − 𝑑)] = 𝑢3(𝑘 − 𝑑) +𝑢n𝑢3nnn − 𝑢:nnn𝑢3nnn − 𝑢n3𝑢(𝑘 − 𝑑) +𝑢n𝑢:nnn − 𝑢3nnn3𝑢3nnn − 𝑢n3  (S14) where 𝑢A,nnnn = 0E&'('∑ 𝑢A%(𝑘)E&'('F/0  (n’=1,2,…). Higher-order polynomials such as P3 and beyond can be defined recursively using the above relation. Next, for each index m (corresponding to a specific combination of delay and polynomial order), we perform a regression task in which the reservoir is trained to reconstruct the target signal ym defined above. Since ym is a product of multiple polynomials of degree nm,d, the total degree n of the target function can be written as: 𝑛 =D 𝑛@,=B=/<  (S15) Let y(k) denote the output of the reservoir. The accuracy of this reconstruction task is evaluated based on the error between ym(k) and y(k), and is defined as the component-wise capacity Cm: 𝐶@ = 1 −∑ [𝑦@(𝑘) − 𝑦(𝑘)]3E&'('F/0∑ [𝑦@(𝑘)]3E&'('F/0  (S16) By performing the above reconstruction task for all index values m, the overall IPC can be obtained. In addition, by focusing only on the components with a specific polynomial degree n, the sum of Cm over the corresponding indices m(n) is defined as the degree-specific capacity Cn. The total IPC, Ctot,  13 is then given by the sum of Cm over all indices: 𝐶A = D 𝐶@@∈@(A)  (S17) 𝐶!"! =D𝐶@@  (S18) As an illustrative example, we describe the step-by-step calculation of the first-order capacity C1 under the optimal condition for the NARMA2 task (T = 100 µs). For n = 1, the target signal ym(k) is given by Eq. (S10) as: 𝑦@(𝑘) = 𝑃<[𝑢(𝑘 − 0)] × 𝑃<[𝑢(𝑘 − 1)] × 𝑃<[𝑢(𝑘 − 2)] × … × 𝑃<p𝑢q𝑘 − (𝑑 − 1)rs × 𝑃0[𝑢(𝑘 − 𝑑)] × 𝑃<p𝑢q𝑘 − (𝑑 + 1)rs × …× 𝑃<[𝑢(𝑘 − 𝐷)]  = 𝑃0[𝑢(𝑘 − 𝑑)] (S19) Using Eq. (S13), the final form of the target with n = 1 can be written as: 𝑦@(𝑘) = 𝑢(𝑘 − 𝑑) − 𝑢n  (S20) This reconstruction task essentially examines whether the reservoir can “recall” the value of the input sequence u(k-d) with a delay d. This corresponds closely to the widely known concept of memory capacity (MC) in the field of reservoir computing, though slight differences exist—such as the inclusion of d = 0 and the method used for evaluating task performance. Figures S9 (a) to (c) shows representative examples of the target ym(k) and the reservoir output y(k) at various delays d = 0, 7, 10. As can be observed, the reservoir accurately reconstructs the target at small delays, whereas the reconstruction deteriorates as d increases. This is because the task becomes more difficult as the reservoir is required to retrieve older information from the input history. The dependence of the task accuracy (i.e., component-wise capacity Cm) on delay d is summarized in Figure S9 (d). The observed decrease in Cm with increasing d represents the device’s “forgetting curve”, indicating the extent to which the reservoir can “linearly” retrieve past information. The first-order capacity C1, corresponding to n = 1, is then obtained by summing the partial capacities as follows: 𝐶0 = D 𝐶@@∈@(A/0)= D𝐶@(A/0)(𝑑)B=/<  (S21) To prevent underestimation of C1, the maximum delay D was set to a sufficiently large value (e.g., D = 40). Furthermore, to avoid overestimation of the IPC, the values of C1 shown in the main text were processed using a surrogate-based thresholding method, as described in the Methods section; specifically, any Cm below a predetermined threshold was set to zero.  14  Figure S9: First-order reconstruction task. Representative examples of the target P1[u(k-d)] and reservoir output y(k) for various delays (a)d=0, (b) d=7 and (c) d=10 in the first-order reconstruction task (n=1). The reservoir accurately reconstructs the target at short delays but exhibits degraded performance as the delay increases. (d) Delay dependence of the component-wise capacity Cm in the first-order task. The monotonic decay of Cm with increasing delay illustrates the forgetting curve of the reservoir and reflects its linear memory capability.   Next, we describe the procedure for calculating the second-order capacity C2, corresponding to n = 2. A representative example of a second-order target ym is given by: 𝑦@(𝑘) = 𝑃3[𝑢(𝑘 − 𝑑)] = 𝑢3(𝑘 − 𝑑) +𝑢n𝑢3nnn − 𝑢:nnn𝑢3nnn − 𝑢n3𝑢(𝑘 − 𝑑) +𝑢n𝑢:nnn − 𝑢3nnn3𝑢3nnn − 𝑢n3  (S22) As in the n = 1 case, a reconstruction task is conducted for each delay d, and the second-order capacity is computed based on the task accuracy. This task evaluates the reservoir's ability to reconstruct waveforms that are quadratic transformations of the input signal u(k-d). Figures S10 (a) to (c) shows examples of the target ym(k) and the reservoir output y(k) for d = 0, 6, 10. Figure S10 (d) also presents the dependence of the component-wise capacity Cm on delay d. Similar to the linear case shown in Figure S9 (d), Cm decreases as d increases, reflecting the increasing difficulty of the task. However, in this task, the reservoir must possess not only memory but also nonlinear transformation capability. As a result, the decay of Cm with increasing d is noticeably faster compared to the linear (first-order) case. This trend becomes more pronounced as the polynomial degree n increases, indicating that tasks requiring both memory and nonlinearity are inherently more challenging. 10-1Value10-1Value10-1Value250240230220210200Time step k1.00.80.60.40.20.0Component-wise capacity  Cm12840Delay dTarget: P1[u(k-d)] Output y(k)d = 2d = 7d = 10abcdabc 15  Figure S10: Second-order reconstruction task with single delay. Examples of target P2[u(k-d)] and output y(k) waveforms for the second-order reconstruction task for various delays (a) d=0, (b) d=6 and (c) d=10. The reservoir performance declines with increasing delay due to the added nonlinearity requirement. (d) Delay dependence of the component-wise capacity Cm in the Second-order task.   Another class of second-order targets ym is given by: 𝑦@(𝑘) = 𝑃0[𝑢(𝑘 − 𝑑0)] × 𝑃0[𝑢(𝑘 − 𝑑3)] = {𝑢(𝑘 − 𝑑0) − 𝑢n }{𝑢(𝑘 − 𝑑3) − 𝑢n }  (S23) In this case, the task involves reconstructing second-order targets generated by the product of two first-order polynomials, P1[u(k-d1)] and P1[u(k-d2)], each with different delays d1 and d2 (d1 < d2). Figure S11 summarizes the dependence of Cm on both d1 and d2. As in the previous case, Cm decreases as either delay increases, again reflecting the growing difficulty of reconstructing temporally distant nonlinear features. For n = 2, the second-order capacity C2 is calculated as the sum of component-wise capacities from the two types of tasks described above.  0.50.0-0.5Value0.50.0-0.5Value0.50.0-0.5Value250240230220210200Time step k1.00.80.60.40.20.0Component-wise capacity  Cm12840Delay dTarget: P2[u(k-d)] Output y(k)d = 2d = 6d = 10abcdabc 16  Figure S11: Second-order reconstruction task with two delays. Delay dependence of Cm in the second-order task based on ym(k)=P1[u(k-d1)]⋅P1[u(k-d2)] (d1<d2). As either d1 or d2 increases, the task becomes more difficult and capacity decreases, reflecting the joint challenge of memory and nonlinear transformation.   For higher-order capacities (n ≥ 3), the procedure is extended in a similar manner. For example, for n = 3, the IPC is evaluated using tasks that reconstruct targets defined by P3[u(k-d)], P2[u(k-d1)]×P1[u(k-d2)] (d1≠d2), and P1[u(k-d1)]×P1[u(k-d2)]×P1[u(k-d3)] (d1 < d2 < d3). By evaluating reconstruction performance for various combinations of polynomial orders and time delays, IPC provides a comprehensive measure of both the memory and nonlinear processing capability of a given reservoir. Ideally, the maximum nonlinearity order and maximum delay should be set to sufficiently large values to fully characterize the reservoir’s capability. However, increasing these values leads to a combinatorial explosion in the number of reconstruction tasks. Therefore, in this study, the maximum polynomial degree was limited to six, and the maximum delay D was set individually for each n, as summarized in Table S2. Under this condition, the total number of target indices M is 3.4×103. Table S3 shows representative IPC values (after surrogate-based thresholding) under selected conditions.  1 2 3 4 5 6 7 8 9 100123456789Delay d2Delay d1Cm 17 Table S2: Maximum delay D values used for each polynomial degree n in the IPC calculation. Larger n corresponds to smaller maximum delays to manage computational complexity while retaining sufficient evaluation coverage. Degree n Maximum delay D 1 40 2 30 3 20 4 15 5 15 6 11   Table S3: Representative values of information processing capacity after surrogate-based thresholding. Condition C1 C2 C3 C4 C5 C6 Ctot T=50 µs / din=1 8.24 19.4 23.1 16.5 12.2 9.48 88.9 T=50 µs / din=2 13.0 26.5 22.3 13.9 9.74 7.70 93.2    18 Note S5: Detailed calculation method for normalized mean squared error Here, we describe the detailed procedure for calculating the normalized mean squared error (NMSE). NMSE was computed over all discrete time steps k, based on the squared error between the reservoir output y(k) and the target signal yt(k). As defined below, NMSE is obtained by normalizing the mean squared error (MSE) with the variance of the target: NMSE =MSE𝜎3  (S24) MSE =1𝑇)J!JD [𝑦!(𝑘) − 𝑦(𝑘)]3E&'('F/0  (S25) Here, TData is the data length; s 2 denotes the variance of the target signal, and is defined as: 𝜎3 =1𝑇)J!JD [𝑦!(𝑘) − 𝑦!y ]3E&'('F/0  (S26) 𝑦!y =1𝑇)J!JD 𝑦!(𝑘)E&'('F/0  (S27) As an illustrative example, we show the NMSE computation in the test phase of the NARMA2 task under optimal conditions (T = 100 µs). Figure S12 displays both the target yt(k) and the reservoir output y(k) over the 800-step time series, with the temporal squared error [yt(k) – y(k)]2 plotted on the right vertical axis. The MSE is obtained by summing the squared error over all time steps (gray shaded area in the figure) and dividing by the number of steps (TData = 800). In this case, MSE was found to be 4.09×10-6. In this study, we adopted NMSE as the error metric to enable fair comparisons with other PRC systems. Unlike raw MSE, NMSE accounts for the variability of the target signal by normalizing with its variance (Eq. S24). As described in the main text, the resulting NMSE under these conditions was 7.35×10-3.   19  Fig S12: NMSE computation in the NARMA2 task. Target signal yt(k), reservoir output y(k), and squared error [yt(k) – y(k)]2 over 800 time steps. The shaded area corresponds to the temporal squared error used to compute MSE, which is then normalized by the target variance to obtain NMSE.  It is worth noting that in the field of PRC, the term “NMSE” is sometimes used to refer to two different normalization schemes. One is the definition adopted in this study (Eq. S24), while the other normalizes MSE by the mean squared value of the target signal rather than its variance. Therefore, when comparing NMSE values across different studies, it is essential to carefully verify the underlying definition. In this work, only NMSE values defined equivalently to Eq. S24 were used for performance comparisons with other PRC implementations.  1.00.80.60.40.20.0[ yt(k) – y(k) ]2  (×10-4)8006004002000Time step k0.350.300.250.200.150.10y t(k),  y(k) Target yt(k)      Prediction y(k)  Temporal squared error (right axis) 20  Figure S13: Gate voltage dynamic range selection for evaluating ambipolar behavior. Schematic illustration of how varying the minimum gate voltage Vin from 0.2 V to 1.8 V at a fixed base voltage Vb=1.8 V alters the dynamic range ΔV=∣Vb−Vin∣ of the gate pulse. This range effectively selects the operating region of the device's transfer characteristics, thereby enabling control over the contribution of ambipolar transport in information processing.    Figure S14: Influence of ambipolar transport on the reservoir’s mapping function. 3D plots of reservoir states as a function of current input u(k) and past state X(k-1). (a, b) Under ΔV=1.8 V, X5 and X6 exhibit distinct nonlinear surfaces, reflecting the contribution of ambipolar behavior. (c, d) Under ΔV=0.2 V, both states show linear, monotonic behavior, indicating suppressed ambipolarity. These results demonstrate that ambipolar transport enhances the diversity and nonlinearity of the system’s mapping functions. 1.41.21.00.80.6I D0 /mA1.51.00.50.0VG /V1.41.21.00.80.6I D0 /mA1.51.00.50.0VG /VΔV=1.8 VΔV=0.2 Va bu(k)0.50110X5(k-1)X5(k)u(k)0.50110X6(k-1)X6(k)u(k)0.50110X6(k-1)X6(k)u(k)0.50110X5(k-1)X5(k)a bc d 21  Figure S15: Dependence of IPC and effective reservoir size on the gate voltage dynamic range ΔV. Both the total information processing capacity and the effective reservoir size Neff (calculated via principal component analysis) increase with ΔV. In particular, higher-order nonlinear capacities (n ≥ 3) show strong dependence on ΔV, indicating that the ambipolar behavior of graphene significantly contributes to the realization of nonlinear transformations in reservoir computing.  806040200Ctot1.61.20.80.4∆V /V4035302520Neff Neff 6th order 5th order 4th order 3rd order 2nd order  1st order 22 Note S6: ESN Setup for the Mackey-Glass Prediction Task To evaluate the efficiency of the developed PRC, we compared it with a well-tuned Echo State Network (ESN) in the 10-step-ahead prediction task for time series generated by the Mackey-Glass (MG) equation, as shown in Figure 7g. This section describes the setup and optimization method of the ESN.  In this study, we adopted a standard ESN model7), where the state evolution is described by the following equation (S28):   𝒙(𝑘) = 𝒇[𝑾𝐫𝐞𝐜𝒙(𝑘 − 1) +𝑾𝐢𝐧𝒖(𝑘)]  (S28) Here, x is the reservoir state vector, f is the mapping function, Wrec is the recurrent weight matrix, Win is the input weight matrix, and u is the input. The mapping function f was set to the hyperbolic tangent. Wrec was defined as a random matrix with connection density d, and Win was defined as a random matrix scaled by a parameter ain (each element of Win ranges from -ain to +ain). We fixed the reservoir size N = 40 and used a grid search to determine the optimal parameters, including the spectral radius of Wrec, the connection density d, and the scaling factor ain of Win. To account for performance variation due to the randomness of the weight matrices, each condition was tested 500 times, and the average root mean square error (RMSE) for the 10 step-ahead prediction task was used as the performance score. From the grid search results, as shown in Figure S16, the optimal parameters were determined to be d=0.42, r=0.48, and ain=0.3.   In the computation load versus RMSE plot shown in Figure 7g, these parameters were fixed while varying N. For each N, the task was executed 500 times, and the average RMSE was plotted. The computational load in the ESN is quantified as the number of floating-point operations (FLOPs) required for inference, as shown in Equations S2 and S3. These represent the operations required for reservoir state evolution (equation S28) and the linear combination in the readout network (equation 1), respectively:   Reservoir layer ∶  FLOPs = 2𝑁3 + 2𝑁P𝑁 + 𝑁 − 𝑁P  (S29) Readout layer ∶  FLOPs = 2𝑁Q𝑁 − 𝑁Q + 1  (S30) Here, Ny and Nu denote the output and input dimensions, respectively. Equations S29 and S30 describe the FLOPs per discrete time step, and their sum represents the total FLOPs required for inference in the ESN.   23  Figure S16: Grid search for ESN optimization. Test error dependence on (a) ain, (b) r, and (c) d for the 10-step-ahead prediction task of the MG equation. The red arrows indicate the optimal conditions.   0.0123456780.1RMSE1.00.90.80.70.6Spectral radius r5 x 10-24321RMSE0.50.40.30.20.1Input scaling ain0.0124680.124RMSE0.50.40.30.20.1Connection densitiy dr = 0.48d = 0.42ain = 0.3d = 0.42ain = 0.3r = 0.48ab c 24   Figure S17: Raman spectrum of the graphene channel. The intensity ratio of the G band to the 2D band confirms that the graphene channel is monolayer8).     Figure S18: Surrogate method for IPC calculation. (a) Distribution of surrogate capacities Csur,m under conditions T=100 µs and din=2. The threshold Ath was set as 1.5 times the maximum surrogate capacity. (b) Component-wise capacities Cm for all indices under the same conditions. The red line indicates the threshold Ath obtained in (a). (c) Cm values for all indices after setting values below the threshold to zero. The IPC values presented in the main text were filtered using this procedure across all conditions to prevent overestimation9).   Intensity2800240020001600Wavenumber /cm-1G band2D band0.200.150.100.050.00Csur,m3210Index m (×104)1.00.80.60.40.20.0Cm3210Index m (×104)1.00.80.60.40.20.0Cm3210Index m (×104)a b cAth = 1.5×max(Csur,m) 25  Figure S19: Calculation method of effective reservoir size Neff using PCA. The plot shows the number of principal components p and the cumulative contribution ratio rc(p) under conditions T=50 µs and din=2. The inset expands the region near rc(p)=100%, where the smallest p that exceeds the threshold rth is defined as Neff.   Supplementary references 1) Arihori, K., Ogawa, M., Souma, S., Sato-Iwanaga, J. and Suzuki, M.-A., “Transient performance analysis of graphene FET gated via ionic solid by numerical simulations based on tight-binding method and Nernst--Planck--Poisson equations,” J. Appl. Phys. 130, 084302 (2021). 2) Gerischer, H., McIntyre, R., Scherson, D. & Storck, W. Density of the electronic states of graphite: derivation from differential capacitance measurements. J. Phys. Chem. 91, 1930–1935 (1987). 3) Kornyshev, A. A. Double-layer in ionic liquids: paradigm change? J. Phys. Chem. B 111, 5545–5557 (2007). 4) Lockett, V., Sedev, R., Ralston, J., Horne, M. & Rodopoulos, T. Differential capacitance of the electrical double layer in imidazolium-based ionic liquids: influence of potential, cation size, and temperature. J. Phys. Chem. C. 112, 7486–7495 (2008). 5) Yamaguchi, T. et al. Low-temperature transport properties of holes introduced by ionic liquid gating in hydrogen-terminated diamond surfaces. J. Phys. Soc. Jpn. 82, 074718 (2013). 6) Uesugi, E., Goto, H., Eguchi, R. et al. Electric double-layer capacitance between an ionic liquid and few-layer graphene. Sci. Rep. 3, 1595 (2013). 7) Lukoševičius, M., & Jaeger, H. Reservoir computing approaches to recurrent neural network training. Comput. Sci. Rev. 3, 127-149 (2009). 8) Ferrari, A. C., et al. Raman spectrum of graphene and graphene layers. Phys. Rev. Lett. 97, 187401 (2006). 9) Tsunegi, S., et al. Information Processing Capacity of Spintronic Oscillator. Adv. Intell. Syst. 5 2300175 (2023). 1.00.90.80.70.60.5Cumulative contribution ratio r c(p)1002 4 6 81012 4 6 8102Number of principal components p1.00000.99990.9998r c(p)52484440prthNeff=45