<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>npj (Nature Partner Journals) Quantum Information (2020).
URL: https://www.nature.com/articles/s41534</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.1007/978-3-540-44838-9_53</article-id>
      <title-group>
        <article-title>Machine Learning-Aided Optimal Control of a Qubit Subjected to External Noise</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Riccardo Cantone</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Shreyasi Mukherjee</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Luigi Giannelli</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Elisabetta Paladino</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giuseppe A. Falci</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>CNR-IMM</institution>
          ,
          <addr-line>Via S. Sofia 64, 95123, Catania</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Dipartimento di Fisica e Astronomia “Ettore Majorana”, Università di Catania</institution>
          ,
          <addr-line>Via S. Sofia 64, 95123 Catania</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Istituto Nazionale di Fisica Nucleare, Sezione di Catania</institution>
          ,
          <addr-line>95123, Catania</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <volume>1</volume>
      <abstract>
        <p>We apply a machine-learning-enhanced greybox framework to a quantum optimal control protocol for open quantum systems. Combining a whitebox physical model with a neural-network blackbox trained on synthetic data, the method captures non-Markovian noise efects and achieves gate fidelities above 90% under Random Telegraph and Ornstein-Uhlenbeck noise. Critical issues of the approach are discussed. This work showcases an attention-based machine-learning-enhanced greybox framework for quantum optimal control, designed to improve the manipulation of open quantum systems subject to complex noise [1, 2]. Quantum control [3] is essential for quantum technologies such as computation, communication, and sensing, yet achieving robust control remains challenging when the system interacts with non-Markovian environments that are hard or even impossible to characterize. The proposed greybox model combines: • a whitebox component that captures the analytically tractable portion of the system dynamics using known physical principles; • a blackbox component implemented via neural networks, trained to learn the unmodelled efects of the environment from data.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
    </sec>
    <sec id="sec-2">
      <title>2. System and Model</title>
      <p>We consider a single qubit subject to classical dephasing noise along the -axis and driven by external
control fields. In the interaction picture, the dynamics is described by the time-dependent Hamiltonian
 () = ctrl() +  ()  ,
where  is the coupling strength between the qubit and the noise, and  () is a classical stochastic
process modeling dephasing noise [5]. Specifically, in this work we consider  () to be either a Random
Telegraph Noise (RTN) process or an Ornstein-Uhlenbeck (OU) process.</p>
      <p>The control Hamiltonian ctrl() implements a drive along the  and -axes</p>
      <p>ctrl() = ()  + () ,
each control field  (), with  ∈ {, }, consisting of 5 Gaussian-shaped pulses.</p>
      <p>Two diferent stochastic processes were considered and compared, namely an RTN process and an
OU process. They are characterised by their power spectrum (), which is the Fourier transform of
the two-point correlation function ⟨ () ()⟩, and in both cases has a Lorentzian shape [4, 6]
() ≈</p>
      <p>4
4 2 + 2
(1)
where  is the switching rate for the RTN process and 1/ is the correlation time of the OU process.
The two stochastic processes difer since the latter is Gaussian, while the former is not, and it is known
that this has an impact on dynamic protocols of protection against noise, such as spin-echo [4, 7].</p>
    </sec>
    <sec id="sec-3">
      <title>3. The Machine Learning Model</title>
      <p>The proposed greybox model integrates analytical knowledge of the quantum system with a
transformerbased neural network. This hybrid architecture includes two components:
• A whitebox part, which enforces the known unitary dynamics of the driven qubit and the associated
measurement process;
• A blackbox neural network, trained to model the influence of the environment on the system’s
evolution.</p>
      <p>Inputs and Outputs The model takes as input the amplitudes of five Gaussian control pulses applied
along each of the  and  axes, for a total of ten real parameters. The pulse widths and positions are
ifxed. The output consists of six gate fidelities, each associated with a diferent target from a universal
set of single-qubit gates.</p>
      <p>Model Architecture The blackbox core is a lightweight transformer encoder. It processes the input
pulse parameters and predicts a set of noise-related parameters that are fed into whitebox layers
implementing:
• Hamiltonian construction and time evolution based on discretized control fields;
• Expectation value calculation over a tomographically complete set of initial states;
• Process matrix reconstruction and fidelity estimation.</p>
      <p>In addition, the model includes specialized output heads that refine the predicted expectation values
before computing the final fidelities.</p>
      <p>Training Strategy Only the blackbox layers contain trainable parameters. The network is trained
using the Adam optimizer to minimize the mean squared error across the six predicted fidelities. Training
is supervised and based on synthetic data generated by simulating noisy quantum dynamics. Whitebox
constraints ensure physically consistent predictions throughout.</p>
      <p>A schematic overview of the architecture is shown in Fig. 1.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Results and open problems</title>
      <p>Separate models were trained across varying values of the coupling strength . This provides insight
on the efectiveness of the graybox approach as a function of the Markovianity of the quantum map
describing time evolution under the efect of noise, which in this case is parametrized by the ratio
/ [8, 4].</p>
      <p>RTN Case. The model showed low training and test MSE across all gates, with prediction errors
increasing with  but remaining in the 10− 2–10− 3 range, indicating robust generalisation. As an
emulator in the optimal control pipeline, it enabled the design of control pulses achieving fidelities
above 99% for the lowest  and above 90% for the highest, with minor gate-dependent variations.
OU Case. The model exhibited similar performance, with low and stable MSE values across all ,
confirming robustness to diferent noise types. Optimal control results mirrored those of the RTN case,
with fidelities exceeding 99% at low  and remaining above 90% even at stronger coupling. While fidelity
declines under higher noise, the model continues to support efective pulse design; future improvements
may benefit from larger datasets or more advanced strategies.</p>
      <p>Our result validates the graybox approach, showing that the optimization framework we have chosen
is very efective in suppressing efects of low-frequency noise ( / &gt; 1), but less efective for noise
yielding Markovian maps (/ &lt; 1). Apparently, Gaussianity does not have an impact in this case, but
we expect that the picture may change when considering 1/ noise [9, 4, 6, 10] resulting from a set of
stochastic processes with diferent  .</p>
      <p>A natural development of this work is applying the method to two-qubit gates, addressing the efect of
both time- and space-correlated noise [11, 12, 13]. Two major issues to be investigated are the scalability
of the approach to larger quantum architectures and the ability to reproduce asymptotic results known
from the theory of dynamical decoupling [14, 4, 12].</p>
    </sec>
    <sec id="sec-5">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the authors used ChatGPT in order to: perform grammar and
spelling checks, paraphrase and reword. After using this tool, the authors reviewed and edited the
content as needed and take full responsibility for the content of the publication.</p>
    </sec>
  </body>
  <back>
    <ref-list />
  </back>
</article>