<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Review of Methods of Resolution Estimation for 3D Reconstructions of Nanoscale Biological Objects from Experiments Data on Super-Bright X-Ray Free Electron Lasers (XFELs)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ikonnikov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>National Research Centre "Kurchatov Institute"</institution>
          ,
          <addr-line>1 Akademika Kurchatova pl., Moscow, 123182</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Nowadays the Fourier shell correlation (FSC) is the most common method for estimating the resolution of 3D structures obtained in Single Particle Imaging (SPI) experiments on X-ray free electron lasers (XFELs). In FSC, the resolution is defined as the spatial frequency at which the correlation between two independently reconstructed structures is equal to some given threshold value. There are multiple methods to define the threshold value. In addition, this approach cannot account for the fact that the quality of reconstruction can be non-uniform for different areas of the biomolecule. Thus, the issue of effective resolution estimation methods remains open. This paper considers multiple alternative approaches to the resolution estimation from adjacent scientific field cryogenic electron microscopy (cryo-EM) and analyzes the applicability of these approaches to the resolution estimation in SPI experiments on XFELs.</p>
      </abstract>
      <kwd-group>
        <kwd>X-ray Free Electron Laser</kwd>
        <kwd>Single Particle Imaging</kwd>
        <kwd>Space Resolution</kwd>
        <kwd>Fourier Shell Correlation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The viral worldwide pandemic caused by SARS-CoV-2 has become a serious
challenge for the entire scientific community, and the search of effective treatment
methods and drugs against COVID-19 is ongoing. Determination of the high-resolution 3D
structure of single viruses’ particles is one of the key and important points for
understanding how viral infection occurs and how we can fight it. Cryogenic electron
microscopy (cryo-EM), which has recently been able to obtain a true view of the atomic
resolution of a biomolecule (1.2 Å) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], has consolidated its position as the leading
method for imaging biomolecular particles. However, in cryo-EM, the samples are
plunge-frozen down to −269 °C and so they are imaged at unphysiological conditions,
which prevents the study of biomolecules in their natural state and limits the ability to
track conformational changes and dynamic events (for example, how the initial event
of cellular recognition occurs between the viral spike (S) protein and the ACE2
receptor [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]).
      </p>
      <p>Copyright © 2020 for this paper by its authors. Use permitted under Creative Commons
License Attribution 4.0 International (CC BY 4.0).</p>
      <p>
        With the invention of super-bright X-ray free electron lasers (e.g. Linac Coherent
Light Source (LCLS) and European XFEL)) the Single Particle Imaging (SPI)
approach allowed researchers to reconstruct 3D structures from many 2D diffraction
images produced in the experiments by X-rays scattered on the single particle
exposed in different orientations [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Thus, SPI experiments opened new opportunities
to study biomolecules in their nature state without previous crystallization or being
frozen. Unfortunately, there are still many challenging problems in SPI experiments
(weak signal, scattered on single particle, low number of diffraction images), which
limit the quality of the obtained 3D structures. Nevertheless, in order to assess the
experimental quality and confidence for the interpretation of the obtained 3D
structures and to compare results with other structural biology methods, we need to use the
resolution estimation.
      </p>
      <p>
        Nowadays, the standard method for estimating resolution of the obtained 3D
structures both in cryo-EM and SPI experiments is the Fourier shell correlation (FSC)
method [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. In the FSC method, the resolution is defined as the spatial frequency at
which the correlation between two independently reconstructed structures becomes
equal to some given threshold value. There are several criteria to choose threshold
value for the resolution estimation, the most popular of which are fixed thresholds of
0.5 and 0.143 [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] (they rely on statistical assumptions on SNR [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]) and also 1/2-bit
threshold [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] (based on informational entropy estimations [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]). Even though the FSC
is widely accepted by the scientific community, a discussion continues about a
threshold value at which the resolution should be defined [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. For a more detailed
description of the FSC method see [
        <xref ref-type="bibr" rid="ref4 ref5 ref6 ref7">4-7</xref>
        ].
      </p>
      <p>
        As an alternative method to select the FSC threshold value, Beckers and Sachse
[
        <xref ref-type="bibr" rid="ref8 ref9">8,9</xref>
        ] have suggested a new adaptive thresholding procedure for identifying the
highest resolution shell based on statistical methods of permutation sampling and false
discovery rate (FDR) control. Permutation sampling of the FSC for each resolution
shell is as follows: firstly, new samples are generated by changing the order of the
Fourier coefficients of the second half-map shell and a large series of FSCs are
computed [
        <xref ref-type="bibr" rid="ref8 ref9">8,9</xref>
        ]. Hence a sample of the noise distribution of the FSC for each resolution
shell is obtained. When applied to every resolution shell, the distributions together
with the original FSC-values can then be statistically tested and conveniently
transformed into p-values [
        <xref ref-type="bibr" rid="ref8 ref9">8,9</xref>
        ]. In order to reduce the risk of false positive errors, p-values
are then corrected by means of FDR control and thresholded at 1% [
        <xref ref-type="bibr" rid="ref8 ref9">8,9</xref>
        ]. The authors
demonstrated [
        <xref ref-type="bibr" rid="ref8 ref9">8,9</xref>
        ], that this method (named FDR-FSC) gives realistic resolution
estimates that are similar to most author-reported resolutions in the Electron
Microscopy Data Bank (EMDB) [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. However, the main advantage of this approach is that
it makes no assumption about the statistical properties of the signal and noise within
the half-maps, and it does not rely on any FSC threshold "criterion".
      </p>
      <p>
        The main drawback of the FSC method is that it estimates only the global
resolution for the whole structure. However, the electron density usually has uneven
resolution over the entire volume: to restore the structure, SPI needs to average the
diffraction images from a large number of individual biomolecules, thus the more individual
biomolecules differ in structure the stronger the heterogeneity of the reconstructed 3D
structure [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Thus, for a correct interpretation of quality of the reconstruction, it is
important to be able to determine the local resolution for each voxel of volume.
Currently, cryo-EM has proposed several approaches to determine local resolution [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
The first approach to determining the local resolution was blocres [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], where the
resolution is locally estimated by means of the FSC, calculated from two independent
reconstructions within a moving window. The most-used method to date for the local
resolution estimation is ResMap [14]. This approach determines the local resolution
by detecting the best 3D sinusoidal wave that fits each map point above the noise
level. MonoRes [15] is based on a similar principle of detecting energy at different
frequencies above noise. MonoRes has been recently expanded to account for
directionality (now named MonoDir) [16]. An important consequence of this work is the
introduction into the field of the concept that resolution is simultaneously local and
directional. The DeepRes method [17], based on deep learning from filtered atomic
models at different frequencies, has also recently been introduced [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. For a more
detailed overview of all local resolution methods, see [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Analysis workflow</title>
      <p>
        The aim of the present study was to verify which of the currently available alternative
approaches for estimating resolution can be successfully applied to evaluate
reconstructions in SPI experiments. To estimate the local resolution, we chose the Resmap
method (as the most popular method for the local resolution estimation in cryo-EM),
and for the global resolution estimation, we opted for the FDR-FSC method. This
work is founded on several major steps. In order to evaluate the accuracy of the
resolution estimation methods, we need to test the method performance on reconstructions
of different quality and resolution. As follows from [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], the resolution value depends
on the amount of diffraction images in the dataset. Also, it is important to understand
how noise affects the reconstruction result and resolution values. Thus, first we
simulated the single particle diffraction experiments with different levels of the Gaussian
noise and different number of diffraction images in dataset for structure of
hemocyanin of the marine mollusk fissurellia (Keyhole limpet hemocyanin type 1 - KLH1)
protein from PDB database [18,19]. For this purpose, we generated one pack of
datasets with different numbers of diffraction images (n = 200, 1000, 10000 and 20000)
without noise. Then, we generated another pack of datasets with different values of
noise (σ = 0, 0.5, 0.8, 0.9, 1.0) with 20000 images in each dataset. Then, we used the
workflow for SPI experiments data processing, which was described in detail in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
Finally, we estimated the global resolution with the FDR-FSC method and the local
resolution with the ResMap method for the obtained reconstructions and compared
these results with the FSC estimation.
      </p>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <sec id="sec-3-1">
        <title>FDR-FSC method</title>
        <p>0,2
0,4
0,6
0,8
1</p>
        <p>1,2</p>
        <p>
          Added white noise, σ
Comparison of the 0.143 cutoff threshold values with the FDR-FSC values
demonstrated a good agreement between both estimations, but the FDR-FSC method shows
a slightly more optimistic estimation. It is worth noting that this result is consistent
with the results obtained for cryo-EM [
          <xref ref-type="bibr" rid="ref8 ref9">8,9</xref>
          ], which proves the universality of the
FDR-FSC approach. One main advantage of the FDR-FSC is that inference of
statistically significant signal in the resolution shells only requires the distribution of random
noise correlations determined by permutation. Thus, this method avoids the
consideration of complicated correlations between signal and noise [
          <xref ref-type="bibr" rid="ref4 ref5 ref6 ref7">4-7</xref>
          ] – one of the most
controversial issues that arise in determining any threshold "criterion"[
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. Thus, the
FDR-FSC method has a good chance to become a new "gold standard" for estimating
resolution in SPI.
3.2
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>ResMap Method</title>
        <p>It can be seen that the resolution over the entire structure of the biomolecule is, in
fact, not uniform, and as the parameters for reconstruction deteriorate (increased noise
or a decrease in the number of diffraction images), this unevenness only increases.
Additionally, we can observe that local resolution decreases near the edges of the
biomolecule, which may be due to errors of reconstruction algorithms [14].
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusions</title>
      <p>
        In order to evaluate the quality of the experiment and reliably interpret the results of
reconstructing the spatial structure from diffraction data in SPI, it is important to have
effective methods for the resolution estimation of these structures. In this research, we
have demonstrated that the ResMap and FDR-FSC methods can be used to estimate
resolution in SPI experiments and show reasonable results for the model data of
KLH1 particle. However, more research is needed in this field: one should test more
particles with various spatial features as well as data from real experiments. Future
work implies testing other local resolution estimation methods [
        <xref ref-type="bibr" rid="ref12 ref13">12,13,15-17</xref>
        ] and
comparison of the obtained results with the results for ResMap.
5
      </p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This research was supported by the Helmholtz Association’s Initiative and
Networking Fund and the Russian Science Foundation (Project No. 18-41-06001). This work
was carried out using computing resources of the federal collective usage center
Complex for Simulation and Data Processing for Mega-Science Facilities at the NRC
“Kurchatov Institute”, http://ckp.nrcki.ru/.
14. Kucukelbir, A., Sigworth, F.J., Tagare, H.D.: Quantifying the local resolution of cryo-EM
density maps. Nat. Methods 11, 63–65 (2014).
15. Vilas, J.L., Gómez-Blanco, J., Conesa, P., et al.: MonoRes: Automatic and accurate
estimation of local resolution for electron microscopy maps. Structure 26, 337–344 (2018).
16. Vilas, J.L., Tagare, H.D., Vargas, J. et al.: Measuring local-directional resolution and local
anisotropy in cryo-EM maps. Nat Commun 11, 55 (2020).
17. Ramírez-Aportela, E., Mota, J., Conesa, P., Carazo, J. M., Sorzano, C.: DeepRes: a new
deep-learning- and aspect-based local resolution method for electron-microscopy maps.</p>
      <p>IUCrJ, 6(6), 1054–1063 (2019).
18. Gatsogiannis C, Markl J.: Keyhole limpet hemocyanin: 9-A CryoEM structure and
molecular model of the KLH1 didecamer reveal the interfaces and intricate topology of the 160
functional units. Journal of Molecular Biology 385(3), 963-983(2009).
19. The Protein Data Bank. https://www.rcsb.org/structure/4BED, last accessed 2020/07/12.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Nakane</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kotecha</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sente</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , et al:
          <article-title>Single-particle cryo-EM at atomic resolution. bioRxiv: the preprint server for biology (</article-title>
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Melero</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sorzano</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Foster</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , et al:
          <article-title>Continuous flexibility analysis of SARS-CoV-2 Spike prefusion structures. bioRxiv: the preprint server for biology (</article-title>
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Ikonnikova</surname>
            ,
            <given-names>K.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Teslyuk</surname>
            ,
            <given-names>A.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bobkov</surname>
            ,
            <given-names>S.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zolotarev</surname>
            ,
            <given-names>S.I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ilyin</surname>
            ,
            <given-names>V.A.</given-names>
          </string-name>
          :
          <article-title>Reconstruction of 3D structure for nanoscale biological objects from experiments data on super-bright X-ray free electron lasers (XFELs): dependence of the 3D resolution on the experiment parameters</article-title>
          ,
          <source>Procedia Computer Science</source>
          <volume>156</volume>
          ,
          <fpage>49</fpage>
          -
          <lpage>58</lpage>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Van</given-names>
            <surname>Heel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Schatz</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          :
          <article-title>Fourier shell correlation threshold criteria</article-title>
          .
          <source>Journal of Structural Biology</source>
          <volume>151</volume>
          (
          <issue>3</issue>
          ),
          <fpage>250</fpage>
          -
          <lpage>262</lpage>
          (
          <year>2005</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Rosenthal</surname>
            ,
            <given-names>P.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Henderson</surname>
          </string-name>
          , R.:
          <article-title>Optimal determination of particle orientation, absolute hand, and contrast loss in single-particle electron cryomicroscopy</article-title>
          .
          <source>Journal of molecular biology 333(4)</source>
          ,
          <fpage>721</fpage>
          -
          <lpage>745</lpage>
          (
          <year>2003</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Van</given-names>
            <surname>Heel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Schatz</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          :
          <article-title>Reassessing the revolutions resolutions. bioRxiv: the preprint server for biology (</article-title>
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Sorzano</surname>
          </string-name>
          , CO,
          <string-name>
            <surname>Vargas</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Otón</surname>
            <given-names>J</given-names>
          </string-name>
          , et al.:
          <article-title>A review of resolution measures and related aspects in 3D Electron Microscopy</article-title>
          .
          <source>Progress in Biophysics and Molecular Biology</source>
          <volume>124</volume>
          ,
          <fpage>1</fpage>
          -
          <lpage>30</lpage>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Beckers</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Statistical Inference of cryo-EM Maps</article-title>
          .
          <source>European Molecular Biology Laboratory (EMBL)</source>
          , Heidelberg (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Beckers</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sachse</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Permutation testing of Fourier shell correlation for resolution estimation of cryo-EM maps</article-title>
          .
          <source>Journal of Structural Biology</source>
          (
          <year>2020</year>
          ), doi: https://doi.org/10.1016/j.jsb.
          <year>2020</year>
          .107579
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <article-title>Protein Data Bank in Europe</article-title>
          . https://www.ebi.ac.uk/pdbe/emdb/,
          <source>last accessed</source>
          <year>2020</year>
          /07/12
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Mandl</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Östlin</surname>
          </string-name>
          , Ch.,
          <string-name>
            <surname>Dawod</surname>
            ,
            <given-names>I. E.</given-names>
          </string-name>
          , et al:
          <article-title>Structural Heterogeneity in Single Particle Imaging Using X-ray Lasers</article-title>
          .
          <source>J. Phys. Chem</source>
          . Lett.
          <volume>11</volume>
          ,
          <fpage>6077</fpage>
          -
          <lpage>6083</lpage>
          (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Vilas</surname>
            ,
            <given-names>J.L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heymann</surname>
            <given-names>J.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tagare</surname>
          </string-name>
          , H.D,
          <string-name>
            <surname>Ramirez-Aportela</surname>
            ,
            <given-names>E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carazo</surname>
            <given-names>JM</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sorzano</surname>
            <given-names>COS</given-names>
          </string-name>
          :
          <article-title>Local resolution estimates of cryo-EM reconstructions</article-title>
          ,
          <source>Current Opinion in Structural Biology</source>
          <volume>64</volume>
          ,
          <fpage>74</fpage>
          -
          <lpage>78</lpage>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Cardone</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heymann</surname>
            ,
            <given-names>J. B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Steven</surname>
            ,
            <given-names>A. C.</given-names>
          </string-name>
          :
          <article-title>One number does not fit all: mapping local variations in resolution in cryo-EM reconstructions</article-title>
          .
          <source>Journal of structural biology</source>
          ,
          <volume>184</volume>
          (
          <issue>2</issue>
          ),
          <fpage>226</fpage>
          -
          <lpage>236</lpage>
          (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>