<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>SISO: A Conceptual Model-based Method for Variant Interpretation Systematization</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>MireiaCosta</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>AlbertoGarcía S</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ana Leon</string-name>
          <email>aleon@vrain.upv.e</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>OscarPasto</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Variant Interpretation, Conceptual Modeling, SISO Method</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>PROS Research Group, VRAIN Research Institute - Universitat Politècnica de València</institution>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Topics</institution>
          ,
          <addr-line>Posters and Demos</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>Variant interpretation is the process by which clinical experts determine if a DNA variant has a significant impact on a patient's health. Current practices in variant interpretation sufer from a lack of traceability and reproducibility due to the chaotic nature of genomic data and the imprecision of existing variant interpretation guidelines. These issues pose substantial barriers to the routine clinical application of variant interpretation. This paper introduces SISO, a conceptual model-based method designed to translate the inherent imprecision of variant interpretation into a concise and well-defined set of steps. The practical utility of the SISO method is demonstrated through a use case involving a variant identified in a patient with suspected familial breast-ovarian cancer syndrome. The SISO method lays the foundations for variant interpretation systematization by guiding decision-making and ensuring reproducibility. As a result, variant interpretation will be a more reliable and consistent process in clinical practice.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>CEUR
ceur-ws.org</p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction</title>
      <p>Our DNA sequence is regarded as one of our most distinguishing characteristics as individuals.
We humans share more than 99% our DNA sequence. However, these small diferences in our
DNA contribute to natural human diversity and influence susceptibility to certain diseases or
variations in response to typical treatments. These diferences are called DNA v1a].riants [
Given their significance to human health, a key objective of medicine is to understand how
DNA variants impact an individual’s health, a process known as variant interpretation.</p>
      <p>
        Despite its importance, variant interpretation sufers from several issues that have yet to be
resolved. The interpretation process involves weighing data about variants, such as the variant’s
frequency among the population, whether it has previously been linked to a disease, etc. This
data is scattered across thousands of data sources with unique and contradictory content as well
as diferent terminology2[]. This data chaos makes the data used for interpretation dificult
to trace and potentially causes the same data to be interpreted diferently by diferent experts
nEvelop-O
CEUR
Workshop
Proceedings
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Additionally, variant interpretation typically follows specialized guidelines that provide
a series of recommendations that guide the interpretation by determining whether or not a
variant meets specific criteria. However, despite their intentions, these guidelines are often
criticized for providing vague definitions that result in subjectivity in their interpretation and
inconsistency in their applicati4o,n5,[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>These factors lead to a lack of clarity regarding both the criteria each expert uses for
interpretation and the evidence they rely on. As a result, variant interpretation sufers from a complete
lack of traceability and reproducibility, posing a significant barrier to its application in routine
clinical practic7e].[</p>
      <p>Transforming variant interpretation from an abstract process into a well-defined, repeatable,
and reliable procedure is a challenge that requires both breaking down the process into its
fundamental elements (i.e., unpacking) and organizing these elements into a coherent, eficient,
and standardized framework (i.e., systematization). Unpacking is vital for disentangling the
intricate details of variant interpretation by clarifying aspects with implicit or ambiguous
definitions and defining a common framework for representation. Conversely, systematization
is critical to making the variant interpretation process explicit, guiding decision-making, and
ensuring reproducibility.</p>
      <p>Previous works have explored the unpacking of variant interpretation through a meta-model
called VarClaMM8][, which represents all relevant elements involved in the interpretation
process, as detailed in Secti2o. nBuilding upon this foundation, the objective of this work is to
take the first steps towards variant interpretation systematization. We present SISO, a method
conceptually grounded in VarClaMM, that aims to translate the inherent imprecision of the
interpretation process into a concise and well-defined set of steps. The primary aim of this
research is to present a novel approach that applies the principles of conceptual modeling to the
domain of variant interpretation, a non-traditional application area. The proposed SISO method
ofers a robust framework for addressing the complexities inherent in variant interpretation.
Preliminary results from the application of this method in a real use case indicate its potential
to enable more standardized, reproducible, and reliable interpretations in clinical practice.</p>
      <p>The remaining is organized as follows: Sect2iopnrovides a summary of the VarClaMM
model. Section3 introduces the SISO method. Secti4odnemonstrates the utility of the method
through a practical use case. Finally, Sec5tcioncludes the paper.</p>
    </sec>
    <sec id="sec-3">
      <title>2. Background: The VarClaMM Meta-model</title>
      <p>The VarClaMM meta-model (Figur1)erepresents the most important elements of the variant
interpretation process: i) the variant itself (depicted in green), ii) the constructs that conform to
the variant interpretation guidelines (depicted in pink), iii) the results of evaluating a guideline
over a variant to obtain its interpretation (depicted in orange), and iv) the variant related data
required to perform the interpretation (depicted in blue).</p>
      <p>Below, we provide a high-level description of VarClaMM, focusing on the most important
elements for understanding the logic behind the SISO method. A more in depth description can
be found in 8[].</p>
      <p>DataModel
-name : string
-url : string</p>
      <p>*
DataElementInDataSource
-path_in_data_source : string *
--nuDaralmt:aesSt:orisuntrgrcineg 1 stored_in_a * -data_value : string
defined_in_a DataElement 1
represents_a 1 --dneasmcerip:tsiotrnin:gstring
-data_type : string
-value_constraints : string[]</p>
      <p>evaluates_a_type_of</p>
      <sec id="sec-3-1">
        <title>Guideline</title>
        <p>Each variant interpretation guideline is representedGuinidtehleine class, which is
characterized by itsapplicability, which refers to the specific disease, gene, or variant type context in Class4
which the guideline is applicable. An example is the ACMG-AMP 2015 guidel9in],ewsh[ich
only apply to mendelian diseases.</p>
      </sec>
      <sec id="sec-3-2">
        <title>Criteria and Metrics</title>
        <p>Each Guideline evaluates diferentCriterion to obtain the classification of a variant. For
instance the ACMG-AMP 2015 guidelines defines 28 criteria, one of which is theCPrViSt1erion.
This criterion evaluatetshiefvariant is null and if it is in a gene where null variants are to cause
disease. Guidelines consider criteria of one of two tByopoelse:anCriterion, which evaluate
to true or false, aSncdoreCriterion, whose evaluation returns a numerical value.</p>
        <p>Both types of criteria define specific conditions whose fulfillment determines whether the
criterion is met. In this model, we call these condiMtieotnrsics. Consider the example
of the PVS1 criterion mentioned above. In its definition, this criterion establis hPoweeresdByVisualParoadigmwCommunityEdition
tw
elldiferentiated, independent conditions: i) the variant must be null, and ii) the gene afected
must cause disease through null variants. Our model represents these two conditions as metrics
associated with the PVS1 criterion.</p>
      </sec>
      <sec id="sec-3-3">
        <title>Variant related data</title>
        <p>To determine whether the condition establishedMbeytaric is fulfilled, we need to evaluate
data about the variant under study. Each kind of data to be evaluated (e.g., an allele frequency)
is represented in thDeataElement class. VarClaMM forces the difereDnattaElements to be
structured according tDoaataModel in order to have shared and standard definition of the
data used.</p>
        <p>The value of aDataElement for a given variant comes from exterDnaatlaSources. This
concrete value is represented inDtahteaElementInDataSource class. For example, the
concept of allele frequency (e.Dga.taaElement) takes the value 1.647e-05 in ExADCataSource
and 1.36e-06 in the GnomADDataSource for the variant FANCI:c.669+1G&gt;T.</p>
      </sec>
      <sec id="sec-3-4">
        <title>Evaluation results</title>
        <p>The Guideline, Criterion, andMetric constructs are evaluated over a speVcairficiant to
obtain its interpretation. The interpretation of a variant based on aGpuairdteilcinuleairs
represented in thIneterpretationResult class. This interpretation is calculated by applying a
set ofrules over eachCriterionResult. EachCriterionResult is calculated by applying the
pass_rule (which takes the valuAeND when all the metrics must be fulfilled aOnRdotherwise)
over the results of each criteriMonet’sric.</p>
        <p>Finally, eachMetricResult is calculated by evaluating alDlatthaeElementInDataSource
for the variant corresponding toDtahteaElement evaluated by thMe etric. The model
also represents the results of evaluating each indDivaitdauEallementInDataSource in the
MetricDataEvaluationResult class.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>3. SISO Method</title>
      <p>The SISO method builds on the unpacking achieved with VarClaMM and establishes the
foundation for systematizing variant interpretation by transforming it into a series of concrete
and well-defined steps. Our method guides the use of the VarClaMM meta-model to define an
interpretation framework that can be consistently applied to any given variant. Consequently,
SISO supports users in applying variant interpretation guidelines, ensuring that accurate and
reproducible results are obtained. This is accomplished through four stages, which are detailed
below.</p>
      <sec id="sec-4-1">
        <title>1. (S)elect an interpretation guideline.</title>
        <p>In this step, the expert must select the most suitable guideline for the interpretation. Here it is
crucial to consider the context in which the interpretation is being performed, as interpretation
guidelines have specific applicabilities (see Sect2io).nThere are three key aspects to consider
when selecting a guideline:
• Variant type: Certain guidelines are tailored to specific types of variants. For instance,
the ACMG-ClinGen 2019 guidelines addrecospsy number variants, while the ACMG 2020
guidelines focus on variants locatmeditionchondrial DNA.</p>
        <p>InterpretationResult
- / interpretation : string
-date : Date
classifies_a</p>
        <p>DataSource
-name : string
-url : string
1</p>
        <p>DataModel DataElement
-name : string defined_in_a -name : string
-url : string -description : string
represents_a 1 --vdaaltuae__tycpoens:trsatrinintgs : string[]
stored_in_a DataElemen*tInDataSource
* -path_in_data_source : string
-data_value : string</p>
        <p>*
is_d*ata_about_a</p>
        <p>1
-nameV:asritarinntg MetricResult
-altGenome : string - / metric_result : boolean
* --aalatTArltan:sst:risntgring *</p>
        <p>&lt;&lt;enumeration&gt;&gt;</p>
        <p>RuleValues
&lt;&lt;datatype&gt;&gt;
-cClalasssisfiicfiactai otinon: Rsturlieng AORND &lt;&lt;Cdoantadtiytipoen&gt;&gt;
•-paGtteernn e:sltroincgation: Some guidelines consider unique aspects o-cfonvdaitiorni_aonpetrastiwoni:tshtriinng specific genes.
-condition_value : string
Examples include the ClinGen Guidelines for the FBN1 gene and the CanVIG guidelines
for the CHEK2 gene.
• Disease relevance: Certain diseases have distinct characteristics that must be taken into Class4
Powered ByVisual Paradigm Community Edition
account when determining if a variant is pathogenic. Examples of such guidelines are the
ACGS guidelines for rare diseases, the Ambry Genetics guidelines for autosomal dominant
and X-linked diseases, and the ACMG-AMP 2015 guidelines for Mendelian diseases.</p>
        <p>It could be the case that no existing guideline covers the specific needs of the interpretation
context. In that case, the method supports the creation of a new guideline. In both cases, the
Guideline must be completely characterized following the VarClaMM meta-model. The
selection of the guideline will completely condition the results of the interpretation and, consequently,
is vital for ensuring the quality and reliability of the interpretation results.</p>
      </sec>
      <sec id="sec-4-2">
        <title>2. (I)dentify the criteria and define the metrics of the guideline.</title>
        <p>Once the interpretation guideline has been selected, the next step involves identifying the
criteria and the metrics needed for its evaluation. This would be straightforward if the guidelines
provided precise definitions. However, as stated in Sec1t,iotnhis is often not the case.</p>
        <p>For instance, consider the PVS1 criterion of the ACMG-AMP 2015 guidelines mentioned
above. For this criterion, the metrics are clear: i) the variant must be null, and ii) the gene
afected must cause disease through null variants. Now, let’s examine the BS1 criterion of the
same guidelines, which assesses whethtehre variant’s allele frequency is greater than expected for
that specific disease . In this case, the definition makes the frequency cut-of entirely dependent
on the expert performing the interpreta5t]i.oAns[a result, one expert might consider a cut-of
of 0.5%, while another, stricter expert might set a cut-of of 1%. Consequently, even though both
experts claim to apply the same criterion, they are not truly evaluating the same thing.</p>
        <p>On the other hand, there are cases where certain criteria from the selected guideline cannot
be applied due to a lack of resources. For instance, the PS3 criterion of the ACMG-AMP
2015 guidelines evaluates whether thereinarveitro or in vivo functional studies supporting
damaging efects . However, only 36% of clinical experts are able to obtain this kind of evidence,
as functional studies require significant monetary and time invest1m0e].nCtson[sequently,
claiming that a certain guideline is used not always implies that all its criteria are applied.</p>
        <p>This highlights that merely stating the guideline used is insuficient to fully understand how
a variant has been interpreted. To properly define the interpretation process, it is fundamental
to specify which criteria of the guideline are actually applied, how each criterion is evaluated
through the definition of metrics, and how these metrics are combined to obtain a criterion’s
result. This can be achieved thanks to the composition relationships bGeutiwdeeelnine and
Criterion andCriterion andMetric defined in the VarClaMM meta-model.</p>
      </sec>
      <sec id="sec-4-3">
        <title>3. (S)elect the required data elements and data sources.</title>
        <p>Once the criteria and their respective metrics are defined, the next step is to identify the specific
data (e.g., thDeataElement) that each identified metric needs to evaluate. For example, within
the PVS1 criterion, the mettrhiecvariant must be null evaluates the consequence of the variant,
whereas the metritche gene afected must cause disease through null variants evaluates the gene’s
disease mechanism. In the context of the BS1 criterion, the selected metric will evaluate the
variant’s frequency. According to VarClaMM, the representationDoafttahEelseements is
dependent on the choseDnataModel.</p>
        <p>With theDataElements identified, the next step is to determine which data sources provide
the required data. The primary challenge here is that this knowledge is scattered across
thousands of data sources. In a recent study, Costa1e1t] paelr.f[ormed a comparative analysis
of several genomic data sources and concluded that none of them provide complete data about
variants and that, in some cases, the data they provide is not concordant.</p>
        <p>
          For example, consider the PM2 criterion of the ACMG-AMP 2015 guidelines, which states
thatthe variant must be absent from control populations. In practice, this criterion is evaluated
by a metric that checks whether the variant has an allele frequency of 0 in population databases.
Two of the most important population databases are E1x2]AaCn[d GnomAD [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. For the
specific case of MITF:c.1A&gt;G, the variant is absent in ExAC, thus meeting the PM2 criterion.
However, in the GnomAD database, this variant has a frequency of 7e-07, meaning the PM2
criterion would not be met. This example illustrates the discrepancies that can arise when
using diferent DataSources, underscoring the importance of carefully selecting and
crossreferencing genomic data sources to ensure traceability of the interpretation results, as even
with the same criteria and metrics the results may difer depending oDnatahSeources used.
        </p>
      </sec>
      <sec id="sec-4-4">
        <title>4. (O)btain the classification results.</title>
        <p>The initial three steps of the SISO method have enabled us to develop a comprehensive framework
for variant interpretation. This framework thoroughly details the guidelines, criteria, and
metrics to be evaluated, the necessary data for evaluation, and the sources from which this data
can be obtained. Such detailed documentation will significantly enhance the traceability and
reproducibility of the interpretation process.</p>
        <p>The final step is to ”execute” this framework to interpret a given variant. To do this, we first
need to obtain the specific value of eaDcahtaElement for the variant in question (e.g., the
DataElementInDataSource). Next, we calculate all MtheetricDataEvaluationResults.
From these results, we can determine the outcomes of each metric and criterion, ultimately
leading to the finaCllassificationResult of the analyzed variant.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>4. Use case</title>
      <p>Here we present a practical application of the SISO method to illustrate its utility in
realworld scenarios. Specifically, we employed the SISO method to guide the interpretation of the
c.191G&gt;A variant in the BRCA1 gene. This variant has been identified in a Chilean patient
suspected of having a familial breast-ovarian cancer syndrome. Below, a description of how
each stage has been performed is provided.</p>
      <sec id="sec-5-1">
        <title>1. Select an interpretation guideline.</title>
        <p>As detailed in Sectio3n,selecting the optimal interpretation guideline requires considering the
variant type, the gene, and the disease under study. The variant under study is of type deletion,
which means it removes a portion of the gene where is located, in this case, the BRCA1 gene. To
our knowledge, there is no interpretation guidelines focusing exclusively on deletion variants.
However, the ClinGen institution has developed a guideline that focus on BRCA1 variants
associated with familial breast-ovarian cancer synd14r]o.mCeo[nsequently, we selected this
guideline as the most appropriate for our use case.</p>
      </sec>
      <sec id="sec-5-2">
        <title>2. Identify the criteria and define the metrics of the guideline.</title>
        <p>The selected guideline comprises B4o0oleanCriterion that must be evaluated to obtain the
InterpretationResult of the variant. Detailing with all 40 criteria is beyond the scope of this
paper. Thus, we focus on a single criterion (i.e., PM2). It is important to underscore that while
we focus on this single criterion, the method has been applied in its entirety to calculate the
InterpretationResult.</p>
        <p>
          The PM2 criterion evaluatetshiefvariant is absent from controls in an outbred population,
from gnomAD v2.1 (non-cancer, exome only subset) and gnomAD v3.1 (non-cancer). Region around
the variant must have an average read depth &gt; 25. From the description of this criterion, two
metrics can be identified: i) the variant must be absent from the population, and ii) the average
read depth must be greater than 25. In this guideline, even though its not common practice, the
criterion also specifies the data sources to be used: the non-cancer exome dataset of gnomAD
2.1 and the non-cancer dataset of gnomAD 3.1. Therefore, in this definition it is crucial to
distinguish betweeFrneqtuehnecy d:DeafintaiEtleimoenntof the metrics and the Rsepaedc_dieficptdha:DtaataEsloemuerntces in which this
condition dnmaemsucersip=ttiFobrneeq=ueFervnecaqyuleuncaytofeadv.ariant in a population dneasmcerip=tiroenad=_Ndeupmtbher of times the variant is read during sequencing
Figure3 idlaltua_stytprea=tfleoast the instantiation of the dCaltai_ntyGpee=ningtuideline, specifically highlighting the PM2
value_constraints = [
          <xref ref-type="bibr" rid="ref1">0,1</xref>
          ]
criterion and its associated metrics,VaarsDdaMefin:DeadtaMboydetl he VarClaMM meta-model. It is important
to note that the evaluation of the vnaamrei=aVnatrD’asMabsence in the population will be determined by
evaluating whether the variant’s frequency is equal to zero.
        </p>
        <p>Class PoweredByVisualParadigmCommunityEdition</p>
      </sec>
      <sec id="sec-5-3">
        <title>3. Select the required data elements and data sources.</title>
        <p>The first task in this step is to determine whDicahtaElement each metric will evaluate. As
specified in Section3, the selection oDfataElement depends on theDataModel chosen as the
framework for data representation. For our purposes, we have selected theDVaatraDMaoMdel,
which has been specifically designed for variant interpretation data represe1n5t].ation [</p>
        <p>The first metric evaluates the frequency of a variant in a population, while the second metric
assesses read_dept1h. Both concepts are represented as attributes of a clasAslclaelllee-d
Frequency in the VarDaM model. Figur4eillustrates the instantiation ofDaeatacEhlement
within the context of VarDaM.</p>
        <p>Once we have identified the requireDdataElements, the next step is to determine the
DataSources from which to obtain them. In this example, the task is straightforward because
the criterion definition specifies the necessary data sources: the non-cancer exome dataset of
gnomAD 2.1 and the non-cancer dataset of gnomAD 3.1.</p>
      </sec>
      <sec id="sec-5-4">
        <title>4. Obtain the classification results.</title>
        <p>The previous steps have allowed us to define the interpretation framework. Now, we need to
execute it for the selected variant. To do so, first, we need to collect the values of the selected
1The number of times a specific variant in the DNA is read during the sequencing process.</p>
        <p>DataElements from the specifiedDataSources (e.g., obtain thDeataElementInDataSource).
For the c.191G&gt;A variant, we found a frequency of 0 (i.e., it is absent) and a mean read_depth
of 31 in the non-cancer dataset of gnomAD 3.1. Similarly, the variant has an approximate
read_depth of 40 and a frequency of 0 in the non-cancer exome dataset of gnomAD 2.1.</p>
        <p>Consequently, as the conditions established by the two metrics are meDtaftoarEalel-l
mentsInDataSource, allMetricDataEvaluationResult are sCelatss to truePow.eredBByVaisuaslPearaddigmCoommnunityEditiohnis, we
t
can determine thMeetricResult and theCriterionResult. The MetricResult for both
defined metrics is true, as alMletricDataEvaluationResult are true. Therefore, since both
MetricResult are true, thCeriterionResult is also true.</p>
        <p>Finally, thIenterpretationResult is derived from all tCheriterionResult. After applying
all criteria and metrics defined by the ClinGen guideline —not just the one displayed here— the
variant is classified aPsathogenic. This indicates that the variant is responsible for the familial
breast-ovarian cancer syndrome exhibited by the patient. The final model instantiation for this
variant is shown in Figu5r.e</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>5. Conclusions</title>
      <p>In conclusion, the SISO method has set the basis for variant interpretation systematization
by providing a structured framework grounded in the VarClaMM meta-model. This method
addresses the inherent imprecision and variability in current variant interpretation practices
by defining clear steps for carrying out variant interpretation. By applying this method to the
c.191G&gt;A variant in the BRCA1 gene, we have demonstrated its practical utility and the ability
to achieve consistent and reproducible results. This case study underscores the potential of
the SISO method to enhance the accuracy and reliability of variant interpretation in clinical</p>
      <p>ClinGen_Guideline : Guideline PM2 : BooleanCriterion
title = ClinGen Variant Interpretation Guidelines for BRCA1 name = PM2
author(s) = ClinGen - ENIGMA BRCA1 and BRCA2 VCEP description = Absent from controls in an outbred population
applicability = BRCA1 gene, familial breast-ovarian cancer pass_rule = AND
settings. Powered ByVisual Paradigm Community Edition</p>
      <p>Future work will focus on developing technological support for each stage of the SISO
method. This advancement will enable the automated application of each step, achieving full
systematization of variant interpretation. By automating these processes, we aim to enhance
the precision and eficiency of variant interpretation, ultimately contributing to the broader goal
of advancing medicine. This will ensure that patients receive the most accurate and efective
clinical care tailored to their unique genetic characteristics.</p>
      <p>Class</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>This work was supported by the Generalitat Valenciana through the CoMoDiD project (CIPROM/2021/023)
and a pre-doctoral Grant (ACIF/2021/117), the Spanish State Research Agency through the SREC
(PID2021-123824OB-I00) project, and the European Union’s Horizon Europe research and
innovation programme through the BETTER project (No. 101136262)</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>National</given-names>
            <surname>Cancer</surname>
          </string-name>
          <string-name>
            <surname>Institute</surname>
          </string-name>
          , Variant,
          <year>2024</year>
          . hUtRtLp:s://www.cancer.gov/publications/ dictionaries/genetics-dictionary/def/va.riant
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>D.</given-names>
            <surname>Rigden</surname>
          </string-name>
          , et al.,
          <article-title>The 2023 nucleic acids research database issue and the online molecular biology database collection</article-title>
          ,
          <source>Nucleic Acids Research</source>
          <volume>51</volume>
          (
          <year>2023</year>
          )
          <fpage>D1</fpage>
          -
          <lpage>D8</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Furqan</surname>
          </string-name>
          , et al.,
          <article-title>Care in specialized centers and data sharing increase agreement in hypertrophic cardiomyopathy genetic test interpretation</article-title>
          ,
          <source>Circulation: Cardiovascular Genetics</source>
          <volume>10</volume>
          (
          <year>2017</year>
          )
          <article-title>e001700</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M. S.</given-names>
            <surname>Lebo</surname>
          </string-name>
          , et al.,
          <article-title>Data sharing as a national quality improvement program: reporting on BRCA1 and BRCA2 variant-interpretation comparisons through the Canadian Open Genetics Repository (COGR)</article-title>
          ,
          <source>Genetics in Medicine</source>
          <volume>20</volume>
          (
          <year>2018</year>
          )
          <fpage>294</fpage>
          -
          <lpage>302</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Y.-E.</given-names>
            <surname>Kim</surname>
          </string-name>
          , et al.,
          <article-title>Challenges and considerations in sequence variant interpretation for mendelian disorders</article-title>
          ,
          <source>Annals of Laboratory Medicine</source>
          <volume>39</volume>
          (
          <year>2019</year>
          )
          <fpage>421</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Harrison</surname>
          </string-name>
          , et al.,
          <article-title>Clinical laboratories collaborate to resolve diferences in variant interpretations submitted to ClinVar, Genetics in Medicine 19 (</article-title>
          <year>2017</year>
          )
          <fpage>1096</fpage>
          -
          <lpage>1104</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>N.</given-names>
            <surname>Agaoglu</surname>
          </string-name>
          , et al.,
          <article-title>Consistency of variant interpretations among bioinformaticians and clinical geneticists in hereditary cancer panels</article-title>
          ,
          <source>European Journal of Human Genetics</source>
          <volume>30</volume>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Costa</surname>
          </string-name>
          , et al.,
          <article-title>A reference meta-model to understand DNA variant interpretation guidelines</article-title>
          , in: J. P. A.
          <string-name>
            <surname>Almeida</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Borbinha</surname>
            , G. Guizzardi,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Link</surname>
          </string-name>
          , J. Zdravkovic (Eds.),
          <source>Conceptual Modeling - 42nd International Conference, ER 2023</source>
          , Lisbon, Portugal, November 6-
          <issue>9</issue>
          ,
          <year>2023</year>
          , Proceedings, volume
          <volume>14320</volume>
          oLfecture Notes in Computer Science, Springer,
          <year>2023</year>
          , pp.
          <fpage>375</fpage>
          -
          <lpage>393</lpage>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>031</fpage>
          -47262-6\_
          <fpage>20</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>S.</given-names>
            <surname>Richards</surname>
          </string-name>
          , et al.,
          <article-title>Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the american college of medical genetics and genomics and the association for molecular pathology</article-title>
          ,
          <source>Genetics in medicine 17</source>
          (
          <year>2015</year>
          )
          <fpage>405</fpage>
          -
          <lpage>423</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>E.</given-names>
            <surname>Zirkelbach</surname>
          </string-name>
          , et al.,
          <article-title>Managing variant interpretation discrepancies in hereditary cancer: Clinical practice, concerns, and desired resources</article-title>
          ,
          <source>Journal of Genetic Counseling</source>
          <volume>27</volume>
          (
          <year>2018</year>
          )
          <fpage>761</fpage>
          -
          <lpage>769</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>M.</given-names>
            <surname>Costa</surname>
          </string-name>
          , et al.,
          <article-title>The consequences of data dispersion in genomics: a comparative analysis of data sources for precision medicine</article-title>
          ,
          <source>BMC Medical Informatics and Decision Making</source>
          <volume>23</volume>
          (
          <year>2023</year>
          )
          <fpage>256</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>K.</given-names>
            <surname>Karczewski</surname>
          </string-name>
          , et al.,
          <article-title>The exac browser: Displaying reference data information from over 60 000 exomes</article-title>
          ,
          <source>Nucleic acids research</source>
          <volume>45</volume>
          (
          <year>2016</year>
          ).
          <year>d1o0i</year>
          .: 1093/nar/gkw971.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>K.</given-names>
            <surname>Karczewski</surname>
          </string-name>
          , et al.,
          <article-title>The mutational constraint spectrum quantified from variation in 141,456 humans</article-title>
          ,
          <source>Nature</source>
          <volume>581</volume>
          (
          <year>2020</year>
          )
          <fpage>434</fpage>
          -
          <lpage>443</lpage>
          .
          <year>do1i0</year>
          :.
          <volume>1038</volume>
          /s41586-020-2308-7.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>ClinGen - ENIGMA BRCA1 and BRCA2</surname>
            <given-names>VCEP</given-names>
          </string-name>
          ,
          <article-title>Clingen enigma brca1 and brca2 expert panel specifications to the acmg/amp variant interpretation guidelines for brca1 version 1</article-title>
          .1.
          <issue>0</issue>
          ,
          <year>2024</year>
          . URL: https://cspec.genome.network/cspec/ui/svi/doc/GN.092
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>M.</given-names>
            <surname>Costa</surname>
          </string-name>
          , et al.,
          <article-title>Comprehensive representation of variation interpretation data via conceptual modeling</article-title>
          ,
          <year>2023</year>
          , pp.
          <fpage>25</fpage>
          -
          <lpage>34</lpage>
          .
          <year>do1i0</year>
          : .
          <volume>1007</volume>
          /978-3-
          <fpage>031</fpage>
          -47112-
          <issue>4</issue>
          _
          <fpage>3</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>