<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Quantum Contextual Information Access and Retrieval</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Emanuele Di Buccio</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giorgio Di Nunzio</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Massimo Melucci</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nicola Orio?</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Padua, Department of Information Engineering</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper illustrates the research work being in progress at the University of Padua within the VII FP Marie Curie International Research Sta Exchange Scheme project QONTEXT, which shows a new vision of information access and retrieval based on Quantum Theory. In particular, this paper describes the basics of QT, the use of the formalism for relevance feedback, music retrieval and visual cluster-based data mining.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Notion Classical Quantum
Event space Hilbert vector space H
Random event Set Orthonormal basis fjBi; jBig</p>
      <p>Probability Measure Set measure State vector j'i
Table 1. The correspondence between classical probability and quantum probability
$"
!"
!
#"</p>
      <p>P( A) =
6
15
!
!
!</p>
      <p>A</p>
      <p>B
B
"
In the classical probabilistic model, events (e.g., word occurrences, category
memberships, relevance, location, task, genre) are represented as sets and the
probability measure is based on a set measure, e.g., set cardinality. In contrast,
in quantum probability, events are represented as orthonormal vectors and the
probability measure is the trace of the product between a density matrix and
the matrix representing an event as summarized in Table 1. The simple example
in Figure 1 depicts that when vectors are used to implement both events and
densities the probability in the vector space is the squared inner product between
the vectors, that is, the squared size of the projection of jAi onto j'i.
3</p>
    </sec>
    <sec id="sec-2">
      <title>Relevance Feedback</title>
      <p>Diverse are the sources of evidence can be adopted, e.g. the behavior of the user
when interacting with the results or judgments explicitly provided by the user
for Relevance Feedback (RF) purpose. The combination of diverse sources of
evidence is complex because of the feature heterogeneity (e.g. term frequency
for the documents content and display-time for the user interaction behavior)
and the lack of knowledge of the factors a ecting relevance assessment. QT can
cope with this complexity because it provides a uniform formalism to model the
diverse factors. In RF, a document can be represented as a density matrix. The
factors may be extracted from the training documents using local co-occurrence
data of terms to obtain a term correlation matrix, thus applying SVD to the
matrix and obtaining a vector basis. Thus, the probability of relevance can be
computed by the trace function. Factors can also model user location or di erent
behavioral patterns extracted from the behavior of the user when interacting
with the results. The main issue is to obtain a vector basis starting from the
evidence gathered from each source of evidence and the mechanism for the basis
computation should be able to unveil the most meaningful factors from the data.
The main advantage is, from the one hand, the uniform modeling of diverse
sources of evidence and, on the other hand, the greater generality of quantum
probability than that of classical probability.</p>
      <p>Our aim within QONTEXT is to employ QT for modeling RF within a
complex and heterogenous feature space.
4</p>
    </sec>
    <sec id="sec-3">
      <title>Music Retrieval and Processing</title>
      <p>Music Information Retrieval (MIR) is an emerging research area that focus on
providing new access methodologies and interaction paradigms to very large
music collections. The wide availability of portable music players, paired by the
increasing amount of digital music available to the end user, makes music access
particularly related to context: music can be used as a background at work and
study, as a distinctive for social groups, as a way to promote personal's image
(especially within young generations). Moreover, it is generally acknowledge that
neither textual metadata nor content-based descriptors alone can completely
describe the music content and the user information need.</p>
      <p>
        For these reasons, the application of QT to music access and retrieval is
particularly promising. On the one hand, it will allow to model in a uni ed
framework di erent sources of evidence, related both to the social role of music
{ i.e., genre, usage, user provided tags { and to the pure acoustic features { i.e.,
melody, rhythm, timbre. Some initial e orts to represent this complex
characteristics has been presented in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. On the other hand, music (and video) o ers
new ways to measure the context of interaction and the implicit user feedback.
Besides the classical implicit feedback evidences, in the case of music access users
can replay or skip part of the whole song they like in a particular moment, can
adjust the volume, can interrupt other activities or take information about the
song or the artist.
      </p>
      <p>Our goal within QONTEXT is to investigate a methodology for measuring
the relevance of music items to a particular interaction context by merging
different sources of evidence.
5</p>
    </sec>
    <sec id="sec-4">
      <title>Data Mining and Visualization</title>
      <p>Data Mining is a wide area of research which involves di erent tasks such as,
for example, clustering, categorization, and regression. The main idea behind all
these di erent facets of data mining is the extraction of useful information from
data, information which is often implicit or previously unknown. The question
is how to model data and how to integrate the outcome of the analyses with
visualization components which may help researchers to validate their models.</p>
      <p>
        During the last decade, the research area of Quantum Clustering (QC) has
given a signi cant contribution in terms of \non-classical" approaches with
efcient clustering algorithms which take advantage of the Quantum Mechanics
principles. The work presented by [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] shows how to nd the center of the clusters
of the dataset calculating the minimum of the potential function V (x) (where
x is a data point) which is based on the de nition the Schrodinger equation. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]
discuss how to speed up a selection of \classical" clustering algorithms by
quantizing some of their parts, and they also suggest that the same paradigm could
also be applied to other problems such as dimensionality reduction and training
a classi er. Whereas the above cited contributions concentrates on the
computational issues of clustering, our research will focus on IAR issues of clustering
because there are no results in this area to our knowledge.
      </p>
      <p>Therefore, within QONTEXT, the aim is to explore a more user oriented
version of QC. In particular, we want to study the interaction between the contexts
of the search process and the clusters, and how the selection of one (or more)
cluster by the user can generate a new spaces by means of visual representations
of the clusters.
6</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgements</title>
      <p>The research leading to these results has received funding from the European
Union Seventh Framework Programme (FP7/2007-2013) under grant agreement
N. 247590.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. van Rijsbergen,
          <string-name>
            <surname>C.J.:</surname>
          </string-name>
          <article-title>The geometry of information retrieval</article-title>
          . Cambridge University Press, UK (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Song</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lalmas</surname>
          </string-name>
          , M.,
          <string-name>
            <surname>van Rijsbergen</surname>
            ,
            <given-names>C.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Frommholz</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Piwowarski</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Zhang,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Zuccon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Arafat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.B.S.</given-names>
            ,
            <surname>Azzopardi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Di Buccio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            ,
            <surname>HuertasRosero</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Hou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            ,
            <surname>Melucci</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          , Ruger, S.:
          <article-title>How quantum theory is developing the eld of information retrieval</article-title>
          .
          <source>In: QI'10: Proceedings of the 4th Quantum Interaction Symposium</source>
          . (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , Zhang,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <surname>X.</surname>
          </string-name>
          :
          <article-title>Music recommendation by uni ed hypergraph: combining social media information and music content</article-title>
          .
          <source>In: Proceedings of ACM Multimedia</source>
          .
          <article-title>(</article-title>
          <year>2010</year>
          )
          <volume>391</volume>
          {
          <fpage>400</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Horn</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gottlieb</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Algorithm for data clustering in pattern recognition problems based on quantum mechanics</article-title>
          .
          <source>Phys. Rev. Lett</source>
          .
          <volume>88</volume>
          (
          <year>2001</year>
          )
          <fpage>018702</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5. Ameur, E.,
          <string-name>
            <surname>Brassard</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gambs</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Quantum clustering algorithms</article-title>
          .
          <source>In: Proceedings of the 24th international conference on Machine learning. ICML '07</source>
          , New York, NY, USA, ACM (
          <year>2007</year>
          ) 1{
          <fpage>8</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Dahabiah</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Puentes</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Solaiman</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Possibilistic similarity estimation and visualization</article-title>
          .
          <source>In: Proceedings of the 2nd International Conference on Theory of Information Retrieval: Advances in Information Retrieval Theory. ICTIR '09</source>
          , Berlin, Heidelberg, Springer-Verlag (
          <year>2009</year>
          )
          <volume>273</volume>
          {
          <fpage>280</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Weinstein</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horn</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Dynamic quantum clustering: A method for visual exploration of structures in data</article-title>
          .
          <source>Phys. Rev. E</source>
          <volume>80</volume>
          (
          <year>2009</year>
          )
          <fpage>066117</fpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>