=Paper=
{{Paper
|id=None
|storemode=property
|title=Augmented Collaborative Spaces for Collective Sense Making: The Dicode Approach
|pdfUrl=https://ceur-ws.org/Vol-743/ASTC2011_Paper2.pdf
|volume=Vol-743
}}
==Augmented Collaborative Spaces for Collective Sense Making: The Dicode Approach==
<pdf width="1500px">https://ceur-ws.org/Vol-743/ASTC2011_Paper2.pdf</pdf>
<pre>
       Augmented Collaborative Spaces for Collective Sense
               Making: The Dicode Approach

      Ahmad Ammari1, Vania Dimitrova1, Lydia Lau1, Manolis Tzagarakis2, and Nikos
                                  Karacapilidis2
                        1
                        School of Computing University of Leeds, Leeds, UK.
                 2
                  Research Academic Computer Technology Institute, Patras, Greece.
             { A.Ammari, V.G.Dimitrova, L.M.S.Lau, }@leeds.ac.uk, tzagara@upatras.gr,
                                    nikos@mech.upatras.gr.


          Abstract. Sense making is at the heart of cognitively complex and data
          intensive decision making processes. It is often conducted in collective spaces
          through exchange of ideas, discussions, analysing situations, and exploring
          alternatives. This position paper proposes a novel approach to facilitate
          collective sense making via a collaboration platform which (a) offers multiple
          views to collaboration (including forums, mind maps, and argumentation
          structure), and (b) provides intelligent support to understand sense making
          behaviour by employing user and community modelling techniques. The work
          is conducted in the framework of the EU funded Dicode project, developing
          intelligent services for data-intensive collaboration and decision making.
          Keywords: Collective sense making, Collaborative workspaces, Intelligent
          support, User and community modelling


1      Introduction

This paper proposes a novel platform to augment the synergy between human and
machine intelligence in complex decision making situations. Many collaborative
decision making problems have to be solved through dialoguing and argumentation
among a group of people [1, 2]. In such contexts, discussions for making sense of the
issues, constraints, and options are usually conducted in an unstructured manner.
Sense making is a “motivated, continuous effort to understand connections (which
can be among people, places, and events) in order to anticipate their trajectories and
act effectively” [3]. Therefore, sense making is an inevitable path in cognitively
complex and data intensive decision making processes.
   Dicode1 (Data-intensive collaboration and decision making), an EU Framework 7
project, sets out to tackle the above challenges for three use cases. The first use case
concerns a team of scientists in clinico-genomic research. The second use case
involves a group of radiographers, radiologists and clinicians in a trial of rheumatoid
arthritis treatment. The third use case involves public opinion monitoring on the
internet for a team of brand consultants to design a campaign.

1
    Dicode website is http://dicode-project.eu/
4      A. Ammari et al.

   Argumentation, as seen in Dicode, is a common activity in collective sense making
process. It is valuable in shaping a common understanding of the problem and can
provide the means to decide which parts of the information brought up by the decision
makers will finally be the input to the solution used. Argumentation may also
stimulate the participation of decision makers and encourage constructive criticism.
However, discovering the connections is mainly by using tacit knowledge and the
value of this activity has been largely unacknowledged. Dicode aims to address the
above by user-friendly multi-view collaboration workspaces, which facilitate the
exchange and sharing of ideas, opinions, comments and resources between
participants. While each collaborative workspace enables an individual or a team to
visualise the connections between concepts and artefacts, keeping track of the
rationale behind the decision points and redeploying the accumulated knowledge in
new situations is itself potentially a cognitively complex process. Hence, intelligent
support will be provided by exploiting the behaviour data captured in the usage logs
and by adding semantics to the content shared.
   This position paper outlines a multi-faceted approach to combine human and
machine intelligence for collective sense making. Specifically, we will present a novel
approach to design collaborative workspaces that facilitate sense making by
combining multiple views – ranging from informal (unstructured) to formal
(structured). Each view facilitates different sense making aspects. Furthermore, we
present a proposal how collaborative workspaces can be augmented with intelligent
support utilising adaptation techniques, namely user and community modelling.


2       The Dicode Project

The goal of the Dicode project is to facilitate and augment collaboration and decision
making in data-intensive and cognitively-complex settings. It will exploit and build
on the most prominent high-performance computing paradigms and large data
processing technologies - such as cloud computing, MapReduce [4], Hadoop2,
Mahout3, and column databases – to meaningfully search, analyze and aggregate data
existing in diverse, extremely large, and rapidly evolving sources. Building on current
advancements, the solution foreseen in the Dicode project will bring together the
reasoning capabilities of both the machine and the humans. It can be viewed as an
innovative workbench incorporating and orchestrating a set of interoperable services
that reduce the data-intensiveness and complexity overload at critical decision points
to a manageable level, thus permitting stakeholders to be more productive and
concentrate on creative activities. Services to be developed are: (i) scalable data
mining services (including services for text mining and opinion mining), (ii)
collaboration support services, and (iii) decision making support services.
   In this paper, the focus is on the collaboration support services which are realised
via multi-view collaborative workspaces augmented with intelligent support for
collective sense making.


2
    Apache Hadoop Project http://hadoop.apache.org/
3
    Apache Mahout Project http://mahout.apache.org/
    Augmented Collaborative Spaces for Collective Sense Making: The Dicode Approach        5


3     Multi-View Collaborative Workspace

In Dicode, three different views of collaboration workspaces (CW) are supported.
These are summarised below:
• Discussion-forum view: In this view, the CW is displayed as a traditional web-
   based forum, where posts are displayed in an ascending chronological order. Users
   are able to post new messages to the collaboration workspace, which appear at the
   end of the list of messages. Posts may also have attachments to enable the
   uploading of files. Discussion-forum exhibits a very low level of formality and are
   mainly suitable to support ideas sharing, exchange and collection.
• Mind-map view: In this view, the CW is displayed as a mind map where users can
   interact with the items on the collaboration workspace. This view deploys a spatial
   metaphor permitting the easy movement and arrangement of items on the
   collaboration workspace (Fig. 1). Messages posted on the collaboration workspace
   in mind-map view can be one of the following types: idea, comment, note and
   generic. Files of any content type (e.g. pdf, jpg) can be uploaded to the CW. The
   set of available types can be configured and participating users will be able to
   define new ones. The mind-map view also provides a set of mechanisms through
   which: (a) items on the collaboration workspace can be related, and (b) new
   abstractions can be created. In particular, creation of relationships between items is
   facilitated by drawing directed arrows between items on the collaboration
   workspace. Visual cues can be used to convey semantics (e.g. red colour can
   indicate opposition, while green can indicate “in favour”; labels can be associated
   to arrows elucidating semantic relationships). Items on the CW can be aggregated,
   to allow a group of items to be treated as a single entity, and transformed into a
   single item creating new, composite items. The mind-map view aims at supporting
   sense-making during data intensive and cognitive complex tasks.


Fig. 1: Mind-map view of a collaboration workspace. Explicit relations can be created between
      collaboration items (arrows) or juxtaposed to express implicit/transient relationship.

• Formal/Argumentation view: The formal/argumentation view of the CW permits
  only a limited set of discourse moves for a limited set of message types whose
  semantics is predefined and fixed. Formal views of the collaboration workspaces
6     A. Ammari et al.

    exhibit a high level of formality. In particular, the formal view (Fig. 2) enables the
    posting of messages which can be of type issue (to indicate the decisions to be
    made) alternative (to represent potential solutions to the issues discussed) or
    position (to comment on alternatives or on other positions). Positions either support
    or are against alternatives and positions and their relationship are explicitly
    specified when users post them to the collaboration workspace. Files can be
    attached to positions to further support their validity. The formal view supports
    also the notion of preferences, used to weigh the importance of two positions and
    reflect the importance of one position over another. Decision making support
    algorithms (e.g. a voting or a multiple criteria decision making), which are
    associated with the CW, can take into consideration the relationships of positions
    as well as existing preferences and calculate which alternative is currently
    prevailing or which position has been defeated. The aim of the formal view is to
    make the CW machine understandable and to further support decision making.


             Fig. 2: A formal view of the collaboration workspace shown in Fig. 1.

   Every CW can be transformed from one view into another at any point in time
by anyone participating in the collaboration. Such transformations are rule-based; a
set of rules specifies how items in the source view are transformed into items of the
destination view. All discourse moves and contributions that users create during
their interaction in the CW are logged within Dicode in order to enable their further
analysis by a variety of services. For each view, log data contains information related
to the event that happened on the workspace and which includes:
• the collaboration workspace’s ID and view where the event took place;
• the user’s operation and the associated content (e.g. adding/updating/deleting an
   item, moving an item, creating relationships between items etc);
• the user who executed the operation;
• the date and time when the event occurred.
   The log data in the CW will be used as an input for intelligent support algorithms.
      Augmented Collaborative Spaces for Collective Sense Making: The Dicode Approach    7


4       Intelligent Support

Intelligent support will augment the multi-view CWs with machine intelligence to
understand and facilitate collective sense making. Intelligent support will be provided
at two levels:
• Understanding collective sense making. This will include user/community
    profiling, e.g. identifying user characteristics, discovering links between
    individuals, identifying common topics; discovering patterns of behaviour such as
    silos or dominance, extracting situations parameters.
• Facilitating collective sense making. This will include interface augmentation (e.g.
    adding visual signals to help establish situational awareness) or suggestions in the
    form of messages (e.g. to facilitate the exchange of ideas, point at useful patterns,
    highlight important situation aspects).

The following subsections propose our approach to implementing the first level of
intelligent support, i.e. understanding collective sense making behaviour. This will be
achieved by three functions (section 4.3) which employ descriptive machine learning
and data mining algorithms and meet the key objectives as stated in section 4.2. The
following section outlines how the CW log data will be enriched with semantics for
user and community modelling.


4.1      Input: Augmented CW Log Data

Intelligent support will be based on the log data from the CWs which include mind
mapping graphs, discussions, arguments and comments. In addition, the users’ meta-
data, including the users’ navigational behaviour as recorded in the usage logs, as well
as the searching behaviour of the users in the collaborative workspace, will be used to
characterise the users and derive a user profile for each user in the community.
Semantic enrichment of the user profiles is achieved by considering semantic data
sources, such as domain ontologies (to identify the domain topics discussed), as well a
collaboration and decision making ontology developed in Dicode (to take into account
the user roles and to link sense making to decision making steps).


4.2      User and Community Modelling

Intelligent support in Dicode is underpinned by a mechanism for user and community
modelling which will be outlined here. It is envisaged to be used by intelligent
services which augment the CW in Dicode. For instance, a recommendation
mechanism in Dicode will be able to use the output of the community modelling
functions to direct to ‘items’ in the CW, e.g. a data set, a set of relevant discussions, a
topic of interest to search for. Furthermore, the users of the CW can be pointed to a
set of discussions that occurred in different times but belong to a certain topic of
interest.
   Objectives. The following four main objectives can be perceived for the
community modelling and user profiling functions:
8     A. Ammari et al.

• O1: Detect topics of community discussions in the collaborative workspace.
• O2: Identify key characteristics of the users in the community from available data
  about the users, i.e. unstructured data, semantic annotations, meta-data, and use
  these characteristics to shape the user profile for each user within the community.
• O3: Quantify the strength of each characteristic for discovery of connections.
• O4: Discover clusters of users and interesting patterns in user behaviour by
  applying descriptive data mining functions, i.e. cluster analysis and association
  mining on the derived user profiles.


4.3    Outline of the Main Algorithms

This section will outline how descriptive machine learning and data mining, such
as cluster analysis and association rule mining, can be applied for user and
community modelling. We will group them into three main functions.

Function 1: Clustering Unstructured Data for Topic Detection
   Purpose (O1). The main purpose of this function is to discover the main topics of
the unstructured data, i.e. community discussions, arguments, using descriptive data
mining methods, i.e. cluster analysis.
   Input. Unstructured data that community users create within the collaborative
workbench, as part of their collaboration activities. These include the discussion and
arguments that occurred between the community users in the workbench. All the
available parts of the discussions can be utilized by the function, i.e. the title of the
discussion thread, main discussion body, replies by other users, tags that collaborating
users attach to the discussion.
   Processing. The input data will be processed as follows:
• Pre-process the input unstructured data and transform it into a term weight
   document matrix to be used as input for cluster analysis.
• Using the pre-processed matrix, build and train a clustering model that segments
   the discussions into distinct groups (clusters) based on the similarities and
   distances between the discussions.
• Using the profiles of the discovered clusters, detect the topic of each cluster of
   discussions based on the frequency of occurrence by considering the most
   occurring terms that occur in each cluster.
   Output. There are two types of output produced by this function:
• Clusters of discussions, where each discussion instance will be assigned a cluster
   id to identify to which discovered cluster of discussions it belongs to.
• Cluster profiles, including the number of discussions that belong to each cluster
   and the most significant terms that belong to each cluster based on the frequency of
   occurrence.

Function 2: Deriving Key User Characteristics and Generating User Profiles
  Purpose (O2 & O3). The purpose of this function is to derive the key
characteristics that describe each user within the community, and weight these
   Augmented Collaborative Spaces for Collective Sense Making: The Dicode Approach     9

characteristics for every user to reflect the significance of each characteristic. These
weighted user profiles will be accumulated in a community model.
   Input. Data input to this function include: (a) Discussion topics that are detected
using the first function described above; (b) User meta-data available from the logs
and meta-data derived from the other components of the collaborative workbench,
including the discussions, arguments, i.e. the author of the main body of the
discussion and the authors of the replies to the main body, the mind mapping graphs,
and the meta-data available from the searching behaviour in the workspace. (3) The
characteristics derived from the unstructured data, i.e. topics, and the meta-data can
be semantically enriched by the collaboration and decision support ontology, relevant
domain ontologies, and open lexical resources, i.e. Wordnet.
   Processing. This function will process the input data as follows:
• Identify user characteristics within the community from the available input data.
• Compute weighted interests in the identified topics - for each identified
   characteristic, the function will compute a numerical weight for each user profile
   that represents the significance (importance) of this characteristic to that user
   within the community.
• Build a user–characteristic matrix that could be input to further descriptive data
   mining functions (cluster analysis and association mining).
   Output. The output of this function is a community model that includes a user
profile for each user. Each user profile represents the weights of the identified
characteristics for each user within the community.

Function 3: Discovering Patterns in the User Profiles
   Purpose (O4). The purpose of this function is to discover hidden patterns in the
user profiles for further support to collaboration and decision making, using
descriptive data mining techniques.
   Inputs. The input to this function is mainly the community model (user profiles)
derived by the second function
   Processing. This function will process the input data as follows:
• Apply cluster analysis methods on the derived user profiles within the community
   model to discover the user clusters and the user cluster profiles.
• Apply association mining methods on the derived user profiles within the
   community model to discover association hidden patterns within the user
   characteristics.
   Output. This function mainly produces three outputs: (a) Clusters of user profiles,
where each user profile instance will be assigned a cluster id to identify to which
discovered cluster of user profiles each user belongs to. (b) Cluster profiles, including
the number of user profiles that belong to each cluster and the characteristics’ values
for the average user profile, i.e. cluster centroid, for each discovered cluster. (c)
Discovered hidden association patterns, including frequent characteristic-sets that list
those significant characteristics that are obtained frequently by the same users, and the
hidden association rules underlying these sets.
10    A. Ammari et al.


5    Related Work

The approach proposed in this paper has two main innovative aspects: (a) a new way
to facilitate sense making using multiple linked views of collaborative workspaces;
and (b) a novel application of user and community modelling to get an understanding
of collective sense making behaviour.
    Over the years, a number of systems have been developed aiming to support the
process of sense making which include Debatepedia [5], Parmenides [6], ClaiMaker
[7], TruthMapper [8] and Cohere [9]. Despite their powerful features, each of these
systems provides only a fixed level of formality lacking the ability to adapt their
environment to the needs of the collaboration. In Dicode, collaborative workspaces
build on and extend the notion of spatial hypertext, which has been proposed as an
alternative to navigational and semantic organisation of resources [10]. Spatial
hypertext employs a spatial metaphor to organize information aiming at taking
advantage of the user’s visual memory and pattern recognition. Due to its ability to
express ambiguity as well as transient and implicit relationships between information,
it is an effective way to support information triage, i.e. the process of sorting through
relevant materials and organizing them to meet the needs at hand[11]. While most
existing hypertext systems permit only a single user to organize the information (e.g.
VIKI [12], WARP [13]), approaches to bring spatial hypertext into the collaborative
realm have only recently started to emerge [14]. Dicode will make a contribution to
this stream by exploiting spatial hypertext for collective sensemaking in cases when
humans need to process large volumes of heterogeneous data.
    Recent research trends look at intelligent ways to support the effective functioning
of close-knit communities through personalization and adaptation techniques.
Modelling users within a community provides the grounds for generating group
recommendations [15]. One method to support that is through detecting the topics that
the collaborating users show interests in. In [16] Cheng and Vassileva derived topics
of users’ interests based on the resources shared by them within the community,
where a reward factor is calculated to measure the relevance of each contributed
resource to the topics derived. In [17], Bretzke and Vassileva modelled users’
interests based on how frequently and recently users have searched for a specific area
from a particular taxonomy. User relationships are then determined based on the
resource downloading behaviour. A more recent approach by Kleanthous and
Dimitrova [18][19] employs the metadata of the shared resources along with an
ontology representing the community context and derives a semantically relevant list
of interests for every user.
    In Dicode, we aim to further enhance the existing topic detection approaches by
exploiting a hybrid machine learning, text data mining, and semantic enrichment
approach. Using as input community discussions, mind-mapping activities, and
relevant ontologies, we aim to discover topics of interests that are buried within the
diversity of unstructured and semi-structured contents produced by the collaborating
members in the multi-view collaborative workspaces. Detected topics will then be
exploited to facilitate collective sense making within the community members.
    A community model can be analysed to automatically detect patterns which can be
used to decide when and how interventions to the community can be done [20]. It has
been shown that community patterns based on these processes can be derived from
    Augmented Collaborative Spaces for Collective Sense Making: The Dicode Approach   11

the community graph. For example, [19] have identified community patterns related
to processes linked to effective knowledge sharing, such as transactive memory (how
members’ knowledge is related), shared mental models (shared understanding of the
common goal), and cognitive centrality (influential members).
   Similarly to Kleanthous and Dimitrova’s work on semantically-enriched
relationship detection, we will exploit semantics and ontologies to enhance the log
data from CWs and get richer input about what is happening in the community.
However, the community modelling approach in Dicode will take the modelling
further by exploiting descriptive data mining approaches, including output from (i)
statistical member segmentation, i.e. group profiles, where members assigned to the
same group share a similar behavioural profile, as well as output from (ii) association
rule mining, i.e. lists of the frequently co-occurring behavioural activities of the
community members, in order to further improve the community pattern discovery
tasks. Discovered patterns will also be used to further augment the multi-view CW for
enhanced collective sense-making, knowledge sharing, and group recommendations.


6      Conclusions

We have set out an ambitious goal to exploit the synergy of machines and humans in
complex cognitive situations that require making decisions involving large volumes
of data. We are starting to unravel the aspects of this synergy. While data mining
techniques (i.e. machine intelligence) can be exploited to process data and discover
trends and patterns, human intelligence is needed to make sense of the data and take
decisions. The process of sense making involves discovering connections, deriving
patterns, generating alternatives, weighting possibilities. People perform these tasks in
an intuitive manner using tacit knowledge. Our ultimate goal is to capture, preserve,
and reuse this tacit knowledge by providing collaborative workspaces for collective
sense making. In turn, we will exploit machine intelligence to analyse the human
behaviour in the collaborative spaces in order to get a better understanding of the
collective sensemaking process, facilitate important aspects, and support future
human sense making (e.g. exploiting patterns applied earlier).
   Currently, we are developing the CWs following a generic approach, which will
enable the same approach to be applied to diverse use cases. The illustrations in this
paper were from the exemplification of the multi-view space for a Breast Cancer
research group embarking on an analysis to discover any common characteristics or
trends that could be deducted from recent studies which used high-throughput
technologies such as microarrays and next-generation sequencing. We plan to apply
the approach presented here to support sense making in a clinical trial of Rheumatoid
Arthritis treatment where a team of medical practitioners examines large data sets and
analyses the effectiveness of the treatment on patients. In addition, the log data from
the CWs is being analysed in line with the functions presented in here to augment
CWs with intelligent support.
12    A. Ammari et al.

Acknowledgements
This publication has been produced in the context of the EU Collaborative Project
"DICODE - Mastering Data-Intensive Collaboration and Decision" which is co-
funded by the European Commission under the contract FP7-ICT-257184. This
publication reflects only the author’s views and the Community is not liable for any
use that may be made of the information contained therein.


References

     1.  van Eemeren, F. H., Grootendorst, R., Snoeck Henkemans.F.: Fundamentals of
         Argumentation Theory. Erblaum, Mahwah, NJ (1996)
     2. Provis, C.: Negotiation, Persuasion and Argument. Journal of Argumentation, 18(1),
         pp. 95-112 (2004)
     3. Klein, G., Moon, B., Hoffman, R.: Making Sense of Sensemaking 1: Alternative
         Perspectives. IEEE Intelligent Systems, 21(4), July/August (2006)
     4. Dean, J., Ghemawat, S.: Mapreduce: simplified data processing on large clusters. In
         OSDI’04: Proceedings of the 6th conference on Symposium on Opearting Systems
         Design & Implementation, USENIX Association, pp. 10–10 (2004)
     5. Debatepedia, http://wiki.idebate.org/
     6. Atkinson, K., Bench-Capon, T.,McBurney, P.: PARMENIDES: Facilitating
         deliberation in democracies. Artificial Intelligence and Law, 14(4), pp. 261-275
         (2006)
     7. Buckingham Shum, S.J., Uren, V., Li, G., Sereno, B., Mancini, C.: Modelling
         Naturalistic Argumentation in Research Literatures: Representation and Interaction
         Design Issues. International Journal of Intelligent Systems, 22 (1), pp.17-47 (2007)
     8. TruthMapping, http://truthmapping.com
     9. Buckingham Shum, S.,: Cohere: Towards Web 2.0 Argumentation, Proceeding of the
         2008 conference on Computational Models of Argument, p.97-108, June 21 (2008)
     10. Shipman, F. M., Marshall, C. C.: Spatial Hypertext: An Alternative to Navigational
         and Semantic Links, ACM Computing Surveys 31(4), December (1999)
     11. Marshall, C.C., Shipman, F. M. III: Spatial hypertext and the practice of information
         triage, Proceedings of the eighth ACM conference on Hypertext, p.124-133, April 06-
         11, Southampton, United Kingdom (1997)
     12. Marshall, C.C., Shipman, F. M., Coombs, J. H.: VIKI: spatial hypertext supporting
         emergent structure, Proceedings of the 1994 ACM European conference on
         Hypermedia technology, p.13-23, September 19-23, Edinburgh, Scotland (1994)
     13. Francisco-Revilla, L., Shipman, F. M.: WARP: a web-based dynamic spatial
         hypertext, Proceedings of the fifteenth ACM conference on Hypertext and
         hypermedia, August 09-13, Santa Cruz, CA, USA (2004).
     14. Solís, C., Ali, N.: ShyWiki - A spatial hypertext wiki, Proceedings of the 4th
         International Symposium on Wikis, September 08-10, Porto, Portugal (2008)
     15. Masthoff, J.: Group Modeling: Selecting a Sequence of Television Items to Suit a
         Group of Viewers, User Modeling and User-Adapted Interaction, 14(1), pp.37-85
         (2004)
     16. Cheng, R., Vassileva, J.: Design and evaluation of an adaptive incentive mechanism
         for sustained educational online communities, Journal of User Modeling and User
         Adaptive Interaction, vol. V16, no. 3, pp.321 – 348 (2006)
     17. Bretzke, H., Vassileva, J.: Motivating Cooperation on Peer to Peer Networks, 9th Int.
         Conf. on User Modelling, Springer (2003)
Augmented Collaborative Spaces for Collective Sense Making: The Dicode Approach     13

  18. Kleanthous, S., Dimitrova, V.: Modelling Semantic Relationships and Centrality to
      Facilitate Community Knowledge Sharing, Proc. of the 5th Int. Conf. on Adaptive
      Hypermedia and Adaptive Web-Based Systems (AH'08) Springer (2008).
  19. Kleanthous, S., Dimitrova, V.: Analyzing Community Knowledge Sharing Behavior,
      UMAP 2010, Springer, pp.231-242 (2010)
  20. Kleanthous, S., Dimitrova, V.: Detecting Changes over Time in a Knowledge Sharing
      Community, Proc. of the 2009 IEEE/WIC/ACM Int. Joint Conf. on WI and IAT,
      IEEE Computer Society Washington, DC, USA Milan, Italy (2009).

</pre>