Workshop on Machine Learning for
Trend and Weak Signal Detection in
 Social Networks and Social Media


                   https://www.irit.fr/twsdetection
1       Objectives of the workshop


The objective of this workshop was to provide an opportunity for researchers and prac-
titioners from different disciplines to meet, exchange and discuss issues, research re-
sults and practical applications regarding trend and weak signal detection in social net-
works. This workshop was an opportunity for participants to develop their network and
to cooperate with each other.

  This workshop took place on February 27-28, 2020, Toulouse, France at INSPE, 56
avenue de l’URSS, 31400 Toulouse.

    This workshop invited submissions covering the following topics, but not limited to:

• Trend detection, analysis and tracking
• Topic identification and event detection
• Weak signal detection
• Information / opinion / knowledge spread and modelling
• Misinformation and misbehavior analysis and detection
• Information quality in social network
• Community detection, expertise and authority discovery
• Social influence, recommendation and media
• Behavior analysis in social networks
• Sentiment analysis
• Crime detection and investigation
• Network visualization and modeling
• Data mining and machine learning
• Real-world case studies
• Ongoing projects based on social media and/or social networks.

We called for contributions that could be either unpublished research results as well as
project presentations and ongoing research. All the papers that were submitted have
been peer reviewed by three reviewers. Since the main idea of the workshop was to
share science, ongoing research without strong result yet also could be accepted. Out
of the scope papers have been rejected.
Some papers have been accepted as long papers, some other are short papers. Finally,
we invited some researchers to give a talk. For those, we publish here the abstract of
their talk as well as a short bio.
2         Committees

    2.1    Organisation Committee

The organisation committee was a multi-disciplinary, implying researchers in computer
science, sociologists.

      Thi Bich Ngoc Hoang, Computer scientist
      Institut de Recherche en Informatique de Toulouse (IRIT), UMR5505 CNRS,
      Toulouse, France, chair

      Pascal Marchand, Information and communication sciences
      Laboratoire d’Études et de Recherches Appliquées en Sciences Sociales
      (LERASS), Université Paul Sabatier,
      Toulouse, France

      Béatrice Milard, Sociologist
      Interdisciplinaire Solidarités, Sociétés, Territoires (LISST), UMR 5193 CNRS
      Toulouse, France

      Josiane Mothe, Computer scientist
      Institut de Recherche Informatique de Toulouse (IRIT), UMR5505 CNRS,
      INSPE-UT2, Université de Toulouse,
      Toulouse, France

      Adrienne Gaultier,
      Institut de Recherche en Informatique de Toulouse (IRIT), UMR5505 CNRS,
      Toulouse, France

      Md Zia Ullah, Computer scientist
      Institut de Recherche en Informatique de Toulouse (IRIT), UMR5505 CNRS,
      Toulouse France


    2.2    Program Committee

The international program committee was composed of:

     •    Nadine Baptiste-Jessel
          IRIT, UMR5505 CNRS,
          France

     •    Adrian-Gabriel Chifu,
          University of Aix-Marseille,
          France
   •    Michał Choraś,
        University of Science and Technology,
        Bydgoszcz, Poland

   •    Fabio Crestani,
        University of Italian Switzerland,
        Switzerland

   •    Liana Ermakova,
        University of Bretagne Occidentale,
        France

   •    Tomas Krilavičius,
        Baltic Institute of Advanced Technology,
        Vilnius, Lithuania

   •    Marion Maisonobe,
        UMR Géographie-cités,
        Paris, France

   •    Véronique Moriceau,
        IRIT, UMR5505 CNRS,
        France

   •    Quốc-Vinh Nguyen-Tran,
        Danang University of Science and Education,
        Vietnam

   •    Francisco-José Pérez-Carrasco,
        Technical University of Valencia,
        Spain

   •    Eric SanJuan,
        University of Avignon,
        France

We would like to thank them all for their work and their reviews.
3       Overview of the proceeding content

The current proceedings follow the workshop program and is organized into sessions
as the workshop was. Each session corresponds to a topic that was discussed during the
workshop.
    The first session focuses on “Risk and Trend Detection”.
   Social media eases rapid connection of people and quick and huge diffusion of con-
tents. It is also known to be a mean for radical group to communicate.
This first session focuses on the associated challenges. It consists in four contribu-
tions. The first contribution comes from an invited speaker, Mirian Fernandez, Senior
Research Fellow at the Knowledge Media Institute, Open University. In her talk untit-
led “Targeting Societal Challenges by analysing social media data”, she presented
different parts of her and collegues’ research results and targeted mainly radicaliza-
tion detection in social media. The abstract of her talk and bio are included in the pro-
ceedings. Tomas Krilavičius, Baltic Institute of Advanced Technology, and Vytautas
Magnus University, Lithuania presented the second communication untitled “What’s
in the News? Identification of Trending Topics in Alternative and Mainstream Lithua-
nian Media” where a deep analysis was developed. A long paper is associated to this
presentation. The third contribution is from Thi Bich Ngoc Hoang from IRIT-CNRS
and is untitled “Topical Community Detection: an Embedding User and Content Simi-
larity Method” where she presents a model to detect communities that not only con-
siders the interactions between actors but also the content they exchange. The final
contribution is this first session has been presented by Konstantinos Demestichas,
member of the Institute of Communication and Computer Systems, Greece where he
presented a platform developed within the PREVISION European project; the short
related paper is untitled “Prediction and Visual Intelligence Platform for Detection of
Irregularities and Abnormal Behaviour”.

    The second session focuses on “Detection of Aggressive Content”.
   Social media allow people to hide their identity and this has led to the proliferation
of abusive language and increase of aggressive content on social media. Automatically
identifying specific content as for example aggressive one is thus a hot research topic.
   The four communications in this session are as follows. Thomas Pellegrini, re-
searcher at the IRIT CNRS lab, France was an invited speaker. He presented some of
his research work in his presentation untitled “Deep learning with weakly-annotated
data: a sound event detection use case (and hate speech detection here and there)”. He
also presented a very accurate state of the art of deep learning methods and above all
on transparency, soft et self-attention -based models, multiple instances and semi-su-
pervised learning. The abstract of his talk and bio are included in the proceedings. The
two next communications were both tackling the same challenge of hate speech or con-
tent detection. Kurt Englmeier from Hochschule Schmalkalden Fakultät Informatik
Blechhammer, Germany presented the his contribution untitled “The Role of Storylines
in Hate Speech Detection. Faneva Ramiandrisoa, from IRIT CNRS lab, presented his
PhD work on “Aggression Identification in Posts - two machine learning approaches”.
The two corresponding long paper are included in the proceeding. The final contribu-
tion was from Mario Laurent, MSHS-T, Université de Toulouse where he presented the
“Hatemeter Project: Analysis of hate speech on twitter at the crossroads of computer
science, humanities and social sciences”. This is a H2020 funded project where the idea
is to go a step further after hate speech detection by helping the development and
spreading of counter speeches. A long paper is also associated to this communication.


  The third session is on Fake news detection.
   Social media makes information spreading easier and quicker, but in many cases
regardless of the information quality. Automate fact-checking has recently become a
hot topic.
   Two communications have been presented within this session. Md Zia Ullah from
IRIT CNRS lab presented a communication untitled “An ML Model for Predicting In-
formation Check-Worthiness using a Variety of Features” associated with a short paper
in these proceedings. Charles Huot and Sonia Collada Pérez from Expert System pre-
sented a communication on the same topic untitled “Detecting fake news in social me-
dia content”


  The fourth and final session focused on Ethics and privacy issue.
   Social data analysis raises ethical and privacy issues. Two communications were
presented on this topic. As an invited speaker, Zuzanna Warso, from Trilateral Re-
search, London gave a clear and completed overview on this issue, untitled “Ethical
and legal challenges of machine learning for trend and weak signal detection in social
networks: an overview”. Included in these proceedings is an abstract related to her
presentation and her bio. Finally, Kostas Davarakis & Eva Blomqvist, from Sigular
Logic, Linkoping university presented the latest communication that is included in this
proceeding under the form of a abstract untitled “Privacy preserving intelligence anal-
ysis for resolving identities”


                                                     Published in ceur-ws, vol n°2606
                                                          http://ceur-ws.org/Vol-2606