Preface This volume contains supplementary proceedings of the third conference on Analysis of Images, Social Networks, and Texts (AIST’2014). The first two con- ferences in 2012 and 2013 attracted a significant number of students, researchers, academics and engineers working on data analysis of images, texts and social networks. The broad scope of AIST makes it an event where researchers from different domains, such as image and text processing, exploiting various data analysis techniques, can meet and exchange ideas. We strongly believe that this may lead to cross-fertilisation of ideas between researchers relying on modern data analysis machinery. Therefore, AIST brings together all kinds of applications of data mining and machine learning techniques. The conference allows specialists from different fields meet each other, present their work and discuss both theo- retical and practical aspects of their data analysis problems. Another important aim of the conference is to stimulate scientists and people from the industry to benefit from the knowledge exchange and identify possible grounds for fruitful collaboration. Following an already established tradition, the conference was held in Yeka- terinburg, capital of Urals region in Russia on 10-12 April 2014. The key topics of AIST are analysis of images and videos; natural language processing and compu- tational linguistics; social network analysis; machine learning and data mining; recommender systems and collaborative technologies; semantic web, ontologies and their applications; analysis of socio-economical data. The Programme Committee and reviewers of the conference featured well- known experts in data mining, natural language processing, image processing and related areas from Russia and all over the world. We have received 74 high quality submissions mostly from Russia and also from France, Germany, India, Poland, Spain, Ukraine and USA, among which only 21 papers have been accepted as regular oral papers (11 long and 10 short) in Communications in Computer and Information Science series (Springer), vol- ume 436. Thus, the acceptance rate of this volume is roughly 28%. In order to stimulate young practitioners and researchers we have also included 3 short industry papers in the main volume and 33 submissions as posters in this supple- mentary proceedings. Each submission has been reviewed by at least 3 reviewers, who are experts in their field, in order to supply detailed and helpful comments. The conference also featured several invited talks and tutorials, as well as an industry session describing current trends and challenges. Invited talks: – Boris Mirkin (Higher School of Economics, Moscow), Data Clustering: Some Topics of Current Interest – Jaume Baixeries (Universitat Politècnica de Catalunya, Barcelona), Charac- terization of Database Dependencies with FCA and Pattern Structures – Dmitriy Kolesov (NextGIS, Moscow), GIS as an Environment for Integration and Analysis of Spatial Data – Natalia Konstantinova (University of Wolverhampton, UK), Relation Ex- traction – Let’s Find More Knowledge Automatically Tutorials: – Jaume Baixeries (Universitat Politècnica de Catalunya, Barcelona), Intro- duction to Formal Concept Analysis and Attribute Dependencies (in 2 parts) – Konstantin Voronstov (CCAS of RAS and Yandex, Moscow), Tutorial on Probabilistic Topic Modeling: Additive Regularization for Stochastic Matrix Factorization – Natalia Konstantinova (University of Wolverhampton, UK), Introduction to Dialogue Systems, Personal Assistants are Becoming a Reality – Natalia Konstantinova (University of Wolverhampton, UK), Academic Writ- ing – Getting Published in International Journals and Conferences The industry speakers covered rich variety of topics: – Iosif Itkin (Exactpro Systems), Network Models for Exchange Trade Analysis – Leonid I. Levkovitch-Maslyuk (EMC), Big Data for business – Alexander Semenov (http://jarens.ru/), Recent Advances in Social Network Analysis – Irina Radchenko (http://iRadche.ru), Current Trends in Open Data and Data Journalism – Oleg Lavrov (KM Alliance Russia), Knowledge Management as a Link be- tween Business & IT – Yury Kupriyanov (WikiVote!), IT Trends and Challenges in Knowledge Man- agement We would like to mention the best conference paper, according to the PC decision, written by V.B. Surya Prasath and Radhakrishnan Delhibabu and en- titled “Automatic Contrast Parameter Estimation in Anisotropic Diffusion for Image Restoration”. We would like to thank the authors for submitting their papers and the members of the Programme Committee for their efforts to provide exhaustive reviews. We would also like to express special gratitude to all the invited speakers and industry representatives. We deeply thank all the partners and sponsors, and owe our gratitude to the Scientific Fund of Higher School of Economics for providing five AIST’s participants with travel grants. Our special thanks goes to Springer editors who helped us starting from the first conference call until the final version of the proceedings. Last but not least, we are grateful to all organisers, especially to Eugeniya Vlasova and Dmitry Ustalov, whose endless energy saved us in the most critical stages of the conference preparation. In Russian AIST is even more than abbreviation, namely it means a stork. So we believe this young and rapidly growing conference still be bringing inspiration for data scientists. May, 2014 Dmitry I. Ignatov Mikhail Yu. Khachay Alexander Panchenko Natalia Konstantinova Rostislav Yavorsky Dmitry Ustalov Organisation The conference was organized by joint team from the National Research Uni- versity Higher School of Economics (HSE), Russia and Krasovsky Institute of Mathematics and Mechanics of UB RAS, Russia. It was supported by the Rus- sian Foundation for Basic Research grant no. 14-07-06803. Program Committee Chairs Dmitry I. Ignatov National Research University Higher School of Eco- nomics, Russia Mikhail Yu. Khachay Krasovsky Institute of Mathematics and Mechanics of UB RAS, Russia Alexander Panchenko Université catholique de Louvain, Belgium & Digital Society Laboratory, Russia Organising Chair Rostislav Yavorsky National Research University Higher School of Eco- nomics, Russia Proceedings Chair Natalia Konstantinova University of Wolverhampton, UK Poster Chair Dmitry Ustalov Krasovsky Institute of Mathematics and Mechanics of UB RAS & Ural Federal University, Russia Organising Committe Eugeniya Vlasova Higher School of Economics, Moscow Dmitry Ustalov Krasovsky Institute of Mathematics and Mechanics of UB RAS & Ural Federal University, Yekaterinburg Olga Fazylova Higher School of Economics, Moscow Liliya Galimziyanova Ural Federal University, Yekaterinburg Alexandra Kaminskaya Yandex, Moscow Maria Khakimova Perm State University, Perm Natalia Korepanova Higher School of Economics, Moscow Elena Nikulina Higher School of Economics, Moscow Andrey Shelomentsev APIO, Yekaterinburg and Moscow Anna Voronova Yandex, Moscow Program Committee Tinku Acharya Videonetics Technology Pvt. Ltd. Simon Andrews Sheffield Hallam University, UK Olga Barinova Moscow State University & Yandex, Russia Malay Bhattacharyya University of Kalyani, India Vladimir Bobrikov Imhonet Research, Russia Elena Bolshakova Moscow State University & National Research Uni- versity Higher School of Economics, Russia Jean-Leon Bouraoui Prometil SARL, France Pavel Braslavski Ural Federal University & Kontur Labs, Russia Ekaterina Chernyak National Research University Higher School of Eco- nomics, Russia Ilia Chetviorkin Moscow State University, Russia Marina Chicheva Image Processing Systems Institute of RAS, Russia Florent Domenach University of Nicosia, Cyprus Alexey Drutsa Moscow State University & Yandex, Russia Maxim Dubinin NextGIS, Russia Victor Erukhimov Itseez, Russia Rashid Faizullin Omsk State Technical University, Russia Thomas Francois Université catholique de Louvain, Belgium Boris Galitsky Become.com, USA Dmitry Ilvovsky Fors & Higher School of Economics, Russia Vladimir Ivanov Kazan Federal University, Russia Vadim Kantorov INRIA, France Mehdi Kaytoue LIRIS - INSA de Lyon, France Vladimir Khoroshevsky Dorodnicyn Computing Centre of RAS and National Research University Higher School of Economics, Russia Sergei Koltsov National Research University Higher School of Eco- nomics, Russia Olessia Koltsova National Research University Higher School of Eco- nomics, Russia Natalia Konstantinova University of Wolverhampton, UK Anton Konushin Lomonosov Moscow State University, Russia Yuri Kudryavcev PMSquare, Australia Sergei Kuznetsov National Research University Higher School of Eco- nomics, Russia Victor Lempitsky Skolkovo Institute of Science and Technology, Russia Alexander Lepskiy National Research University Higher School of Eco- nomics, Russia Natalia Loukachevitch Research Computing Center of Moscow State Uni- versity, Russia Julian Mcauley Stanford University, USA Vlado Menkovski Eindhoven University of Tehnology, The Nether- lands Olga Mitrofanova St. Petersburg State University, Russia Dmitry Mouromtsev National Research University of Information Tech- nologies, Mechanics and Optics, Russia Xenia Naidenova Kirov Military Medical Academy, Russia Sergey Nikolenko Steklov Mathematical Institute of RAS & National Research University Higher School of Economics, Russia Sergey Objedkov Digital Society Lab & National Research University Higher School of Economics, Russia Evgeniy Perevodchikov Tomsk State University of Control Systems and Ra- dioelectronics, Russia Andrey Philippovich Bauman Moscow State Technical University, Russia Jonas Poelmans Katholieke Universiteit Leuven, Belgium Simon Polovina Sheffield Hallam University, UK Alexander Porshnev National Research University Higher School of Eco- nomics, Russia Irina Radchenko National Research University Higher School of Eco- nomics, Russia Delhibabu Radhakrishnan RWTH Aachen, Germany Konstantin Rudakov Dorodnicyn Computing Centre of RAS, Russia Andrey Savchenko National Research University Higher School of Eco- nomics, Russia Alexandra Savelieva Microsoft & National Research University Higher School of Economics, Russia Yuri Smetanin Russian Foundation for Basic Research, Russia Leonid Sokolinsky South Ural State University, Russia Rustam Tagiew Qlaym Gmbh, Germany Irina Temnikova University of Wolverhampton, UK Alexander Ulanov HP Labs, Russia Dmitry Vinogradov All-Russian Institute for Scientific and Technical In- formation of RAS, Russia Alexander Vodyaho Saint Petersburg Electrotechnical University "LETI", Russia Konstantin Vorontsov Dorodnicyn Computing Centre of RAS, Russia Leonind Zhukov Ancestry.com, USA & National Research University Higher School of Economics, Russia Nataly Zhukova St.Petersburg institute for Infromatics and Automa- tion of RAS, Russia Dominik Śle˛zak University of Warsaw, Poland & Infobright Inc. Invited Reviewers Aleksei Buzmakov INRIA, France Yuri Katkov Blue Brain Project, Switzerland Lidia Pivovarova University of Helsinki, Finland Alexander Semenov Higher School of Economics Alumni, Russia Nikita Spirin University of Illinois at Urbana-Champaign, USA Dmitry Ustalov Krasovsky Institute of Mathematics and Mechanics of UB RAS & Ural Federal University, Russia Andrey Bronevich Higher School of Economics Alumni, Russia Sujoy Chatterjee University of Kalyani, India Surya Prasath University of Missouri-Columbia, USA Kirill Shileev Digital Society Laboratory, Russia Natalia Vassilieva HP Labs, Russia Victoria Yaneva University of Wolverhampton, UK Partners and Sponsoring Institutions Central & Eastern European Software Engineering Conference in Russia CLAIM Data Mining Labs Digital Society Laboratory EMC GraphiCon Innovative Trading Systems Krasovsky Institute of Mathematics and Mechanics NextGIS NLPub penxy Russian Foundation for Basic Research SKB Kontur Ural Federal University Ural IT Cluster WikiVote Yandex Table of Contents Posters and Short Papers in English Terminology Extraction from the Baidu Encyclopedia . . . . . . . . . . . . . . . . . 1 Bulat Fatkulin Chatbot for IT Security Training: Using Motivational Interviewing to Improve Security Behaviour . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Iwan Gulenko Conceptual Scheme for Text Classification System . . . . . . . . . . . . . . . . . . . . . 17 Nicolay Lyfenko Toward Network Information Navigation Algorithms . . . . . . . . . . . . . . . . . . 22 Sergei Bel’kov and Sergei Goldstein Semiotic System of Musical Texts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28 Andrew Philippovich, Irina Golubeva and Marina Danshina Automatic Extraction of Hypernyms and Hyponyms from Russian Texts . 35 Kristina Sabirova and Artem Lukanin Non-Linear Filtering of Images on the Basis of Generalized Method of Least Absolute Values . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 Alexander A. Tyrsin and Vladimir A. Surin Comparison of Some Image Quality Approaches . . . . . . . . . . . . . . . . . . . . . . 48 Boris B. Parfenenkov and Maksim A. Panachev Zipf’s Law for LiveJournal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54 Nikita N. Trifonov Moving Object Detection in Video Streams Received from a Moving Camera . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59 Sergey Starkov and Maksim Lukyanchenko Automatic Selection of Verbs-Markers for Segmentation Task of Process Descriptions in Natural Language Texts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64 Varvara A. Krayvanova Visual Analytics in FCA-based Clustering . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69 Yury Kashnitsky Analysis System of Scientific Publications Based on the Ontology Approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81 Viacheslav Lanin and Svetlana Strinyuk Automatic Defect Recognition in Corrosion Logging Using Magnetic Imaging Defectoscopy Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86 Rita Gaibadullina, Bulat Zagidullin and Vladimir Bochkarev Automated Generation of Assessment Test Items from Text: Some Quality Aspects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91 Andrey Kurtasov GPS Navigation Algorithm Based on OSM Data . . . . . . . . . . . . . . . . . . . . . . 96 Daniel Khachay System of Ontologies for Data Processing Applications Based on Implementation of Data Mining Techniques . . . . . . . . . . . . . . . . . . . . . . . . . . 102 Alexander Vodyaho and Nataly Zhukova Logic-Mathematical Apparatus of Data Processing Used in Information Technology of Web-Portal Development . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117 Svitlana Bevz Semantic Methods of Structuring Mathematical Content and Open Scientific E-Journals Management Systems . . . . . . . . . . . . . . . . . . . . . . . . . . . 130 Alexander Elizarov, Evgeny Lipachev and Denis Zuev Posters and Papers in Russian The Method of Constructing the Membership Function to Classify Images Based on Histograms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133 Ivan Posokhov and Oksana S. Logunova A VAR Analysis of Electricity Consumption . . . . . . . . . . . . . . . . . . . . . . . . . . 146 Nurgul Mamatova Using 3D Animated Hand Gestures to Create a New Type of CAPTCHA 151 Artem Shumilov and Andrey Philippovich Automatic Music Rating Based on Implicit Assessments . . . . . . . . . . . . . . . 156 Sergey Smagin Semantic Search Algorithms in Large Text Collections . . . . . . . . . . . . . . . . . 161 Vitaly Savchenko Using CAPTCHA in a Massive Free Association Experiment on the Internet . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167 Dmitry Lakhvich Determining Which Cities’ Features Affect the Opinions’ Sentiments on Twitter . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 172 Alexander Zyryanov and Nikita Putintsev Vox Populi Online: The Comparison of Posts’ Structure and Topics Among the “Regular” and “Popular” Bloggers on LiveJournal . . . . . . . . . . . 177 Svetlana Alexeeva, Olessia Koltsova and Sergei Koltsov Parameter Estimation of Chaotic Process Using UKF and Time Series Forecasting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 182 Elena Malyutina and Vladimir I. Shiryaev Searching for Experts Using the Semantic Analysis of Texts . . . . . . . . . . . . 187 Igor Zahlebin Automatic Natural Language Generation Using an OWL Model, Semantics and Pragmatics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 192 Polina Sazonova Multi-Target Pedestrian Tracking Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . 197 Roman Zakharov Toward a Method of Representing the Semantics of the Text . . . . . . . . . . . 202 Irina N. Efremova and Vladislav V. Efremov Toward the Representation of Continuous Optical Images in a Digital Computer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 205 Vladislav V. Efremov and Irina N. Efremova