Preface This volume contains proceedings of the Fifth International conference on Anal- ysis of Images, Social Networks, and Texts (AIST 2016)1 . The previous con- ferences in 2012–2015 attracted a significant number of students, researchers, academics and engineers working on interdisciplinary data analysis of images, texts, and social networks. The broad scope of AIST makes it an event where researchers from differ- ent domains, such as image and text processing, exploiting various data analysis techniques, can meet and exchange ideas. We strongly believe that this may lead to crossfertilisation of ideas between researchers relying on modern data analy- sis machinery. Therefore, AIST brings together all kinds of applications of data mining and machine learning techniques. The conference allows specialists from different fields to meet each other, present their work, and discuss both theo- retical and practical aspects of their data analysis problems. Another important aim of the conference is to stimulate scientists and people from the industry to benefit from the knowledge exchange and identify possible grounds for fruitful collaboration. The conference was held during April 6–8, 2016. Following an already estab- lished tradition, the conference was organised in Yekaterinburg, a cross-roads between European and Asian parts of Russia, the capital of Urals region.The key topics of AIST are analysis of images and videos; natural language process- ing and computational linguistics; social network analysis; pattern recognition, machine learning and data mining; recommender systems and collaborative tech- nologies; semantic web, ontologies and their applications. The Program Committee and the reviewers of the conference included well- known experts in data mining and machine learning, natural language process- ing, image processing, social network analysis, and related areas from leading institutions of 27 countries including Australia, Bangladesh, Belgium, Brazil, Croatia, Cyprus, Egypt, Estonia, Finland, France, Germany, Greece, India, Ire- land, Israel, Italy, Japan, Lithuania, Norway, Portugal, Qatar, Russia, Spain, Switzerland, The Netherlands, UK, Ukraine, and USA. This year we have received 142 submissions mostly from Russia but also from Australia, Austria, France, Hungary, India, Italy, Mexico, Norway, Sweden, UK, USA and Vietnam. Out of 142 only 30 papers were accepted as regular oral papers. Thus, the acceptance rate of this volume was around 21%. In order to encourage young practitioners and researchers we included 3 papers of industry speakers to the main volume and 38 papers to the supplementary proceedings. Each submission was reviewed by at least three reviewers, experts in their fields, in order to supply detailed and helpful comments. 1 http://aistconf.org/ The conference featured several invited talks and industry session dedicated to current trends and challenges. Invited talks: – Mehdi Kaytoue (Université de Lyon, CNRS, INSA-Lyon, France), Finding Duplicate Labels in Behavioral Data: an Application for E-Sport Analytics. – Attila Kertesz-Farkas (Higher School of Economics, Russia), False Discovery Rate Control for Database Search Methods over Heterogeneous Biological Data – Radhakrishnan Delhibabu (Kazan Federal University, Russia), Analysis of Anisotropic Diffusion in Image Processing The business speakers also covered a wide variety of topics2 . We have included three of those invited talks into the main volume: – Denis Nikiforov, Alexander Korchagin and Ruslan Sivakov (Centre of Infor- mation Technology, Ekaterinburg, Russia) An Ontology-Driven Approach to Electronic Document Structure Design – Tatyana Prisyach (Speech Technology Center, St. Petersburg, Russia), Valentin Mendelev and Dmitry Ubskiy (ITMO-University, St. Petersburg, Russia), Data Augmentation for Training of Noise Robust Acoustic Models – Alexander Semenov (Higher School of Economics, Russia) and Peter Romov (Yandex, Russia), Performance of Machine Learning Algorithms in Predict- ing Game Outcome from Drafts in Dota We would also like to mention the best conference papers selected by the Program Committee within each section: – On Complexity of Searching a Subset of Vectors with Shortest Average un- der a Cardinality Restriction by Anton Eremeev, Alexander Kelmanov, and Artem Pyatkin (Machine Learning and Data Mining); – The Problem of The Optimal Packing of The Equal Radius Circles for Non- Euclidean Metric by Alexander Kazakov, Anna Lempert and Huy Liem Nguyen (Data Analysis, Social Networks & Complex Data); – Parallel Non-blocking Deterministic Algorithm for Online Topic Modeling by Oleksandr Frei and Murat Apishev (Natural Language Processing); – Image Processing Algorithms with Structure Transferring Properties on the Basis of Gamma-normal Model by Inessa Gracheva and Andrey Kopylov (Analysis of Images and Video). The final round of SNA Hackathon 20163 co-organised by OK.Ru and AIST team has taken place in conjunction with the main conference on April 6–8. We would like to thank the authors for submitting their papers and the members of the Program Committee for their efforts in providing exhaustive 2 the detailed program of AIST 2016 Business Day can be found at a separate website: http://dataconf.org 3 http://snahackathon.org/ vi reviews. We would also like to express special gratitude to all the invited speakers and industry representatives. We deeply thank all the partners and sponsors. Our golden sponsors are Ex- actpro4 and OK.Ru5 . Exactpro, a fully owned subsidiary of London Stock Ex- change Group, specialises in quality assurance for exchanges, investment banks, brokers and other financial sector organisations. OK.Ru, a part of Mail.Ru Group, is one of the two largest Russian social networking services. Our bronze sponsor is Centre of Information Technologies (Centre IT)6 . This is a Russian engineering company that specializes in creating exclusive IT solutions and ser- vices for the interdepartmental electronic interaction both on the national and international levels. We would like to acknowledge the Scientific Fund of Higher School of Eco- nomics for providing AIST participants with travel grants. Our special thanks goes to Springer editors who helped us, starting from the first conference call to the final version of the proceedings. Last but not least, we are grateful to all organisers, especially to Eugeniya Vlasova, and the volunteers, whose endless energy saved us at the most critical stages of the conference preparation. Traditionally, we would like to mention the Russian word “aist” is more than just a simple abbreviation (in Cyrillic), it means a “stork”. Since it is a wonderful free bird, a symbol of happiness and peace, this stork brought us the inspiration to organise the AIST conference. So we believe that this young and rapidly growing conference will be bringing inspiration to data scientists around the World! October, 2016 Dmitry Ignatov Michael Khachay Valery Labunets Natalia Loukachevitch Sergey Nikolenko Alexander Panchenko Andrey Savchenko Konstantin Vorontsov 4 exactprosystems.com 5 OK.ru 6 centre-it.com vii Organisation Program Committee Chairs Dmitry I. Ignatov National Research University Higher School of Eco- nomics, Moscow, Russia Mikhail Khachay Krasovsky Institute of Mathematics and Mechanics of Ural Branch of Russian Academy of Sciences, Russia Valery G. Labunets Ural Federal University, Russia Natalia Loukachevitch Computing Centre of Lomonosov Moscow State Univer- sity, Russia Sergey Nikolenko National Research University Higher School of Eco- nomics & Steklov Institute of Mathematics of Russian Academy of Sciences, St. Petersburg, Russia Alexander Panchenko Technische Universität Darmstadt, Germany & Univer- sité catholique de Louvain, Belgium Andrey Savchenko National Research University Higher School of Eco- nomics, Nizhny Novgorod, Russia Konstantin Vorontsov Dorodnicyn Computing Centre of Russian Academy of Sciences, Russia Proceedings Chair Dmitry I. Ignatov National Research University Higher School of Eco- nomics, Russia Organising Committee Anna Golubtsova National Research University Higher School of Eco- nomics, Moscow Irina Dolgaleva National Research University Higher School of Eco- nomics, Moscow Eugeniya Vlasova National Research University Higher School of Eco- nomics, Moscow Natalia Papulovskaya Ural Federal University, Yekaterinburg Ekaterina Borovitina Chelyabinsk State University, Russia Volunteers Maxim Pasynkov Krasovsky Institute of Mathematics and Mechanics of Ural Branch of Russian Academy of Sciences, Rus- sia, Yekaterinburg Anna Voronova Yandex, Moscow Eugene Tsymbalov Webgames & National Research University Higher School of Economics, Moscow Daria Baranetskaya National Research University Higher School of Eco- nomics, Moscow Valeria Bubnova National Research University Higher School of Eco- nomics, Moscow Business Day Chair Evgenia Vlasova National Research University Higher School of Eco- nomics, Moscow Program Committee Mikhail Ageev Lomonosov Moscow State University, Russia Nickolay Arefyev Lomonosov Moscow State University & Digital So- ciety Lab, Russia Jaume Baixeries Universitat Politècnica de Catalunya, Catalonia, Spain Artem Baklanov Krasovskii Institute of Mathematics and Mechanics of Ural Branch of the Russian Academy of Sciences, Russia & International Institute for Applied Systems Analysis, Austria Pedro Balage University of São Paulo, Brazilia Sergey Bartunov National Research University Higher School of Eco- nomics, Russia & Deep Mind, UK Malay Bhattacharyya Indian Institute of Engineering Science and Technol- ogy, India Elena Bolshakova Moscow State Lomonosov University, Russia Anastasia Bonch- National Research University Higher School of Eco- Osmolovskaya nomics, Russia Aurélien Bossard Université Paris 8, France Jean-Leon Bouraoui Université Catholique de Louvain, Belgium Leonid Boytsov Carnegie Mellon University, USA Pavel Braslavski Ural Federal University / Kontur Labs, Russia Evgeny Burnaev Institute for Information Transmission Problems of Russian Academy of Sciences, Russia Aleksey Buzmakov INRIA-LORIA(CNRS-Université de Lorraine), France ix Artem Chernodub Institute of Mathematical Machines and Systems Problems of Ukraine National Academy of Science, Ukraine Vladimir Chernov Institute for Image Processing of Russian Academy of Science, Russia Ekaterina Chernyak National Research University Higher School of Eco- nomics, Russia Marina Chicheva Samara National Research University, Russia Bonaventura Coppola IBM Research, USA Hernani Costa University of Malaga, Spain Boris Dobrov Lomonosov Moscow State University, Russia Sofia Dokuka National Research University Higher School of Eco- nomics, Russia Florent Domenach Akita International University, Japan Alexey Drutsa Lomonosov Moscow State University & Yandex, Russia Julia Efremova Eindhoven University of Technology, The Nether- lands Natalia Efremova Moscow State Lomonosov University, Russia Maria Eskevich Radboud University Nijmegen, the Netherlands Myasnikov Evgeny Samara National Research University, Russia Stefano Faralli University of Mannheim, Germany Victor Fedoseev Image Processing Systems Institute of Russian Academy of Sciences & Samara National Research University, Russia Michael Figurnov Skolkovo Institute of Science and Technology, Russia Elena Filatova New York City College of Technology, USA Mark Fishel University of Tartu, Estonia Thomas Francois Université catholique de Louvain, Belgium Oleksandr Frei Schlumberger, Norway Binyam Gebrekidan Gebre Max Planck Institute for Psycholinguistics, The Netherlands Natalia Grabar STL CNRS Université Lille 3, France Dmitry Granovsky Yandex, Russia Mena Habib Maastricht University, The Netherlands Dmitry Ilvovsky National Research University Higher School of Eco- nomics, Russia Vladimir Ivanov Kazan Federal University, Russia Dmitry Kan SemanticAnalyzer, Finland Nikolay Karpov National Research University Higher School of Eco- nomics, Russia Egor Kashkin Vinogradov Russian Language Institute of Russian Academy of Sciences, Russia Yury Katkov Ecole Polytechnique Fédérale de Lausanne, Switzer- land x Mehdi Kaytoue LIRIS - INSA de Lyon, France Alexander Kelmanov Sobolev Institute of Mathematics of Siberian Branch of Russian Academy of Sciences, Russia Andrey I. Kibzun Moscow Aviation Institute, Russia Victor Kitov Lomonosov Moscow State University, Russia Eduard Klyshinskii Moscow State Institute of Electronics and Mathe- matics, Russia Ekaterina Kochmar University of Cambridge, UK Sergei Koltcov National Research University Higher School of Eco- nomics, Russia Olessia Koltsova National Research University Higher School of Eco- nomics, Russia Alex Konduforov AltexSoft, Ukraine Natalia Konstantinova University of Wolverhampton, UK Andrey Kopylov Tula State University, Russia Kirill Kornyakov Itseez & Lobachevsky State University of Nizhni Novgorod, Russia Mikhail Korobov ScrapingHub, Russia Anton Korshunov Institute for System Programming of Russian Academy of Sciences, Russia Evgeny Kotelnikov Vyatka State University, Russia Olga Krasotkina Tula State University, Russia Tomas Krilavicius Vytautas Magnus University, Lithuania Valentina Kuskova National Research University Higher School of Eco- nomics, Russia Andrey Kutuzov University of Oslo, Norway Andrey Kuznetsov Samara National Research University, Russia Alexander Lepskiy National Research University Higher School of Eco- nomics, Russia Vadim Levit Ariel University, Israel Benjamin Lind National Research University Higher School of Eco- nomics, Russia Natalia Loukachevitch Lomonosov Moscow State University, Russia Olga Lyashevskaya National Research University Higher School of Eco- nomics & Vinogradov Russian Language Institute of Russian Academy of Sciences, Russia Ilya Markov University of Amsterdam, The Netherlands Luis Marujo Carnegie Mellon University, USA and(University of Lisbon, Portugal Sérgio Matos University of Aveiro, Portugal Yelena Mejova Qatar Computing Research Institute, Qatar Benjamin Milde Technische Universität Darmstadt, Germany Olga Mitrofanova St. Petersburg State University, Russia Andrea Moro Sapienza, Università di Roma, Italy Hubert Naets Université Catholique de Louvain, Belgium xi Vassilina Nikoulina Xerox Research Center Europe, France Damien Nouvel Institut National des Langues et Civilisations Ori- entales, France Dimitri Nowicki Institute of Cybernetics of Ukraine National Academy of Science, Ukraine Alexander Panchenko Universite catholique de Louvain, Belgium Georgios Petasis National Center for Scientific Research Demokritos, Greece Andrey Philippovich Bauman Moscow State Technical University, Moscow Lidia Pivovarova University of Helsinki, Finland Vladimir Pleshko RCO, Russia Jonas Poelmans Bigdatarobots, Cyprus Alexander Porshnev National Research University Higher School of Eco- nomics, Russia Surya Prasath University of Missouri-Columbia, USA Delhibabu Radhakrishnan Kazan Federal University, Russia Carlos Ramisch Aix Marseille University, France Evgeniy Riabenko Moscow Institute of Physics and Technology, Russia Martin Riedl Technische Universität Darmstadt, Germany Alexandra Roshchina Institute of Technology Tallaght, Ireland Eugen Ruppert Technische Universität Darmstadt, Germany Mohammed Abdel-Mgeed Ain Shams University, Egypt M. Salem Grigory Sapunov Stepic, Russia Sheikh Muhammad Sarwar University of Dhaka, Bangladesh & University of Massachusetts Amherst, USA Andrey Savchenko National Research University Higher School of Eco- nomics, Russia Carsten Schnober Deutsches Institut für Internationale Pädagogische Forschung, Germany Marijn Schraagen Utrecht University, The Netherlands Alexander Semenov National Research University Higher School of Eco- nomics, Russia Oleg Seredin Tula State University, Russia Andrey Shcherbakov Intel, Russia Jan Snajder University of Zagreb, Croatia Evgeny Sokolov Lomonosov Moscow State University, Germany Christian Stab Technische Universität Darmstadt, Germany Vadim Strijov Dorodnicyn Computing Centre of Russian Academy of Sciences, Russia Maria Sukhareva Goethe University Frankfurt, Germany Marina Suvorova Lomonosov Moscow State University, Russia Irina Temnikova Qatar Computing Research Institute, Qatar Christos Tryfonopoulos University of Peloponnese, Greece xii Denis Turdakov Institute for System Programming of Russian Academy of Sciences, Russia Alexander Ulanov HP Labs, Russia Natalia Vassilieva HP Labs, Russia Dmitry Vetrov Lomonosov Moscow State University, Russia Ekaterina Vylomova University of Melbourne, Australia Patrick Watrin Université catholique de Louvain, Belgium Rostislav Yavorskiy National Research University Higher School of Eco- nomics, Russia Seid Muhie Yimam Technische Universität Darmstadt, Germany Marcos Zampieri Saarland University, Germany Olga Zvereva Ural Federal University, Russia Invited Reviewers Guillaume Bosc Sujoy Chatterjee Vladimir M. Chernov Silvio Ricardo Cordeiro Samuel Daylis Anna Denisova Svyatoslav Elizarov Denis Fedyanin Yuri Kan Benjamin Milde Alexander Minkin Andrey Naumov Paraskevi Raftopoulou Oleg Slavin Ivan Sterligov Dmitry Ustalov Natali Vaganova Diliara Valeeva Lefteris Zervakis Sponsors Golden sponsors: Exactpro OK.Ru (Mail.Ru Group) Bronze sponsor: IT Centre xiii Table of Contents Structural Optimization of the Travel Time Prediction Model Based on Hierarchical Regression . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Anton Agafonov and Vladislav Myasnikov Linear Variation and Optimization of Algorithms of Connected- Component Labeling in Binary Images . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 Fedor Alekseev, Mikhail Alekseev and Artyom Makovetskii The Hybrid Approach to Part-of-Speech Disambiguation . . . . . . . . . . . . . . . 21 Elena Bruches, Dmitrii Karpenko and Varvara Krayvanova The Methodology of Automated Decryption of Znamenny Chants . . . . . . . 27 Marina Danshina, Andrey Philippovich and Irina Golubeva The Structure of an Investment Portfolio in Two-step Problem of Optimal Investment with One Risky Asset Via the Probability Criterion . 42 Alexei Ignatov Evolution of Advanced Technology Studies: Searching for a Communication Core . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51 Alina Kadyrova and Konstantin Fursov Evolvable Semantic Platform for Facilitating Knowledge Exchange . . . . . . 62 Nikolay Karpov, Eduard Babkin and Alexander Demidovskij Lazy Learning of Classification Rules for Complex Structure Data . . . . . . . 73 Yury Kashnitsky The Corpus of Syntactic Co-occurences: the First Glance . . . . . . . . . . . . . . 85 Edward Klyshinsky, Petr Ermakov, Natalia Lukashevich and Olesya Karpik Determination of an Unmanned Mobile Object Orientation by Natural Landmarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91 Anton Korsakov, Ivan Fomin, Dmitry Gromoshinsky, Aleksandr Bakhshiev, Dmitrii Stepanov and Ekaterina Smirnova Using Ontology-Driven Methods to Develop Frameworks for Tackling NLP Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 Taisiya Kostareva, Svetlana Chuprina and Alexandr Nam Detection and Tracking of the Objects in Real Time Applied to the Problem of the Log Volume Estimation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114 Artem Kruglov and Yurii Chiryshev Checking the Topological Consistency of Maps of Different Scales . . . . . . . 124 Kirill Kuptsov, Sergey Eremeev and Dmitry Andrianov Fast Multiparametric Wavelet Transforms and Packets for Image Processing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 134 Valeri Labunets, Denis Komarov and Ekaterina Ostheimer The Bichromatic Excitable Schrodinger Metamedium . . . . . . . . . . . . . . . . . . 146 Valery Labunets, Ivan Artemov and Ekaterina Ostheimer Organizations, State Interactions, and Field Stability: A Network Analysis of the Tea Party and Occupy Wall Street Movements . . . . . . . . . . 156 Benjamin Lind, B. Remy Cross and Georgy Mkrtchyan The Results of Sulfur Print Image Classification of Section Images . . . . . . 170 Oxana S. Logunova, Ivan A. Posokhov, Anatoliy Y. Mikov, Elena A. Ilyina, Natalya V. Dyorina and Anatoliy B. Belyavskiy The 360 Around View System for Large Vehicles, the Methods of Calibration and Removal of Barrel Distortion for Omnidirectional Cameras 182 Aleksey S. Makarov and Marina V. Bolsunovskaya Smoothing Voronoi-Based Path with Minimized Length and Visibility Using Composite Bezier Curves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 191 Ilya Makarov and Pavel Polyakov Multi-class Learning in Big Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 203 Anton Malenichev, Olga Krasotkina, Vadim Mottl and Oleg Seredin Automatic Detection of Bone Contours in X-Ray Images . . . . . . . . . . . . . . . 212 Alexey Mikhaylichenko, Yana Demyanenko and Elena Grushko Program Tools for Dynamic Investigation of Social Networks . . . . . . . . . . . 224 Alexander Mikov, Elena Zamyatina and Daria Germanova Classification of E-commerce Websites by Product Categories . . . . . . . . . . . 237 George Moiseev The Study of Applicability of the Decision Tree Method for Contouring of the Left Ventricle Area in Echographic Video Data . . . . . . . . . . . . . . . . . . 248 Andrey Mukhtarov, Sergey Porshnev, Vasiliy Zuzin, Anastasia Bobkova and Vladimir Bobkov Extracting Functional Job Roles from Professional Social Networking Site Profiles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 258 Anastasiia Nesterenko On Road Defects Detection and Classification . . . . . . . . . . . . . . . . . . . . . . . . 264 Thu Huong Nguyen, Aleksei Zhukov and The Long Nguyen xv ARTM vs. LDA: an SVD Extension Case Study . . . . . . . . . . . . . . . . . . . . . . 276 Sergey Nikolenko The Use Convolutional Neural Networks for Recognition of Semiographic Chants . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 283 Andrey Philippovich, Maxim Boynov and Marina Danshina An Elective Multibiometric Authentication . . . . . . . . . . . . . . . . . . . . . . . . . . . 292 Alexey Ruchay A Project Teams Creation Based on Communities Detection . . . . . . . . . . . 303 Mikhail Semenov, Elena Koroleva, Dilmurat Tursunov and Lev Bulygin A Branching Alignment-Based Synthesis of Regular Expressions . . . . . . . . 315 Andrei Shcherbakov Topic Modeling for Answers Detection in Online Game Chats . . . . . . . . . . 329 Alexander Sirotkin, Ilya Musabirov, Paul Okopny, Denis Bulygin and Vladimir Ivanov Detector of Interest Point within Region of Interest on NBI Endoscopy Images . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 334 Dmitry M. Stepanov, Vyacheslav V. Mizgulin, Vsevolod V. Kosulnikov, Radi M. Kadushnikov, Evgeny D. Fedorov and Olga A. Buntseva Research of Properties of Digital Noise in Contrast Images . . . . . . . . . . . . . 340 Vladimir Surin and Alexander Tyrsin The Iterative Closest Points Algorithm and Affine Transformations . . . . . . 349 Dmitrii Tihonkih, Artyom Makovetskii and Vladislav Kuznetsov Conceptual Model for Routine Measurements Analyses in Semantic Web Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 357 Nataly Zhukova, Alexander Vodyaho and Maksim Lapaev The Task of Compiling the Project Execution Plan in the Multi-Agent Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 369 Alexey Zraenko Triad Census Usage for Communication Network Analysis . . . . . . . . . . . . . . 378 Olga M. Zvereva xvi