=Paper= {{Paper |id=Vol-1625/abs2 |storemode=property |title=None |pdfUrl=https://ceur-ws.org/Vol-1625/abs2.pdf |volume=Vol-1625 }} ==None== https://ceur-ws.org/Vol-1625/abs2.pdf
Topic Modeling without Generative Probabilistic
   Model: An Approach and its Validation ?

                        Mikhail G. Kreines, Elena M. Kreines

         BaseTech LLC, Raketny boulevard, 11-7-164, Moscow, 129366, Russia
                                mkrf@yandex.ru



        Abstract. We propose a novel approach to compute a model of seman-
        tic structure of natural language texts corpora based on the models of
        the texts. This approach differs from LSI and standard ways of topic
        modeling (D. Blei, 2012). We do not use matrix factorization and prob-
        abilistic distributions of the words given a priory. As the base for the
        corpora model construction we use the set of the original vector models
        of the corpora texts. The ways of empirical validation and applications
        of the models are considered.

        Keywords: natural languages, texts, text collections, models, Informa-
        tion Retrieval




?
    The work is partially financially supported by the Ministry of Education and Science,
    Russian Federation (Contract N. 14.579.21.0090 from 27.11.2014, project identifier
    RFMEFI57914X0090).