12
paź
Autor: Marcin kategoria: Note about IT, Scientific research diary | Tagi :Formal Concept Analysis, information extraction, information extraction system | Brak komentarzy
This paper proposes application of Formal Concept Analysis (FCA) in creating character-level information extraction patterns and presents BigGrams: a prototype of a languageindependent information extraction system. The main goal of the system is to recognise and to extract of named entities belonging to some semantic classes (e.g. cars, actors, pop-stars, etc.) from semi structured text (web page documents).
12
paź
Autor: Marcin kategoria: About everything-anything | Tagi :exploratory analysis of text data, Keywords: text data mining, methods of analysis of textual data, text analyzing | Brak komentarzy
This article describes the author’s classification of the methods and techniques of textual data mining. In this article also describes the currently available methods and sauces representation of textual data and their processing techniques. Also conducted a discussion on the processing of text documents using the presented methods. This paper also discussed the possibilities and limitations of individual methods to process the presented text documents.