Author Archives: Marcin

A Diversified Classification Committee for Recognition of Innovative Internet Domains

Abstract

The objective of this paper was to propose a classification method of innovative domains on the Internet. The proposed approach helped to estimate whether companies are innovative or not through analyzing their web pages. A Naïve Bayes classification committee was used as the classification system of the domains. The classifiers in the committee were based concurrently on Bernoulli and Multinomial feature distribution models, which were selected depending on the diversity of input data. Moreover, the information retrieval procedures were applied to find such documents in domains that most likely indicate innovativeness. The proposed methods have been verified experimentally. The results have shown that the diversified classification committee combined with the information retrieval approach in the preprocessing phase boosts the classification quality of domains that may represent innovative companies. This approach may be applied to other classification tasks.

The hybrid decision support system for Fire Service – chosen project’s problems

Abstract

This article presents the design process of a hybrid decision support system (HSWD) for the State Fire Service (PSP). The Design for Trustworthy Software (DFTS) methodology was chosen to ensure system reliability. The paper focuses particularly on the requirements planning stage and the overall platform design. The study identifies key challenges in the early project stages, primarily stemming from methodology, environment, and user-related factors. These elements play a crucial role at the start of the design process, whereas aspects such as software, hardware, and measurement have a lesser initial impact. The authors analyze the causes of these challenges and propose solutions to address them. By outlining the lack of specific information solutions in the current State Fire Service infrastructure, this research highlights the importance of a structured approach in decision support system development. The findings contribute to the design of a robust and reliable platform that enhances decision-making in emergency response scenarios.

The Cascading Knowledge Discovery: A Smarter Way to Design Information Systems

Abstract

This article describes a proposal of information system project method. This method based on author’s cascading knowledge discovery in databases process. In this article, the author also to presented use case of this process. All analysis presented in this article based on text reports from the rescue fire service.

A Method for Designing a Knowledge Base and Rules for Text Segmentation Using Formal Concept Analysis

Abstract

Objective: Presentation of a specialist text segmentation technique. The text was derived from reports (a form “Information about theevent”, field “Information about the event – descriptive data”) prepared by rescue units of the State Fire Service after firefighting andrescue operations.

Methods: In order to perform the task the author has proposed a method of designing the knowledge base and rules for a textsegmentation tool. The proposed method is based on formal concept analysis (FCA). The knowledge base and rules designed by theproposed method allow performing the segmentation process of the available documentation. The correctness and effectiveness of theproposed method was verified by comparing its results with the other two solutions used for text segmentation.

Results: During the research and analysis rules and abbreviations that were present in the studied specialist texts were grouped anddescribed. Thanks to the formal concepts analysis a hierarchy of detected rules and abbreviations was created. The extracted hierarchyconstituted both a knowledge and rules base of tools for segmentation of the text. Numerical and comparative experiments on theauthor’s solution with two other methods showed significantly better performance of the former. For example, the F-measure resultsobtained from the proposed method are 95.5% and are 7-8% better than the other two solutions.

Conclusions: The proposed method of design knowledge and rules base text segmentation tool enables the design and implementationof software with a small error divide the text into segments. The basic rule to detect the end of a sentence by the interpretation of thedots and additional characters as the end of the segment, in fact, especially in case of specialist texts, must be packaged with additionalrules. These actions will significantly improve the quality of segmentation and reduce the error. For the construction and representationof such rules is suitable presented in the article, the formal concepts analysis. Knowledge engineering and additional experiments canenrich the created hierarchy by the new rules. The newly inserted knowledge can be easily applied to the currently established hierarchythereby contributing to improving the segmentation of the text. Moreover, within the numerical experiment is made unique: a set ofrules and abbreviations used in reports and set properly separated and labeled segments

Article – Language-Independent Information Extraction Based on Formal Concept Analysis

This paper proposes application of Formal Concept Analysis (FCA) in creating character-level information extraction patterns and presents BigGrams: a prototype of a languageindependent information extraction system. The main goal of the system is to recognise and to extract of named entities belonging to some semantic classes (e.g. cars, actors, pop-stars, etc.) from semi structured text (web page documents).

Article – Review of methods and text data mining techniques

This article describes the author’s classification of the methods and techniques of textual data mining. In this article also describes the currently available methods and sauces representation of textual data and their processing techniques. Also conducted a discussion on the processing of text documents using the presented methods. This paper also discussed the possibilities and limitations of individual methods to process the presented text documents.

Designing Information Systems Through Text Mining: A Case Study of Fire Service Documentation

On September 25, 2013, at 12:15 PM in room WA-130 of the Rectorate building at Białystok University of Technology, I successfully defended my doctoral dissertation titled “Text Data Analysis in Designing a Selected Information System: A Case Study of National Fire Service Incident Documentation.” A detailed description of the proposed method can be found in the Publications – Seminars section or downloaded directly here. Below is a simplified overview of my research.

Article – Proposition of hybrid process model semi structured description of event from fire services rescues operation

The article “Proposition of hybrid process model semi structured description of event from fire services rescues operation” describes a review of actual developed knowledge representation and case representation for fire services cases based reasoning system. The article also describes a method of processing the cases of events. This processing method based on classification and information retrieval.

Article – Crowdsourcing in rescue fire service – proposed application

Few days ago a SIMIS magazine publicated my article Crowdsourcing in rescue fire service – proposed application. In this article I describes the proposal to apply crowdsourcing in Polish rescue fire service. This article also describes basic principles for implementing an crowdsourcing information platform in rescue fire service as well as the scheme of its implementation. Of this paper also to I describes the genesis of this proposal related to the evaluation of research conducted by the author on text mining analysis and extraction of information in the design of information systems.

Questionnaire for fire service

I finished implementation  a questionnaire for fire service on the previous day. This questionnaire was implemented for quantitative/qualitative research. The destination of this research is a creation a hybrid decision support system for Polish fire service.  It’s a complicate problem which require many of different  researches. This research integrate solution from logistic, transport, game theory,  artificial intelligence, linguistics, retrieval of text information  like a text mining and especially text reprezentation and his processing.  All project was described by process of design for trustworthy software (DFTS). The results of  investigations will be published after the completion questionnaire by respondenst.

Questionnaire was implemented using web technologies like a JAVA+JSF+Hibernate+MySQL.