Tag Archives: systematic review

Document Classification Pattern Recognition via Information Fusion: A systematic review of multimodal and multiview representation approaches

Abstract

Information fusion is used widely to improve document classification by integrating multiple data sources (multimodal) or multiple representations of the same data (multiview). Yet the literature has been fragmented: there has been no unified framework, no quantitative synthesis of “how much fusion helps,” and limited practitioner-oriented guidance. In our systematic review we analyse 139 primary studies, propose a formal framework to structure the field, summarise key qualitative trends, and perform a random-effects meta-analysis (to our knowledge, the first focused specifically on document classification). The results show that multimodal fusion significantly improves accuracy (mean gain +5.28 percentage points, p=0.0016), while multiview fusion yields consistent but modest improvements for accuracy (+4.67%), F1-score (+3.08%) and recall (all p<0.05). We also highlight a reproducibility gap: only 11.8% (multimodal) and 23.3% (multiview) of studies report statistical tests. Overall, the key lesson is practical: success depends less on algorithmic complexity and more on aligning the fusion strategy with the task context and committing to rigorous validation.

A Researcher’s Roadmap: A Practical Framework for Rigorous Science

After many years spent in research, the scientific process—from idea to publication—becomes second nature. However, this intuition, though invaluable, deserves to be structured. The desire to describe this workflow stems not only from a need to better understand my own work but also from the desire to create a map that can help others navigate this complex terrain.

One inspiration was a humorous but accurate list from the book “We Have No Idea: A Guide to the Unknown Universe” by Jorge Cham and Daniel Whiteson:

  1. Organize what you know
  2. Look for patterns
  3. Ask questions
  4. Buy a tweed jacket with elbow patches

However, scientific work is, above all, the art of asking the right questions. It’s not about “beating the baseline” but about understanding a phenomenon. The question “why?” is a researcher’s compass. In turn, understanding often means the ability to reconstruct a mechanism (e.g., by implementing code or a formal proof), although in some areas of mathematics, a complete, verifiable line of reasoning is sufficient.

I have noticed that whether I am writing an empirical paper in Natural Language Processing (NLP) or a systematic review with a meta-analysis, a common skeleton lies beneath the surface. The result of these observations is the working framework below, which attempts to visualize this skeleton.

Continue reading

The Outcomes and Publication Standards of Research Descriptions in Document Classification: A Systematic Review

Abstract

Document classification, a critical area of research, employs machine and deep learning methods to solve real-world problems. This study attempts to highlight the qualitative and quantitative outcomes of the literature review from a broad range of scopes, including machine and deep learning methods, as well as solutions based on nature, biological, or quantum physics-inspired methods. A rigorous synthesis was conducted using a systematic literature review of 102 papers published between 2003 and 2023. The 20 Newsgroups (bydate version) were used as a reference point of benchmarks to ensure fair comparisons of methods. Qualitative analysis revealed that recent studies utilize Graph Neural Networks (GNNs) combined with models based on the transformer architecture and propose end-to-end solutions. Quantitative analysis demonstrated state-of-the-art results, with accuracy, micro and macro F1-scores of 90.38%, 88.28%, and 89.38%, respectively. However, the reproducibility of many studies may need to be revised for the scientific community. The resulting overview covers a wide range of document classification methods and can contribute to a better understanding of this field. Additionally, the systematic review approach reduces systematic error, making it useful for researchers in the document classification community.