A metadata approach to classify domain-specific documents for Event-based Surveillance Systems

Digital news sources are the primary source of information for health officials and stakeholders to stay informed about potential health risks. However, with the abundance of news sources available, it can be challenging to distinguish relevant news articles from irrelevant ones. To address this issue, we propose a metadata-based approach for classifying news articles containing information on health events. The first step involves extracting metadata from each news article in the dataset. We then use a machine learning model to classify news articles as relevant or irrelevant. The proposed approach was validated using two different datasets with varying combinations of relevant and irrelevant news articles. The experiments were conducted using a 70%-30% train-test split. The results of the experiments show that the proposed approach is highly effective in classifying relevant news articles for Event-based Surveillance System (EBS). Additionally, several metadata features were identified as being important for the classification task.

Saved in:
Bibliographic Details
Main Authors: Syed, Mehtab Alam, Arsevska, Elena, Roche, Mathieu, Teissere, Maguelonne
Format: conference_item biblioteca
Language:eng
Published: IEEE
Online Access:http://agritrop.cirad.fr/607316/
http://agritrop.cirad.fr/607316/1/Metadata_classification_camera.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!