Air pollutants affect both human health and the environment. For this reason, environmental managers and urban planners focus their efforts in monitoring air pollution. In this context, complete information is required to support the decision-making process to improve the quality of life in urban zones. Hence, it is important to extract knowledge not only on concentration levels but associations between air pollutants. Based on the Cross-industry standard process for data mining, this paper presents an approach which leads to identify correlations and incidence between the most harmful pollutants in the Andean Region: Ozone, Carbon monoxide, Sulfur dioxide, Nitrogen dioxide and, Particulate material. This paper describes an experiment using a real dataset from a monitoring station in Cuenca, Ecuador located in the Andean region. The results show that the proposed approach is effective to extract knowledge useful to support the evaluation of air quality in urban zones. In addition, this approach provides a starting point for future data mining applications for the analysis of air pollution in the context of the Andean region.
Named Entity Recognition problem’s objective is to automatically identify and classify entities like persons, places,organizations, and so forth. That is an area in Natural Language Processing and Information Extraction. NamedEntity Recognition is important because it is a fundamental step of preprocessing for several applications like relationextraction. However, it is a hard problem to solve as several categories of named entities are written similarly andthey appear in similar contexts. To accomplish it, we can use some hybrid approaches. Nevertheless, in this presentstudy, we use linguistic flavor by applying Local Grammar and Cascade of Transducers. Local Grammars are used torepresent the rules of a particular linguistic structure. They are often built manually to describe the entities we aimto recognize. In our study, we adapted a Local Grammar to improve the Recognition of Named Entities. The resultsshow an improvement of up to 7% on the F-measure metric in relation to the previous Local Grammar. Also, we builtanother Local Grammar to recognize family relationships from the improved Local Grammar. We present a practicalapplication for the extracted relationships using Prolog.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.