Recently, Indonesian Health Ministry has problems with interpreting health surveillance chart, because they need experts for interpreting the chart. Since it is difficult to provide experts especially in rural area, intelligent system for interpreting the health surveillance chart is necessary. This paper proposed a method to generate sentences in Indonesian language which is extracted from the chart image (e.g. health surveillance chart). The sentences is used to interpret the chart in Indonesian Language. In order to generate sentences, lexicalization based on image features data is proposed to provide some "seed words". These words are used by the NLG system to generate sentences. After obtaining the seed words, the lexical selection is conducted and the main topic of the paragraph is determined. Finally, the main topic was determined and furthermore the sentence in Indonesian Language is ready to be generated. There were some methods for interpreting data such as [1][2][3], but this method could not be used in Indonesian Health Ministry case because most of the chart which should be interpreted are in the form of chart image. Therefore, the proposed method uses NLG based on image extraction and pattern abstraction, such that it can be used for interpreting the chart image.
Graph Feature Extraction is the task to collect any important feature on the graph image so the extracted data can be further processed. In this paper, it is assumed that the graph consists only one single curve. The extracted features here are graph title, axis title, axis values, legend, scales, and curve values.To find the Region Of Interest (ROI) of the graph image, pixel projection is adopted. This is necessary to find the possible information location in the graph image. After the location has been located, then the alphanumeric data such as graph title, axis texts, and the legend of the graph are processed by the OCR (Optical Character Recognition).The values of the graph line is extracted by performing the scale calculation, and then, using the scale, the line pixel position is calculated to find the numeric data of the graph.The extracted data can be used for further processing such as graph interpretation system, pattern recognition, pattern abstraction, automatic graph reader system, etc. The data also can be used to analyze the graph when the graph came in the image form and is needed to be converted into numeric form.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.