Optical character recognition (OCR) is a technology to digitize a paper-based document to digital form. This research studies the extraction of the characters from a Thai vehicle registration certificate via a Google Cloud Vision API and a Tesseract OCR. The recognition performance of both OCR APIs is also examined. The 84 color image files comprised three image sizes/resolutions and five image characteristics. For suitable image type comparison, the greyscale and binary image are converted from color images. Furthermore, the three pre-processing techniques, sharpening, contrast adjustment, and brightness adjustment, are also applied to enhance the quality of image before applying the two OCR APIs. The recognition performance was evaluated in terms of accuracy and readability. The results showed that the Google Cloud Vision API works well for the Thai vehicle registration certificate with an accuracy of 84.43%, whereas the Tesseract OCR showed an accuracy of 47.02%. The highest accuracy came from the color image with 1024×768 px, 300dpi, and using sharpening and brightness adjustment as pre-processing techniques. In terms of readability, the Google Cloud Vision API has more readability than the Tesseract. The proposed conditions facilitate the possibility of the implementation for Thai vehicle registration certificate recognition system.
This paper aims to develop web and mobile application and open data platform to facilitate the community researchers use for monitoring the water quality management in Thailand. The paper developed a mobile and web application to collect and represent WQI data. Besides that, this paper elaborated the open data platform for sharing the WQI data to the other public sectors with two formats are people-readable format and machine-readable format. The population of this research is the community researchers who live in the Pakpanang river basin, including in local authorities, volunteers, and academic researchers. This paper also pays attention to the third parties are living in the outside area, which uses the WQI data. The experiment found that the overall user satisfaction whos participate in this project is in a good rank. The top three ranks of the function usage are: the first rank is a monthly report of the WQI data function. The second rank is a sending the WQI data by the volunteer function, and a calculating the WQI data by the academic researcher function. Finally, The third rank is open data usage function. Also, the measurement station increased from twenty to twenty-two stations. The volunteers expanded from the local authorities, volunteers, and academic researchers to students and teachers. For the future, this paper tries to apply a minimisation method for optimising The WQI parameters and also classify the proper parameters for each measurement station.
This study aims to develop an automated data digitization system for the Thai vehicle registration certificate. It is the first system developed as a web service Application Programming Interface (API), which is essential for any enterprise to increase its business value. Currently, this system is available on “www.carjaidee.com”. The system involves four steps: 1) an embedded frame aligns a document to be correctly recognised in the image acquisition step; 2) sharpening and brightness filtering techniques to enhance image quality are applied in the pre-processing step; 3) the Google Cloud Vision API receives a prompt to proceed in the recognition step; 4) a specific domain dictionary to improve accuracy rate is developed for the post-processing step. This study defines 92 images for the experiment by counting the correct words and terms from the output. The findings suggest that the proposed method, which had an average accuracy of 93.28%, was significantly more accurate than the original method using only the Google Cloud Vision API. However, the system is limited because the dictionaries cannot automatically recognise a new word. In the future, we will explore solutions to this problem using natural language processing techniques. Doi: 10.28991/CEJ-2022-08-07-09 Full Text: PDF
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.