Email is one of the most powerful tools for communication. Many businesses use email as the main channel for communication, so it is possible that substantial data are included in email content. In order to help businesses grow faster, a workflow management system may be required. The data gathered from email content might be a robust source for a workflow management system. This research proposes an email extraction system to extract data from any incoming emails into suitable database fields. The database, which is created by the program, has been planned for the implementation of a workflow management system. The research is presented in three phases: (1) define suitable criteria to extract data; (2) implement a program to extract data, and store them in a database; and (3) implement a program for validating data in a database. Four criteria are applied for an email extraction system. The first criterion is to select contact information at the end of the email content; the second criterion is to select specified keywords, such as tel, email, and mobile; the third criterion is to select unique names, which start with a capital letter, such as the names of people, places, and corporates; the fourth criterion is to select special texts, such as Co. Ltd, .com, and www. The empirical results suggest that when all four criteria are considered, the accuracy of a program and percentage of blank fields are at an acceptable level compared with the results from other criteria. When four criteria are applied to extract 7,340 emails in English, the accuracy of this experiment is approximately 68.66%, while the percentage of blank fields in a database is approximately 68.05. The database created by the experiment can be applied in a workflow management system.
KeywordsBusiness operations, startup business, import/export industry, email, business data, workflow management system, business transactions, migrating, email extraction system. Revised: May 2017 * The authors would like to express their gratitude to the Executive Vice-President of Finish International Freight Co. Ltd, as well as another two anonymous companies which cannot be mentioned because of confidentiality. The companies provided very useful information and insights to conduct this research. Thanks also to Khun Natthicha Phonjan and Khun Sariporn Plipon, who assisted greatly to edit and verify the accuracy of the program. It is appreciated that the business data provided by the three selected businesses are sensitive, and will not be disclosed or used for any purpose other than the research for the paper. The authors are also grateful for the helpful comments and suggestions of Chia-Lin Chang. Corresponding author: Takorn Prexawanprasut (takorn.pre@dpu.ac.th) 1
JEL Classification
AbstractEmail is one of the most powerful tools for communication. Many businesses use email as the main channel for communication, so it is possible that substantial data are included in email content. In order to help businesses grow faster, a workflow management system ma...