2023
DOI: 10.1109/access.2023.3240898
|View full text |Cite
|
Sign up to set email alerts
|

Systematic Literature Review of Information Extraction From Textual Data: Recent Methods, Applications, Trends, and Challenges

Abstract: Information extraction (IE) is a challenging task, particularly when dealing with highly heterogeneous data. State-of-the-art data mining technologies struggle to process information from textual data. Therefore, various IE techniques have been developed to enable the use of IE for textual data. However, each technique differs from one another because it is designed for different data types and has different target information to be extracted. This study investigated and described the most contemporary methods… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(4 citation statements)
references
References 178 publications
0
1
0
Order By: Relevance
“…In [33,34], they focus mainly on methods, applications, trends and challenges in extracting information from textual data. The research discusses the applications of information extraction in different domains and the challenges faced.…”
Section: Application Of Knowledge Graphsmentioning
confidence: 99%
See 1 more Smart Citation
“…In [33,34], they focus mainly on methods, applications, trends and challenges in extracting information from textual data. The research discusses the applications of information extraction in different domains and the challenges faced.…”
Section: Application Of Knowledge Graphsmentioning
confidence: 99%
“…The traditional edit distance cannot determine the degree of similarity at the semantic level, which is particularly important for the semantics of domain KGs, as it can only determine the degree of match of string literal d by measuring the distance of the string. For this reason, we have introduced Word2vec [34] for unsupervised learning in the traditional editing distance, in which word vectors are trained for specialized domains.…”
Section: Calculation Of Similarity Based On Improved Edit Distancementioning
confidence: 99%
“…These include languages with small speaking populations and minimal written data, a language widely used but rarely discussed in NLP research, and domains with limited training data [14]. VOLUME XX, 2017 Previous systematic studies [1], [3], [7], [15] explained NER, RE, and EE for information extraction on text data but did not include SRL. We also found only a few systematic studies that addressed SRL for IE processes, where Wang et al [16] described SRL specifically for Chinese, and Ariyanto et al [2] discussed SRL specifically for Indonesian.…”
Section: Introductionmentioning
confidence: 99%
“…By harnessing advanced analytical techniques and machine learning algorithms [2], predictive modeling can tap into this wealth of information to foresee future trends, consumer behaviors, and emerging patterns [3]. Businesses can adapt their strategies based on predictive insights gleaned from social media data, enabling them to make informed decisions [4], anticipate market shifts, and tailor offerings to customer preferences [5]. Navigating the data-rich realm of social media, its significance in predictive modeling goes beyond its sheer volume.…”
Section: Introductionmentioning
confidence: 99%