2019
DOI: 10.1186/s40537-019-0254-8
|View full text |Cite
|
Sign up to set email alerts
|

An analytical study of information extraction from unstructured and multidimensional big data

Abstract: IntroductionInformation extraction (IE) process extracts useful structured information from the unstructured data in the form of entities, relations, objects, events and many other types. The extracted information from unstructured data is used to prepare data for analysis. Therefore, the efficient and accurate transformation of unstructured data in the IE process improves the data analysis. Numerous techniques have been introduced for different data types i.e. text, image, audio, and video.The advancement in … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
73
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
3
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 117 publications
(75 citation statements)
references
References 142 publications
(215 reference statements)
0
73
0
Order By: Relevance
“…Recent developments in NLP techniques have helped the automation of this process through machine learning and, in particular, deep learning algorithms [ 82 , 83 ]. Symptoms, patient demographics, clinical data, algorithms, performance, and limitations are identifiable in the texts by properly trained models, which can obtain comparable accuracy to humans at a much faster rate, making it finally possible to monitor the enormous volume of the literature produced [ 84 ]. The resulting structured data can be exploited to enrich knowledge graphs (KGs) [ 85 - 87 ], which provide a means to represent and formalize information [ 85 , 88 ], analytical, relational, and inferential investigations and fill the knowledge gaps in the community.…”
Section: Mining Of the Medical Literaturementioning
confidence: 99%
“…Recent developments in NLP techniques have helped the automation of this process through machine learning and, in particular, deep learning algorithms [ 82 , 83 ]. Symptoms, patient demographics, clinical data, algorithms, performance, and limitations are identifiable in the texts by properly trained models, which can obtain comparable accuracy to humans at a much faster rate, making it finally possible to monitor the enormous volume of the literature produced [ 84 ]. The resulting structured data can be exploited to enrich knowledge graphs (KGs) [ 85 - 87 ], which provide a means to represent and formalize information [ 85 , 88 ], analytical, relational, and inferential investigations and fill the knowledge gaps in the community.…”
Section: Mining Of the Medical Literaturementioning
confidence: 99%
“…The actual source of that data could be internal or external. Besides, these data can be created passively or actively by humans, systems or sensors, and, can follow heterogeneous formats such as structured, semi-structured or unstructured [34,35]. Once data generated, it will eventually power other systems through acquisition processes.…”
Section: Data Generationmentioning
confidence: 99%
“…Moreover, information extraction (IE) techniques required with the rapid growth of multifaceted also called as multidimensional unstructured data which are explored in a survey by [51]. Task-dependent and task-independent are the limitations of IE covering all data types.…”
Section: B Assessment Of Rq2: Which Challenges Have Been Faced Durinmentioning
confidence: 99%