2021
DOI: 10.1007/s13278-021-00827-y
|View full text |Cite
|
Sign up to set email alerts
|

Profile generation from web sources: an information extraction system

Abstract: The Internet space has a vast collection of information which is not always structured. These sources of information such as social media, news articles, blogs, speeches and videos often contain information that could be utilized to generate decision making tools such as reports about events and individuals. Using this information is a long and tedious process if done manually. Over the years, a lot of research has been done in data mining and natural language processing techniques to facilitate the consumptio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 22 publications
(7 reference statements)
0
2
0
Order By: Relevance
“…In response to the current composite implementation crises, high mistake rate, and slow extraction speed of web information extraction expertise, the paper [15] suggested a revolutionary web extraction technique based on the uniqueness of web page construction. To improve the efficiency of information extraction, we need to look at the automatic web information extraction technique [16]. Page clustering is used to find high comparison clusters by supporting the DOM extraction technique, and the accuracy of clustering findings is improved by using the similarity measure [17].…”
Section: Related Workmentioning
confidence: 99%
“…In response to the current composite implementation crises, high mistake rate, and slow extraction speed of web information extraction expertise, the paper [15] suggested a revolutionary web extraction technique based on the uniqueness of web page construction. To improve the efficiency of information extraction, we need to look at the automatic web information extraction technique [16]. Page clustering is used to find high comparison clusters by supporting the DOM extraction technique, and the accuracy of clustering findings is improved by using the similarity measure [17].…”
Section: Related Workmentioning
confidence: 99%
“…The contribution of the MUC warehouse in the field of IE even leads researchers to classify IE approaches into two classes. The first class of MUC IE approaches: AutoSolg [6], LIEP [7], PALKA [8], etc. And the second class of IS approaches after MUC: WIEN [9], SoftMealy [10], WHISK [11], etc.…”
Section: Related Workmentioning
confidence: 99%