2022
DOI: 10.1155/2022/9220661
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Extraction of Web Page Text Information Based on Network Topology Coincidence Degree

Abstract: In order to effectively solve the above problems, an automatic extraction method of web text information based on network topology coincidence degree is proposed. Search engine, web crawler, and hypertext tag are used to classify web text information, and then, dimensionality reduction is carried out. After processing, the similarity of different features of web page text information is calculated, the similarity is sorted, and the similar text information is extracted according to the correlation based on seg… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(8 citation statements)
references
References 37 publications
(47 reference statements)
0
8
0
Order By: Relevance
“…According to Diffusion-based algorithms, their results are more trustworthy and less vulnerable to the type of graphs used. Reference [15] An automatic extraction approach based on network topology coincidence degree is proposed to successfully overcome the above concerns. To classify web text content, a search engine, a web crawler, and a hypertext tag are utilized, followed by dimensionality reduction.…”
Section: Literature Surveymentioning
confidence: 99%
“…According to Diffusion-based algorithms, their results are more trustworthy and less vulnerable to the type of graphs used. Reference [15] An automatic extraction approach based on network topology coincidence degree is proposed to successfully overcome the above concerns. To classify web text content, a search engine, a web crawler, and a hypertext tag are utilized, followed by dimensionality reduction.…”
Section: Literature Surveymentioning
confidence: 99%
“…Zhinian Shu & Xiaorong Li [18] implemented an automatic extraction method of web text information based on network topology coincidence degree. Web crawler, hypertext tag, and search engine were utilized for web text information classification, and the reduction of dimensionality was carried out.…”
Section: Literature Surveymentioning
confidence: 99%
“…And in the similar way, proposed method also synthesized dataset for the comparison. Table 4 shows that the comparative analysis with the existing methods of WCPK [17], Automatic extraction method [18], Malicious website detection technique [19], and BERT, SoftMax [20].…”
Section: Comparative Analysismentioning
confidence: 99%
“…Using this method to retrieve the web portal information can avoid the limitation of slow convergence caused by too much complex data; thus, improving the retrieval speed of a computer processing cloud data. e reason is that this method can construct the decision tree of information retrieval quickly and reduce the retrieval time to a certain extent by using the dynamic information as the node of the decision tree [35,36].…”
Section: Experimental Analysismentioning
confidence: 99%