2021
DOI: 10.1016/j.cose.2021.102312
|View full text |Cite
|
Sign up to set email alerts
|

Website categorization via design attribute learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 35 publications
0
5
0
Order By: Relevance
“…This paper extended the study of Cohen et al [24] by developing an algorithm that automatically extracts websites' features, including full screenshots and image analysis capabilities, in a large-scale operation and enriches each website record with third-party data regarding its operation and metadata. Advanced machine learning (ML) classi cation models were then applied to determine whether each website is malicious.…”
Section: Experimental Settingsmentioning
confidence: 94%
See 3 more Smart Citations
“…This paper extended the study of Cohen et al [24] by developing an algorithm that automatically extracts websites' features, including full screenshots and image analysis capabilities, in a large-scale operation and enriches each website record with third-party data regarding its operation and metadata. Advanced machine learning (ML) classi cation models were then applied to determine whether each website is malicious.…”
Section: Experimental Settingsmentioning
confidence: 94%
“…In terms of design attributes, we refer to all visual and nonvisual elements that a web page consists of [24]. Among these attributes, we can nd HTML code and hierarchies, JavaScript, CSS, color tables, styles, font types, objects, etc.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Content categorization falls into use cases of machine learning or probabilistic models as they can be trained to automatically categorize or classify content into predefined categories based on the content's characteristics, features, or patterns. Techniques needed for achieving a success in these fields have been developed by researchers generally in purpose of cracks and malicious websites detection [2], web navigation prediction [5], fake news detection [1], Search Engine Optimization [11]. All the purposes of websites or generally content classification utilized different methods of data acquisition and preprocessing, feature extraction and machine learning algorithms.…”
Section: Introductionmentioning
confidence: 99%