2014
DOI: 10.14257/ijmue.2014.9.12.09
|View full text |Cite
|
Sign up to set email alerts
|

On Objective Keywords Extraction: Tf-Idf based Forward Words Pruning Algorithm for Keywords Extraction on YouTube

Abstract: Discovery and subsequent effective retrieval of useful user generated content depends on proper meta-data annotation implemented on an object such as a title and Keywords. In this study, a simpler unsupervised non graph-based algorithm for extracting keywords is proposed. A novel key phrases chunking approach was adopted; this utilizes words sequences as they appear in the original document. The simple but effective Term frequency-inverse document frequency (tf-idf) weighting scheme was exploited to rank the n… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2015
2015
2017
2017

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 13 publications
0
1
0
Order By: Relevance
“…In addition, the video metadata as described in figure 1 were extracted together with the hidden official keywords using a Python based API. A novel keyphrasing algorithm [17] was used on the three textual sources to extend the number of textual representations for each video content to nine; namely the three originals, their three keyphrases representation and the official keyphrases from YouTube resource. The extra two sources were obtained by taking unions of the ASR keywords with metadata and the official keywords for comparison purposes.…”
Section: Figure 2 Conceptual Frameworkmentioning
confidence: 99%
“…In addition, the video metadata as described in figure 1 were extracted together with the hidden official keywords using a Python based API. A novel keyphrasing algorithm [17] was used on the three textual sources to extend the number of textual representations for each video content to nine; namely the three originals, their three keyphrases representation and the official keyphrases from YouTube resource. The extra two sources were obtained by taking unions of the ASR keywords with metadata and the official keywords for comparison purposes.…”
Section: Figure 2 Conceptual Frameworkmentioning
confidence: 99%