Query by document

Yang, Yin; Bansal, Nilesh; Dakka, Wisam; Ipeirotis, Panagiotis G.; Koudas, Nick; Papadias, Dimitris

doi:10.1145/1498759.1498806

Cited by 82 publications

(72 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…They first extract semantic keywords from the paragraph and then do the search in an annotated image database. In addition, querying by documents is studied in [24].…”

Section: B Cross Domain Searchmentioning

confidence: 99%

Cross Domain Search by Exploiting Wikipedia

Liu

Jiang

et al. 2012

2012 IEEE 28th International Conference on Data Engineering

View full text Add to dashboard Cite

Abstract-The abundance of Web 2.0 resources in various media formats calls for better resource integration to enrich user experience. This naturally leads to a new cross domain resource search requirement, in which a query is a resource in one modal and the results are closely related resources in other modalities. With cross domain search, we can better exploit existing resources.Intuitively, tags associated with Web 2.0 resources are a straightforward medium to link resources with different modality together. However, tagging is by nature an ad hoc activity. They often contain noises and are affected by the subjective inclination of the tagger. Consequently, linking resources simply by tags will not be reliable. In this paper, we propose an approach for linking tagged resources to concepts extracted from Wikipedia, which has become a fairly reliable reference over the last few years. Compared to the tags, the concepts are therefore of higher quality. We develop effective methods for cross-modal search based on the concepts associated with resources. Extensive experiments were conducted, and the results show that our solution achieves good performance.

show abstract

“…They first extract semantic keywords from the paragraph and then do the search in an annotated image database. In addition, querying by documents is studied in [24].…”

Section: B Cross Domain Searchmentioning

confidence: 99%

Cross Domain Search by Exploiting Wikipedia

Liu

Jiang

et al. 2012

2012 IEEE 28th International Conference on Data Engineering

View full text Add to dashboard Cite

show abstract

“…We chose YTE over frequency based techniques since we did not want to be limited by counts from a 12000 post corpus for tf.idf calculations. Also, a recent work comparing YTE, tf.idf and mutual information based techniques for word and phrase identification concluded that YTE did better than tf.idf when identifying top k < 4 keywords in a document and all three were similar in characterizing document content for larger values of k [6].…”

Section: Abstract For Advertisingmentioning

confidence: 99%

“…As every keyword k i is added from C2 to C1, the change in Information Content of C1 is measured as IC(C1,ki ) δ =IC(C1,ki )−IC(C1) (6) where IC(C1, k i ) is the information content of C1 after adding keyword k i from C2. IC(C1, k i ) δ is positive when k i is strongly associated with words in C1 and negative when k i is unrelated to words in C1.…”

Section: Identifying Contextual Abstractmentioning

confidence: 99%

Monetizing User Activity on Social Networks - Challenges and Experiences

Muthialu

Baid

Sheth

et al. 2009

2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology

View full text Add to dashboard Cite

Abstract-This work summarizes challenges and experiences in monetizing user activity on public forums on social network sites. We present a approach that identifies the monetization potential of user posts and eliminates off-topic content to identify the most relevant and monetizable keywords for advertising. Preliminary studies using data from MySpace and Facebook show that 52% of ad impressions generated using keywords from our system were more targeted compared to the 30% relevant impressions generated without using our system.

show abstract

“…Relational data can be observed in many predictive modeling tasks, such as forecasting the winner in two-player computer games [1], predicting proteins that interact with other proteins in bioinformatics [2], retrieving documents that are similar to a target document in text mining [3], investigating the persons that are friends of each other on social network sites [4], etc. All these examples represent fields of applica-tion in which specific machine learning and data mining algorithms are successfully developed to infer relations from data; pairwise relations, to be more specific.…”

Section: Introductionmentioning

confidence: 99%

Learning Valued Relations from Data

Waegeman

Pahikkala

Airola

et al. 2011

Advances in Intelligent and Soft Computing

View full text Add to dashboard Cite

Driven by a large number of potential applications in areas like bioinformatics, information retrieval and social network analysis, the problem setting of inferring relations between pairs of data objects has recently been investigated quite intensively in the machine learning community. To this end, current approaches typically consider datasets containing crisp relations, so that standard classification methods can be adopted. However, relations between objects like similarities and preferences are in many real-world applications often expressed in a graded manner. A general kernel-based framework for learning relations from data is introduced here. It extends existing approaches because both crisp and valued relations are considered, and it unifies existing approaches because different types of valued relations can be modeled, including symmetric and reciprocal relations. This framework establishes in this way important links between recent developments in fuzzy set theory and machine learning. Its usefulness is demonstrated on a case study in document retrieval.

show abstract

Query by document

Cited by 82 publications

References 22 publications

Cross Domain Search by Exploiting Wikipedia

Cross Domain Search by Exploiting Wikipedia

Monetizing User Activity on Social Networks - Challenges and Experiences

Learning Valued Relations from Data

Contact Info

Product

Resources

About