Learning Fast Matching Models from Weak Annotations

Li, Xue; Luo, Zhipeng; Sun, Hao; Zhang, Jianjin; Han, Weihao; Chu, Xianqi; Zhang, Liang-Jie; Zhang, Qi

doi:10.1145/3308558.3313466

Cited by 9 publications

(3 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Parameters in baselines are carefully tuned on the validation set to select the most desirable parameter setting. Considering the high imbalance distribution of the annotations, following the previous work [20] we select ROC-AUC score as the measurement, which represents the area under the Receiver Operating Characteristic curve. We release our code to facilitate future research (https:// github.com/ qwe35/ AdsGNN ).…”

Section: Baseline Methodsmentioning

confidence: 99%

“…This strategy confuses the relevance correlations with the click relations and thus may introduce ambiguities from two aspects. Firstly, the arbitrariness and subjectivity of user behavior lead to the misalignment between user clicks and true relevance annotations [20], which may introduce noises into the ground truth and further pollute the training set. Secondly, negative pairs sampled by data synthesizing usually share no common tokens for queries and ads, which may mislead the relevance model to view common terms as critical evidence of relevance.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

AdsGNN: Behavior-Graph Augmented Relevance Modeling in Sponsored Search

Pang

Liu

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Sponsored search ads appear next to search results when people look for products and services on search engines. In recent years, they have become one of the most lucrative channels for marketing. As the fundamental basis of search ads, relevance modeling has attracted increasing attention due to the significant research challenges and tremendous practical value. Most existing approaches solely rely on the semantic information in the input query-ad pair, while the pure semantic information in the short ads data is not sufficient to fully identify user's search intents. Our motivation lies in incorporating the tremendous amount of unsupervised user behavior data from the historical search logs as the complementary graph to facilitate relevance modeling. In this paper, we extensively investigate how to naturally fuse the semantic textual information with the user behavior graph, and further propose three novel AdsGNN models to aggregate topological neighborhood from the perspectives of nodes, edges and tokens. Furthermore, two critical but rarely investigated problems, domain-specific pre-training and long-tail ads matching, are studied thoroughly. Empirically, we evaluate the AdsGNN models over the large industry dataset, and the experimental results of online/offline tests consistently demonstrate the superiority of our proposal.

show abstract

Section: Baseline Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

AdsGNN: Behavior-Graph Augmented Relevance Modeling in Sponsored Search

Pang

Liu

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…Some other works [3,12] learn embeddings of queries and ads in a shared vector space from search session, ad click, and search link click data using word2vec [28] like algorithms. A few recent works [23] exploit advances in neural information retrieval models [29] such as Deep Crossing [41] in sponsored search. An important line of work [11,13,15,27,37] that is based on the idea of performing query to query transformations, also known as query rewriting.…”

Section: Introductionmentioning

confidence: 99%

Diversity driven Query Rewriting in Search Advertising

Mohankumar

Begwani

Singh

2021

Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery &Amp; Data Mining

View full text Add to dashboard Cite

Retrieving keywords (bidwords) with the same intent as query, referred to as close variant keywords, is of prime importance for effective targeted search advertising. For head and torso search queries, sponsored search engines use a huge repository of same intent queries and keywords, mined ahead of time. Online, this repository is used to rewrite the query and then lookup the rewrite in a repository of bid keywords contributing to significant revenue. Recently generative retrieval models have been shown to be effective at the task of generating such query rewrites. We observe two main limitations of such generative models. First, rewrites generated by these models exhibit low lexical diversity, and hence the rewrites fail to retrieve relevant keywords that have diverse linguistic variations. Second, there is a misalignment between the training objective -the likelihood of training data, v/s what we desire -improved quality and coverage of rewrites. In this work, we introduce CLOVER, a framework to generate both high-quality and diverse rewrites by optimizing for human assessment of rewrite quality using our diversity-driven reinforcement learning algorithm. We use an evaluation model, trained to predict human judgments, as the reward function to finetune the generation policy. We empirically show the effectiveness of our proposed approach through offline experiments on search queries across geographies spanning three major languages. We also perform online A/B experiments on Bing, a large commercial search engine, which shows (i) better user engagement with an average increase in clicks by 12.83% accompanied with an average defect reduction by 13.97%, and (ii) improved revenue by 21.29%. CCS CONCEPTS• Computing methodologies → Natural language generation; • Information systems → Sponsored search advertising.

show abstract

Enhanced DSSM (deep semantic structure modelling) technique for job recommendation

Mishra

Rathi

2022

Journal of King Saud University - Computer and Information Scie

View full text Add to dashboard Cite

Learning Fast Matching Models from Weak Annotations

Cited by 9 publications

References 27 publications

AdsGNN: Behavior-Graph Augmented Relevance Modeling in Sponsored Search

AdsGNN: Behavior-Graph Augmented Relevance Modeling in Sponsored Search

Diversity driven Query Rewriting in Search Advertising

Enhanced DSSM (deep semantic structure modelling) technique for job recommendation

Contact Info

Product

Resources

About