Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2016
DOI: 10.1145/2939672.2939700
|View full text |Cite
|
Sign up to set email alerts
|

Data-Driven Metric Development for Online Controlled Experiments

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
32
0
1

Year Published

2017
2017
2024
2024

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 66 publications
(33 citation statements)
references
References 28 publications
0
32
0
1
Order By: Relevance
“…While o ine metrics are especially valuable when evaluating a system in prior to its deployment [13,34], online metrics have been widely adopted for modern search engines because such metrics are calculated based on the interactions between practical users and systems. Inspired by previous research on metrics meta-evaluation [9,11,15,19], we compare the evaluation performance of some most widely-used online metrics, including:…”
Section: Comparison Across O Line Metricsmentioning
confidence: 99%
See 1 more Smart Citation
“…While o ine metrics are especially valuable when evaluating a system in prior to its deployment [13,34], online metrics have been widely adopted for modern search engines because such metrics are calculated based on the interactions between practical users and systems. Inspired by previous research on metrics meta-evaluation [9,11,15,19], we compare the evaluation performance of some most widely-used online metrics, including:…”
Section: Comparison Across O Line Metricsmentioning
confidence: 99%
“…Recent studies show that assessors' judgments may signi cantly di er from users' assessments [31]. e second problem is that the evaluation results based on o ine metrics can be biased because they are usually generated with a small and incomplete dataset [13].…”
mentioning
confidence: 99%
“…Measuring change is a common theme in applied data science. In online A/B testing [15,17,36,37,44], we estimate the average treatment effect (ATE) by the difference of the same metric measured from treatment and control groups, respectively. In time series analyses and longitudinal studies, we often track a metric over time and monitor changes between different time points.…”
Section: Inferring Percent Changes 21 Percent Change and Fieller Intmentioning
confidence: 99%
“…Several previous papers discuss the adjustment of metric variance in A/B testing. The delta method [16] and Bootstrap method [17] are two variance estimation approaches that can be applied to correct the variance without the assumption of independence. These two methods work well in theory, however, they require storing raw data (e.g.…”
Section: B Variance Estimationmentioning
confidence: 99%