OpinionRank: Extracting Ground Truth Labels from Unreliable Expert Opinions with Graph-Based Spectral Ranking

Dawson, G. W.; Polikar, Robi

doi:10.48550/arxiv.2102.05884

Cited by 2 publications

(2 citation statements)

References 23 publications

(32 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For the MNIST dataset, we use auxiliary deep generative models as our SSL algorithm due to its small parameter footprint [50]; for the SVHN dataset, we chose the FixMatch algorithm as representative of the current state-of-the-art for semi-supervised learning [75]. For both datasets, we use OpinionRank as our learning from crowds algorithm due to its nonparametric nature and fast performance [18], and selected DivideMix for learning from noisy labels due to its state-of-the-art performance on a wide variety of noisy labels tasks [44].…”

Section: Comparisons To State-of-the-art Under Adversarial Label Noisementioning

confidence: 99%

Rethinking Noisy Label Models: Labeler-Dependent Noise with Adversarial Awareness

Dawson,

Polikar

2021

Preprint

Self Cite

View full text Add to dashboard Cite

Most studies on learning from noisy labels rely on unrealistic models of i.i.d. label noise, such as class-conditional transition matrices. More recent work on instancedependent noise models are more realistic, but assume a single generative process for label noise across the entire dataset. We propose a more principled model of label noise that generalizes instance-dependent noise to multiple labelers, based on the observation that modern datasets are typically annotated using distributed crowdsourcing methods. Under our labeler-dependent model, label noise manifests itself under two modalities: natural error of good-faith labelers, and adversarial labels provided by malicious actors. We present two adversarial attack vectors that more accurately reflect the label noise that may be encountered in real-world settings, and demonstrate that under our multimodal noisy labels model, state-ofthe-art approaches for learning from noisy labels are defeated by adversarial label attacks. Finally, we propose a multi-stage, labeler-aware, model-agnostic framework that reliably filters noisy labels by leveraging knowledge about which data partitions were labeled by which labeler, and show that our proposed framework remains robust even in the presence of extreme adversarial label noise.Preprint. Under review.

show abstract

Section: Comparisons To State-of-the-art Under Adversarial Label Noisementioning

confidence: 99%

Rethinking Noisy Label Models: Labeler-Dependent Noise with Adversarial Awareness

Dawson,

Polikar

2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…For many of these applications, hardware accelerators such as Graphics Processing Units (GPUs) and Field Programmable Gate Arrays (FPGAs) are a highly-effective solution, especially when mixed-precision and reduced-precision arithmetic come into play [1]- [6]. As spectral methods become ubiquitous in the large scale graph pipelines of Spectral Clustering [7], Information Retrieval (IR) [8] and ranking [9], such techniques require algorithms that can compute only a subset of the most relevant eigenvalues (i.e. the largest in modulo) and their associated eigenvectors while taking advantage of the sparsity of real-world graphs.…”

Section: Introductionmentioning

confidence: 99%

A Mixed Precision, Multi-GPU Design for Large-scale Top-K Sparse Eigenproblems

Sgherzi¹,

Parravicini²,

Santambrogio³

2022

Preprint

View full text Add to dashboard Cite

Graph analytics techniques based on spectral methods process extremely large sparse matrices with millions or even billions of non-zero values. Behind these algorithms lies the Top-K sparse eigenproblem, the computation of the largest eigenvalues and their associated eigenvectors. In this work, we leverage GPUs to scale the Top-K sparse eigenproblem to bigger matrices than previously achieved while also providing state-of-the-art execution times. We can transparently partition the computation across multiple GPUs, process out-of-core matrices, and tune precision and execution time using mixed-precision floating-point arithmetic. Overall, we are 67× faster than the highly optimized ARPACK library running on a 104-thread CPU and 1.9× than a recent FPGA hardware design. We also determine how mixedprecision floating-point arithmetic improves execution time by 50 % over double-precision, and is 12× more accurate than single-precision floating-point arithmetic.

show abstract

OpinionRank: Extracting Ground Truth Labels from Unreliable Expert Opinions with Graph-Based Spectral Ranking

Cited by 2 publications

References 23 publications

Rethinking Noisy Label Models: Labeler-Dependent Noise with Adversarial Awareness

Rethinking Noisy Label Models: Labeler-Dependent Noise with Adversarial Awareness

A Mixed Precision, Multi-GPU Design for Large-scale Top-K Sparse Eigenproblems

Contact Info

Product

Resources

About