Taylan Cemgil scite author profile

Reliable detection of out-of-distribution (OOD) inputs is increasingly understood to be a precondition for deployment of machine learning systems. This paper proposes and investigates the use of contrastive training to boost OOD detection performance. Unlike leading methods for OOD detection, our approach does not require access to examples labeled explicitly as OOD, which can be difficult to collect in practice. We show in extensive experiments that contrastive training significantly helps OOD detection performance on a number of common benchmarks. By introducing and employing the Confusion Log Probability (CLP) score, which quantifies the difficulty of the OOD detection task by capturing the similarity of inlier and outlier datasets, we show that our method especially improves performance in the 'near OOD' classes -a particularly challenging setting for previous methods.Preprint. Under review.

show abstract

Morphology of the mandibular fossa and inclination of the articular eminence in patients with internal derangement and in symptom-free volunteers

Sülün

Cemgil²,

Duc³

et al. 2001

Oral Surgery, Oral Medicine, Oral Pathology, Oral Radiology, an

View full text Add to dashboard Cite

Achieving Robustness in the Wild via Adversarial Mixing With Disentangled Representations

Gowal

Qin

Huang

et al. 2020

View full text Add to dashboard Cite

EndoSensorFusion: Particle Filtering-Based Multi-Sensory Data Fusion with Switching State-Space Model for Endoscopic Capsule Robots

Turan

Almalioglu

Gilbert

et al. 2018

View full text Add to dashboard Cite

A reliable, real time multi-sensor fusion functionality is crucial for localization of actively controlled capsule endoscopy robots, which are an emerging, minimally invasive diagnostic and therapeutic technology for the gastrointestinal (GI) tract. In this study, we propose a novel multi-sensor fusion approach based on a particle filter that incorporates an online estimation of sensor reliability and a non-linear kinematic model learned by a recurrent neural network. Our method sequentially estimates the true robot pose from noisy pose observations delivered by multiple sensors. We experimentally test the method using 5 degree-of-freedom (5-DoF) absolute pose measurement by a magnetic localization system and a 6-DoF relative pose measurement by visual odometry. In addition, the proposed method is capable of detecting and handling sensor failures by ignoring corrupted data, providing the robustness expected of a medical device. Detailed analyses and evaluations are presented using ex-vivo experiments on a porcine stomach model prove that our system achieves high translational and rotational accuracies for different types of endoscopic capsule robot trajectories.

show abstract

Does your dermatology classifier know what it doesn’t know? Detecting the long-tail of unseen conditions

Roy

Ren

Azizi

et al. 2022

Medical Image Analysis

View full text Add to dashboard Cite

A Fine-Grained Analysis on Distribution Shift

Wiles¹,

Gowal²,

Stimberg³

et al. 2021

Preprint

View full text Add to dashboard Cite

Robustness to distribution shifts is critical for deploying machine learning models in the real world. Despite this necessity, there has been little work in defining the underlying mechanisms that cause these shifts and evaluating the robustness of algorithms across multiple, different distribution shifts. To this end, we introduce a framework that enables fine-grained analysis of various distribution shifts. We provide a holistic analysis of current state-of-the-art methods by evaluating 19 distinct methods grouped into five categories across both synthetic and real-world datasets. Overall, we train more than 85K models. Our experimental framework can be easily extended to include new methods, shifts, and datasets. We find, unlike previous work (Gulrajani & Lopez-Paz, 2021), that progress has been made over a standard ERM baseline; in particular, pretraining and augmentations (learned or heuristic) offer large gains in many cases. However, the best methods are not consistent over different datasets and shifts.

show abstract

Achieving Robustness in the Wild via Adversarial Mixing with Disentangled Representations

Gowal¹,

Qin²,

Huang³

et al. 2019

Preprint

View full text Add to dashboard Cite

Recent research has made the surprising finding that state-of-the-art deep learning models sometimes fail to generalize to small variations of the input. Adversarial training has been shown to be an effective approach to overcome this problem. However, its application has been limited to enforcing invariance to analytically defined transformations like p -norm bounded perturbations. Such perturbations do not necessarily cover plausible real-world variations that preserve the semantics of the input (such as a change in lighting conditions). In this paper, we propose a novel approach to express and formalize robustness to these kinds of real-world transformations of the input. The two key ideas underlying our formulation are (1) leveraging disentangled representations of the input to define different factors of variations, and (2) generating new input images by adversarially composing the representations of different images. We use a StyleGAN model to demonstrate the efficacy of this framework. Specifically, we leverage the disentangled latent representations computed by a StyleGAN model to generate perturbations of an image that are similar to real-world variations (like adding make-up, or changing the skin-tone of a person) and train models to be invariant to these perturbations. Extensive experiments show that our method improves generalization and reduces the effect of spurious correlations.

show abstract

Evaluating the Adversarial Robustness of Adaptive Test-time Defenses

Croce¹,

Gowal²,

Brunner³

et al. 2022

Preprint

View full text Add to dashboard Cite

Adaptive defenses that use test-time optimization promise to improve robustness to adversarial examples. We categorize such adaptive testtime defenses and explain their potential benefits and drawbacks. In the process, we evaluate some of the latest proposed adaptive defenses (most of them published at peer-reviewed conferences). Unfortunately, none significantly improve upon static models when evaluated appropriately. Some even weaken the underlying static model while simultaneously increasing inference cost. While these results are disappointing, we still believe that adaptive test-time defenses are a promising avenue of research and, as such, we provide recommendations on evaluating such defenses. We go beyond the checklist provided by Carlini et al. ( 2019) by providing concrete steps that are specific to this type of defense.

show abstract

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Taylan Cemgil

Contrastive Training for Improved Out-of-Distribution Detection

Morphology of the mandibular fossa and inclination of the articular eminence in patients with internal derangement and in symptom-free volunteers

Achieving Robustness in the Wild via Adversarial Mixing With Disentangled Representations

EndoSensorFusion: Particle Filtering-Based Multi-Sensory Data Fusion with Switching State-Space Model for Endoscopic Capsule Robots

Does your dermatology classifier know what it doesn’t know? Detecting the long-tail of unseen conditions

A Fine-Grained Analysis on Distribution Shift

Achieving Robustness in the Wild via Adversarial Mixing with Disentangled Representations

Evaluating the Adversarial Robustness of Adaptive Test-time Defenses

Contact Info

Product

Resources

About