2023
DOI: 10.1038/s41598-023-39169-4
|View full text |Cite
|
Sign up to set email alerts
|

The promise of zero-shot learning for alcohol image detection: comparison with a task-specific deep learning algorithm

Abstract: Exposure to alcohol content in media increases alcohol consumption and related harm. With exponential growth of media content, it is important to use algorithms to automatically detect and quantify alcohol exposure. Foundation models such as Contrastive Language-Image Pretraining (CLIP) can detect alcohol exposure through Zero-Shot Learning (ZSL) without any additional training. In this paper, we evaluated the ZSL performance of CLIP against a supervised algorithm called Alcoholic Beverage Identification Deep … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(7 citation statements)
references
References 10 publications
(18 reference statements)
0
6
0
Order By: Relevance
“…The preliminary evidence to date suggests that, using ZSL, large language models produced relatively accurate results relative to human coders and task‐specific deep‐learning algorithms. In a recent study concerning identifying alcoholic beverages in images [30], we compared the ZSL performance of CLIP (contrastive language image pre‐training; Radford et al . [31]) to ABIDLA2 (alcoholic beverage identification deep‐learning algorithm, version 2), which is a task‐specific algorithm trained to detect alcoholic beverages in images [32].…”
Section: Zero‐shot Learning (Zsl)—a Brief Overviewmentioning
confidence: 99%
See 4 more Smart Citations
“…The preliminary evidence to date suggests that, using ZSL, large language models produced relatively accurate results relative to human coders and task‐specific deep‐learning algorithms. In a recent study concerning identifying alcoholic beverages in images [30], we compared the ZSL performance of CLIP (contrastive language image pre‐training; Radford et al . [31]) to ABIDLA2 (alcoholic beverage identification deep‐learning algorithm, version 2), which is a task‐specific algorithm trained to detect alcoholic beverages in images [32].…”
Section: Zero‐shot Learning (Zsl)—a Brief Overviewmentioning
confidence: 99%
“…ABIDLA2 was trained on more than 160 000 alcohol images and took months for a team of annotators to annotate the data and a large amount of computer resources to train and refine the algorithm. Despite this, we found that CLIP achieved comparable performance in identifying alcoholic beverages in images, and in some tasks performed better than ABIDLA2 [30]. Considering text analysis applications, ZSL large language models have achieved high accuracy for non‐substance use areas in tasks such as processing clinical text [33], classifying the topics of Tweets or news stories [34] and cross‐lingual sentiment classification [35].…”
Section: Zero‐shot Learning (Zsl)—a Brief Overviewmentioning
confidence: 99%
See 3 more Smart Citations