2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.00276
|View full text |Cite
|
Sign up to set email alerts
|

How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
28
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 72 publications
(37 citation statements)
references
References 20 publications
0
28
0
Order By: Relevance
“…We chose to work with a German sign language since that is the only dataset with gloss annotation that could help us study our hypotheses. The How2Sign dataset (Duarte et al, 2021) is a feasible dataset for ASL, but it does not allow any model to extract facial landmarks, facial action units or facial expression from the original video frames since the faces are blurred. In the future, we hope to see new datasets with better and more diverse annotations…”
Section: Discussionmentioning
confidence: 99%
“…We chose to work with a German sign language since that is the only dataset with gloss annotation that could help us study our hypotheses. The How2Sign dataset (Duarte et al, 2021) is a feasible dataset for ASL, but it does not allow any model to extract facial landmarks, facial action units or facial expression from the original video frames since the faces are blurred. In the future, we hope to see new datasets with better and more diverse annotations…”
Section: Discussionmentioning
confidence: 99%
“…The collection and annotation of sign language data is an expensive task that needs the collaboration of linguistic experts and native speakers. While there are some publicly available datasets for SLP [4,8,13,14,21,44,50], they suffer from weakly annotated data for sign language. Furthermore, most of the available datasets in SLP contain a restricted domain of the vocabularies/sentences.…”
Section: Discussionmentioning
confidence: 99%
“…In such dataset, a paired form of the continuous sign language sentence and the corresponding spoken language sentence needs to be included. Just a few datasets meet these criteria [11,21,44,84] . The point is that most of the aforementioned datasets cannot be used for end-to-end translation [11,44,84].…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…While there are some large-scale and annotated datasets available for sign language recognition [20], there are only a few publicly available large-scale datasets for SLP. Two public datasets, RWTH-Phoenix-2014T [44] and How2Sign [45] are the most used datasets in sign language translation. The former includes German sign language sentences that can be used for text-to-sign language translation.…”
Section: Datasetsmentioning
confidence: 99%