AUTSL: A Large Scale Multi-Modal Turkish Sign Language Dataset and Baseline Methods

Sincan, Özge Mercanoğlu; Keleş, Hacer Yalım

doi:10.1109/access.2020.3028072

Cited by 118 publications

(75 citation statements)

References 48 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Then, they built a temporal attention-based model for classification. In our previous work [5], we proposed a baseline model for a new large-scale isolated Turkish Sign Language (AUTSL) dataset. We integrated a Feature Pooling Module and a temporal attention model to focus on more relevant spatio-temporal parts of the videos.…”

Section: Related Workmentioning

confidence: 99%

“…We evaluate our proposed models on two very recently shared large-scale isolated sign language datasets: AUTSL [5] and BosphorusSign22k [6].…”

Section: A Datasets and Preprocessingmentioning

confidence: 99%

“…AUTSL [5] is a large-scale, signer independent, isolated TSL dataset that contains 226 signs, and 36,302 video samples. The dataset contains 43 signers; 31 of them are included in the training set, 6 are in the validation set, and the remaining 6 are in the test set.…”

Section: A Datasets and Preprocessingmentioning

confidence: 99%

See 2 more Smart Citations

Using Motion History Images With 3D Convolutional Networks in Isolated Sign Language Recognition

Sincan

Keleş

2022

IEEE Access

Self Cite

View full text Add to dashboard Cite

Sign language recognition using computational models is a challenging problem that requires simultaneous spatio-temporal modeling of the multiple sources, i.e. faces, hands, body, etc. In this paper, we propose an isolated sign language recognition model based on a model trained using Motion History Images (MHI) that are generated from RGB video frames. RGB-MHI images represent spatio-temporal summary of each sign video effectively in a single RGB image. We propose two different approaches using this RGB-MHI model. In the first approach, we use the RGB-MHI model as a motion-based spatial attention module integrated into a 3D-CNN architecture. In the second approach, we use RGB-MHI model features directly with the features of a 3D-CNN model using a late fusion technique. We perform extensive experiments on two recently released large-scale isolated sign language datasets, namely AUTSL and BosphorusSign22k. Our experiments show that our models, which use only RGB data, can compete with the state-of-the-art models in the literature that use multi-modal data.

show abstract

Section: Related Workmentioning

confidence: 99%

“…We evaluate our proposed models on two very recently shared large-scale isolated sign language datasets: AUTSL [5] and BosphorusSign22k [6].…”

Section: A Datasets and Preprocessingmentioning

confidence: 99%

See 1 more Smart Citation

Using Motion History Images With 3D Convolutional Networks in Isolated Sign Language Recognition

Sincan

Keleş

2022

IEEE Access

Self Cite

View full text Add to dashboard Cite

show abstract

“…Contrary to the previous methods that use a single Kinect sensor, this work additionally employs a machine vision camera, along with a television screen, for sign demonstration. Sincan et al in [ 16 ], captured isolated Turkish sign language glosses using Kinect sensors with a large variety of indoor and outdoor backgrounds, revealing the importance of capturing videos with various backgrounds. Adaloglou et al in [ 17 ], created a large sign language dataset with RealSense D435 sensor that records both RGB and depth information.…”

Section: Sign Language Capturingmentioning

confidence: 99%

Artificial Intelligence Technologies for Sign Language

Papastratis

Chatzikonstantinou

Konstantinidis

et al. 2021

Sensors

View full text Add to dashboard Cite

AI technologies can play an important role in breaking down the communication barriers of deaf or hearing-impaired people with other communities, contributing significantly to their social inclusion. Recent advances in both sensing technologies and AI algorithms have paved the way for the development of various applications aiming at fulfilling the needs of deaf and hearing-impaired communities. To this end, this survey aims to provide a comprehensive review of state-of-the-art methods in sign language capturing, recognition, translation and representation, pinpointing their advantages and limitations. In addition, the survey presents a number of applications, while it discusses the main challenges in the field of sign language technologies. Future research direction are also proposed in order to assist prospective researchers towards further advancing the field.

show abstract

“…Примечание: ЗР -задача распознавания; ПО -количество предметных областей; Р -разметка; УЗ -устройство захвата (камера, сенсор); ЦК -цветная камера; н/д -нет данных.ИНФОРМАЦИОННОУПРАВЛЯЮЩИЕ СИСТЕМЫ № 6, 202112 …”

unclassified

Analytical review of models and methods for automatic recognition of gestures and sign languages

Ryumin¹,

Kagirov²,

Axyonov³

et al. 2021

ICS

View full text Add to dashboard Cite

Introduction: Currently, the recognition of gestures and sign languages is one of the most intensively developing areas in computer vision and applied linguistics. The results of current investigations are applied in a wide range of areas, from sign language translation to gesture-based interfaces. In that regard, various systems and methods for the analysis of gestural data are being developed. Purpose: A detailed review of methods and a comparative analysis of current approaches in automatic recognition of gestures and sign languages. Results: The main gesture recognition problems are the following: detection of articulators (mainly hands), pose estimation and segmentation of gestures in the flow of speech. The authors conclude that the use of two-stream convolutional and recurrent neural network architectures is generally promising for efficient extraction and processing of spatial and temporal features, thus solving the problem of dynamic gestures and coarticulations. This solution, however, heavily depends on the quality and availability of data sets. Practical relevance: This review can be considered a contribution to the study of rapidly developing sign language recognition, irrespective to particular natural sign languages. The results of the work can be used in the development of software systems for automatic gesture and sign language recognition.

show abstract

AUTSL: A Large Scale Multi-Modal Turkish Sign Language Dataset and Baseline Methods

Cited by 118 publications

References 48 publications

Using Motion History Images With 3D Convolutional Networks in Isolated Sign Language Recognition

Using Motion History Images With 3D Convolutional Networks in Isolated Sign Language Recognition

Artificial Intelligence Technologies for Sign Language

Analytical review of models and methods for automatic recognition of gestures and sign languages

Contact Info

Product

Resources

About