Lakshmish Kaushik scite author profile

Lakshmish Kaushik

5Publications

99Citation Statements Received

60Citation Statements Given

How they've been cited

128

How they cite others

Affiliations

The University of Texas at Dallas, Nuance Communications (Austria)

Publications

Order By: Most citations

Fearless Steps: Apollo-11 Corpus Advancements for Speech Technologies from Earth to the Moon

Hansen

Sangwan

Joglekar

et al. 2018

View full text Add to dashboard Cite

The Apollo Program is one of the most significant benchmarks for technology and innovation in human history. The previously introduced UTD-CRSS Fearless Steps initiative resulted in the digitization of the original analog audio tapes recorded during the Apollo Space Missions. The entire speech data for the Apollo 11 Mission is now being made publicly available with the release of the Fearless Steps Corpus. This corpus consists of a cumulative 19,000 hours of conversational speech spanning over thirty time-synchronized channels. With over six hundred speakers, the corpus has a rich collection of information which can be beneficial for research and advancement in the speech and language community. Recent efforts on this data have led to the generation of pipeline diarization transcripts for the entire speech corpus. Research has also been done to address speech and natural language tasks such as speech activity detection, speech recognition, and sentiment analysis. This paper provides an overview of the Fearless Steps Corpus and highlights the factors that make the processing of this data a challenging problem. To promote further development of algorithms for naturalistic data, five challenge tasks are also organized. We also describe the challenge tasks with details on a fully transcribed subset of the corpus, and initial results achieved by our systems.

show abstract

Sentiment extraction from natural audio streams

Kaushik

Sangwan

Hansen

2013

View full text Add to dashboard Cite

The 2019 Inaugural Fearless Steps Challenge: A Giant Leap for Naturalistic Audio

Hansen¹,

Joglekar²,

Shekhar³

et al. 2019

View full text Add to dashboard Cite

The 2019 FEARLESS STEPS (FS-1) Challenge is an initial step to motivate a streamlined and collaborative effort from the speech and language community towards addressing massive naturalistic audio, the first of its kind. The Fearless Steps Corpus is a collection of 19,000 hours of multi-channel recordings of spontaneous speech from over 450 speakers under multiple noise conditions. A majority of the Apollo Missions original analog data is unlabeled and has thus far motivated the development of both unsupervised and semi-supervised strategies. This edition of the challenge encourages the development of core speech and language technology systems for data with limited groundtruth/low resource availability and is intended to serve as the "First Step" towards extracting high-level information from such massive unlabeled corpora. In conjunction with the Challenge, 11,000 hours of synchronized 30-channel Apollo-11 audio data has also been released to the public by CRSS-UTDallas. We describe in this paper the Fearless Steps Corpus, Challenge Tasks, their associated baseline systems, and results. In conclusion, we also provide insights gained by the CRSS-UTDallas team during the inaugural Fearless Steps Challenge.

show abstract

A Speaker Diarization System for Studying Peer-Led Team Learning Groups

Dubey¹,

Kaushik²,

Sangwan³

et al. 2016

View full text Add to dashboard Cite

Peer-led team learning (PLTL) is a model for teaching STEM courses where small student groups meet periodically to collaboratively discuss coursework. Automatic analysis of PLTL sessions would help education researchers to get insight into how learning outcomes are impacted by individual participation, group behavior, team dynamics, etc.. Towards this, speech and language technology can help, and speaker diarization technology will lay the foundation for analysis. In this study, a new corpus is established called CRSS-PLTL, that contains speech data from 5 PLTL teams over a semester (10 sessions per team with 5-to-8 participants in each team). In CRSS-PLTL, every participant wears a LENA device (portable audio recorder) that provides multiple audio recordings of the event. Our proposed solution is unsupervised and contains a new online speaker change detection algorithm, termed G 3 algorithm in conjunction with Hausdorff-distance based clustering to provide improved detection accuracy. Additionally, we also exploit cross channel information to refine our diarization hypothesis. The proposed system provides good improvements in diarization error rate (DER) over the baseline LIUM system. We also present higher level analysis such as the number of conversational turns taken in a session, and speakingtime duration (participation) for each speaker.

show abstract

Speech activity detection for NASA apollo space missions: challenges and solutions

Ziaei¹,

Kaushik²,

Sangwan³

et al. 2014

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.