The pattern of clinical breast cancer metastasis correlates with a single nucleotide polymorphism in the C1qA component of complement

The main objective of the work presented in this paper was to develop a complete system that would accomplish the original visions of the MALACH project. Those goals were to employ automatic speech recognition and information retrieval techniques to provide improved access to the large video archive containing recorded testimonies of the Holocaust survivors. The system has been so far developed for the Czech part of the archive only. It takes advantage of the state-of-the-art speech recognition system tailored to the challenging properties of the recordings in the archive (elderly speakers, spontaneous speech and emotionally loaded content) and its close coupling with the actual search engine. The design of the algorithm adopting the spoken term detection approach is focused on the speed of the retrieval. The resulting system is able to search through the 1,000 h of video constituting the Czech portion of the archive and find query word occurrences in the matter of seconds. The phonetic search implemented alongside the search based on the lexicon words allows to find even the words outside the ASR system lexicon such as names, geographic locations or Jewish slang.

show abstract

A Relevance Score Estimation for Spoken Term Detection Based on RNN-Generated Pronunciation Embeddings

Švec

Psutka

Šmídl

et al. 2017

View full text Add to dashboard Cite

In this paper, we present a novel method for term score estimation. The method is primarily designed for scoring the out-of-vocabulary terms, however it could also estimate scores for in-vocabulary results. The term score is computed as a cosine distance of two pronunciation embeddings. The first one is generated from the grapheme representation of the searched term, while the second one is computed from the recognized phoneme confusion network. The embeddings are generated by specifically trained recurrent neural network built on the idea of Siamese neural networks. The RNN is trained from recognition results on word-and phone-level in an unsupervised fashion without need of any hand-labeled data. The method is evaluated on the MALACH data in two languages, English and Czech. The results are compared with two baseline methods for OOV term detection.

show abstract

Air traffic control communication (ATCC) speech corpora and their use for ASR and TTS development

Šmídl

Švec

Tihelka

et al. 2019

Lang Resources & Evaluation

View full text Add to dashboard Cite

Spoken Term Detection and Relevance Score Estimation Using Dot-Product of Pronunciation Embeddings

Švec¹,

Šmídl²,

Psutka³

et al. 2021

View full text Add to dashboard Cite

The paper describes a novel approach to Spoken Term Detection (STD) in large spoken archives using deep LSTM networks. The work is based on the previous approach of using Siamese neural networks for STD and naturally extends it to directly localize a spoken term and estimate its relevance score. The phoneme confusion network generated by a phoneme recognizer is processed by the deep LSTM network which projects each segment of the confusion network into an embedding space. The searched term is projected into the same embedding space using another deep LSTM network. The relevance score is then computed using a simple dot-product in the embedding space and calibrated using a sigmoid function to predict the probability of occurrence. The location of the searched term is then estimated from the sequence of output probabilities. The deep LSTM networks are trained in a self-supervised manner from paired recognition hypotheses on word and phoneme levels. The method is experimentally evaluated on MALACH data in English and Czech languages.

show abstract

Adjusting BERT’s Pooling Layer for Large-Scale Multi-Label Text Classification

Lehečka

Švec

Ircing

et al. 2020

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Luboš Šmídl

System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive

A Relevance Score Estimation for Spoken Term Detection Based on RNN-Generated Pronunciation Embeddings

Air traffic control communication (ATCC) speech corpora and their use for ASR and TTS development

Spoken Term Detection and Relevance Score Estimation Using Dot-Product of Pronunciation Embeddings

Adjusting BERT’s Pooling Layer for Large-Scale Multi-Label Text Classification

Contact Info

Product

Resources

About