Our system is currently under heavy load due to increased usage. We're actively working on upgrades to improve performance. Thank you for your patience.
2012
DOI: 10.1109/tasl.2011.2159594
|View full text |Cite
|
Sign up to set email alerts
|

Bilingual Experiments on Automatic Recovery of Capitalization and Punctuation of Automatic Speech Transcripts

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
42
0
10

Year Published

2015
2015
2020
2020

Publication Types

Select...
7

Relationship

4
3

Authors

Journals

citations
Cited by 52 publications
(54 citation statements)
references
References 26 publications
0
42
0
10
Order By: Relevance
“…Other features involve pitch (f0), energy, jitter and shimmer, including pitch and energy average, median, standard deviation, dynamics, range, and slopes, both within and between IPUs [1]. Pitch related features were calculated based on semitones rather than frequency.…”
Section: Recent Progress In Parameterizations For Paralinguisticsmentioning
confidence: 99%
“…Other features involve pitch (f0), energy, jitter and shimmer, including pitch and energy average, median, standard deviation, dynamics, range, and slopes, both within and between IPUs [1]. Pitch related features were calculated based on semitones rather than frequency.…”
Section: Recent Progress In Parameterizations For Paralinguisticsmentioning
confidence: 99%
“…Prosodic and textual cues are also combined in [22] and implemented in a decision tree classifier with the goal to detect sentence boundaries. A combination of lexical-, prosodic-, and speaker-based features is also found in [4] for the detection of full stops, commas, and question marks in a bilingual English-Portuguese broadcast news data, while [19] focuses on Czech broadcast news speech to detect commas and sentence boundaries by using a prosodic model in a decision trees and a multi-layer perceptron and N-gram models for language modeling. Similar works deal with the punctuation generation problem by using statistical models of prosodic features [9], the combination of both textual and prosodic features based on adaptive boosting [18], and a cross-linguistic study of prosodic features through two different approaches for feature selection: a forward search wrapper and feature filtering [14].…”
Section: Related Workmentioning
confidence: 99%
“…O enriquecimento de transcrições automáticas depende de: i) transcrições manuais, ii) transcrições produzidas pelo reconhecedor automático e iii) análise do sinal acústico (para mais informações sobre este processo, veja-se Batista et al 2012aBatista et al e 2012bMoniz 2013;Moniz et al 2014b;Cabarrão et al 2015). A anotação manual é complexa sobretudo no caso da anotação de corpora de fala espontânea, uma vez que implica pontuar fala, por um lado, e identificar disfluências e marcadores discursivos, por outro -tarefas que, como se sabe, afetam a concordância inter-anotadores em distintos corpora do português europeu (cf.…”
Section: Prosódia E Processamento Automáticounclassified