Jakub Lachowicz scite author profile

Jakub Lachowicz

5Publications

40Citation Statements Received

24Citation Statements Given

How they've been cited

How they cite others

Affiliations

Polish-Japanese Academy of Information Technology, Amazon (Germany)

Publications

Order By: Most citations

Effect of Data Reduction on Sequence-to-sequence Neural TTS

Latorre

Lachowicz

Lorenzo-Trueba

et al. 2019

View full text Add to dashboard Cite

Recent speech synthesis systems based on sampling from autoregressive neural networks models can generate speech almost undistinguishable from human recordings. However, these models require large amounts of data. This paper shows that the lack of data from one speaker can be compensated with data from other speakers. The naturalness of Tacotron2-like models trained on a blend of 5k utterances from 7 speakers is better than that of speaker dependent models trained on 15k utterances, but in terms of stability multi-speaker models are always more stable. We also demonstrate that models mixing only 1250 utterances from a target speaker with 5k utterances from another 6 speakers can produce significantly better quality than state-ofthe-art DNN-guided unit selection systems trained on more than 10 times the data from the target speaker.

show abstract

Enhancing audio quality for expressive Neural Text-to-Speech

Ezzerg¹,

Gabryś²,

Putrycz³

et al. 2021

View full text Add to dashboard Cite

Effect of data reduction on sequence-to-sequence neural TTS

Latorre¹,

Lachowicz²,

Lorenzo-Trueba³

et al. 2018

Preprint

View full text Add to dashboard Cite

Implementing a Statistical Parametric Speech Synthesis System for a Patient with Laryngeal Cancer

Szklanny

Lachowicz

2022

Sensors

View full text Add to dashboard Cite

Total laryngectomy, i.e., the surgical removal of the larynx, has a profound influence on a patient’s quality of life. The procedure results in a loss of natural voice, which in effect constitutes a significant socio-psychological problem for the patient. The main aim of the study was to develop a statistical parametric speech synthesis system for a patient with laryngeal cancer, on the basis of the patient’s speech samples recorded shortly before the surgery and to check if it was possible to generate speech quality close to that of the original recordings. The recording made use of a representative corpus of the Polish language, consisting of 2150 sentences. The recorded voice proved to indicate dysphonia, which was confirmed by the auditory-perceptual RBH scale (roughness, breathiness, hoarseness) and by acoustical analysis using AVQI (The Acoustic Voice Quality Index). The speech synthesis model was trained using the Merlin repository. Twenty-five experts participated in the MUSHRA listening tests, rating the synthetic voice at 69.4 in terms of the professional voice-over talent recording, on a 0–100 scale, which is a very good result. The authors compared the quality of the synthetic voice to another model of synthetic speech trained with the same corpus, but where a voice-over talent provided the recorded speech samples. The same experts rated the voice at 63.63, which means the patient’s synthetic voice with laryngeal cancer obtained a higher score than that of the talent-voice recordings. As such, the method enabled for the creation of a statistical parametric speech synthesizer for patients awaiting total laryngectomy. As a result, the solution would improve the quality of life as well as better mental wellbeing of the patient.

show abstract

Reduction Products of Benzalpinacoline

Hill¹,

Spear²,

Lachowicz³

1923

J. Am. Chem. Soc.

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jakub Lachowicz

Effect of Data Reduction on Sequence-to-sequence Neural TTS

Enhancing audio quality for expressive Neural Text-to-Speech

Effect of data reduction on sequence-to-sequence neural TTS

Implementing a Statistical Parametric Speech Synthesis System for a Patient with Laryngeal Cancer

Reduction Products of Benzalpinacoline

Contact Info

Product

Resources

About