“…To synthesize dysarthric speech, there is a need to build a system controlling different characteristics of dysarthric speech for generating variant dysarthric speech. As will be discussed later in this paper, and according to a number of studies (Rudzicz, Namasivayam et al 2012, Zhang, Dang et al 2014, Bigi, Klessa et al 2015, Kuo and Tjaden 2016, Yunusova, Graham et al 2016), such a system should have the following capabilities in order to support generation of authentic and diverse speech: 1) ability to control the speaking rate (duration), pitch, energy for a variety of dysarthria severity levels, 2) ability to learn and model pause behavior of dysarthric speakers (e.g., duration of pause and pause occurrence) and control pause insertion locations and durations 3) ability to learn and model individual voice characteristics of speakers and use these to generate new speaking styles 4) ability to learn and model these characteristics from a small amount of dysarthric speech data.…”