Abstract. We study the problem of learning sequential top-down tree-toword transducers (stws). First, we present a Myhill-Nerode characterization of the corresponding class of sequential tree-to-word transformations (ST W). Next, we investigate what learning of stws means, identify fundamental obstacles, and propose a learning model with abstain. Finally, we present a polynomial learning algorithm.