We develop the basic building blocks of a frequency domain framework for
drawing statistical inferences on the second-order structure of a stationary
sequence of functional data. The key element in such a context is the spectral
density operator, which generalises the notion of a spectral density matrix to
the functional setting, and characterises the second-order dynamics of the
process. Our main tool is the functional Discrete Fourier Transform (fDFT). We
derive an asymptotic Gaussian representation of the fDFT, thus allowing the
transformation of the original collection of dependent random functions into a
collection of approximately independent complex-valued Gaussian random
functions. Our results are then employed in order to construct estimators of
the spectral density operator based on smoothed versions of the periodogram
kernel, the functional generalisation of the periodogram matrix. The
consistency and asymptotic law of these estimators are studied in detail. As
immediate consequences, we obtain central limit theorems for the mean and the
long-run covariance operator of a stationary functional time series. Our
results do not depend on structural modelling assumptions, but only functional
versions of classical cumulant mixing conditions, and are shown to be stable
under discrete observation of the individual curves.Comment: Published in at http://dx.doi.org/10.1214/13-AOS1086 the Annals of
Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical
Statistics (http://www.imstat.org
Abstract. The assumption of separability of the covariance operator for a random image or hypersurface can be of substantial use in applications, especially in situations where the accurate estimation of the full covariance structure is unfeasible, either for computational reasons, or due to a small sample size. However, inferential tools to verify this assumption are somewhat lacking in high-dimensional or functional data analysis settings, where this assumption is most relevant. We propose here to test separability by focusing on K-dimensional projections of the difference between the covariance operator and a nonparametric separable approximation. The subspace we project onto is one generated by the eigenfunctions of the covariance operator estimated under the separability hypothesis, negating the need to ever estimate the full non-separable covariance. We show that the rescaled difference of the sample covariance operator with its separable approximation is asymptotically Gaussian. As a by-product of this result, we derive asymptotically pivotal tests under Gaussian assumptions, and propose bootstrap methods for approximating the distribution of the test statistics. We probe the finite sample performance through simulations studies, and present an application to log-spectrogram images from a phonetic linguistics dataset.
Dialect variation is of considerable interest in linguistics and other social sciences. However, traditionally it has been studied using proxies (transcriptions) rather than acoustic recordings directly. We introduce novel statistical techniques to analyse geolocalised speech recordings and to explore the spatial variation of pronunciations continuously over the region of interest, as opposed to traditional isoglosses, which provide a discrete partition of the region. Data of this type require an explicit modeling of the variation in the mean and the covariance. Usual Euclidean metrics are not appropriate, and we therefore introduce the concept of d-covariance, which allows consistent estimation both in space and at individual locations. We then propose spatial smoothing for these objects which accounts for the possibly non convex geometry of the domain of interest. We apply the proposed method to data from the spoken part of the British National Corpus, deposited at the British Library, London, and we produce maps of the dialect variation over Great Britain. In addition, the methods allow for acoustic reconstruction across the domain of interest, allowing researchers to listen to the statistical analysis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.