Are Transformers Effective for Time Series Forecasting?

Zeng, Ailing; Chen, Muxi; Zhang, Lei; Xu, Qian

doi:10.48550/arxiv.2205.13504

Cited by 30 publications

(54 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In addition to linear regression (LR), seven deep learning methods, namely hierarchical interpolation for time series forecasting (N-HiTS) [66], temporal convolutional network (TCN) [67], Transformer (TF) [68], NLinear [69], long short-term memory (LSTM) [70], gated recurrent unit (GRU) [71], and temporal fusion transformer (TFT) [53], were investigated to develop predictive models and compare their performance. N-HiTS and NLinear were improved to support producing probabilistic forecasts based on quantile regression.…”

Section: Studied Machine Learning Methodsmentioning

confidence: 99%

A Digitalization Framework for Smart Maintenance of Historic Buildings

2023

Linköping Studies in Science and Technology. Licentiate Thesis

View full text Add to dashboard Cite

show abstract

Section: Studied Machine Learning Methodsmentioning

confidence: 99%

A Digitalization Framework for Smart Maintenance of Historic Buildings

2023

Linköping Studies in Science and Technology. Licentiate Thesis

View full text Add to dashboard Cite

show abstract

“…These models show that sequence dependencies can be modeled for long-term forecasting [ 27 ], but rely heavily on periodicity. In general, our findings suggest that the correlation between subsequences is crucial.…”

Section: Related Workmentioning

confidence: 99%

State Causality and Adaptive Covariance Decomposition Based Time Series Forecasting

Wang

Geng

et al. 2023

Sensors

View full text Add to dashboard Cite

Time series forecasting is a very vital research topic. The scale of time series in numerous industries has risen considerably in recent years as a result of the advancement of information technology. However, the existing algorithms pay little attention to generating large-scale time series. This article designs a state causality and adaptive covariance decomposition-based time series forecasting method (SCACD). As an observation sequence, the majority of time series is generated under the influence of hidden states. First, SCACD builds neural networks to adaptively estimate the mean and covariance matrix of latent variables; Then, SCACD employs causal convolution to forecast the distribution of future latent variables; Lastly, to avoid loss of information, SCACD applies a sampling approach based on Cholesky decomposition to generate latent variables and observation sequences. Compared to existing outstanding time series prediction models on six real datasets, the model can achieve long-term forecasting while also being lighter, and the forecasting accuracy is improved in the great majority of the prediction tasks.

show abstract

“…Both modules are used to replace self-attention and cross-attention modules, with a time complexity of O (L). These Transformer-based models have demonstrated outstanding performance in learning long-term sequence dependencies [38]. CNN-based methods are commonly used to extract local temporal features by leveraging convolution kernels.…”

Section: Tsf Modelsmentioning

confidence: 99%

“…The overall structure of MLGN is shown in figure 2. Inspired by traditional time series decomposition algorithms [46] and deep learning models [24,30,37,38], we design a multi-scale sequence decomposition (MSDecomp) block to separate complex patterns of input series. Then we utilize seasonal component prediction module to predict seasonal information and trend component prediction module to predict trend information respectively.…”

Section: Mlgn Frameworkmentioning

confidence: 99%

MLGN: multi-scale local-global feature learning network for long-term series forecasting

Jiang,

Wang,

Sun

et al. 2023

Mach. Learn.: Sci. Technol.

View full text Add to dashboard Cite

Although Transformer-based methods have achieved remarkable performance in the field of long-term series forecasting, they can be computationally expensive and lack the ability to specifically model local features as CNNs. CNN-based methods, such as temporal convolutional network (TCN), utilize convolutional filters to capture local temporal features. However, the intermediate layers of TCN suffer from a limited effective receptive field, which can result in the loss of temporal relations during global feature extraction.To solve the above problems, we propose to combine local features and global correlations to capture the overall view of time series (e.g. fluctuations, trends). To fully exploit the underlying information in the time series, a multi-scale branch structure is adopted to model different potential patterns separately. Each pattern is extracted using a combination of interactive learning convolution and causal frequency enhancement to capture both local features and global correlations. Furthermore, our proposed method,multi-scale local-global feature learning network (MLGN), achieves a time and memory complexity of O(L) and consistently achieve state-of-the-art results on six benchmark datasets. In comparision with previous best method Fedformer, MLGN yields 12.98% and 11.38% relative improvements for multivariate and univariate time series, respectively. Our code and data are available on Github at https://github.com/Zero-coder/MLGN.

show abstract

Are Transformers Effective for Time Series Forecasting?

Cited by 30 publications

References 13 publications

A Digitalization Framework for Smart Maintenance of Historic Buildings

A Digitalization Framework for Smart Maintenance of Historic Buildings

State Causality and Adaptive Covariance Decomposition Based Time Series Forecasting

MLGN: multi-scale local-global feature learning network for long-term series forecasting

Contact Info

Product

Resources

About