Supervised Sequence Labelling with Recurrent Neural Networks

Graves, Alexander

doi:10.1007/978-3-642-24797-2

Cited by 2,616 publications

(2,262 citation statements)

References 121 publications

Supporting

Mentioning

2,088

Contrasting

Unclassified

Order By: Relevance

“…Parameters of LSTM are trained using BPTT. Core structure of LSTM is illustrated as follows (Graves, 2012): Figure 3. Structure of LSTM memory block…”

Section: Rnn and Lstmmentioning

confidence: 99%

A Spatiotemporal Prediction Framework for Air Pollution Based on Deep RNN

Fan

Hou

et al. 2017

ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci.

128

View full text Add to dashboard Cite

ABSTRACT:Time series data in practical applications always contain missing values due to sensor malfunction, network failure, outliers etc. In order to handle missing values in time series, as well as the lack of considering temporal properties in machine learning models, we propose a spatiotemporal prediction framework based on missing value processing algorithms and deep recurrent neural network (DRNN). By using missing tag and missing interval to represent time series patterns, we implement three different missing value fixing algorithms, which are further incorporated into deep neural network that consists of LSTM (Long Short-term Memory) layers and fully connected layers. Real-world air quality and meteorological datasets (Jingjinji area, China) are used for model training and testing. Deep feed forward neural networks (DFNN) and gradient boosting decision trees (GBDT) are trained as baseline models against the proposed DRNN. Performances of three missing value fixing algorithms, as well as different machine learning models are evaluated and analysed. Experiments show that the proposed DRNN framework outperforms both DFNN and GBDT, therefore validating the capacity of the proposed framework. Our results also provides useful insights for better understanding of different strategies that handle missing values.

show abstract

“…Parameters of LSTM are trained using BPTT. Core structure of LSTM is illustrated as follows (Graves, 2012): Figure 3. Structure of LSTM memory block…”

Section: Rnn and Lstmmentioning

confidence: 99%

A Spatiotemporal Prediction Framework for Air Pollution Based on Deep RNN

Fan

Hou

et al. 2017

ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci.

128

View full text Add to dashboard Cite

show abstract

“…RNN has been widely used for sequence generation tasks (Graves, 2012a;Schuster and Paliwal, 1997). RNN accepts sequence of inputs X = {x 1 , x 2 , x 3 , ..., x |X| }, and gets h t at time t according to Equation (2).…”

Section: Rnnmentioning

confidence: 99%

Towards Automatic Generation of Product Reviews from Aspect-Sentiment Scores

Zang¹,

Wan²

2017

Proceedings of the 10th International Conference on Natural Language Generation

View full text Add to dashboard Cite

Data-to-text generation is very essential and important in machine writing applications. The recent deep learning models, like Recurrent Neural Networks (RNNs), have shown a bright future for relevant text generation tasks. However, rare work has been done for automatic generation of long reviews from user opinions. In this paper, we introduce a deep neural network model to generate long Chinese reviews from aspect-sentiment scores representing users' opinions. We conduct our study within the framework of encoderdecoder networks, and we propose a hierarchical structure with aligned attention in the Long-Short Term Memory (LSTM) decoder. Experiments show that our model outperforms retrieval based baseline methods, and also beats the sequential generation models in qualitative evaluations.

show abstract

“…Since these gates allow for write, read, and reset operations within a memory block, an LSTM block can be interpreted as (differentiable) memory chip in a digital computer. The overall effect of the gate units is that the LSTM memory cells can store and access information over long periods of time and thus avoid the vanishing gradient problem (for details see [6]). …”

Section: Bidirectional Long Short-term Memorymentioning

confidence: 99%

“…To extend the Jacobian to recurrent neural networks, we have to specify the timesteps (representing utterances) at which the input and output variables are measured. Thus, we calculate a four-dimensional matrix called the sequential Jacobian [6] to determine the sensitivity of the network outputs at time t to the inputs at time t :…”

Section: Sequential Jacobian Analysismentioning

confidence: 99%

See 1 more Smart Citation

Analyzing the memory of BLSTM Neural Networks for enhanced emotion classification in dyadic spoken interactions

Wöllmer

Metallinou

Katsamanis

et al. 2012

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

Recent studies indicate that bidirectional Long Short-Term Memory (BLSTM) recurrent neural networks are well-suited for automatic emotion recognition systems and may lead to better results than systems applying other widely used classifiers such as Support Vector Machines or feedforward Neural Networks. The good performance of BLSTM emotion recognition systems could be attributed to their ability to model and exploit contextual information self-learned via recurrently connected memory blocks which allows them to incorporate information about how emotion evolves over time. However, the actual amount of bidirectional context that a BLSTM classifier takes into account when classifying an observation has not been investigated so far. This paper presents a methodology to systematically investigate the number of past and future utterance-level observations that are considered to generate an emotion prediction for a given utterance, and to examine to what extent this temporal bidirectional context contributes to the overall BLSTM performance.

show abstract

Supervised Sequence Labelling with Recurrent Neural Networks

Cited by 2,616 publications

References 121 publications

A Spatiotemporal Prediction Framework for Air Pollution Based on Deep RNN

A Spatiotemporal Prediction Framework for Air Pollution Based on Deep RNN

Towards Automatic Generation of Product Reviews from Aspect-Sentiment Scores

Analyzing the memory of BLSTM Neural Networks for enhanced emotion classification in dyadic spoken interactions

Contact Info

Product

Resources

About