Recurrent Neural Networks (RNNs): Architectures, Training Tricks, and Introduction to Influential Research

Das, Susmita; Tariq, Amara; Santos, Thiago; Kantareddy, Sai Sandeep; Banerjee, Imon

doi:10.1007/978-1-0716-3195-9_4

Cited by 18 publications

(4 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…RNNs characterize a specialized class of ANNs engineered to manage sequential data [ 49 , 50 ]. They achieve this by incorporating connections that create directed cycles within the network graph, facilitating dynamic temporal behavior and the processing of sequences of variable lengths.…”

Section: Types Of Pnnsmentioning

confidence: 99%

Exploring Types of Photonic Neural Networks for Imaging and Computing—A Review

Khonina,

Kazanskiy,

Skidanov

et al. 2024

Nanomaterials

View full text Add to dashboard Cite

Photonic neural networks (PNNs), utilizing light-based technologies, show immense potential in artificial intelligence (AI) and computing. Compared to traditional electronic neural networks, they offer faster processing speeds, lower energy usage, and improved parallelism. Leveraging light’s properties for information processing could revolutionize diverse applications, including complex calculations and advanced machine learning (ML). Furthermore, these networks could address scalability and efficiency challenges in large-scale AI systems, potentially reshaping the future of computing and AI research. In this comprehensive review, we provide current, cutting-edge insights into diverse types of PNNs crafted for both imaging and computing purposes. Additionally, we delve into the intricate challenges they encounter during implementation, while also illuminating the promising perspectives they introduce to the field.

show abstract

Section: Types Of Pnnsmentioning

confidence: 99%

Exploring Types of Photonic Neural Networks for Imaging and Computing—A Review

Khonina,

Kazanskiy,

Skidanov

et al. 2024

Nanomaterials

View full text Add to dashboard Cite

show abstract

“…Traditional RNNs suffer from the vanishing gradient problem, which makes it difficult for the model to capture long-range dependencies in a sequence. LSTM is an advancement over traditional RNNs, designed to overcome the vanishing gradient problem by introducing memory cells, input gates, forget gates and output gates to control the flow of information into and out of the cells [91]. The RNN model can transfer information more efficiently by using skip connections.…”

Section: Recurrent Neural Network (Rnn)mentioning

confidence: 99%

“…The reset gate regulates the extent to which past information must be forgotten to update the relevant memory, while the update gate regulates the retention or abandonment of information and the integration of new information from input into existing memory. With these two gates, GRU provides adaptive control over the flow of information in recurrent networks [91]. Marzinotto et al [56] use four layers of bidirectional GRU and are equipped with various more complex features for SRL tasks, including morphology, surface characteristics, and syntax.…”

Section: Recurrent Neural Network (Rnn)mentioning

confidence: 99%

A Systematic Review on Semantic Role Labeling for Information Extraction in Low-Resource Data

Ariyanto,

Purwitasari,

Fatichah

2024

IEEE Access

View full text Add to dashboard Cite

Challenges in the big data phenomenon arise due to the existence of unstructured text data, which is very large, comes from various sources, has various formats, and contains much noise. The complexity of unstructured text data makes it difficult to extract useful information. Therefore, a process is needed to transform it into structured data to be processed further. The information Extraction (IE) process helps to extract relationships, entities, semantic roles, and events from unstructured text data by converting them into structured output. One of IE's tasks is Semantic Role Labeling (SRL), which has a crucial function in identifying semantic roles in a sentence so that it can enrich the understanding of the text. However, much of SRL development focuses on high-resource data, especially in English. The limited development of SRL in specific low-resource languages or domains is a complex challenge. This research aims to conduct a systematic study on the development of SRL for low-resource data, both in low-resource language or domainspecific contexts. The review process was carried out systematically using the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) model, and 54 quality papers were obtained from the filtering process (from 2018 to 2023). We review several essential points, including (1) datasets that are often used for SRL tasks and their labeling strategies for low-resource data, (2) methods that have currently been developed for SRL tasks and learning scenarios when dealing with low-resource data, (4) evaluation metrics, (5) application of SRL tasks. This review is complemented by a discussion of issues and potential solutions for developing SRL on low-resource data to help researchers develop SRL more effectively in dealing with the challenges faced with low-resource data.

show abstract

“…These crucial state transitions are meticulously orchestrated by a set of adaptive gating units, which include the input, forget, and output gates, each performing a specific regulatory function to ensure the fidelity of information flow across the temporal expanse of the sequence. There are three kinds of gates in the LSTM layer, input gate, forget gate, and output gate (35). Figure 1 illustrates the flow of data at time step t and shows how the gates forget, update, and output the cell and hidden states.…”

Section: Constructing the Lstm Model 241 Cell Structure Of Lstm Networkmentioning

confidence: 99%

Comparison of simulation and predictive efficacy for hemorrhagic fever with renal syndrome incidence in mainland China based on five time series models

Wang,

Yang,

et al. 2024

Front. Public Health

View full text Add to dashboard Cite

BackgroundHemorrhagic fever with renal syndrome (HFRS) is a zoonotic infectious disease commonly found in Asia and Europe, characterized by fever, hemorrhage, shock, and renal failure. China is the most severely affected region, necessitating an analysis of the temporal incidence patterns in the country.MethodsWe employed Autoregressive Integrated Moving Average (ARIMA), Long Short-Term Memory (LSTM), Convolutional Neural Network (CNN), Nonlinear AutoRegressive with eXogenous inputs (NARX), and a hybrid CNN-LSTM model to model and forecast time series data spanning from January 2009 to November 2023 in the mainland China. By comparing the simulated performance of these models on training and testing sets, we determined the most suitable model.ResultsOverall, the CNN-LSTM model demonstrated optimal fitting performance (with Root Mean Square Error (RMSE), Mean Absolute Percentage Error (MAPE), and Mean Absolute Error (MAE) of 93.77/270.66, 7.59%/38.96%, and 64.37/189.73 for the training and testing sets, respectively, lower than those of individual CNN or LSTM models).ConclusionThe hybrid CNN-LSTM model seamlessly integrates CNN’s data feature extraction and LSTM’s recurrent prediction capabilities, rendering it theoretically applicable for simulating diverse distributed time series data. We recommend that the CNN-LSTM model be considered as a valuable time series analysis tool for disease prediction by policy-makers.

show abstract

Recurrent Neural Networks (RNNs): Architectures, Training Tricks, and Introduction to Influential Research

Cited by 18 publications

References 21 publications

Exploring Types of Photonic Neural Networks for Imaging and Computing—A Review

Exploring Types of Photonic Neural Networks for Imaging and Computing—A Review

A Systematic Review on Semantic Role Labeling for Information Extraction in Low-Resource Data

Comparison of simulation and predictive efficacy for hemorrhagic fever with renal syndrome incidence in mainland China based on five time series models

Contact Info

Product

Resources

About