Representations of Syntax [MASK] Useful: Effects of Constituency and Dependency Structure in Recursive LSTMs

Lepori, Michael A.; Linzen, Tal; McCoy, R. Thomas

doi:10.18653/v1/2020.acl-main.303

Cited by 7 publications

(5 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Although Reference 26 argues that compositional parsing is more important than dependency parsing for the word‐to‐word hierarchy in a sentence, given that the sentence‐to‐sentence relationships in a paragraph cannot be defined in terms of compositions and that our experimental results (Figure 7) suggest that particular sentence‐initial verbs are more effective for parsing the sentence‐to‐sentence hierarchy, we reckon that our unsupervised model is more biased toward dependency parsing. Therefore, a larger dataset of business process descriptions would be very effective for our work.…”

Section: Discussionmentioning

confidence: 84%

A‐PGRD: Attention‐based automatic business process model generation from RPA process description

Zhu,

Liu,

et al. 2023

Concurrency and Computation

View full text Add to dashboard Cite

SummaryRobotic process automation (RPA), a tool driven by business processes as the kernel, continues to heat up in the business community. However, process‐centric RPA modeling lacks an effective means. To address this problem, we propose a method for automatic process acquisition using RPA process descriptions as input. Existing deep learning process generation methods cannot be applied at the phrase level and have low accuracy at the sentence level. The proposed neural network method is based on an attention mechanism for automatic business process model generation from RPA process descriptions (A‐PGRD). The approach analyzes easily accessible and unstructured natural language text documents, constructs a non‐autoregressive neural network with an attention mechanism to retrieve the business process hierarchy, and generates a tree‐like business process graph using unsupervised automation. Through K‐fold cross‐validation, the method achieves an accuracy of 41.7% on the manually collected open‐source RPA business process dataset. Compared with the previous method, the method improves the learning efficiency by 23%–27%. The obtained results can be applied to the RPA tool to better optimize the business process and thus help organizations gain an edge over their competition.

show abstract

Section: Discussionmentioning

confidence: 84%

A‐PGRD: Attention‐based automatic business process model generation from RPA process description

Zhu,

Liu,

et al. 2023

Concurrency and Computation

View full text Add to dashboard Cite

show abstract

“…Lepori et al ( 2020 ) experiment with an artificially constructed set of simple transitive sentences (Subject-Verb-Object), containing optional adjectival or prepositional modifiers in a controlled, probabilistic setting. They show that when a BiLSTM is fine-tuned on a distribution which explicitly requires moving beyond lexical co-occurrences and creating more abstract representations, performance dramatically improves: this suggests that a simple sequential mechanism can be enough if the linguistic signal is structured in a way that abstraction is encouraged.…”

Section: The Role Of Inputmentioning

confidence: 99%

Can Recurrent Neural Networks Validate Usage-Based Theories of Grammar Acquisition?

Pannitto

Herbelot

2022

Front. Psychol.

View full text Add to dashboard Cite

It has been shown that Recurrent Artificial Neural Networks automatically acquire some grammatical knowledge in the course of performing linguistic prediction tasks. The extent to which such networks can actually learn grammar is still an object of investigation. However, being mostly data-driven, they provide a natural testbed for usage-based theories of language acquisition. This mini-review gives an overview of the state of the field, focusing on the influence of the theoretical framework in the interpretation of results.

show abstract

“…We also conduct an experiment that has been proposed in a previous study (Lepori et al, 2020) to impart hierarchical bias to the models, and found out that it did not help in our scenario ( §A.7).…”

Section: How Confident Are Language Models?mentioning

confidence: 99%

“…Note that, since the projection is in 2 dimensions, to measure the spread evaluate l 2 norm of a vector of standard deviations across individual components. A.7 Fine-Tuning Lepori et al (2020) showed that the syntactic robustness of RNNs could be improved by finetuning the trained models on a small amount of syntactically challenging data. We consider a similar exercise for our trained language models (Selective sampling), where we further fine-tuned the model with the challenging artificially generated sentences.…”

Section: A6 Analysis Of Variancementioning

confidence: 99%

Can RNNs trained on harder subject-verb agreement instances still perform well on easier ones?

Bansal,

Bhatt,

Agarwal

2020

Preprint

View full text Add to dashboard Cite

The main subject and the associated verb in English must agree in grammatical number as per the Subject-Verb Agreement (SVA) phenomenon. It has been found that the presence of a noun between the verb and the main subject, whose grammatical number is opposite to that of the main subject, can cause speakers to produce a verb that agrees with the intervening noun rather than the main noun; the former thus acts as an agreement attractor. Such attractors have also been shown to pose a challenge for RNN models without explicit hierarchical bias to perform well on SVA tasks. Previous work suggests that syntactic cues in the input can aid such models to choose hierarchical rules over linear rules for number agreement. In this work, we investigate the effects of the choice of training data, training algorithm, and architecture on hierarchical generalization. We observe that the models under consideration fail to perform well on sentences with no agreement attractor when trained solely on natural sentences with at least one attractor. Even in the presence of this biased training set, implicit hierarchical bias in the architecture (as in the Ordered Neurons LSTM) is not enough to capture syntaxsensitive dependencies. These results suggest that current RNNs do not capture the underlying hierarchical rules of natural language, but rather use shallower heuristics for their predictions. * * Equal Contribution Subject-verb agreement is a phenomenon where the main subject agrees in grammatical number with its associated verb, oblivious to the presence of any other noun phrase in the sentence. This is exemplified as:1. *The keys to the cabinet is on the table. 1 2. The keys to the cabinet are on the table.

show abstract

Representations of Syntax [MASK] Useful: Effects of Constituency and Dependency Structure in Recursive LSTMs

Cited by 7 publications

References 28 publications

A‐PGRD: Attention‐based automatic business process model generation from RPA process description

A‐PGRD: Attention‐based automatic business process model generation from RPA process description

Can Recurrent Neural Networks Validate Usage-Based Theories of Grammar Acquisition?

Can RNNs trained on harder subject-verb agreement instances still perform well on easier ones?

Contact Info

Product

Resources

About