2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE) 2019
DOI: 10.1109/ase.2019.00047
|View full text |Cite
|
Sign up to set email alerts
|

Regexes are Hard: Decision-Making, Difficulties, and Risks in Programming Regular Expressions

Abstract: Regular expressions (regexes) are a powerful mechanism for solving string-matching problems. They are supported by all modern programming languages, and have been estimated to appear in more than a third of Python and JavaScript projects. Yet existing studies have focused mostly on one aspect of regex programming: readability. We know little about how developers perceive and program regexes, nor the difficulties that they face.In this paper, we provide the first study of the regex development cycle, with a foc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
15
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 41 publications
(18 citation statements)
references
References 42 publications
(65 reference statements)
2
15
0
Order By: Relevance
“…Rejecting valid strings represents 61.8% of the incorrect behavior bugs. This reinforces the observation that developers prefer to compose a conservative regex to an overly liberal one [39] and tend to expand the scope of regular expressions as software evolves [62].…”
Section: Regex: Incorrect Behaviorsupporting
confidence: 82%
See 2 more Smart Citations
“…Rejecting valid strings represents 61.8% of the incorrect behavior bugs. This reinforces the observation that developers prefer to compose a conservative regex to an overly liberal one [39] and tend to expand the scope of regular expressions as software evolves [62].…”
Section: Regex: Incorrect Behaviorsupporting
confidence: 82%
“…(mozilla/fxa-auth-server#1743). This problem is regarded to be one of the difficulties of regex programming [39]. Further research efforts are needed to better understand when to use and when not to use regexes.…”
Section: Discussion and Future Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In the future, we plan to identify variations of CALDERA commands to further identify threats. Finally, we are considering the value of the GPE adding perturbations that are like regular expressions [12], directing the hunt to a family of variants to which a threat may evolve.…”
Section: Discussionmentioning
confidence: 99%
“…The most obvious limitation is the lack of recursion. Statements can quickly turn out messy and hard to maintain [6] [7]. Thus, regular expressions are not quite enough.…”
Section: A Describing a Language Using A Grammarmentioning
confidence: 99%