Proceedings of the 35th ACM SIGPLAN Conference on Programming Language Design and Implementation 2014
DOI: 10.1145/2594291.2594333
|View full text |Cite
|
Sign up to set email alerts
|

FlashExtract

Abstract: Various document types that combine model and view (e.g., text files, webpages, spreadsheets) make it easy to organize (possibly hierarchical) data, but make it difficult to extract raw data for any further manipulation or querying. We present a general framework FlashExtract to extract relevant data from semi-structured documents using examples. It includes: (a) an interaction model that allows end-users to give examples to extract various fields and to relate them in a hierarchical organization using structu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 129 publications
(18 citation statements)
references
References 22 publications
0
16
0
Order By: Relevance
“…This ranking method has been widely adopted in the area of programming by examples [Gulwani 2011;Le and Gulwani 2014;Rolim et al 2017]. For instance, Gulwani [2011] specifies a string processing domain-specific language.…”
Section: Resolution Rankingmentioning
confidence: 99%
See 1 more Smart Citation
“…This ranking method has been widely adopted in the area of programming by examples [Gulwani 2011;Le and Gulwani 2014;Rolim et al 2017]. For instance, Gulwani [2011] specifies a string processing domain-specific language.…”
Section: Resolution Rankingmentioning
confidence: 99%
“…Programming by Example (PBE) is a leading inductive synthesis technique which generates programs from input-output examples. It has the potential to revolutionize end-user programming ] such as string processing [Gulwani 2011] and format normalization [Kini and Gulwani 2015] in spreadsheet, data extraction by highlighting texts on web pages [Le and Gulwani 2014], web automation by clicking a few buttons [Barman et al 2016], and hierarchically structured data transformation [Yaghmazadeh et al 2016].…”
Section: Vsa-based Program Synthesismentioning
confidence: 99%
“…The EUSES corpus collects spreadsheets used as databases, and for financial, grading, homework, inventory, and modeling purposes. EUSES is frequently used by researchers building spreadsheet tools [Alawini et al 2015;Barowy et al 2014Barowy et al , 2015Cheung et al 2016;Grigoreanu et al 2010;Hermans and Dig 2014;Hermans et al 2012aHermans et al , 2010Hermans et al , 2013Hofer et al 2013;Joharizadeh 2015;Le and Gulwani 2014;Muşlu et al 2015;Singh et al 2017]. All of the categories present in EUSES are represented in the CUSTODES suite.…”
Section: About the Benchmarksmentioning
confidence: 99%
“…FlashProfile's pattern learner uses the PROSE library (Microsoft 2017d), which implements the FlashMeta framework for inductive program synthesis, specifically programming-by-examples (PBE) Lieberman 2001). PBE has been leveraged by recent works on automating repetitive text-processing tasks, such as string transformation (Gulwani 2011;Singh 2016), extraction (Le and Gulwani 2014), and format normalization (Kini and Gulwani 2015). However, unlike these applications, data profiling does not solicit any (output) examples from the user.…”
Section: Related Workmentioning
confidence: 99%