TCN: Table Convolutional Network for Web Table Interpretation

Wang, Daheng; Shiralkar, Prashant; Lockard, Colin; Huang, Binxuan; Dong, Xin Luna; Jiang, Meng

doi:10.1145/3442381.3450090

Cited by 22 publications

(11 citation statements)

References 42 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Cell Value Recovery (CVR) objective used in TaBERT [101] applies the span-based prediction objective to deal with multiple value tokens. In TCN [93], each token represents each cell, so they randomly mask 10% of table cells beforehand for recovery from the set of all cell values. • Cloze: Cell-level cloze used in TUTA [92] samples cell strings based on the bi-tree structure as candidate choices, and at each blanked position, encourages the model to retrieve its corresponding cell string.…”

Section: Token-level Most Pre-training Models Use Token Mlmmentioning

confidence: 99%

Table Pre-training: A Survey on Model Architectures, Pre-training Objectives, and Downstream Tasks

Dong¹,

Cheng²,

He³

et al. 2022

Preprint

View full text Add to dashboard Cite

Since a vast number of tables can be easily collected from web pages, spreadsheets, PDFs, and various other document types, a flurry of table pre-training frameworks have been proposed following the success of text and images, and they have achieved new state-of-thearts on various tasks such as table question answering, table type recognition, column relation classification, table search, formula prediction, etc. To fully use the supervision signals in unlabeled tables, a variety of pre-training objectives have been designed and evaluated, for example, denoising cell values, predicting numerical relationships, and implicitly executing SQLs. And to best leverage the characteristics of (semi-)structured tables, various tabular language models, particularly with specially-designed attention mechanisms, have been explored. Since tables usually appear and interact with free-form text, table pre-training usually takes the form of table-text joint pre-training, which attracts significant research interests from multiple domains. This survey aims to provide a comprehensive review of different model designs, pre-training objectives, and downstream tasks for table pre-training, and we further share our thoughts and vision on existing challenges and future opportunities.

show abstract

Section: Token-level Most Pre-training Models Use Token Mlmmentioning

confidence: 99%

Table Pre-training: A Survey on Model Architectures, Pre-training Objectives, and Downstream Tasks

Dong¹,

Cheng²,

He³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Temporarily, they show much more promising performances than RNN/LSTM based models. However, except few models, feature extractors of others are still limited by the old stereotype that the input sequence length shall be kept invariant throughout the whole network [2], [5], [7]. Even those which do shorten the input sequence length make some 'compensations' for this behavior [6].…”

Section: Invariance Of Input Sequence Length As Features?mentioning

confidence: 99%

“…Baselines we select contain end-to-end models, contrastive learning based models (CoST [18], TS2Vec [22], TNC [24], MoCo [51], Triplet [52], CPC [53], TST [54], TCC [55]) and a feature engineered model (TSFresh package). Endto-end models include traditional time series forecasting models (ARIMA [20], [21], Prophet [56], N-BEATS [17]), CNN (SCINet [1], TCN [2]), RNN (LSTNet [5], DeepAR [4], LSTMa [57]), Transformer (LogTrans [7], Informer [6], Reformer [39]) and GNN (StemGNN [9]). Most of the results are taken from other papers [6], [18], [22] and the rest of them are compensated by us using unified settings for fair comparison.…”

Section: Comparison Experimentsmentioning

confidence: 99%

Respecting Time Series Properties Makes Deep Time Series Forecasting Perfect

Shen¹,

Wei²,

Wang³

2022

Preprint

View full text Add to dashboard Cite

How to handle time features shall be the core question of any time series forecasting model. Ironically, it is often ignored or misunderstood by deep-learning based models, even those baselines which are state-of-the-art. This behavior makes their inefficient, untenable and unstable. In this paper, we rigorously analyze three prevalent but deficient/unfounded deep time series forecasting mechanisms or methods from the view of time series properties, including normalization methods, multivariate forecasting and input sequence length. Corresponding corollaries and solutions are given on both empirical and theoretical basis. We thereby propose a novel time series forecasting network, i.e. RTNet, on the basis of aforementioned analysis. It is general enough to be combined with both supervised and self-supervised forecasting format. Thanks to the core idea of respecting time series properties, no matter in which forecasting format, RTNet shows obviously superior forecasting performances compared with dozens of other SOTA time series forecasting baselines in three real-world benchmark datasets. By and large, it even occupies less time complexity and memory usage while acquiring better forecasting accuracy. The source code is available at https://github.com/OrigamiSL/RTNet.

show abstract

“…Chen et al [9] formulate the web information extraction problem as structural reading comprehension and build a BERT [15] based model to extract structured fields from the web documents. It is worth mentioning that there are also methods that work on multimodal information extraction [44,45,48,55], which focus on extracting the field information from the visual layout or the rendered HTML of the web documents.…”

Section: Related Work 21 Information Extractionmentioning

confidence: 99%

WebFormer: The Web-page Transformer for Structure Information Extraction

Wang¹,

Fang²,

Ravula³

et al. 2022

Preprint

View full text Add to dashboard Cite

Structure information extraction refers to the task of extracting structured text fields from web pages, such as extracting a product offer from a shopping page including product title, description, brand and price. It is an important research topic which has been widely studied in document understanding and web search. Recent natural language models with sequence modeling have demonstrated state-of-the-art performance on web information extraction. However, effectively serializing tokens from unstructured web pages is challenging in practice due to a variety of web layout patterns. Limited work has focused on modeling the web layout for extracting the text fields. In this paper, we introduce WebFormer, a Web-page transFormer model for structure information extraction from web documents. First, we design HTML tokens for each DOM node in the HTML by embedding representations from their neighboring tokens through graph attention. Second, we construct rich attention patterns between HTML tokens and text tokens, which leverages the web layout for effective attention weight computation. We conduct an extensive set of experiments on SWDE and Common Crawl benchmarks. Experimental results demonstrate the superior performance of the proposed approach over several state-of-the-art methods. CCS CONCEPTS• Computing methodologies → Information extraction.

show abstract

TCN: Table Convolutional Network for Web Table Interpretation

Cited by 22 publications

References 42 publications

Table Pre-training: A Survey on Model Architectures, Pre-training Objectives, and Downstream Tasks

Table Pre-training: A Survey on Model Architectures, Pre-training Objectives, and Downstream Tasks

Respecting Time Series Properties Makes Deep Time Series Forecasting Perfect

WebFormer: The Web-page Transformer for Structure Information Extraction

Contact Info

Product

Resources

About