Proceedings of the 25th ACM International on Conference on Information and Knowledge Management 2016
DOI: 10.1145/2983323.2983350
|View full text |Cite
|
Sign up to set email alerts
|

Structural Clustering of Machine-Generated Mail

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
4
0

Year Published

2017
2017
2019
2019

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(4 citation statements)
references
References 37 publications
0
4
0
Order By: Relevance
“…It has been widely used in information extraction over structured web pages [10,25]. For emails, multiple algorithms for template induction have been described [5,11,21] along with applications in information extraction [3,20,31], email threading [5], and hierarchical classification [37]. This paper uses techniques previously described in the literature [31], but is the first description of an online template induction system.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…It has been widely used in information extraction over structured web pages [10,25]. For emails, multiple algorithms for template induction have been described [5,11,21] along with applications in information extraction [3,20,31], email threading [5], and hierarchical classification [37]. This paper uses techniques previously described in the literature [31], but is the first description of an online template induction system.…”
Section: Related Workmentioning
confidence: 99%
“…Rather, 90% of email traffic consists of machine-generated business-to-consumer (B2C) emails instantiated from a collection of a few million email templates, as illustrated in Figure 1. Existing email mining systems take advantage of this templatic nature of emails to extract information from emails using a two-step process [11,20,21,31].…”
Section: Introductionmentioning
confidence: 99%
“…It has been widely used in information extraction over structured web pages [3,23]. For emails, multiple algorithms for template induction have been described [2,4] along with applications like email threading [2] and hierarchical classification [44]. A technique has also been suggested for plain text emails [35] where data is not explicitly structured.…”
Section: Related Workmentioning
confidence: 99%
“…Unlike the general web, which is predominantly public, emails are private. Thus, to date, relatively little research has been presented on information extraction over this domain [4,46]. Learning and iterating is a much more difficult task when privacy must be preserved [13].…”
mentioning
confidence: 99%