Proceedings of the 2015 ACM Symposium on Document Engineering 2015
DOI: 10.1145/2682571.2797068
|View full text |Cite
|
Sign up to set email alerts
|

Similarity-Based Support for Text Reuse in Technical Writing

Abstract: Technical writing in professional environments, such as user manual authoring for new products, is a task that relies heavily on reuse of content. Therefore, technical content is typically created following a strategy where modular units of text have references to each other. One of the main challenges faced by technical authors is to avoid duplicating existing content, as this adds unnecessary effort, generates undesirable inconsistencies, and dramatically increases maintenance and translation costs. However,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
14
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 13 publications
(16 citation statements)
references
References 21 publications
0
14
0
Order By: Relevance
“…The bag of words is the data representation technique used in most of the consulted literature [13,6,14,15,1,16]. It consists on representing each text document as a vector of frequencies [13].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…The bag of words is the data representation technique used in most of the consulted literature [13,6,14,15,1,16]. It consists on representing each text document as a vector of frequencies [13].…”
Section: Related Workmentioning
confidence: 99%
“…It consists on representing each text document as a vector of frequencies [13]. A variant of this representation uses Term Frequency-Inverse Document Frequency (TF-IDF) weighting [16,6] where each word in a document is assigned a weight depending on their frequencies within a specific document and throughout all the documents.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Large and complex documents used in technical communication are often composed of smaller building blocks called content components . This enables referenced reuse of components across and within different documents and cost‐efficient translation in cases where only a subset of a document is changed . Examples for these document types are not only any kind of technical information (manuals, reports, and educational material) but also standards documents, patents, and some specifications types.…”
Section: Introductionmentioning
confidence: 99%
“…1,2 and within different documents and cost-efficient translation in cases where only a subset of a document is changed. 4 Examples for these document types are not only any kind of technical information (manuals, reports, and educational material) but also standards documents, patents, and some specifications types. Content components can resemble, but are not limited to, subsections of a document and are, in most cases, conceptually self-contained.…”
Section: Introductionmentioning
confidence: 99%