2014
DOI: 10.1016/j.techfore.2013.12.019
|View full text |Cite
|
Sign up to set email alerts
|

“Term clumping” for technical intelligence: A case study on dye-sensitized solar cells

Abstract: Tech Mining seeks to extract intelligence from Science, Technology & Innovation information record sets on a subject of interest. A key set of Tech Mining interests concerns which R&D activities are addressed in the publication and patent abstract records under study. This paper presents six "term clumping" steps that can clean and consolidate topical content in such text sources. It examines how each step changes the content, potentially to facilitate extraction of usable intelligence as the end goal. We illu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
90
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
6
2

Relationship

5
3

Authors

Journals

citations
Cited by 142 publications
(93 citation statements)
references
References 28 publications
(29 reference statements)
1
90
0
Order By: Relevance
“…1) The clustering-based topic identification model -we group the term clumping process [20]-cleaned features of science data sources into "general feature" and "specific feature," and refine the data-oriented, but adaptive text clustering model [21] to identify the hot research topics and key technologies;…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…1) The clustering-based topic identification model -we group the term clumping process [20]-cleaned features of science data sources into "general feature" and "specific feature," and refine the data-oriented, but adaptive text clustering model [21] to identify the hot research topics and key technologies;…”
Section: Methodsmentioning
confidence: 99%
“…Therefore, our emphasis in this paper is science textual data, and text mining techniques will be used to handle the raw data for feature extraction. In [20], we developed a Term Clumping process to retrieve key terms from science textual data by term removal, consolidation, and clustering. However, the aim of the original Term Clumping is to remove common terms and select core terms; thus, we undertake modifications on this approach to meet our exact needs for feature extraction, to evolve it by adding a thesaurus for our specified technical domain, adjusting the existing Science and Academic Thesaurus, skipping the Term Frequency Inverse Document Frequency (TFIDF) approach for term removal, but applying it for feature weighting, and skipping the clustering-based term consolidation approaches (e.g., Term Clustering and Combine Term Network).…”
Section: Step 1: Clustering-based Topic Identificationmentioning
confidence: 99%
“…The researchers in our lab have built a thesaurus by the method of term clumping. This thesaurus can be used to process large amount of documents including treat the synonyms (Zhang et al, 2014a(Zhang et al, , 2014b. In addition, Choi et al (2011) also proposes such a method for this purpose.…”
Section: Constructing the Sao Structure Map And Mining The Significanmentioning
confidence: 99%
“…Keller and Gracht (2014) also used a Delphi survey for technology forecasting analysis. In recent years, many studies have been conducted on a hybrid technology analysis that combines the results of qualitative and quantitative analyses [13,14]. Huang et al (2014) combined bibliometrics (quantitative technology analysis) and road-mapping (qualitative technology analysis) to examine research and development (R&D) planning.…”
Section: Introductionmentioning
confidence: 99%
“…Huang et al (2014) combined bibliometrics (quantitative technology analysis) and road-mapping (qualitative technology analysis) to examine research and development (R&D) planning. Zhang et al (2014) also conducted a technology analysis combining multivariate analysis (clustering, factor analysis, and topic model) as a quantitative approach with purposive analysis TRIZ (theory of solving inventive problem) and technology road-mapping) as a qualitative approach. Using the extracted keywords from the patent title and abstract, they carried out term clumping for technological intelligence.…”
Section: Introductionmentioning
confidence: 99%