Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing 2010
DOI: 10.1145/1851476.1851501
|View full text |Cite
|
Sign up to set email alerts
|

Dimension reduction and visualization of large high-dimensional data via interpolation

Abstract: The recent explosion of publicly available biology gene sequences and chemical compounds offers an unprecedented opportunity for data mining. To make data analysis feasible for such vast volume and high-dimensional scientific data, we apply high performance dimension reduction algorithms. It facilitates the investigation of unknown structures in a three dimensional visualization. Among the known dimension reduction algorithms, we utilize the multidimensional scaling and generative topographic mapping algorithm… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
40
0
1

Year Published

2010
2010
2015
2015

Publication Types

Select...
4
3
1

Relationship

5
3

Authors

Journals

citations
Cited by 44 publications
(41 citation statements)
references
References 21 publications
(28 reference statements)
0
40
0
1
Order By: Relevance
“…Authors have been researching on developing high performance visualization algorithms, such as parallel MDS and GTM and their interpolation extensions [7,8], to visualize large PubChem dataset in 3D space by using our in-house 3D data point visualization tool and now we extend its functionality to access external data sources in a dynamic way.…”
Section: Data Visualization and Remote Data Accessmentioning
confidence: 99%
“…Authors have been researching on developing high performance visualization algorithms, such as parallel MDS and GTM and their interpolation extensions [7,8], to visualize large PubChem dataset in 3D space by using our in-house 3D data point visualization tool and now we extend its functionality to access external data sources in a dynamic way.…”
Section: Data Visualization and Remote Data Accessmentioning
confidence: 99%
“…Also, DAexp95 results are very similar to or even better than DA-exp99 results although DA-exp95 takes shorter time than DA-exp99 case. In future work, we will integrate these ideas with the interpolation technology described in [29] to give a robust approach to dimension reduction of large datasets that scales like O(N) rather O(N 2 ) of general MDS methods.…”
Section: Discussionmentioning
confidence: 99%
“…It has applied linear discriminant analysis to the labeled objects in the representation space. In contrast to them, [7] has proposed an EM-like optimization solution, called MI-MDS to solve the problem with STRESS criteria in (26), which found embedding of approximating to the distance rather than the inner product as in CMDS. In addition to that, [6] has proposed a heuristic method, called HE-MI, to lower the time cost of MI-MDS.…”
Section: Related Workmentioning
confidence: 99%
“…MI-MDS is an iterative majorization algorithm proposed by [7] to minimize the STRESS value in (32), where all weights are assumed to be 1. It will find nearest neighbors from insample points of a given out-of-sample point � at first, denoted as = { 1 , 2 , 3 , … , } .…”
Section: A Out-of-sample Problem and Mi-mdsmentioning
confidence: 99%
See 1 more Smart Citation