Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing 2010
DOI: 10.1145/1851476.1851549
|View full text |Cite
|
Sign up to set email alerts
|

Browsing large scale cheminformatics data with dimension reduction

Abstract: Visualization of large-scale high dimensional data tool is highly valuable for scientific discovery in many fields. We present PubChemBrowse, a customized visualization tool for cheminformatics research. It provides a novel 3D data point browser that displays complex properties of massive data on commodity clients. As in GIS browsers for Earth and Environment data, chemical compounds with similar properties are nearby in the browser. PubChemBrowse is built around in-house high performance parallel MDS (Multi-D… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

1
6
0

Year Published

2011
2011
2018
2018

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 22 publications
(39 reference statements)
1
6
0
Order By: Relevance
“…It is observed that MDS yields two distinct clusters of compounds, while the lower complexity compounds are scattered towards the edges. The visualization obtained here is comparable to those obtained in [8], [17].…”
Section: ) Pubchem Datasetsupporting
confidence: 87%
“…It is observed that MDS yields two distinct clusters of compounds, while the lower complexity compounds are scattered towards the edges. The visualization obtained here is comparable to those obtained in [8], [17].…”
Section: ) Pubchem Datasetsupporting
confidence: 87%
“…The main dilemma is how to represent high dimensional data in 2D display space without adding extra visual clutter. Several approaches have been introduced to visually represent high dimensional data, but only a few of them are scalable [5,18,33,55]. Additionally, challenges include identifying patterns with large a number of dimensions, and following trends across multiple dimensions.…”
Section: Challenges Of High Dimensional Heterogeneous Geotemporal Datmentioning
confidence: 99%
“…The goal is to maintain the relations between the high dimension data in the lower dimension space without adding any new relations that are not originally in the data. Techniques such as principle component analysis (PCA) [72,55], multidimensional scaling (MDS) [3,33], and Self Organizing Maps (SOM) [3,57] are commonly used to transform data to lower dimensions.…”
Section: Dimensional Reduction Approachesmentioning
confidence: 99%
See 1 more Smart Citation
“…This collection of papers was selected from those presented at the Emerging Computational Methods for the Life Sciences Workshop of the ACM HPDC 2010 conference , Chicago, Illinois June 22, 2010. The papers were all enhanced over the conference versions and separately reviewed.…”
mentioning
confidence: 99%