Building k edge-disjoint spanning trees of minimum total length for isometric data embedding

Li, Yang

doi:10.1109/tpami.2005.192

Cited by 55 publications

(34 citation statements)

References 8 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Besides the embedding results, the residual variance is also taken as an evaluation criterion (Tenenbaum et al, 2000;Geng et al, 2005;Yang, 2005Yang, , 2006. Residual variance is defined as 1 À R 2 ðD Y ; D G Þ where D Y is a matrix of Euclidean distances between data points after embedding, D G is a matrix of estimated geodesic distances, and R represents correlation coefficient.…”

Section: Resultsmentioning

confidence: 99%

“…The second method is the k-nearest neighbors approach, which has been extensively applied and improved in many ways: The transitive closure of neighbors of any data point is required to cover all data points, otherwise the information on relative positions of the connected components would be lost. This is why effective approaches have been proposed to build the connected neighborhood graph (Yang, 2005(Yang, , 2006. Since the Euclidean distance cannot be applied to discover the neighborhood with any shape, the geodesic distance and path algebra have been applied to determine whether two points are neighbors Varini et al, 2006).…”

Section: Related Workmentioning

confidence: 99%

“…In the relative space, dðy 3 ; y 1 Þ < dðy 3 ; y 4 Þ, the outlier or noise becomes further away from the normal points, making it easier to refrain the influence of noises on isometric embedding approaches. Although this could produce disconnected manifolds and makes most manifold learning approaches fail to finish the correct embedding, it can be easily solved by building k-edges connected neighborhood graph (Yang, 2005(Yang, , 2006. Furthermore, in the relative space, the distances among points vary nonlinearly.…”

Section: Local Relative Transformationmentioning

confidence: 99%

“…However, if neighborhood size k is too small, disconnected manifolds could be produced. A neighborhood graph based on a smaller neighborhood size will lead to more errors in the estimation of geodesic distances that in turn leads to drastically incorrect embedding (Yang, 2005). This indicates that k should take the largest possible value when short-circuit edges do not appear.…”

Section: Enhanced Isometric Embedding Approachmentioning

confidence: 99%

See 3 more Smart Citations

Local relative transformation with application to isometric embedding

Wen

Jiang

Wen

2009

Pattern Recognition Letters

View full text Add to dashboard Cite

Section: Resultsmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Local Relative Transformationmentioning

confidence: 99%

Section: Enhanced Isometric Embedding Approachmentioning

confidence: 99%

See 2 more Smart Citations

Local relative transformation with application to isometric embedding

Wen

Jiang

Wen

2009

Pattern Recognition Letters

View full text Add to dashboard Cite

“…(multi-dimensional scaling)方法来获取全局最优的几何结构,获得了较好的效果,目前已发展了很多改进算法,如基于核方法的ISOMAP、监督ISOMAP [3] 、增量式ISOMAP [4] 等.LLE在降维嵌入过程中保持局部的几何结构不变,并能避免局部极小,最终获得一个全局的低维嵌入系统,效果也很好.目前的改进算法包括利用Hessian变换改进的算法HLLE(Hessian LLE) [5] 、利用数据分类信息改进的监督LLE、增量式LLE [6] 、利用Fisher改进的LLE [7] 等.目前,国内也展开了较深入的理论研究和应用实践 [8] .例如,ISOMAP中连续流形与其低维参数空间等距映射的存在性证明 [9] 、根据放大因子和延伸方向研究高维观测数据与其低维参数空间数据的联系 [10] 等. ISOMAP的基本假设是全局等距映射和凸的参数空间,这在很多情况下难以满足;而HLLE只要求局部等距映射和开的连通参数空间,有更宽的应用范围.但是,与ISOMAP一样,都极大程度地依赖于局部邻域是否正确地反映了流形的内在结构.现有的k-近邻邻域确定方法对稀疏和噪音数据容易产生扭曲的邻域结构,从而导致短路现象 [11] .所谓短路是指流形上的折叠面靠得很近,使得某些点的邻域来自不同的折叠面,因而并不是流形上的最近邻,这常常导致显著的性能偏差,因此需要邻域优化.邻域优化方法包括从完全连接图中重复抽取最小生成树来构造连通邻域图的方法 [12] ,以保证降维之后不丢失数据之间的相对位置.利用数据的分类信息重定义距离,进而利用新定义的距离来确定邻域的方法 [3] ,缺点是对无分类信息的数据不适用.目前,也有利用残差和线性重构造系数来自动选择最佳邻域大小的研究 [13−15] ,但一旦确定,每个数据点的邻域大小仍然是相同的.另一种方法是为每个点选择初步邻域,利用PCA(principal component analysis)构造此邻域的主线性子空间,然后从邻域中删除偏离主线性子空间的邻域点 [16] ,当邻域本质上是非线性时,此方法可能不适用,同时,太多的参数使得应用起来较为困难. [17] ,利用图代数优化邻域 [18] 等,但邻域大小仍然是全局统一的.考虑到HLLE需要保持局部区域的线性化,当数据流形是非均匀分布时,采用全局统一的邻域大小难以满足,因为若邻域参数取得太大,则容易消除流形的小尺度结构,并不可避免地面临短路问题,相反,则容易导致流形分裂 [19] .因此,我们曾提出了对整个不均匀分布流形递归分解为近似均匀分布的子流形,并自动计算每个子流形邻域大小的方法,进而改进LLE [20] ,但是它需要计算所有点之间的测 2) 采用ISOMAP的方法计算局部数据集X i 中任意两点之间的局部测地距离,主要包括两步:…”

unclassified

Dynamically Determining Neighborhood Parameter for Locally Linear Embedding

Wen¹

2008

Journal of Software

View full text Add to dashboard Cite

Abstract:Locally linear embedding is a kind of very competitive nonlinear dimensionality reduction with good representational capacity for a broader range of manifolds and high computational efficiency. However, they are based on the assumption that the whole data manifolds are evenly distributed so that they determine the neighborhood for all points with the same neighborhood size. Accordingly, they fail to nicely deal with most real problems that are unevenly distributed. This paper presents a new approach that takes the general conceptual framework of Hessian locally linear embedding so as to guarantee its correctness in the setting of local isometry to an open connected subset but dynamically determines the local neighborhood size for each point. This approach estimates the approximate geodesic distance between any two points by the shortest path in the local neighborhood graph, and then determines the neighborhood size for each point by using the relationship between its local estimated geodesic distance matrix and local Euclidean distance matrix. This approach has clear geometry intuition as well as the better performance and stability to deal with the sparsely sampled or noise contaminated data sets that are often unevenly distributed. The conducted experiments on benchmark data sets validate the proposed approach.

show abstract

Building cost effective lower layer VPNs: The ILEC/CLEC dilemma

Meddeb

2010

Int J Communication

View full text Add to dashboard Cite

SUMMARYLayer 2 and layer 1 Virtual Private Network (VPN) services; ranging from simple leased lines to extending private LANs across public networks, are commonplace today. With the continuously growing economic difficulties, capital meltdown, and telecommunication business turmoil, delivering those VPN services at the lowest cost or with the maximum revenue margin, while committing to Service Level Agreements (SLA), has become essential. We show that whether we tackle the optimal VPN design problem from an Incumbent Local Exchange Carrier (ILEC) standpoint or from a Competitive Local Exchange Carrier (CLEC) standpoint, we obtain contradictory rules. We show that by building Edge Disjoint VPN trees and spreading the traffic all over the network, the ILEC can achieve maximum throughput and enhanced network performance; while by concentrating all the VPN traffic over a single tree, the CLEC can minimize the cost of leased bandwidth. We then propose two simple algorithms that can help carriers and service providers leverage their networks and increase their revenue margins while meeting SLA requirements.

show abstract

Building k edge-disjoint spanning trees of minimum total length for isometric data embedding

Cited by 55 publications

References 8 publications

Local relative transformation with application to isometric embedding

Local relative transformation with application to isometric embedding

Dynamically Determining Neighborhood Parameter for Locally Linear Embedding

Building cost effective lower layer VPNs: The ILEC/CLEC dilemma

Contact Info

Product

Resources

About