The Internet is a complex network of interconnected routers and the existence of collective behavior such as congestion suggests that the correlations between different connections play a crucial role. It is thus critical to measure and quantify these correlations. We use methods of random matrix theory (RMT) to analyze the cross-correlation matrix C of information flow changes of 650 connections between 26 routers of the French scientific network 'Renater'. We find that C has the universal properties of the Gaussian orthogonal ensemble of random matrices: The distribution of eigenvalues-up to a rescaling which exhibits a typical correlation time of the order 10 minutes-and the spacing distribution follows the predictions of RMT. There are some deviations for large eigenvalues which contain network-specific information and which identify genuine correlations between connections. The study of the most correlated connections reveal the existence of 'active centers' which are exchanging information with a large number of routers thereby inducing correlations between the corresponding connections. These strong correlations could be a reason for the observed self-similarity in the WWW traffic.
The Internet infrastructure is not virtual: its distribution is dictated by social, geographical, economical, or political constraints. However, the infrastructure's design does not determine entirely the information traffic and different sources of complexity such as the intrinsic heterogeneity of the network or human practices have to be taken into account. In order to manage the Internet expansion, plan new connections or optimize the existing ones, it is thus critical to understand correlations between emergent global statistical patterns of Internet activity and human factors. We analyze data from the French national 'Renater' network which has about 2 millions users and which consists in about 30 interconnected routers located in different regions of France and we report the following results. The Internet flow is strongly localized: most of the traffic takes place on a 'spanning' network connecting a small number of routers which can be classified either as 'active centers' looking for information or 'databases' providing information. We also show that the Internet activity of a region increases with the number of published papers by laboratories of that region, demonstrating the positive impact of the Web on scientific activity and illustrating quantitatively the adage 'the more you read, the more you write'.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.