2009
DOI: 10.1016/j.ipm.2009.02.001
|View full text |Cite
|
Sign up to set email alerts
|

Discovering power laws in computer programs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

2
7
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 13 publications
(9 citation statements)
references
References 38 publications
2
7
0
Order By: Relevance
“…The occurrences of tags in online resources1920, keywords in scientific publications21 and words in web pages resulted from web searching22 also simultaneously display the Zipf's and Heaps' laws. Interestingly, even the identifiers in programs by Java, C++ and C languages exhibit the same scaling laws23.…”
mentioning
confidence: 94%
“…The occurrences of tags in online resources1920, keywords in scientific publications21 and words in web pages resulted from web searching22 also simultaneously display the Zipf's and Heaps' laws. Interestingly, even the identifiers in programs by Java, C++ and C languages exhibit the same scaling laws23.…”
mentioning
confidence: 94%
“…2)-4) As such, Zipf's law has attracted a great deal of attention, but the general mechanism of Zipf's law remains unclear. In quantitative linguistics, Zipf's exponent, α, is evaluated for various languages, such as English and Russian, 5) and for programming languages, such as Java and C, 6) and is known in many cases to be approximately 1. Heaps' law 7) states that the number of distinct words increases nonlinearly as the total number of words in a document increases.…”
Section: §1 Introductionmentioning
confidence: 99%
“…For all component systems whose realizations have this temporal ordering of components, it is possible to evaluate over a single realization how the number of different components h grows with the realization size m. More in general, the same scaling can be analyzed for component systems even without a natural ordering of components (for example for genomes as composed by genes or for LEGO toys) if the sizes M of the available realizations span a sufficiently large range. As discussed in the Introduction, in several empirical systems this quantity follows a sublinear and approximately power-law function h(m) ∝ m ν (with ν < 1), known as Heaps's law [5,17,[28][29][30][31][32]. Each run of the SSR process also generates an ordered sequence of components (or visited states), and the question is what is the predicted scaling of h(m) for this stochastic process.…”
Section: Resultsmentioning
confidence: 99%
“…This law describes the sublinear growth of the number of different components (i.e. the observed vocabulary) with the system size (i.e., the total number of components), and has been observed in several empirical systems from linguistics to genomics [5,[28][29][30][31][32]. In models based on equilibrium ensembles, such as the random-group-formation model [22], the vocabulary is typically a fixed parameter, thus this scaling cannot be addressed straightforwardly.…”
Section: Introductionmentioning
confidence: 99%