2016
DOI: 10.1371/journal.pcbi.1005110
|View full text |Cite
|
Sign up to set email alerts
|

Zipf’s Law Arises Naturally When There Are Underlying, Unobserved Variables

Abstract: Zipf’s law, which states that the probability of an observation is inversely proportional to its rank, has been observed in many domains. While there are models that explain Zipf’s law in each of them, those explanations are typically domain specific. Recently, methods from statistical physics were used to show that a fairly broad class of models does provide a general explanation of Zipf’s law. This explanation rests on the observation that real world data is often generated from underlying causes, known as l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

9
93
1

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 79 publications
(114 citation statements)
references
References 30 publications
9
93
1
Order By: Relevance
“…Consistent with Zipf's law, the power-law distribution of transcript abundance [162,163], the top 200 expressed transcripts in the bulk RNA-seq data contribute around 50% of the total detected transcripts in the scRNA-seq data and it is clear that these are the only transcripts detected reliably ( Table S5). The abundant transcripts in bulk RNA-seq include many cell type-specific surface markers which explains the ability to use scRNA-seq to discover such markers.…”
Section: The Relationship Between Single Cell and Bulk Rna-seq Datasupporting
confidence: 53%
“…Consistent with Zipf's law, the power-law distribution of transcript abundance [162,163], the top 200 expressed transcripts in the bulk RNA-seq data contribute around 50% of the total detected transcripts in the scRNA-seq data and it is clear that these are the only transcripts detected reliably ( Table S5). The abundant transcripts in bulk RNA-seq include many cell type-specific surface markers which explains the ability to use scRNA-seq to discover such markers.…”
Section: The Relationship Between Single Cell and Bulk Rna-seq Datasupporting
confidence: 53%
“…It is curious that Zipf's Law (Zipf 1935), most commonly associated with linguistics, works well for exoplanet multiplicities. Zipfian laws are argued by Aitchison et al (2016) to be natural outcomes of systems involving a large number of latent variables, and this may represent another example. Extending our analysis to M-dwarfs, particularly from TESS, will provide a good test as to whether the Zipfian model can persist in the face of new data.…”
Section: Discussionmentioning
confidence: 99%
“…Of course, the knowledge thatŝ comes from a given distribution p(s|θ) changes considerably the picture 9 . In that case, the information that the sampleŝ contains on the generative 9 Even the presence of a structure in the data points provides useful information beyond what is assumed in our general setting. For example, if s = (σ 1 , .…”
Section: Relation With Parametric Modelsmentioning
confidence: 99%