2022
DOI: 10.22541/au.166998603.34117027/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Easy-to-use R functions to separate reduced-representation genomic datasets into sex-linked and autosomal loci, and conduct sex-assignment

Abstract: Identifying sex-linked markers in genomic datasets is important, because their analyses can reveal sex-specific biology, and their presence in supposedly neutral autosomal datasets can result in incorrect estimates of genetic diversity, population structure and parentage. But detecting sex-linked loci can be challenging, and available scripts neglect some categories of sex-linked variation. Here, we present new R functions to (1) identify and separate sex-linked loci in ZW and XY sex determination systems and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 45 publications
0
4
0
Order By: Relevance
“…Genotypes were filtered by removing individuals with >30% of missing data, loci with reproducibility <95%, loci missing in >20% of individuals and monomorphic loci. We further removed loci likely to represent incorrectly merged loci (78 loci with heterozygosity significantly ( p < 0.05) larger than the maximum expected value of 0.5 in either Cotter or Cataract populations) using R function filter.excess.het (Robledo‐Ruiz et al., 2023). We did not apply the function to the Cotter‐only dataset where multi‐age group/cohorts represented populations because excess of heterozygosity could result from admixture between previously isolated populations (Wahlund‐breaking), a suspected outcome of the 2013 Cotter Reservoir expansion.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Genotypes were filtered by removing individuals with >30% of missing data, loci with reproducibility <95%, loci missing in >20% of individuals and monomorphic loci. We further removed loci likely to represent incorrectly merged loci (78 loci with heterozygosity significantly ( p < 0.05) larger than the maximum expected value of 0.5 in either Cotter or Cataract populations) using R function filter.excess.het (Robledo‐Ruiz et al., 2023). We did not apply the function to the Cotter‐only dataset where multi‐age group/cohorts represented populations because excess of heterozygosity could result from admixture between previously isolated populations (Wahlund‐breaking), a suspected outcome of the 2013 Cotter Reservoir expansion.…”
Section: Methodsmentioning
confidence: 99%
“…Smaller per‐cohort Colony2 datasets were also created using the above filtering criteria. R function gl2colony (Robledo‐Ruiz et al., 2023) was used to create Colony2 input files within R.…”
Section: Methodsmentioning
confidence: 99%
“…While SeXY can perform sexing with a single individual at low depth and was tailored to also work with reference genomes of divergent taxa, its sexing is based on arbitrary thresholds. Other methods (Gautier, 2014; Robledo‐Ruiz et al., 2023) can also infer both sex and sex‐linked loci but are designed for SNP data sets (e.g. RAD sequencing) and classify each locus individually.…”
Section: Introductionmentioning
confidence: 99%
“…While SeXY can perform sexing with a single individual at low depth and was tailored to also work with reference genomes of divergent taxa, its sexing is based on arbitrary thresholds. Other methods (Robledo-Ruiz et al, 2023;Gautier, 2014) can also infer both sex and sex-linked loci but are designed for SNP data sets (e.g. RAD sequencing) and classify each locus individually.…”
Section: Introductionmentioning
confidence: 99%