We report improved whole-genome shotgun sequences for the genomes of indica and japonica rice, both with multimegabase contiguity, or almost 1,000-fold improvement over the drafts of 2002. Tested against a nonredundant collection of 19,079 full-length cDNAs, 97.7% of the genes are aligned, without fragmentation, to the mapped super-scaffolds of one or the other genome. We introduce a gene identification procedure for plants that does not rely on similarity to known genes to remove erroneous predictions resulting from transposable elements. Using the available EST data to adjust for residual errors in the predictions, the estimated gene count is at least 38,000–40,000. Only 2%–3% of the genes are unique to any one subspecies, comparable to the amount of sequence that might still be missing. Despite this lack of variation in gene content, there is enormous variation in the intergenic regions. At least a quarter of the two sequences could not be aligned, and where they could be aligned, single nucleotide polymorphism (SNP) rates varied from as little as 3.0 SNP/kb in the coding regions to 27.6 SNP/kb in the transposable elements. A more inclusive new approach for analyzing duplication history is introduced here. It reveals an ancient whole-genome duplication, a recent segmental duplication on Chromosomes 11 and 12, and massive ongoing individual gene duplications. We find 18 distinct pairs of duplicated segments that cover 65.7% of the genome; 17 of these pairs date back to a common time before the divergence of the grasses. More important, ongoing individual gene duplications provide a never-ending source of raw material for gene genesis and are major contributors to the differences between members of the grass family.
Microsatellite (MS) polymorphism is an important source of genetic diversity, providing support for map-based cloning and molecular breeding. We have developed a new database that contains 52 845 polymorphic MS loci between indica and japonica, composed of ample Class II MS markers, and integrated 18 828 MS loci from IRGSP and genetic markers from RGP. Based on genetic marker positions on the rice genome ( ), we determined the approximate genetic distances of these MS loci and validated 100 randomly selected markers experimentally with 90% success rate. In addition, we recorded polymorphic MS positions in indica cv. 9311 that is the most important paternal parent of the two-line hybrid rice in China. Our database will undoubtedly facilitate the application of MS markers in genetic researches and marker-assisted breeding. The data set is freely available from .
This study aimed to find potential diagnostic markers for osteoarthritis (OA) and analyze the role of immune cells infiltration in this pathology. We used OA datasets from the Gene Expression Omnibus database. First, R software was used to identify differentially expressed genes (DEGs) and perform functional correlation analysis. Then least absolute shrinkage and selection operator (LASSO) logistic regression and support vector machine-recursive feature elimination algorithms were used to screen and verify the diagnostic markers of OA. Finally, CIBERSORT was used to evaluate the infiltration of immune cells in OA tissues, and the correlation between diagnostic markers and infiltrating immune cells was analyzed. A total of 458 DEGs were screened in this study. GRB10 and E2F3 (AUC = 0.962) were identified as diagnostic markers of OA. Immune cell infiltration analysis found that resting mast cells, T regulatory cells, CD4 memory resting T cells, activated NK cells, and eosinophils may be involved in the OA process. In addition, GRB10 was correlated with NK resting cells, naive CD4 + T cells, and M1 macrophages, while E2F3 was correlated with resting mast cells. In conclusion, GRB10 and E2F3 can be used as diagnostic markers of osteoarthritis, and immune cell infiltration plays an important role in the occurrence and progression of OA.
The genome sequence of the Severe Acute Respiratory Syndrome (SARS)-associated virus provides essential information for the identification of pathogen(s), exploration of etiology and evolution, interpretation of transmission and pathogenesis, development of diagnostics, prevention by future vaccination, and treatment by developing new drugs. We report the complete genome sequence and comparative analysis of an isolate (RIOI ) of the coronavirus that has been recognized as a pathogen for SARS. The genome is 29725 nt in size and has 11 ORFs (Open Reading Frames). It is composed of a stable region encoding an RNA-dependent RNA polymerase (composed of 2 ORFs) and a variable region representing 4 CDSs (coding sequences) for viral structural genes (the S, E, M, N proteins) and 5 PUPs (putative uncharacterized proteins). Its gene order is identical to that of other known coronaviruses. The sequence alignment with all known RNA viruses places this virus as a member in the family of Coronaviridae. Thirty putative substitutions have been identified by comparative analysis of the 5 SARSassociated virus genome sequences in GenBank. Fifteen of them lead to possible amino acid changes (non-synonymous mutations) in the proteins. Three amino acid changes, with predicted alteration of physical and chemical features, have been detected in the S protein that is postulated to be involved in the immunoreactions between the virus and its host. Two amino acid changes have been detected in the M protein, which could be related to viral envelope formation. Phylogenetic analysis suggests the possibility of non-human origin of the SARS-associated viruses but provides no evidence that they are man-made. Further efforts should focus on identifying the etiology of the SARS-associated virus and ruling out conclusively the existence of other possible SARS-related pathogen(s).
This paper aimed to study the time course changes in taste compounds of Dezhou-braised chicken during the entire cooking process mainly consisting of deep-frying, high-temperature boiling, and low-temperature braising steps. For this purpose, meat samples at different processing stages were analyzed for 5'-nucleotides and free amino acids, and were also subjected to electronic tongue measurements. Results showed that IMP, Glu, Lys, and sodium chloride were the main compounds contributing to the taste attributes of the final product. IMP and Glu increased in the boiling step and remained unchanged in the following braising steps. Meanwhile, decrease in Lys content and increase in sodium chloride content were observed over time in both boiling and braising steps. Intensities for bitterness, saltiness, and Aftertaste-B obtained from the electronic tongue analysis were correlated with the concentrations of these above chemical compounds. Therefore, the electronic tongue system could be applied to evaluate the taste development of Dezhou-braised chicken during processing.
Human leukocyte antigen (HLA) system is the most polymorphic region known in the human genome. In the present study, we analyzed for the first time the HLA-A gene polymorphisms defined by the high-resolution typing methods—sequence-based typing (SBT) in 161 Northern Chinese Han people. A total of 74 different HLA-A gene types and 36 alleles were detected. The most frequent alleles were A*110101 (GF=0.2360), A*24020101 (GF=0.1646), and A*020101 (GF=0.1553); followed by A*3303 (GF=0.1180), A*3001 (GF=0.0590), and A*310102 (GF=0.0404). The frequencies of following alleles, A*0203, A*0205, A*0206, A*0207, A*030101, A*2423, A*2601, A*3201, and A*3301, are all higher than 0.0093. The homozygous alleles include A*020101, A*110101, A*24020101 and A*310102. Heterozygosity (H), polymorphism information content (PIC), discrimination power (DP) and probability of paternity exclusion (PPE) of HLA-A in the samples were calculated and their values were 0.8705, 0.8491, 0.6014, and 0.9475, respectively. These results by SBT analysis of HLA-A polymorphism in Northern Chinese Han population, especially the allele subtypes character, will be of great interest for clinical transplantation, disease-associated study and forensic identification. Implementation of high-resolution typing methods allows a significantly wider spectrum of HLA variation including rare alleles. This spectrum will further be extensively utilized in many fields.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.