Through a prospective clinical sequencing program for advanced cancers, four index cases were identified which harbor gene rearrangements of FGFR2 including patients with cholangiocarcinoma, breast cancer, and prostate cancer. After extending our assessment of FGFR rearrangements across multiple tumor cohorts, we identified additional FGFR gene fusions with intact kinase domains in lung squamous cell cancer, bladder cancer, thyroid cancer, oral cancer, glioblastoma, and head and neck squamous cell cancer. All FGFR fusion partners tested exhibit oligomerization capability, suggesting a shared mode of kinase activation. Overexpression of FGFR fusion proteins induced cell proliferation. Two bladder cancer cell lines that harbor FGFR3 fusion proteins exhibited enhanced susceptibility to pharmacologic inhibition in vitro and in vivo. Due to the combinatorial possibilities of FGFR family fusion to a variety of oligomerization partners, clinical sequencing efforts which incorporate transcriptome analysis for gene fusions are poised to identify rare, targetable FGFR fusions across diverse cancer types.
A major goal in drug design is the improvement of computational methods for docking and scoring. The Community Structure Activity Resource (CSAR) aims to collect available data from industry and academia which may be used for this purpose (). Also, CSAR is charged with organizing community-wide exercises based on the collected data. The first of these exercises was aimed to gauge the overall state of docking and scoring, using a large and diverse data set of protein–ligand complexes. Participants were asked to calculate the affinity of the complexes as provided and then recalculate with changes which may improve their specific method. This first data set was selected from existing PDB entries which had binding data (Kd or Ki) in Binding MOAD, augmented with entries from PDBbind. The final data set contains 343 diverse protein–ligand complexes and spans 14 pKd. Sixteen proteins have three or more complexes in the data set, from which a user could start an inspection of congeneric series. Inherent experimental error limits the possible correlation between scores and measured affinity; R2 is limited to ∼0.9 when fitting to the data set without over parametrizing. R2 is limited to ∼0.8 when scoring the data set with a method trained on outside data. The details of how the data set was initially selected, and the process by which it matured to better fit the needs of the community are presented. Many groups generously participated in improving the data set, and this underscores the value of a supportive, collaborative effort in moving our field forward.
Binding MOAD (Mother of All Databases) is a database of 9836 protein–ligand crystal structures. All biologically relevant ligands are annotated, and experimental binding-affinity data is reported when available. Binding MOAD has almost doubled in size since it was originally introduced in 2004, demonstrating steady growth with each annual update. Several technologies, such as natural language processing, help drive this constant expansion. Along with increasing data, Binding MOAD has improved usability. The website now showcases a faster, more featured viewer to examine the protein–ligand structures. Ligands have additional chemical data, allowing for cheminformatics mining. Lastly, logins are no longer necessary, and Binding MOAD is freely available to all at http://www.BindingMOAD.org.
The residue composition of a ligand binding site determines the interactions available for diffusion-mediated ligand binding, and understanding general composition of these sites is of great importance if we are to gain insight into the functional diversity of the proteome. Many structure-based drug design methods utilize such heuristic information for improving prediction or characterization of ligand-binding sites in proteins of unknown function. The Binding MOAD database if one of the largest curated sets of protein-ligand complexes, and provides a source of diverse, high-quality data for establishing general trends of residue composition from currently available protein structures. We present an analysis of 3,295 non-redundant proteins with 9,114 non-redundant binding sites to identify residues over-represented in binding regions versus the rest of the protein surface. The Binding MOAD database delineates biologically-relevant “valid” ligands from “invalid” small-molecule ligands bound to the protein. Invalids are present in the crystallization medium and serve no known biological function. Contacts are found to differ between these classes of ligands, indicating that residue composition of biologically relevant binding sites is distinct not only from the rest of the protein surface, but also from surface regions capable of opportunistic binding of non-functional small molecules. To confirm these trends, we perform a rigorous analysis of the variation of residue propensity with respect to the size of the dataset and the content bias inherent in structure sets obtained from a large protein structure database. The optimal size of the dataset for establishing general trends of residue propensities, as well as strategies for assessing the significance of such trends, are suggested for future studies of binding-site composition.
Coller BS. Impact of sex, age, race, ethnicity and aspirin use on bleeding symptoms in healthy adults. J Thromb Haemost 2011; 9: 100-8.Summary.Background: Comparing a patientÕs bleeding symptoms with those of healthy individuals is an important component of the diagnosis of bleeding disorders, but little is known about whether bleeding symptoms in healthy individuals vary by sex, race, ethnicity, age, or aspirin use. Objectives, Patients/Methods: We developed a comprehensive, ontologybacked, Web-based questionnaire to collect bleeding histories from 500 healthy adults. The mean age was 43 years (range 19-86 years), 63% were female, 19% were Hispanic, 37% were African-American, 43% were Caucasian, 8% were Asian, and 4% were multiracial. Results: 18 of the 36 symptoms captured occurred with < 5% frequency, and 26% of participants reported no bleeding symptoms (range 0-19 symptoms). Differences in sex, race, ethnicity, aspirin use and age accounted for only 6-13% of the variability in symptoms. Although men reported fewer symptoms than women (median 1 vs. 2, P < 0.01), there was no difference when sex-specific questions were excluded (median 1 for both men and women, P = 0.50). However, women reported more easy bruising (24% vs. 7%, P < 0.01) and venipuncture-related bruising (10% vs. 3%, P = 0.02). The number of symptoms did not vary by race or age, but epistaxis was reported more frequently by Caucasians than by African-Americans (29% vs. 18%, P = 0.02), and epistaxis frequency decreased with age (odds ratio 0.97 per year, P < 0.01). Paradoxically, infrequent aspirin users reported more bruising and heavy menses than frequent users (21% vs. 8%, P = 0.01, and 56% vs. 38%, P = 0.03, respectively). Conclusions: Our findings provide a contemporaneous and comprehensive description of bleeding symptoms in a diverse group of healthy individuals. Our Web-based system is freely available to other investigators.
Physical differences in small molecule binding between enzymes and non-enzymes were found through mining the protein-ligand database, Binding MOAD (Mother of All Databases). The data suggest that divergent approaches may be more productive for improving the affinity of ligands for the two classes of proteins. High-affinity ligands of enzymes are much larger than those with low affinity, indicating that the addition of complementary functional groups is likely to improve the affinity of an enzyme inhibitor. However, this process may not be as fruitful for ligands of nonenzymes. High-and low-affinity ligands of non-enzymes are nearly the same size, so modest modifications and isosteric replacement might be most productive. The inherent differences between enzymes and non-enzymes have significant ramifications for scoring functions and structure-based drug design. In particular, non-enzymes were found to have greater ligand efficiencies than enzymes. Ligand efficiencies are often used to indicate druggability of a target, and this finding supports the feasibility of non-enzymes as drug targets. The differences in ligand efficiencies do not appear to come from the ligands; instead, the pockets yield different amino acid compositions, despite very similar distributions of amino acids in the overall protein sequences.
Background Anti-PD-1 and PD-L1 (collectively PD-[L]1) therapies are approved for many advanced solid tumors. Biomarkers beyond PD-L1 immunohistochemistry, microsatellite instability, and tumor mutation burden (TMB) may improve benefit prediction. Methods Using treatment data and genomic and transcriptomic tumor tissue profiling from an observational trial (NCT03061305), we developed Immunotherapy Response Score (IRS), a pan-tumor predictive model of PD-(L)1 benefit. IRS real-world progression free survival (rwPFS) and overall survival (OS) prediction was validated in an independent cohort of trial patients. Results Here, by Cox modeling, we develop IRS—which combines TMB with CD274, PDCD1, ADAM12 and TOP2A quantitative expression—to predict pembrolizumab rwPFS (648 patients; 26 tumor types; IRS-High or -Low groups). In the 248 patient validation cohort (248 patients; 24 tumor types; non-pembrolizumab PD-[L]1 monotherapy treatment), median rwPFS and OS are significantly longer in IRS-High vs. IRS-Low patients (rwPFS adjusted hazard ratio [aHR] 0.52, p = 0.003; OS aHR 0.49, p = 0.005); TMB alone does not significantly predict PD-(L)1 rwPFS nor OS. In 146 patients treated with systemic therapy prior to pembrolizumab monotherapy, pembrolizumab rwPFS is only significantly longer than immediately preceding therapy rwPFS in IRS-High patients (interaction test p = 0.001). In propensity matched lung cancer patients treated with first-line pembrolizumab monotherapy or pembrolizumab+chemotherapy, monotherapy rwPFS is significantly shorter in IRS-Low patients, but is not significantly different in IRS-High patients. Across 24,463 molecularly-evaluable trial patients, 7.6% of patients outside of monotherapy PD-(L)1 approved tumor types are IRS-High/TMB-Low. Conclusions The validated, predictive, pan-tumor IRS model can expand PD-(L)1 monotherapy benefit outside currently approved indications.
The lack of standardized methods for human phenotyping is a major obstacle in translational science. We have developed a bleeding history phenotyping system comprising an ontology, a questionnaire, a Web-based phenotype recording instrument (PRI), and a database. The ontology facilitates transparency, collaboration, aggregation of data, and data analysis. The integrated system allows investigators worldwide to use the PRI, add their de-identified data to the database, and query the aggregated data. Thus, this system can increase the power to detect genotype-phenotype-environment relationships and help new investigators begin their studies. We anticipate that this approach may be applicable to other disorders.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.