Yuan Gong scite author profile

In the past decade, convolutional neural networks (CNNs) have been widely adopted as the main building block for endto-end audio classification models, which aim to learn a direct mapping from audio spectrograms to corresponding labels. To better capture long-range global context, a recent trend is to add a self-attention mechanism on top of the CNN, forming a CNN-attention hybrid model. However, it is unclear whether the reliance on a CNN is necessary, and if neural networks purely based on attention are sufficient to obtain good performance in audio classification. In this paper, we answer the question by introducing the Audio Spectrogram Transformer (AST), the first convolution-free, purely attention-based model for audio classification. We evaluate AST on various audio classification benchmarks, where it achieves new state-of-the-art results of 0.485 mAP on AudioSet, 95.6% accuracy on ESC-50, and 98.1% accuracy on Speech Commands V2.

show abstract

AST: Audio Spectrogram Transformer

Gong¹,

Chung²,

Glass³

2021

Preprint

View full text Add to dashboard Cite

Massively parallel detection of gene expression in single cells using subnanolitre wells

2010

View full text Add to dashboard Cite

The relationship between the expression of particular genes in cells and their impact on phenotypic characteristics is important for understanding how cells regulate responses to their environment. We have developed a microwell-based method to detect copies of mRNA transcripts directly from individual cells by one-step, single-cell, reverse transcription polymerase chain reaction (RT-PCR). Our approach permits the detection of mRNA transcripts of interest for more than 6000 single cells in parallel per assay with high sensitivity and specificity for constitutively active genes. This simple method was also combined with microengraving and image-based cytometry to examine the relationships between gene expression and cellular secretion of antibodies in a clonal population. We observed that most individual human B cell hybridomas transcribed a requisite gene for their antibodies, but only a subset of those cells secreted the antibody. The technique should also allow the detection of replicating intracellular pathogens such as retroviruses.

show abstract

Immuno-Hybridization Chain Reaction for Enhancing Detection of Individual Cytokine-Secreting Human Peripheral Mononuclear Cells

et al. 2011

View full text Add to dashboard Cite

We present here a new method to enhance the detection of secreted cytokines and chemokines from single human mononuclear cells. The technique uses a hybridization chain reaction (HCR) to amplify signals resulting from sandwich immunoassays. This immuno-HCR employs oligonucleotide-based initiators covalently linked to antibodies to propagate a chain reaction of hybridization events involving a pair of complementary hairpin oligomers bearing fluorescent labels. Integrating this strategy for signal amplification with microengraving—a soft lithographic method for printing arrays of secreted proteins from thousands of single cells—improves both the limits of detection and sensitivity for cytokines and chemokines captured from individual cells by an average of 200-fold relative to methods for direct detection by fluoresence. This approach should enhance the utility of microengraving for defining the immunological signatures of diseases and responses to interventional therapies based on multiplexed single-cell analysis.

show abstract

PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation

Gong¹,

Chung²,

Glass³

2021

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

Audio event classification is an active research area and has a wide range of applications. Since the release of AudioSet, great progress has been made in advancing the classification accuracy, which mostly comes from the development of novel model architectures and attention modules. However, we find that appropriate training techniques are equally important for building audio event classification models with AudioSet, but have not received the attention they deserve. To fill the gap, in this work, we present PSLA, a collection of training techniques that can noticeably boost the model accuracy including ImageNet pretraining, balanced sampling, data augmentation, label enhancement, model aggregation and their design choices.By training an EfficientNet with these techniques, we obtain a model that achieves a new state-of-the-art mean average precision (mAP) of 0.474 on AudioSet, outperforming the previous best system of 0.439.

show abstract

Topic Modeling Based Multi-modal Depression Detection

Gong

Poellabauer

2017

View full text Add to dashboard Cite

Major depressive disorder is a common mental disorder that affects almost 7% of the adult U.S. population. e 2017 Audio/Visual Emotion Challenge (AVEC) asks participants to build a model to predict depression levels based on the audio, video, and text of an interview ranging between 7-33 minutes. Since averaging features over the entire interview will lose most temporal information, how to discover, capture, and preserve useful temporal details for such a long interview are significant challenges. erefore, we propose a novel topic modeling based approach to perform context-aware analysis of the recording. Our experiments show that the proposed approach outperforms context-unaware methods and the challenge baselines for all metrics.

show abstract

A hepatocyte differentiation model reveals two subtypes of liver cancer with different oncofetal properties and therapeutic targets

Liu

Yan

Sun

et al. 2020

Proc. Natl. Acad. Sci. U.S.A.

View full text Add to dashboard Cite

Clinical observation of the association between cancer aggressiveness and embryonic development stage implies the importance of developmental signals in cancer initiation and therapeutic resistance. However, the dynamic gene expression during organogenesis and the master oncofetal drivers are still unclear, which impeded the efficient elimination of poor prognostic tumors, including human hepatocellular carcinoma (HCC). In this study, human embryonic stem cells were induced to differentiate into adult hepatocytes along hepatic lineages to mimic liver development in vitro. Combining transcriptomic data from liver cancer patients with the hepatocyte differentiation model, the active genes derived from different hepatic developmental stages and the tumor tissues were selected. Bioinformatic analysis followed by experimental assays was used to validate the tumor subtype-specific oncofetal signatures and potential therapeutic values. Hierarchical clustering analysis revealed the existence of two subtypes of liver cancer with different oncofetal properties. The gene signatures and their clinical significance were further validated in an independent clinical cohort and The Cancer Genome Atlas database. Upstream activator analysis and functional screening further identified E2F1 and SMAD3 as master transcriptional regulators. Small-molecule inhibitors specifically targeting the oncofetal drivers extensively down-regulated subtype-specific developmental signaling and inhibited tumorigenicity. Liver cancer cells and primary HCC tumors with different oncofetal properties also showed selective vulnerability to their specific inhibitors. Further precise targeting of the tumor initiating steps and driving events according to subtype-specific biomarkers might eliminate tumor progression and provide novel therapeutic strategy.

show abstract

Effects of methylprednisolone use on viral genomic nucleic acid negative conversion and CT imaging lesion absorption in COVID‐19 patients under 50 years old

Gong

Guan

Zhu

et al. 2020

Journal of Medical Virology

View full text Add to dashboard Cite

The use of corticosteroids has been controversial in viral pneumonia. In most cases, application of methylprednisolone in severe and critical viral pneumonia patients can quickly alleviate the symptoms of dyspnea and prevent disease progression. However, some scholars have confirmed that corticosteroids delayed the body's clearance of the virus. In our retrospective non-randomized study, 34 patients under 50 years old and diagnosed with coronavirus disease 2019 (COVID-19) were included. According to the given methylprednisolone treatment (n = 18) or not (n = 16), they were separated into two groups. By comparing the clinical data we concluded that corticosteroids therapy can effectively release COVID-19 symptoms such as persistent fever and difficult in breathing, improve oxygenation, and prevent disease progression. However, it can prolong the negative conversion of nucleic acids.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yuan Gong

AST: Audio Spectrogram Transformer

AST: Audio Spectrogram Transformer

Massively parallel detection of gene expression in single cells using subnanolitre wells

Immuno-Hybridization Chain Reaction for Enhancing Detection of Individual Cytokine-Secreting Human Peripheral Mononuclear Cells

PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation

Topic Modeling Based Multi-modal Depression Detection

A hepatocyte differentiation model reveals two subtypes of liver cancer with different oncofetal properties and therapeutic targets

Effects of methylprednisolone use on viral genomic nucleic acid negative conversion and CT imaging lesion absorption in COVID‐19 patients under 50 years old

Contact Info

Product

Resources

About