In the past decade, convolutional neural networks (CNNs) have been widely adopted as the main building block for endto-end audio classification models, which aim to learn a direct mapping from audio spectrograms to corresponding labels. To better capture long-range global context, a recent trend is to add a self-attention mechanism on top of the CNN, forming a CNN-attention hybrid model. However, it is unclear whether the reliance on a CNN is necessary, and if neural networks purely based on attention are sufficient to obtain good performance in audio classification. In this paper, we answer the question by introducing the Audio Spectrogram Transformer (AST), the first convolution-free, purely attention-based model for audio classification. We evaluate AST on various audio classification benchmarks, where it achieves new state-of-the-art results of 0.485 mAP on AudioSet, 95.6% accuracy on ESC-50, and 98.1% accuracy on Speech Commands V2.
No abstract
The relationship between the expression of particular genes in cells and their impact on phenotypic characteristics is important for understanding how cells regulate responses to their environment. We have developed a microwell-based method to detect copies of mRNA transcripts directly from individual cells by one-step, single-cell, reverse transcription polymerase chain reaction (RT-PCR). Our approach permits the detection of mRNA transcripts of interest for more than 6000 single cells in parallel per assay with high sensitivity and specificity for constitutively active genes. This simple method was also combined with microengraving and image-based cytometry to examine the relationships between gene expression and cellular secretion of antibodies in a clonal population. We observed that most individual human B cell hybridomas transcribed a requisite gene for their antibodies, but only a subset of those cells secreted the antibody. The technique should also allow the detection of replicating intracellular pathogens such as retroviruses.
We present here a new method to enhance the detection of secreted cytokines and chemokines from single human mononuclear cells. The technique uses a hybridization chain reaction (HCR) to amplify signals resulting from sandwich immunoassays. This immuno-HCR employs oligonucleotide-based initiators covalently linked to antibodies to propagate a chain reaction of hybridization events involving a pair of complementary hairpin oligomers bearing fluorescent labels. Integrating this strategy for signal amplification with microengraving—a soft lithographic method for printing arrays of secreted proteins from thousands of single cells—improves both the limits of detection and sensitivity for cytokines and chemokines captured from individual cells by an average of 200-fold relative to methods for direct detection by fluoresence. This approach should enhance the utility of microengraving for defining the immunological signatures of diseases and responses to interventional therapies based on multiplexed single-cell analysis.
Audio event classification is an active research area and has a wide range of applications. Since the release of AudioSet, great progress has been made in advancing the classification accuracy, which mostly comes from the development of novel model architectures and attention modules. However, we find that appropriate training techniques are equally important for building audio event classification models with AudioSet, but have not received the attention they deserve. To fill the gap, in this work, we present PSLA, a collection of training techniques that can noticeably boost the model accuracy including ImageNet pretraining, balanced sampling, data augmentation, label enhancement, model aggregation and their design choices.By training an EfficientNet with these techniques, we obtain a model that achieves a new state-of-the-art mean average precision (mAP) of 0.474 on AudioSet, outperforming the previous best system of 0.439.
Major depressive disorder is a common mental disorder that affects almost 7% of the adult U.S. population. e 2017 Audio/Visual Emotion Challenge (AVEC) asks participants to build a model to predict depression levels based on the audio, video, and text of an interview ranging between 7-33 minutes. Since averaging features over the entire interview will lose most temporal information, how to discover, capture, and preserve useful temporal details for such a long interview are significant challenges. erefore, we propose a novel topic modeling based approach to perform context-aware analysis of the recording. Our experiments show that the proposed approach outperforms context-unaware methods and the challenge baselines for all metrics.
Clinical observation of the association between cancer aggressiveness and embryonic development stage implies the importance of developmental signals in cancer initiation and therapeutic resistance. However, the dynamic gene expression during organogenesis and the master oncofetal drivers are still unclear, which impeded the efficient elimination of poor prognostic tumors, including human hepatocellular carcinoma (HCC). In this study, human embryonic stem cells were induced to differentiate into adult hepatocytes along hepatic lineages to mimic liver development in vitro. Combining transcriptomic data from liver cancer patients with the hepatocyte differentiation model, the active genes derived from different hepatic developmental stages and the tumor tissues were selected. Bioinformatic analysis followed by experimental assays was used to validate the tumor subtype-specific oncofetal signatures and potential therapeutic values. Hierarchical clustering analysis revealed the existence of two subtypes of liver cancer with different oncofetal properties. The gene signatures and their clinical significance were further validated in an independent clinical cohort and The Cancer Genome Atlas database. Upstream activator analysis and functional screening further identified E2F1 and SMAD3 as master transcriptional regulators. Small-molecule inhibitors specifically targeting the oncofetal drivers extensively down-regulated subtype-specific developmental signaling and inhibited tumorigenicity. Liver cancer cells and primary HCC tumors with different oncofetal properties also showed selective vulnerability to their specific inhibitors. Further precise targeting of the tumor initiating steps and driving events according to subtype-specific biomarkers might eliminate tumor progression and provide novel therapeutic strategy.
The use of corticosteroids has been controversial in viral pneumonia. In most cases, application of methylprednisolone in severe and critical viral pneumonia patients can quickly alleviate the symptoms of dyspnea and prevent disease progression. However, some scholars have confirmed that corticosteroids delayed the body's clearance of the virus. In our retrospective non-randomized study, 34 patients under 50 years old and diagnosed with coronavirus disease 2019 (COVID-19) were included. According to the given methylprednisolone treatment (n = 18) or not (n = 16), they were separated into two groups. By comparing the clinical data we concluded that corticosteroids therapy can effectively release COVID-19 symptoms such as persistent fever and difficult in breathing, improve oxygenation, and prevent disease progression. However, it can prolong the negative conversion of nucleic acids.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.