Contextualised word embeddings is a powerful tool to detect contextual synonyms. However, most of the current state-of-the-art (SOTA) deep learning concept extraction methods remain supervised and underexploit the potential of the context. In this paper, we propose a self-supervised pre-training approach which is able to detect contextual synonyms of concepts being training on the data created by shallow matching. We apply our methodology in the sparse multi-class setting (over 15,000 concepts) to extract phenotype information from electronic health records. We further investigate data augmentation techniques to address the problem of the class sparsity. Our approach achieves a new SOTA for the unsupervised phenotype concept annotation on clinical text on F1 and Recall outperforming the previous SOTA with a gain of up to 4.5 and 4.0 absolute points, respectively. After fine-tuning with as little as 20% of the labelled data, we also outperform BioBERT and ClinicalBERT. The extrinsic evaluation on three ICU benchmarks also shows the benefit of using the phenotypes annotated by our model as features.
Bayesian phylogenetic algorithms are computationally intensive. BEAST 1.10 inferences made use of the BEAGLE 3 high-performance library for efficient likelihood computations. The strategy allows phylogenetic inference and dating in current knowledge for SARS-CoV-2 transmission. Follow-up simulations on hybrid resources of Santos Dumont supercomputer using four phylogenomic data sets, we characterize the scaling performance behavior of BEAST 1.10. Our results provide insight into the species tree and MCMC chain length estimation, identifying preferable requirements to improve the use of high-performance computing resources. Ongoing steps involve analyzes of SARS-CoV-2 using BEAST 1.8 in multi-GPUs.
Este artigo trata do LP Todos os olhos, de Tom Zé, considerado pela crítica especializada um dos discos mais críticos e mais polêmicos da carreira do artista. Lançado em 1973, dentro de um contexto de recrudescimento da repressão e da censura pela ditadura militar brasileira, e da consolidação da indústria cultural no país, investigamos de que maneira o disco traz marcas de tal conjuntura social, política e econômica em seu projeto estético. Verificamos de que modo os diversos aspectos constitutivos do LP (capas, letras, arranjos, sonoridades) compõem um objeto-disco, com uma unidade temática centrada na figura do anti-herói, suas imperfeições e insuficiências. Observamos que no conjunto de fonogramas prevalecem as fraquezas e os defeitos em detrimento das virtudes e qualidades, expressando, de certa maneira, um contra discurso ideológico frente aos padrões culturais, estéticos e políticos difundidos no regime militar.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.