Haithem Afli scite author profile

BackgroundSARS-CoV-2 is an RNA virus causing COVID-19. The clinical characteristics and epidemiology of COVID-19 have been extensively investigated, however, only one study so far focused on the patient’s nasopharynx microbiota. In this study we investigated the nasopharynx microbial community of patients that developed different severity levels of COVID-19. We performed 16S ribosomal DNA sequencing from nasopharyngeal swab samples obtained from SARS-CoV-2 positive (56) and negative (18) patients in the province of Alicante (Spain) in their first visit to the hospital. Positive SARS-CoV-2 patients were observed and later categorized in mild (symptomatic without hospitalization), moderate (hospitalization), and severe (admission to ICU). We compared the microbiota diversity and OTU composition among severity groups and built bacterial co-abundance networks for each group.ResultsStatistical analysis indicated differences in the nasopharyngeal microbiome of COVID19 patients. 62 OTUs were found exclusively in SARS-CoV-2 positive patients, mostly classified as members of the phylum Bacteroidota (18) and Firmicutes (25). OTUs classified as Prevotella were found to be significantly more abundant in patients that developed more severe COVID-19. Furthermore, co-abundance analysis indicated a loss of network complexity among samples from patients that later developed more severe symptoms.ConclusionOur study shows that the nasopharyngeal microbiome of COVID-19 patients showed differences in the composition of specific OTUs and complexity of co-abundance networks. Taxa with differential abundances among groups could serve as biomarkers for COVID-19 severity. Nevertheless, further studies with larger sample sizes should be conducted to validate these results.

show abstract

Maintaining Sentiment Polarity in Translation of User-Generated Content

Lohar¹,

Afli²,

Way³

2017

View full text Add to dashboard Cite

The advent of social media has shaken the very foundations of how we share information, with Twitter, Facebook, and Linkedin among many well-known social networking platforms that facilitate information generation and distribution. However, the maximum 140-character restriction in Twitter encourages users to (sometimes deliberately) write somewhat informally in most cases. As a result, machine translation (MT) of user-generated content (UGC) becomes much more difficult for such noisy texts. In addition to translation quality being affected, this phenomenon may also negatively impact sentiment preservation in the translation process. That is, a sentence with positive sentiment in the source language may be translated into a sentence with negative or neutral sentiment in the target language. In this paper, we analyse both sentiment preservation and MT quality per se in the context of UGC, focusing especially on whether sentiment classification helps improve sentiment preservation in MT of UGC. We build four different experimental setups for tweet translation (i) using a single MT model trained on the whole Twitter parallel corpus, (ii) using multiple MT models based on sentiment classification, (iii) using MT models including additional out-of-domain data, and (iv) adding MT models based on the phrase-table fill-up method to accompany the sentiment translation models with an aim of improving MT quality and at the same time maintaining sentiment polarity preservation. Our empirical evaluation shows that despite a slight deterioration in MT quality, our system significantly outperforms the Baseline MT system (without using sentiment classification) in terms of sentiment preservation. We also demonstrate that using an MT engine that conveys a sentiment different from that of the UGC can even worsen both the translation quality and sentiment preservation.

show abstract

FaDA: Fast Document Aligner using Word Embedding

Lohar

Ganguly

Afli

et al. 2016

View full text Add to dashboard Cite

FaDA is a free/open-source tool for aligning multilingual documents. It employs a novel crosslingual information retrieval (CLIR)-based document-alignment algorithm involving the distances between embedded word vectors in combination with the word overlap between the source-language and the target-language documents. In this approach, we initially construct a pseudo-query from a source-language document. We then represent the target-language documents and the pseudo-query as word vectors to find the average similarity measure between them. This word vector-based similarity measure is then combined with the term overlap-based similarity. Our initial experiments show that s standard Statistical Machine Translation (SMT)- based approach is outperformed by our CLIR-based approach in finding the correct alignment pairs. In addition to this, subsequent experiments with the word vector-based method show further improvements in the performance of the system.

show abstract

Performance and Information Leakage in Splitfed Learning and Multi-Head Split Learning in Healthcare Data and Beyond

Joshi

Thapa

Camtepe

et al. 2022

MPs

View full text Add to dashboard Cite

Machine learning (ML) in healthcare data analytics is attracting much attention because of the unprecedented power of ML to extract knowledge that improves the decision-making process. At the same time, laws and ethics codes drafted by countries to govern healthcare data are becoming stringent. Although healthcare practitioners are struggling with an enforced governance framework, we see the emergence of distributed learning-based frameworks disrupting traditional-ML-model development. Splitfed learning (SFL) is one of the recent developments in distributed machine learning that empowers healthcare practitioners to preserve the privacy of input data and enables them to train ML models. However, SFL has some extra communication and computation overheads at the client side due to the requirement of client-side model synchronization. For a resource-constrained client side (hospitals with limited computational powers), removing such conditions is required to gain efficiency in the learning. In this regard, this paper studies SFL without client-side model synchronization. The resulting architecture is known as multi-head split learning (MHSL). At the same time, it is important to investigate information leakage, which indicates how much information is gained by the server related to the raw data directly out of the smashed data—the output of the client-side model portion—passed to it by the client. Our empirical studies examine the Resnet-18 and Conv1-D architecture model on the ECG and HAM-10000 datasets under IID data distribution. The results find that SFL provides 1.81% and 2.36% better accuracy than MHSL on the ECG and HAM-10000 datasets, respectively (for cut-layer value set to 1). Analysis of experimentation with various client-side model portions demonstrates that it has an impact on the overall performance. With an increase in layers in the client-side model portion, SFL performance improves while MHSL performance degrades. Experiment results also demonstrate that information leakage provided by mutual information score values in SFL is more than MHSL for ECG and HAM-10000 datasets by 2×10−5 and 4×10−3, respectively.

show abstract

Nasopharyngeal microbial communities of patients infected with SARS-COV-2 that developed COVID-19

Ventero

Cuadrat

Vidal

et al. 2020

Preprint

View full text Add to dashboard Cite

BackgroundSARS-CoV-2 is an RNA virus causing COVID-19. The clinical characteristics and epidemiology of COVID-19 have been extensively investigated, however studies focused on the patient’s microbiota are still lacking. In this study, we investigated the nasopharyngeal microbiome composition of patients who developed different severity levels of COVID-19. We performed Rdna-SSU (16S) sequencing from nasopharyngeal swab samples obtained from SARS-CoV-2 positive (56) and negative (18) patients in the province of Alicante (Spain) in their first visit to the hospital. Positive SARS-CoV-2 patients were observed and later categorized in mild (symptomatic without hospitalization), moderate (hospitalization) and severe (admission to ICU). We compared the microbiome diversity and OTU composition among severity groups using Similarity Percentage (SIMPER) analysis and Maaslin2. We also built bacterial co-abundance networks for each group using Fastpar.ResultsStatistical analysis indicated differences in the nasopharyngeal microbiome of COVID19 patients. 62 OTUs were found exclusively in SARS-CoV-2 positive patients, mostly classified as members of the phylum Bacteroidetes (18) and Firmicutes (25). OTUs classified as Prevotella were found to be significantly more abundant in patients that developed more severe COVID-19. Furthemore, co-abundance analysis indicated a loss of network complexity among samples from patients that later developed more severe symptoms.ConclusionsOur preliminary study shows that the nasopharyngeal microbiome of COVID-19 patients showed differences in the composition of specific OTUs and complexity of co-abundance networks. These microbes with differential abundances among groups could serve as biomarkers for COVID-19 severity. Nevertheless, further studies with larger sample sizes should be conducted to validate these results.IMPORTANCEThis work has studied the microbiota of the nasopharyngeal tract in COVID19 patients using advanced techniques of molecular microbiology. Diverse microorganisms, most of which are harmless or even beneficial to the host, colonize the nasopharyngeal tract. These microorganisms are the microbiota, and they are present in every people. However, changes in this microbiota could be related to different diseases as cancer, gastrointestinal pathologies or even COVID19. This study has been performed to investigate the microbiota from patients with COVID19, in order to determinate its implication in the pathology severity. The results obtained showed that it is possible that several specific microorganisms are present only in patients with severe COVID19. These data, could be used as a prognostic biomarker to early detect whose patients will develop a severe COVID19 and improve their clinical management.

show abstract

Building and using multimodal comparable corpora for machine translation

Afli¹,

Barrault²,

Schwenk³

2016

Nat. Lang. Eng.

View full text Add to dashboard Cite

In recent decades, statistical approaches have significantly advanced the development of machine translation systems. However, the applicability of these methods directly depends on the availability of very large quantities of parallel data. Recent works have demonstrated that a comparable corpus can compensate for the shortage of parallel corpora. In this paper, we propose an alternative to comparable corpora containing text documents as resources for extracting parallel data: a multimodal comparable corpus with audio documents in source language and text document in target language, built fromEuronewsandTEDweb sites. The audio is transcribed by an automatic speech recognition system, and translated with a baseline statistical machine translation system. We then use information retrieval in a large text corpus in the target language in order to extract parallel sentences/phrases. We evaluate the quality of the extracted data on an English to French translation task and show significant improvements over a state-of-the-art baseline.

show abstract

Stool and Ruminal Microbiome Components Associated With Methane Emission and Feed Efficiency in Nelore Beef Cattle

et al. 2022

View full text Add to dashboard Cite

Background: The impact of extreme changes in weather patterns on the economy and human welfare is one of the biggest challenges our civilization faces. From anthropogenic contributions to climate change, reducing the impact of farming activities is a priority since it is responsible for up to 18% of global greenhouse gas emissions. To this end, we tested whether ruminal and stool microbiome components could be used as biomarkers for methane emission and feed efficiency in bovine by studying 52 Brazilian Nelore bulls belonging to two feed intervention treatment groups, that is, conventional and by-product-based diets.Results: We identified a total of 5,693 amplicon sequence variants (ASVs) in the Nelore bulls’ microbiomes. A Differential abundance analysis with the ANCOM approach identified 30 bacterial and 15 archaeal ASVs as differentially abundant (DA) among treatment groups. An association analysis using Maaslin2 software and a linear mixed model indicated that bacterial ASVs are linked to the host’s residual methane emission (RCH4) and residual feed intake (RFI) phenotype variation, suggesting their potential as targets for interventions or biomarkers.Conclusion: The feed composition induced significant differences in both abundance and richness of ruminal and stool microbial populations in ruminants of the Nelore breed. The industrial by-product-based dietary treatment applied to our experimental groups influenced the microbiome diversity of bacteria and archaea but not of protozoa. ASVs were associated with RCH4 emission and RFI in ruminal and stool microbiomes. While ruminal ASVs were expected to influence CH4 emission and RFI, the relationship of stool taxa, such as Alistipes and Rikenellaceae (gut group RC9), with these traits was not reported before and might be associated with host health due to their link to anti-inflammatory compounds. Overall, the ASVs associated here have the potential to be used as biomarkers for these complex phenotypes.

show abstract

Putative mobilized colistin resistance genes in the human gut microbiome

et al. 2021

View full text Add to dashboard Cite

Background The high incidence of bacterial genes that confer resistance to last-resort antibiotics, such as colistin, caused by mobilized colistin resistance (mcr) genes, poses an unprecedented threat to human health. Understanding the spread, evolution, and distribution of such genes among human populations will help in the development of strategies to diminish their occurrence. To tackle this problem, we investigated the distribution and prevalence of potential mcr genes in the human gut microbiome using a set of bioinformatics tools to screen the Unified Human Gastrointestinal Genome (UHGG) collection for the presence, synteny and phylogeny of putative mcr genes, and co-located antibiotic resistance genes. Results A total of 2079 antibiotic resistance genes (ARGs) were classified as mcr genes in 2046 metagenome assembled genomes (MAGs), distributed across 1596 individuals from 41 countries, of which 215 were identified in plasmidial contigs. The genera that presented the largest number of mcr-like genes were Suterella and Parasuterella. Other potential pathogens carrying mcr genes belonged to the genus Vibrio, Escherichia and Campylobacter. Finally, we identified a total of 22,746 ARGs belonging to 21 different classes in the same 2046 MAGs, suggesting multi-resistance potential in the corresponding bacterial strains, increasing the concern of ARGs impact in the clinical settings. Conclusion This study uncovers the diversity of mcr-like genes in the human gut microbiome. We demonstrated the cosmopolitan distribution of these genes in individuals worldwide and the co-presence of other antibiotic resistance genes, including Extended-spectrum Beta-Lactamases (ESBL). Also, we described mcr-like genes fused to a PAP2-like domain in S. wadsworthensis. These novel sequences increase our knowledge about the diversity and evolution of mcr-like genes. Future research should focus on activity, genetic mobility and a potential colistin resistance in the corresponding strains to experimentally validate those findings.

show abstract

12 3 4

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Haithem Afli

Nasopharyngeal Microbial Communities of Patients Infected With SARS-CoV-2 That Developed COVID-19

Maintaining Sentiment Polarity in Translation of User-Generated Content

FaDA: Fast Document Aligner using Word Embedding

Performance and Information Leakage in Splitfed Learning and Multi-Head Split Learning in Healthcare Data and Beyond

Nasopharyngeal microbial communities of patients infected with SARS-COV-2 that developed COVID-19

Building and using multimodal comparable corpora for machine translation

Stool and Ruminal Microbiome Components Associated With Methane Emission and Feed Efficiency in Nelore Beef Cattle

Putative mobilized colistin resistance genes in the human gut microbiome

Contact Info

Product

Resources

About