Tor-Kristian Jenssen scite author profile

We have carried out automated extraction of explicit and implicit biomedical knowledge from publicly available gene and text databases to create a gene-to-gene co-citation network for 13,712 named human genes by automated analysis of titles and abstracts in over 10 million MEDLINE records. The associations between genes have been annotated by linking genes to terms from the medical subject heading (MeSH) index and terms from the gene ontology (GO) database. The extracted database and accompanying web tools for gene-expression analysis have collectively been named 'PubGene'. We validated the extracted networks by three large-scale experiments showing that co-occurrence reflects biologically meaningful relationships, thus providing an approach to extract and structure known biology. We validated the applicability of the tools by analyzing two publicly available microarray data sets.

show abstract

A primer on gene expression and microarrays for machine learning researchers

Kuo

Kim

Trimarchi

et al. 2004

Journal of Biomedical Informatics

View full text Add to dashboard Cite

The Human Genomic Melting Map

et al. 2005

View full text Add to dashboard Cite

In a living cell, the antiparallel double-stranded helix of DNA is a dynamically changing structure. The structure relates to interactions between and within the DNA strands, and the array of other macromolecules that constitutes functional chromatin. It is only through its changing conformations that DNA can organize and structure a large number of cellular functions. In particular, DNA must locally uncoil, or melt, and become single-stranded for DNA replication, repair, recombination, and transcription to occur. It has previously been shown that this melting occurs cooperatively, whereby several base pairs act in concert to generate melting bubbles, and in this way constitute a domain that behaves as a unit with respect to local DNA single-strandedness. We have applied a melting map calculation to the complete human genome, which provides information about the propensities of forming local bubbles determined from the whole sequence, and present a first report on its basic features, the extent of cooperativity, and correlations to various physical and biological features of the human genome. Globally, the melting map covaries very strongly with GC content. Most importantly, however, cooperativity of DNA denaturation causes this correlation to be weaker at resolutions fewer than 500 bps. This is also the resolution level at which most structural and biological processes occur, signifying the importance of the informational content inherent in the genomic melting map. The human DNA melting map may be further explored at http://meltmap.uio.no.

show abstract

Methods for Large-Scale Mining of Networks of Human Genes

Jenssen

Öberg

Andersson

et al. 2001

View full text Add to dashboard Cite

In molecular biology there is much interest in various types of relationships between genes. Due to the complexity and rapid development of this field, much of this knowledge exists only in free-text form. A database of relationships between genes may allow background knowledge to be used in computerised analyses. As far as we know, no comprehensive manually cured database of this kind exists, and constructing and maintaining such a database manually would be very labour-intensive. Efficient automated methods for extraction and structuring of relationships between genes from free-text would be valuable. A database named PubGene has previously been created and it contains a comprehensive network of human genes created by automated extraction of co-occurrence of gene terms in over 10 million MEDLINE records. Co-occurring genes were linked together under the hypothesis that two genes will co-occur only if they have some biological relationship. In this paper, we show that for the subset of human genes encoding enzymes, pairs of co-occurring enzyme genes are significantly more closely related biologically than when these genes are compared randomly. Manual inspection, however, shows that some of the links in PubGene are not correct and it also indicates how the noise can be reduced. We propose a complementary method for automated extraction of relationships between genes by use of information from the Science Citation Index (SCI) database. We relate two genes if they have been co-referred, that is, having reference articles being co-cited in a third article. The alternative approach confirms relationships found in PubGene, and it also finds other relevant relationships.

show abstract

FigSearch: a figure legend indexing and classification system

Liu¹,

Jenssen²,

Nygaard³

et al. 2004

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.