Nello Cristianini scite author profile

Pattern analysis 3 1.1 Patterns in data 4 1.2 Pattern analysis algorithms 12 1.3 Exploiting patterns 17 1.4 Summary 22 1.5 Further reading and advanced topics 23 2 Kernel methods: an overview 25 2.1 The overall picture 26 2.2 Linear regression in a feature space 27 2.3 Other examples 36 2.4 The modularity of kernel methods 42 2.5 Roadmap of the book 43 2.6 Summary 44 2.7 Further reading and advanced topics 45 3 Properties of kernels 47 3.1 Inner products and positive semi-definite matrices 48 3.2 Characterisation of kernels 60 3.3 The kernel matrix 68 3.4 Kernel construction 74 3.5 Summary 82 3.6 Further reading and advanced topics 82 4 Detecting stable patterns 85 4.1 Concentration inequalities 86 4.2 Capacity and regularisation: Rademacher theory 93 v vi Contents

show abstract

Support vector machine classification and validation of cancer tissue samples using microarray expression data

Furey¹,

Cristianini²,

Duffy³

et al. 2000

2,070

1,046

View full text Add to dashboard Cite

show abstract

CAFE: a computational tool for the study of gene family evolution

et al. 2006

View full text Add to dashboard Cite

Knowledge-based analysis of microarray gene expression data by using support vector machines

Brown

Grundy

Lin

et al. 2000

Proc. Natl. Acad. Sci. U.S.A.

1,948

1,013

View full text Add to dashboard Cite

We introduce a method of functionally classifying genes by using gene expression data from DNA microarray hybridization experiments. The method is based on the theory of support vector machines (SVMs). SVMs are considered a supervised computer learning method because they exploit prior knowledge of gene function to identify unknown genes of similar function from expression data. SVMs avoid several problems associated with unsupervised clustering methods, such as hierarchical clustering and self-organizing maps. SVMs have many mathematical features that make them attractive for gene expression analysis, including their flexibility in choosing a similarity function, sparseness of solution when dealing with large data sets, the ability to handle large feature spaces, and the ability to identify outliers. We test several SVMs that use different similarity metrics, as well as some other supervised learning methods, and find that the SVMs best identify sets of genes with a common function using expression data. Finally, we use SVMs to predict functional roles for uncharacterized yeast ORFs based on their expression data.

show abstract

A statistical framework for genomic data fusion

Lanckriet¹,

Bie²,

Cristianini³

et al. 2004

583

431

View full text Add to dashboard Cite

show abstract

Estimating the tempo and mode of gene family evolution from comparative genomic data

Hahn¹,

Bie²,

Stajich³

et al. 2005

Genome Res.

271

334

View full text Add to dashboard Cite

Comparison of whole genomes has revealed that changes in the size of gene families among organisms is quite common. However, there are as yet no models of gene family evolution that make it possible to estimate ancestral states or to infer upon which lineages gene families have contracted or expanded. In addition, large differences in family size have generally been attributed to the effects of natural selection, without a strong statistical basis for these conclusions. Here we use a model of stochastic birth and death for gene family evolution and show that it can be efficiently applied to multispecies genome comparisons. This model takes into account the lengths of branches on phylogenetic trees, as well as duplication and deletion rates, and hence provides expectations for divergence in gene family size among lineages. The model offers both the opportunity to identify large-scale patterns in genome evolution and the ability to make stronger inferences regarding the role of natural selection in gene family expansion or contraction. We apply our method to data from the genomes of five yeast species to show its applicability.

show abstract

On Kernel Target Alignment

Cristianini¹,

Kandola²,

Elisseeff³

et al.

188

341

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.