Performance of genetic programming optimised Bowtie2 on genome comparison and analytic testing (GCAT) benchmarks

Langdon, William B.

doi:10.1186/s13040-014-0034-0

Cited by 403 publications

(308 citation statements)

References 19 publications

(15 reference statements)

Supporting

Mentioning

292

Contrasting

Unclassified

Order By: Relevance

“…Subsequent bioinformatics analyses were performed with clean reads according to the following pipeline: clean reads were aligned to the A. thaliana reference genome by Tophat56, the mapped reads were manipulated to BAM files by SAMtools57, then calculated the gene expression level by HTseq58. Differentially expressed genes were acquired by DESeq259; the unmapped BAM files were converted to Fastq files via bedtools and aligned to virus reference genome by Bowtie 260.…”

Section: Methodsmentioning

confidence: 99%

Analyses of RNA-Seq and sRNA-Seq data reveal a complex network of anti-viral defense in TCV-infected Arabidopsis thaliana

Guo

et al. 2016

Sci Rep

View full text Add to dashboard Cite

In order to identify specific plant anti-viral genes related to the miRNA regulatory pathway, RNA-Seq and sRNA-Seq were performed using Arabidopsis WT and dcl1-9 mutant line. A total of 5,204 DEGs were identified in TCV-infected WT plants. In contrast, only 595 DEGs were obtained in the infected dcl1-9 mutant plants. GO enrichment analysis of the shared DEGs and dcl1-9 unique DEGs showed that a wide range of biological processes were affected in the infected WT plants. In addition, miRNAs displayed different patterns between mock and infected WT plants. This is the first global view of dcl1-9 transcriptome which provides TCV responsive miRNAs data. In conclusion, our results indicated the significance of DCL1 and suggested that PPR genes may play an important role in plant anti-viral defense.

show abstract

Section: Methodsmentioning

confidence: 99%

Analyses of RNA-Seq and sRNA-Seq data reveal a complex network of anti-viral defense in TCV-infected Arabidopsis thaliana

Guo

et al. 2016

Sci Rep

View full text Add to dashboard Cite

show abstract

“…If we do not encourage diversity, we may end up with a population where the majority of programs are very similar to the seed. New generations of individuals are Approach Representation Improvement Fitness Metric locoGP Java (AST) Performance Bytecode Operations Langdon [17], Petke [28] C++ (Statement) Performance, Specialisation Line Count Arcuri [2], White [41] Java-like (AST) Performance Simulated CPU Cycle Walsh & Ryan (Paragen) [30,31], Parallelisation Instructions Parallel Programs Functionality Chennupati (MCGE) [4] Orlov (FINCH) [25] Java (Byecode) Functionality Error Count Castle [3] Java-like (AST) Functionality Error Count O'Cinnéide [5], Simons [33] Java (Refactoring Patterns) Quality (e.g. elegance) Software Metrics Table 1: Feature Comparison of Improvement Approaches.…”

Section: Locogpmentioning

confidence: 99%

“…As Java is a widely used general purpose language, the ability to automatically improve existing Java programs is of wide interest. In this context, GP is a good approach for exploring the implicit effects of source code changes on performance [17,42].…”

Section: Introductionmentioning

confidence: 99%

locoGP

Cody-Kenny

Galván

Barrett

2015

Proceedings of the Companion Publication of the 2015 Annual Conference on Genetic and Evolutionary Computation

View full text Add to dashboard Cite

We present locoGP, a Genetic Programming (GP) system written in Java for evolving Java source code. locoGP was designed to improve the performance of programs as measured in the number of operations executed. Variable test cases are used to maintain functional correctness during evolution. The operation of locoGP is demonstrated on a number of typically constructed "off-the-shelf" hand-written implementations of sort and prefix-code programs. locoGP was able to find improvement opportunities in all test problems.

show abstract

“…However, optimizing attributes like execution time, memory consumption and power consumption is generally considered an improvement of a non-functional property which spans another big part of the GI literature. Of those attributes, execution time seems to be very popular, with Langdon's work on the 50k line DNA sequencing tool Bowtie [18,20] possibly the best known. Langdon has also reported 100 fold speed-up of another DNA sequencing tool BarraCUDA [17,19,[21][22][23] and the GI improvements have now been included in the official release.…”

Section: Related Workmentioning

confidence: 99%

Exploring Fitness and Edit Distance of Mutated Python Programs

Haraldsson

Woodward

Brownlee

et al. 2017

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. Genetic Improvement (GI) is the process of using computational search techniques to improve existing software e.g. in terms of execution time, power consumption or correctness. As in most heuristic search algorithms, the search is guided by fitness with GI searching the space of program variants of the original software. The relationship between the program space and fitness is seldom simple and often quite difficult to analyse. This paper makes a preliminary analysis of GI's fitness distance measure on program repair with three small Python programs. Each program undergoes incremental mutations while the change in fitness as measured by proportion of tests passed is monitored.We conclude that the fitnesses of these programs often does not change with single mutations and we also confirm the inherent discreteness of bug fixing fitness functions. Although our findings cannot be assumed to be general for other software they provide us with interesting directions for further investigation.

show abstract

Performance of genetic programming optimised Bowtie2 on genome comparison and analytic testing (GCAT) benchmarks

Cited by 403 publications

References 19 publications

Analyses of RNA-Seq and sRNA-Seq data reveal a complex network of anti-viral defense in TCV-infected Arabidopsis thaliana

Analyses of RNA-Seq and sRNA-Seq data reveal a complex network of anti-viral defense in TCV-infected Arabidopsis thaliana

locoGP

Exploring Fitness and Edit Distance of Mutated Python Programs

Contact Info

Product

Resources

About