2004
DOI: 10.1002/0471250953.bia01bs05
|View full text |Cite
|
Sign up to set email alerts
|

Common File Formats

Abstract: This appendix discusses a few of the file formats frequently encountered in bioinformatics. Specifically, it reviews the rules for generating FASTA files and provides guidance for interpreting NCBI descriptor lines, commonly found in FASTA files. In addition, it reviews the construction of GenBank files.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2009
2009
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(8 citation statements)
references
References 0 publications
0
8
0
Order By: Relevance
“…For example, in meta-analysis of sequences an established and unified file standard is crucial (Ten Hoopen et al, 2017). FASTA (Pearson & Lipman, 1988), FASTQ (Cock et al, 2010), and SAM/BAM (Li et al, 2009) are famous examples of file standards that have allowed the effective exchange of information between numerous groups involved in the earliest sequencing projects (Leonard & Littlejohn, 2004;Ondřej & Dvořák, 2012;Zhang, 2016). Any disparities in the sampling method also have to be taken into account when biological material is concerned, so it is essential they are recorded appropriately (Ten Hoopen et al, 2017).…”
Section: Comparison and Integration Of Datasets And Databasesmentioning
confidence: 99%
“…For example, in meta-analysis of sequences an established and unified file standard is crucial (Ten Hoopen et al, 2017). FASTA (Pearson & Lipman, 1988), FASTQ (Cock et al, 2010), and SAM/BAM (Li et al, 2009) are famous examples of file standards that have allowed the effective exchange of information between numerous groups involved in the earliest sequencing projects (Leonard & Littlejohn, 2004;Ondřej & Dvořák, 2012;Zhang, 2016). Any disparities in the sampling method also have to be taken into account when biological material is concerned, so it is essential they are recorded appropriately (Ten Hoopen et al, 2017).…”
Section: Comparison and Integration Of Datasets And Databasesmentioning
confidence: 99%
“…1. Run geneid on the first example (example1.fa) with default options: geneid -P param/human3iso.param samples/example1.fa geneid is a Unix command-line program that requires as input a file containing a DNA sequence in FASTA format (samples/example1.fa; see Leonard & Littlejohn, 2004, for discussion of FASTA format), and a parameter file. This is specified by using the option -P followed by the name of the parameter file.…”
Section: Geneid -Hmentioning
confidence: 99%
“…Users must input a DNA sequence in FASTA format (Leonard & Littlejohn, 2004) either from file or from the text area, while the external information in GFF format is optional. The process for building a graphical representation from the geneid output with the program Alioto et al…”
Section: Filesmentioning
confidence: 99%
See 1 more Smart Citation
“…GFF or PTT) (Leonard et al , 2007). It is also possible to open multiple FASTA files or raw sequence data, and there is an option for downloading sequence information from the NCBI web site http://www.ncbi.nlm.nih.gov/.…”
mentioning
confidence: 99%