The higher plant Arabidopsis thaliana (Arabidopsis) is an important model for identifying plant genes and determining their function. To assist biological investigations and to define chromosome structure, a coordinated effort to sequence the Arabidopsis genome was initiated in late 1996. Here we report one of the first milestones of this project, the sequence of chromosome 4. Analysis of 17.38 megabases of unique sequence, representing about 17% of the genome, reveals 3,744 protein coding genes, 81 transfer RNAs and numerous repeat elements. Heterochromatic regions surrounding the putative centromere, which has not yet been completely sequenced, are characterized by an increased frequency of a variety of repeats, new repeats, reduced recombination, lowered gene density and lowered gene expression. Roughly 60% of the predicted protein-coding genes have been functionally characterized on the basis of their homology to known genes. Many genes encode predicted proteins that are homologous to human and Caenorhabditis elegans proteins.
The taxonomic placement of the moth-butterfly,
Macrosoma conifera
(Warren 1897) (Lepidoptera: Hedylidae), has been controversial. The 15,344 bp complete
M. conifera
circular mitogenome, assembled by genome skimming, consists of 81.7% AT nucleotides, 22 tRNAs, 13 protein-coding genes, 2 rRNAs and a control region in the typical butterfly gene order.
Macrosoma conifera COX1
features an atypical CGA start codon while
ATP6, COX1, COX2,
and
ND5
exhibit incomplete stop codons completed by the post-transcriptional addition of 3′ A residues. Phylogenetic reconstruction places
M. conifera
as sister to the skippers (Hesperiidae), which is consistent with several recent phylogenetic analyses.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.