This project had three main objectives: (1) to establish a database of essays written by different language groups on a variety of topics for the Test of Written English (TWE) that can be used in future research; (2) to summarize, analyze, and compare the linguistic properties of those essays; and (3) to determine how the TWE performance of language groups relates to essay styles. As part of the first objective of this project we created a database of 1,737 essays, a data matrix of essay variables, files containing sorted phrases and vocabularies of different language groups, and files of the common and unique vocabulary items for each pair of language groups.The essay sample consisted of TWE essays from five language groups, including Arabic, Chinese, English (including native-English and nonnative-English), and Spanish speakers. Essays from English-speaking students in the United States were collected and scored to provide a baseline with which to compare essays by students who speak English as a second language (ESL).This report includes the analysis of 106 variables for each essay along with summary analyses, such as correlation, analyses of variance, discriminative analysis, and factor analysis. It also presents information on the accuracy and cost of data entry and on the accuracy of the text analysis programs, as well as extensive data on vocabulary. A program that assesses content was developed for this project.Current (1998-99) members of the TOEFL Committee of Examiners are: