In conjunction with the completion of the human genome sequence, microarray technology offers a complementary strategy to traditional methodologies used to search for genetic determinants involved in multifactorial diseases such as Alzheimer's disease. In order to gain benefits from this strategy, we have designed home-made microarrays to compare the expression of all ORFs located within loci of interest defined by genome scanning in Alzheimer family studies. Two approaches were selected using either probes amplified by PCR from a cDNA bank or specific oligonucleotides. Here, we report the challenging task of validating, prioritising and selecting the best ORFs derived from the genome sequence. The initial inventory from the NCBI website allowed us to select 5849 ORF's within nine loci. Half of them resulted from prediction models using the GenomeScan software. However, our data have shown that predicted ORFs may not be representative of exonic sequences, or even real genes. These observations have led us to exclude these ORFs from our study, decreasing their number from 5849 to 2748. Microarrays may be only 'snapshots' of our current knowledge of the human genome.