19Genome sequence of Tricholoma matsutake was revealed as the one of the large fungal genomes 20 published up to date at 189.0 Mbp with 15,305 predicted genes. The unusual size of this fungal 21 genome contained frequent colonization of transposable elements (TEs) occupying more than 22 half of the entire genome. We identified that 702 genes were surrounded by TEs and 83.2% of 23 those genes were never transcribed at any development stage. This observation corroborated that 24 the insertion of transposable elements alters the transcription of the genes neighboring TEs. 25 Repeat-induced point mutation such as C to T hypermutation with a bias over 'CpG' 26 dinucleotides was also recognized in this genome, representing a typical defense mechanism 27 against TEs during evolution. Many transcription factor genes were activated in both primordia 28 and fruiting body, which indicates that many regulatory processes are shared during 29 developmental stages.. Small secreted protein genes (<300 aa) were dominantly transcribed in 30 hyphae, where symbiotic interactions occur with hosts. Comparative analysis with 37 31 Agaricomycetes genomes revealed that IstB-like domain (PF01695) was conserved in 32 taxonomically diverse mycorrhizal genomes, where the T. matsutake genome contained four 33 copies of this domain. Three of the IstB-like genes were overexpressed in hyphae. In the 34 CAZyme analysis, reduced CAZyme genes were found as other ectomycorrhizal genomes 35 including a lot of loss of glycoside hydrolase genes. Also, auxiliary activity genes were 36 dominantly transcribed in primordia. The T. matsutake genome sequence provides insight into 37 the large genome size and clues to understand unusual fungal genome expansion. 38 95 7 been deposited and 1,618 had gene predictions. We used one genome per genus to draw the plot. 96 Tricholoma matsutake is indicated by the arrow. b. Species tree of Tricholoma matsutake and 37 97 Agaricomycetes genomes. Only bootstrap values of less than 100 are marked. The scale bar that 98 represents the mean number of amino acid substitutions per site is shown. Aspergillus nidulans 99 genome (GenBank: GCF_000149205.2) was used as an outgroup. The branch to the outgroup 100 was shortened for visualization purpose.101 102 Repeat elements in the T. matsutake genome 103The T. matsutake genome had a high content of repeat elements, which has been suggested as a 104 concurrent feature of ectomycorrhizal genomes [2,13,14]. The repeat elements were estimated to 105 have a total size of 92.4 Mbp, representing 48.9% of the entire 189.0 Mbp genome (Fig 2). The 106 major classes of repeat elements were LTR/Gypsy and LTR/Copia transposable elements, 107 corresponding to 41 Mbp (21.9%) and 7 Mbp (3.9%) of the genome, respectively. These two 108 frequent elements have also been observed in other basidiomycete genomes [14,15]. Within the 109 repeat regions, there were 15,014 complete coding sequences (containing start codon, stop 110 codon, and no internal stop codon), which was equivalent to a...