24Chromosome 4q deletion is one of the most frequently detected genomic imbalance events 25 in congenital heart disease (CHD) patients. However, a portion of CHD-associated 4q deletions 26 do not include known CHD genes. Alignment of those 4q deletions defined a minimal 27 overlapping region including only one gene-SORBS2. Histological analysis of Sorbs2 -/heart 28 revealed atrial septal hypoplasia/aplasia or double atrial septum. Mechanistically, SORBS2 had 29 a dual role in maintaining sarcomeric integrity of cardiomyocytes and specifying the fate of 30 second heart field (SHF) progenitors through c-ABL/NOTCH/SHH axis. In a targeted 31 sequencing of a panel of known and candidate CHD genes on 300 CHD cases, we found that 32 rare SORBS2 variants were significantly enriched in CHD patients. Our findings indicate that 33 SORBS2 is a regulator of cardiac development and its haploinsufficiency may contribute to 34 cardiac phenotype of 4q deletion syndrome. The presence of double atrial septum in Sorbs2 -/-35 hearts reveals the first molecular etiology of this rare anomaly linked to paradoxical 36 thromboembolism. 37 38 39 Keywords: 4q deletion syndrome, Congenital heart disease, SORBS2, second heart field, 40 Notch1 41 42 3 Introduction 43Congenital heart disease (CHD) is present in ~1% newborns (1). Although CHD is the most 44 common birth defect, the underlying causes of most CHDs remain elusive. Genetic 45 heterogeneity and gene-environment interaction limit the genotype-phenotype correlation and 46 complicate interpretation of CHD genetics. Anatomic rectification through interventional and 47 surgical treatments have greatly improved survival rate of CHD patients (2). However, the same 48 medical treatment often yields different outcomes in different CHD patients, suggesting the 49 underlying genetic lesion impacts CHD prognosis. Therefore, elucidation of CHD genetic basis 50 is becoming ever more important in both reducing short-term therapeutic complications and 51 improving long-term care of CHD patients. With the advances in genetic diagnosis techniques, 52 the discovery of genetic variants is no longer a limit (3). The interpretation of the association 53 of identified genetic variants with phenotype becomes the imperative need to address the 54 genetic basis of CHD.
55CNV is a chromosomal structural aberration comprising of deletion or duplication with a 56 size larger than 1kb. Due to the abundance of low-copy repeats and retrotransposons, CNVs 57 are quite common genetic variants in human genomes. Previous studies have estimated that 58 CNVs contribute to 10-15% CHD (4, 5). The most common chromosomal deletion is 59 Del22q11.2, which is the shared genetic cause of 3 clinical syndromes-DiGeorge syndrome, 60 velocardiofacial syndrome and conotruncal anomaly face syndrome (6). The common forms of 61 Del22q11.2 are ~3 or 1.5 Mbs deletion encompassing dozens of genes (6). The successful 62 identification of the major CHD gene TBX1 has greatly advanced our understanding of the 63 pathogenesis of 22q11.2 del...