Page 82 - Read Online
P. 82

Goodman et al. J Transl Genet Genom 2020;4:144-58  I  http://dx.doi.org/10.20517/jtgg.2020.23                                  Page 151

               Table 2. Top ranking GO terms for genes mapping to KS signature sites
                                             Hyper raw  Hyper FDR   Hyper fold                    Total genes
                Ontology         Term name                               CpG hits  Total CpGs  Gene hits
                                              P-value   Q-value    enrichment                    annotated
                GO cellular   Postsynaptic    2.46E-10  2.12E-07  5.82     21     4,685     8       224
                component    membrane
                             Synaptic membrane  6.69E-09  2.89E-06  4.59   22     6,218     9       294
                             Presynaptic membrane  7.97E-09  2.75E-06  9.66  12   1,612     5       71
                             Receptor complex  1.45E-06  4.18E-04  3.75    19     6,582     6       322
                             Postsynapse      4.00E-06  9.87E-04  3.13     22     9,125     9       422
                             Chloride channel   8.60E-05  1.86E-02  8.59   6      906       2       47
                             complex
                Human phenotype Hypoplastic heart  5.17E-06  1.72E-02  20.86  5   311       2       11

               transcription start site, revealing that the majority of signature sites (408) were intergenic. Enriched
               terms or pathways in cellular components and human phenotype ontology are listed in Table 2. The most
               prominent finding was the identification of pathways and processes involved in neuronal and synaptic
               function across all ontologies.


               The single enriched human phenotype was hypoplastic heart. Congenital heart defects are a core feature of
                                                                                             [35]
               KS, including one reported severe presentation of KS with hypoplastic left heart syndrome . Two genes,
               CHRND and DTNA, containing differentially methylated CpG sites were annotated to this GO term.

               Independent validation of Kleefstra syndrome DNA methylation signature
               Using the KS DNAm signature, we developed a machine learning classification model capable of
               categorizing individuals as positive or negative for KS on the basis of their DNAm levels at signature sites.
               We trained a SVM model on data from the KS discovery group (n = 10) and neurotypical controls (n =
               42) used to derive the signature. First, we classified a validation cohort of five unrelated individuals with a
               clinical KS diagnosis, KS11_V - KS15_V (EHMT1 nonsense variant n = 1 and 9q34.3 deletions n = 4). The
               SVM model classified all five KS individuals correctly, i.e., as positive for KS, demonstrating high sensitivity
               of the signature [Figure 3]. Furthermore, we classified an additional validation cohort of five unrelated KS
               individuals under the age of 1 year, KS16_I - KS20_I; their ages ranged from 2 days to 9 months. Despite
               training the classification model only on individuals over 1 year old, the model correctly classified all KS
               infants as positive for KS [Figure 3].

               To test the specificity of the KS DNAm signature, we assessed an additional 29 neurotypical controls
               (n = 14 female, age 1 month to 16 years), all of which classified as negative for KS (i.e., with controls)
               demonstrating 100% specificity of the signature [Figure 3]. Additionally, we assessed whether the KS
               DNAm signature could be used to classify patients (n = 8) with Nicolaides-Baraitser syndrome (NCBRS), a
               neurodevelopmental disorder with some clinical features that overlap KS, including ID, ASD and seizures
               but with distinct facial characteristics. NCBRS is caused by haploinsufficiency of SMARCA2, which
               encodes a protein that is part of another epigenetic regulator (SNF/SWI chromatin remodeling complex).
                                                                                                       [11]
               Pathogenic variants in SMARCA2 have been shown to be associated with a distinct DNAm signature .
               All eight samples were classified as negative for KS, with controls (n = 5 female, age 4-15 years), providing
               further evidence of the specificity of the KS DNAm signature.

               Finally, we tested two individuals carrying partial duplications of EHMT1, both of which mapped to the last
               two exons of the gene [26,27] . Duplications with similar boundaries in individuals with variable phenotypes,
                                                                                     [36]
               including ID and dysmorphic features, have been previously reported as benign . Both patients, Dup1
               and Dup2, were classified as negative for KS. Patient Dup1 had dysmorphic features and also carried an
               unbalanced 31.8-Mb complex rearrangement on chromosome 6p, as well as a microdeletion on 6q27
   77   78   79   80   81   82   83   84   85   86   87