Evidence for 28 genetic disorders discovered by combining healthcare and research data.
Kaplanis J., Samocha KE., Wiel L., Zhang Z., Arvai KJ., Eberhardt RY., Gallone G., Lelieveld SH., Martin HC., McRae JF., Short PJ., Torene RI., de Boer E., Danecek P., Gardner EJ., Huang N., Lord J., Martincorena I., Pfundt R., Reijnders MRF., Yeung A., Yntema HG., Deciphering Developmental Disorders Study None., Vissers LELM., Juusola J., Wright CF., Brunner HG., Firth HV., FitzPatrick DR., Barrett JC., Hurles ME., Gilissen C., Retterer K.
De novo mutations in protein-coding genes are a well-established cause of developmental disorders1. However, genes known to be associated with developmental disorders account for only a minority of the observed excess of such de novo mutations1,2. Here, to identify previously undescribed genes associated with developmental disorders, we integrate healthcare and research exome-sequence data from 31,058 parent-offspring trios of individuals with developmental disorders, and develop a simulation-based statistical test to identify gene-specific enrichment of de novo mutations. We identified 285 genes that were significantly associated with developmental disorders, including 28 that had not previously been robustly associated with developmental disorders. Although we detected more genes associated with developmental disorders, much of the excess of de novo mutations in protein-coding genes remains unaccounted for. Modelling suggests that more than 1,000 genes associated with developmental disorders have not yet been described, many of which are likely to be less penetrant than the currently known genes. Research access to clinical diagnostic datasets will be critical for completing the map of genes associated with developmental disorders.