Title: Evidence for 28 genetic disorders discovered by combining healthcare and research data
Author: Joanna Kaplanis, Kaitlin E. Samocha, Laurens Wiel, Zhancheng Zhang, Kevin J. Arvai, Ruth Y. Eberhardt, Giuseppe Gallone, Stefan H. Lelieveld, Hilary C. Martin, Jeremy F. McRae, Patrick J. Short, Rebecca I. Torene, Elke de Boer, Petr Danecek, Eugene J. Gardner, Ni Huang, Jenny Lord, Iigo Martincorena, Rolph Pfundt, Margot R. F. Reijnders, Alison Yeung, Helger G. Yntema, Lisenka E. L. M. Vissers, Jane Juusola, Caroline F. Wright, Han G. Brunner, Helen V. Firth, David R. FitzPatrick, Jeffrey C. Barrett, Matthew E. Hurles, Christian Gilissen, Kyle Retterer
Abstract: De novo mutations in protein-coding genes are a well-established cause of developmental disorders1. However, genes known to be associated with developmental disorders account for only a minority of the observed excess of such de novo mutations1,2. Here, to identify previously undescribed genes associated with developmental disorders, we integrate healthcare and research exome-sequence data from 31,058 parent–offspring trios of individuals with developmental disorders, and develop a simulation-based statistical test to identify gene-specific enrichment of de novo mutations. We identified 285 genes that were significantly associated with developmental disorders, including 28 that had not previously been robustly associated with developmental disorders. Although we detected more genes associated with developmental disorders, much of the excess of de novo mutations in protein-coding genes remains unaccounted for. Modelling suggests that more than 1,000 genes associated with developmental disorders have not yet been described, many of which are likely to be less penetrant than the currently known genes. Research access to clinical diagnostic datasets will be critical for completing the map of genes associated with developmental disorders. By integrating healthcare and exome-sequencing data from parent–offspring trios of patients with developmental disorders, 28 genes that had not previously been associated with developmental disorders were identified.