Need Help?

Reference Standards for Mosaic Variant Detection

After conception, postzygotic mutations continuously occur throughout life in humans, causing somatic mosaicism in an individual. The variant type, time of origination, and locations of the mosaic mutations result in unique mosaic patterns in a combinatorial manner and further affect phenotypes, including various noncancerous diseases. Several efforts have, thus, been made to identify the mutational landscape and mechanisms underlying the mosaic mutations. From a technical aspect, the accurate detection of mutations is at the core of the mosaicism research, and the construction of a standard reference is a critical first step to serve as the basis for analytical validation and benchmarks for germline and somatic mutation detection. Furthermore, a reference standard for mosaic mutations is urgently needed to enable more advanced research.

Herein, we generated robust, large-scale, and cell line mixture-based reference standards that contain abundant mosaic SNVs and INDELs along with wild-type sites and germ line variants. Moreover, the whole reference standard set mimics the cumulative aspect of mosaic variant acquisition in the early developmental stage through a progressive cell line mixing with confirmed genotypes. It can be used for optimizing the current use of mosaic variant detection strategies, as well as developing algorithms for future improvements.

We previously generated an NCBI BioProject (PRJNA758606) to release much of these data. Only the data that are subject to authorized access constraints have been included in this submission to dbGaP. The two data sources are thus, complementary to each other.