Affordable sequencing and genotyping methods are essential for large scale genome-wide association studies. While genotyping microarrays and reference panels for imputation are available for human subjects, non-human model systems often lack such options. Our lab previously demonstrated an efficient and cost-effective method to genotype heterogeneous stock rats using double-digest genotyping-by-sequencing. However, low-coverage whole-genome sequencing offers an alternative method that has several advantages. Here, we describe a cost-effective, high-throughput, high-accuracy genotyping method for N/NIH heterogeneous stock rats that can use a combination of sequencing data previously generated by double-digest genotyping-by-sequencing and more recently generated by low-coverage whole-genome-sequencing data. Using double-digest genotyping-by-sequencing data from 5,745 heterogeneous stock rats (mean 0.21x coverage) and low-coverage whole-genome-sequencing data from 8,760 heterogeneous stock rats (mean 0.27x coverage), we can impute 7.32 million bi-allelic single-nucleotide polymorphisms with a concordance rate >99.76% compared to high-coverage (mean 33.26x coverage) whole-genome sequencing data for a subset of the same individuals. Our results demonstrate the feasibility of using sequencing data from double-digest genotyping-by-sequencing or low-coverage whole-genome-sequencing for accurate genotyping, and demonstrate techniques that may also be useful for other genetic studies in non-human subjects.
This paper's license is marked as closed access or non-commercial and cannot be viewed on ResearchHub. Visit the paper's external site.