The South Asian genome.
Chambers JC., Abbott J., Zhang W., Turro E., Scott WR., Tan ST., Afzal U., Afaq S., Loh M., Lehne B., O'Reilly P., Gaulton KJ., Pearson RD., Li X., Lavery A., Vandrovcova J., Wass MN., Miller K., Sehmi J., Oozageer L., Kooner IK., Al-Hussaini A., Mills R., Grewal J., Panoulas V., Lewin AM., Northwood K., Wander GS., Geoghegan F., Li Y., Wang J., Aitman TJ., McCarthy MI., Scott J., Butcher S., Elliott P., Kooner JS.
The genetic sequence variation of people from the Indian subcontinent who comprise one-quarter of the world's population, is not well described. We carried out whole genome sequencing of 168 South Asians, along with whole-exome sequencing of 147 South Asians to provide deeper characterisation of coding regions. We identify 12,962,155 autosomal sequence variants, including 2,946,861 new SNPs and 312,738 novel indels. This catalogue of SNPs and indels amongst South Asians provides the first comprehensive map of genetic variation in this major human population, and reveals evidence for selective pressures on genes involved in skin biology, metabolism, infection and immunity. Our results will accelerate the search for the genetic variants underlying susceptibility to disorders such as type-2 diabetes and cardiovascular disease which are highly prevalent amongst South Asians.