What can exome sequencing do for you?

Jacek Majewski; Jeremy Schwartzentruber; Emilie Lalonde; Alexandre Montpetit; Nada Jabado

doi:10.1136/jmedgenet-2011-100223

Article Text

PDF

Review

What can exome sequencing do for you?

Jacek Majewski1,2,
Jeremy Schwartzentruber1,
Emilie Lalonde1,2,
Alexandre Montpetit1,
Nada Jabado2,3

¹McGill University and Genome Quebec Innovation Centre, Montreal, Canada
²Department of Human Genetics, McGill University, Montreal, Canada
³Department of Pediatrics, McGill University Health Center Research Institute, Montreal, Canada

Correspondence to Dr Nada Jabado, Montreal Children's Hospital Research Institute, 4060 Ste Catherine West, PT-239, Montreal, Quebec H3Z 2Z3, Canada; nada.jabado{at}mcgill.ca

Abstract

Recent advances in next-generation sequencing technologies have brought a paradigm shift in how medical researchers investigate both rare and common human disorders. The ability cost-effectively to generate genome-wide sequencing data with deep coverage in a short time frame is replacing approaches that focus on specific regions for gene discovery and clinical testing. While whole genome sequencing remains prohibitively expensive for most applications, exome sequencing—a technique which focuses on only the protein-coding portion of the genome—places many advantages of the emerging technologies into researchers' hands. Recent successes using this technology have uncovered genetic defects with a limited number of probands regardless of shared genetic heritage, and are changing our approach to Mendelian disorders where soon all causative variants, genes and their relation to phenotype will be uncovered. The expectation is that, in the very near future, this technology will enable us to identify all the variants in an individual's personal genome and, in particular, clinically relevant alleles. Beyond this, whole genome sequencing is also expected to bring a major shift in clinical practice in terms of diagnosis and understanding of diseases, ultimately enabling personalised medicine based on one's genome. This paper provides an overview of the current and future use of next generation sequencing as it relates to whole exome sequencing in human disease by focusing on the technical capabilities, limitations and ethical issues associated with this technology in the field of genetics and human disease.

Molecular genetics
cancer: CNS
paediatric oncology

https://doi.org/10.1136/jmedgenet-2011-100223

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Introduction

Discoveries made in the 20th century have helped to completely reshape all fields of biomedical studies as we know them. The revolution we are currently witnessing was triggered by the discovery of the DNA double helix in 19531 2 which has enabled major advances in genetics and heredity. Advances in knowledge have often been driven by the advent of new technologies. PCR was discovered in 19833 and revolutionised our approach to the study of DNA, and that in turn revolutionised the molecular analysis of mammalian genes. In 1977 two landmark articles describing methods for DNA sequencing were published.4 5 The approach reported by Sanger and colleagues was further refined and commercialised leading to its dissemination throughout the research community and, ultimately, into clinical diagnostics. In an industrial high-throughput configuration, Sanger technology was then used in the sequencing of the first human genome which was completed in 2001 through the Human Genome Project, a 13-year effort with an estimated cost of $2.7 billion.6–8 In 2008, by comparison, a human genome was sequenced over a 5-month period for approximately $1.5 million9 and now, in 2011, sequencing a whole genome is done over a period of a few days and is soon projected to cost less than $10 000. The latter accomplishment was made possible by the commercial launch of the first massively parallel pyrosequencing platform in 2005, which ushered in the era of high-throughput genomic analysis now referred to as ‘next-generation sequencing’ (NGS). The time and cost needed is expected to fall even further in the very near future, making these technologies available to researchers without large budgets. Vast amounts of genotype and phenotype data are now being generated by a growing number of research efforts. Interpreting all these new data and translating the findings to practical healthcare is a challenge. This review focuses on what whole exome sequencing (WES) using NGS can teach us about human disease, starting with single gene disorders and moving on to more complex genetic disorders including complex traits and cancer. We overview the current state of the technology, its practical current and future uses, the reasons to use WES instead of whole genome sequencing (WGS) and some of the ethical dilemmas arising from the impact of the results on society and on clinical practice.

Next-generation sequencing (NGS) technologies and experimental approaches for whole exome sequencing (WES)

NGS platforms share a common technological feature—namely, massively parallel sequencing of clonally amplified or single DNA molecules that are spatially separated in a flow cell. This design is a paradigm shift from that of Sanger sequencing and has allowed scaling-up by orders of magnitude. In NGS, sequencing is performed by repeated cycles of polymerase-mediated nucleotide extensions or, in the case of ABI SOLiD, by iterative cycles of oligonucleotide ligation. As a massively parallel process, NGS generates hundreds of megabases to gigabases of nucleotide sequence from a single instrument run, depending on the platform. Targeted sequencing approaches have the general advantage of increased sequence coverage of regions of interest—such as coding exons of genes—at lower cost and higher throughput compared with random shotgun sequencing methods.9–12 Most large-scale methods for targeted sequencing use a variation of a hybrid selection approach. Complementary nucleic acid ‘baits’ are used to ‘fish’ for regions of interest in the total pool of nucleic acids, which can be DNA12–15 or RNA.16 Any subset of the genome can be targeted, including exons, non-coding RNAs, highly conserved regions of the genome, disease-associated LD blocks or other regions of interest. Because the exome represents only approximately 1% of the genome or about 30 Mb, vastly higher sequence coverage can be readily achieved using second-generation sequencing platforms with considerably less raw sequence and cost than WGS. For example, whereas 90 Gb of sequence is required to obtain 30-fold average coverage of the genome, 75-fold average coverage is achieved for the exome with only 3 Gb of sequence using the current state-of-the-art platforms for targeting.17 18 However, there are inefficiencies in the targeting process. For example, uneven capture efficiency across exons can result in exons with low sequence coverage, and off-target hybridisation means that at least 20% of reads come from genomic DNA outside the exome. In addition, exome capture is not complete. Indeed, the probes in sequence capture methods are designed based on information from gene annotation databases such as the consensus coding sequence (CCDS) Database and RefSeq Database. Therefore, unknown or yet-to-be-annotated exons, evolutionary conserved non-coding regions and regulatory sequences such as enhancers or promoters are not typically captured. Partly to address these issues of coverage, the latest commercially available capture kits provide nearly complete coverage of the well-annotated genes but also allow the user to add custom content by designing capture probes targeted to additional regions of interest such as promoters or highly conserved sequences. Newer kits have also expanded the regions captured to include micro-RNA sites and untranslated regions of genes, thus increasing the regions captured from ∼30 Mb to as high as 62 Mb (table 1).

View this table:

Table 1

Comparison of commercially available technologies for human whole exome capture (numbers taken from their respective data sheets)

An important consideration is that NGS technologies have higher base calling error rates than Sanger sequencing, although this can be remedied to some extent by increasing the depth of sequencing coverage to ensure minimal ‘false calls’.19 This makes the resequencing of mutant or variant genes using conventional sequencing techniques important for validation and increases the cost of the approach. All of these inefficiencies are likely to be ameliorated as sequencing and capture technology continue to improve. Importantly, the higher coverage of the exome that can be affordably achieved for a large number of samples makes exome sequencing highly suitable for mutation discovery and its use is becoming increasingly routine.

WES in human disease

“Why me?” “Why my child?” “What could I have done to avoid this?” “What can I do to be cured or to get better?” These questions routinely face clinicians caring for patients with a given disease, and they are even questions we ask ourselves as medical knowledge expands with the rapid advances in genome sequencing. They pertain to any human disease as all have a genetic component—major or minor—or, for some, they simply reflect the wish to know what genetic information they are born with. Until 2008 such questions usually remained unanswered because, even for some Mendelian disorders, we did not know how to identify causative mutations within our genome. Indeed, our individual genomes contain variants which may protect us against or increase our susceptibility to our environment and the multiple stressors we encounter during our lifespan. Knowing these variants can perhaps allow us to better prepare or to avoid the negative impacts they might have on our health, lifespan and offspring. In the short years since the first commercial platform became available, NGS has dramatically accelerated multiple areas of genomics research, enabling experiments that previously were not technically feasible or affordable. In this paper we describe the major ongoing applications of NGS as they pertain to WES.

Genetic variants identified using WES

Genetic variants induce a phenotype that can vary between individuals in penetrance or physiological effect and may depend on (1) environmental factors; (2) modifier genes and/or the epigenome; and (3) the additive/synergistic effect from another genetic variant (digenic inheritance).20 High-penetrance variants induce a strong physiological effect and thus these alleles have usually been identified as causative for Mendelian monogenic disorders using linkage studies in families (see below). Low-penetrance variants have a weak phenotype and causative alleles are typically identified in large case/control cohorts as part of the study of complex trait disorders. Common high-penetrance disease variants are rare as they would normally be eliminated from breeding populations, except in cases of balancing selection where heterozygotes have an advantage over homozygotes (eg, sickle cell disease and resistance to cerebral malaria). The distinction between monogenic and complex diseases is therefore operational as all genetic variants are de facto transmitted in a Mendelian fashion and thus amenable to discovery using WES, giving this technique far wider applicability than simply the study of rare monogenic disorders (figure 1).

Figure 1

Roadmap for the application of next generation sequencing technologies for the identification of disease-relevant genomic variations.

WES in characterising monogenic (Mendelian) disorders

Uncovering genetic defects underlying monogenic inherited disorders is one of the most obvious applications of WES. Single-gene disorders, while individually rare, are in aggregate numerous and have an enormous impact on the well-being of affected patients. To date, the gene responsible for the disease in more than 3000 Mendelian disorders has not yet been uncovered. Although rates of spontaneous mutation in the human genome have been estimated in various ways, it is clear that worldwide the entire human gene repertoire is bombarded with new pathogenic alleles each year.21 The number of known mutations in human nuclear genes that underlie or are associated with human inherited disease now exceeds 100 000 in more than 3700 different genes (Human Gene Mutation Database). However, for a variety of reasons this figure probably represents only a small fraction of the clinically relevant genetic variants in the human genome.

NGS brings new ways of addressing monogenic disorders. Classical strategies involved using linkage analysis in families with known shared genetic heritage, identifying candidate genomic regions enclosing the gene with the causative mutation, narrowing the interval whenever possible with additional families/probands and thereafter either implementing a candidate gene approach or systematically sequencing the genes located within the interval. The advent of single nucleotide polymorphism (SNP) arrays, which can identify regions of homozygosity within a genome, helped significantly to hasten the linkage analysis by narrowing the regions of interest for further directed sequencing. However, these approaches are costly and time-consuming and their success in identifying causative genetic variants has been variable, mainly due to the small numbers of available affected individuals for a given Mendelian disease and possibly also due to locus heterogeneity.22–25 Deep resequencing of all human genes for discovery of allelic variants could potentially identify the gene underlying any given rare monogenic disease where a shared genetic heritage is not readily available.11

Protein-coding genes constitute only approximately 1% of the human genome but harbour about 85% of the mutations with large effects on disease-related traits. Indeed, most Mendelian disorders are caused by exonic mutations or splice-site mutations that change the amino acid sequence of the affected gene. In contrast to the more laborious approach of SNP homozygosity mapping, the exome sequencing approach is faster, does not depend on shared allelic heritage and can be done in the presence of allelic heterogeneity. Instead, its success depends on the mutation being present in the captured portion of the genome and on our ability to identify it as a pathogenic variant among the many thousands of new variants detected in each exome (the ‘background noise’). Strategies to identify these mutations using exome sequencing generally rely on certain assumptions. First, homozygous or heterozygous mutations in only a single gene are required to cause the disease and these mutations will be extremely rare (eg, only present in affected individuals). Second, these mutations have a large effect size, are highly penetrant and, as such, are assumed to affect the protein sequence (ie, non-synonymous SNPs, insertions/deletions or splice-site mutations). The main strategy employed to identify causative mutations is therefore to find all variants in the exome and apply various filters based on the assumptions above. Additional filters are also required to remove false positives caused by systematic, sequencing and misalignment errors. However, the development of bioinformatics analysis tools and the availability and rapidly decreasing cost of NGS technology render exome sequencing simpler and faster than homozygosity mapping. Both alternatives are still complementary as homozygosity mapping can allow us to focus rapidly on the variants most likely to be causative, which can now be identified in record time and with a small number of affected individuals using WES, a cost-effective, reproducible and robust strategy.

The reported successes using NGS have increased exponentially since its first application in 2008 and have frequently been achieved using a limited number of patients.26–28 Identification of genetic defects in autosomal recessive diseases was performed using either unrelated individuals10 29–34 and/or individuals from the same families35–42 and was, in some instances, coupled to homozygosity mapping.43–45 Similar success was achieved for autosomal dominant disorders.46 47 WES has even been able to identify the causative mutation in diseases with genetic and phenotypic heterogeneity.30 34 48 49 Such heterogeneity would make identification of the causative mutation very difficult, if not impossible, by traditional linkage-based approaches. In two recent publications WGS was used to identify the causative gene, but it should be noted that in both instances the investigators identified the genetic alterations in the exome.50 51 Thus, in total since 2009, more than 20 causative genes have been identified and this number is growing exponentially. With the availability of this technology, several initiatives in North America through the National Institute of Health (NIH), Finding of Rare Disease Genes in Canada (FORGE) and Rare Disease Consortium for Autosomal Loci (RaDiCAL) have been launched which aim to characterise Mendelian disorders. The challenge is now how to validate, among the multiple variants that will be identified, the causative alteration and link it to disease and function. Moreover, it is likely that we will identify novel phenotypes for a known gene, or the reverse, as different hypomorphic mutations can lead to distinct phenotypes52–54 and identical mutations in a given gene have been shown to induce distinct phenotypes.55 In addition, in highly consanguineous probands, the compounded effect of added genetic alterations can affect the phenotype and lead to what has been identified as a novel disease.

Paradigm shift brought by WES in the identification of de novo mutations

A remarkable demonstration of how powerful WES using NGS can be in teaching us about human disease comes from a recent study that provides evidence that de novo single nucleotide variants may contribute substantially to mental retardation.56 Investigators explored a major paradox in evolutionary theory—namely, that the per-generation mutation rate in humans is high despite the allelic loss due to reduced fertility. They postulated that these de novo mutations may compensate for allele loss in common neurodevelopmental and psychiatric diseases, and explain this paradox in evolutionary genetic theory. They used a family-based WES approach to test this de novo mutation hypothesis in 10 individuals with unexplained mental retardation. Following WES of their trios (parents and proband), they identified and validated unique non-synonymous de novo mutations in nine genes. While they further establish the power of NGS/WES in identifying the genetic basis of human diseases, their findings also provide strong experimental support for a de novo paradigm; de novo point mutations of large effect sizes together with de novo copy number variation could potentially explain the majority of all cases of mental retardation in the population.56 In addition, in cases where the results from the analysis of WES are not conclusive in identifying a causative gene and/or in the scenario where only one affected individual within the family is available for studying a rare disorder, resequencing trios (parents and the patient) can help to pinpoint the causative genetic variant by excluding mutations shared with the parents.

WES in characterising complex trait disorders

Genome-wide association (GWA) studies of complex traits have been successful in identifying common variant associations but have failed to explain most of the heritability of these traits.57 The field of complex trait genetics is shifting towards the study of low-frequency (minor allele frequency (MAF) 0.01–0.05) and rare (MAF <0.01) variants, some of which are hypothesised to have larger effects. Indeed, GWA studies, which so far have focused on very common SNPs, have been completed for most common human diseases and many related traits.58 59 These studies have been designed based on the knowledge of most of the very common gene variants (MAF >∼5%) in the human genome and have identified over 500 independent strong SNP associations (p<1×10⁻⁸) (see the National Human Genome Research Institute Catalog of Published GWA Studies). However, most of these associated SNPs have very small effect sizes and the proportion of heritability explained is at best modest for most traits.57 60 Furthermore, most GWA signals have yet to be tracked to causal polymorphisms. Fine mapping and functional evaluation of these loci is an ongoing process with several successful examples indicating that the causal variants must have subtle regulatory effects. Although the systematic identification of rare variants associated with common diseases has not yet been feasible, several rare variants have nevertheless been identified that confer a substantial risk of disease. For example, autism, mental retardation, epilepsy and schizophrenia have been shown to be influenced by rare structural variants that affect genes.61 Additionally, it seems possible that some—perhaps even many—of the current GWA signals could reflect the effect of one or more rare variants that have been tagged by common variants.62 Whatever underlies the GWA signals for common diseases, it is clear that GWA studies of common variants have limited value in disease prediction.

As discussed above, past results make a strong case for common diseases being more similar to Mendelian diseases than is postulated by the common disease–common variant model. It seems possible that much of the genetic cause of common diseases is due to rare and generally deleterious variants that have a strong impact on the risk of disease in individual patients, and which can now be identified thanks to NGS. Because the most obvious disease-influencing variants will be the clearly functional ones, WES has the potential to identify these rare variants and allow definitive connections to be rapidly established between specific genes and many important common diseases. However, there are drawbacks to this technique, the most important being that it almost entirely misses structural variation. WES, despite its name, also misses a certain set of exons: if causal variants lie within these exons that are not targeted (we found that 5–10% of RefSeq exons or ∼3% of RefSeq coding exons have <5× coverage in the latest commercial capture kits), they will not be identified, as is also the case for monogenic disorders. Additionally, the capture methods currently used require the sequencing of a far greater number of bases than expected based on the size of the exome, which makes WES prices comparable to those of low-coverage WGS. However, low-coverage sequencing will miss many of the variants present in only a single individual. For these reasons, high-coverage WES is the method of choice for complex traits as it is becoming more affordable, and there is a rapid increase in the sequencing capacity of existing platforms as well as the development of new less-expensive platforms.

An essential point to improve the chances of success is to carefully choose cases and controls for each study as costs restrict sample size. Selecting cases with a strong family history will increase the probability of finding pathogenic variants with large effects. The availability of large control cohorts who can be recalled and phenotypically evaluated will also be crucial. Unlike variants with weak influences which could be expected to appear in the population at large without much phenotypic effect, a variant of strong effect is less likely to appear in an individual without any phenotypic consequence. Confirmation of such potentially causal variants will therefore often require the careful evaluation of the phenotype in any controls who are carriers. Several complex diseases are currently being investigated using NGS and include mental diseases, diabetes and autoimmune disorders such as lupus and inflammatory bowel diseases. The results from these studies have the potential to revolutionise the screening of these disorders and our therapeutic approach. The low-frequency and rare variants are likely to be population-specific at a much finer scale than common variants, so careful geographical and molecular matching of cases and controls is that much more important than in GWA studies. An alternative strategy is to do gene-based rather than variant-based analysis, which will require sequencing rather than genotyping in the replication cohorts. The pros and cons of these alternatives are still being debated.

WES in characterising cancer

The application of NGS technologies is allowing substantial advances in cancer genomics. Indeed, the development of massively parallel sequencing technologies makes it feasible to catalogue all classes of somatically-acquired mutations in a cancer.63–65 It has become feasible to sequence all expressed genes (the transcriptomes),66 67 exomes and, more recently, complete genomes64 68–70 of cancer samples. WES with capillary sequencing allowed the analysis of all known coding genes in colorectal, breast and pancreatic carcinomas and glioblastoma.71–73 These studies have led to the discovery of somatic mutations in isocitrate dehydrogenase 1 in glioblastoma72 and of germline mutations in PALB2 (the gene encoding partner and localiser of BRCA2) in patients with pancreatic carcinoma,74 among other important findings. In addition, the hybrid selection approach will be particularly powerful for diagnostic analysis of the cancer genome; for diagnosis, there may be value in sequencing specific oncogenes and/or tumour suppressor genes at very high coverage in samples with a low percentage of tumour cells.75 However, a major challenge of cancer genome analysis is to identify ‘driver’ mutations,65 and several recent genome studies of leukaemias, myelomas and solid tumours including breast, lung and pancreatic cancer have concentrated their analysis on coding regions (exomes) to increase the likelihood of identifying driver mutations76–79 or used integrative genomics approaches (mapping of structural variation, whole genome methylation and gene expression analysis) in association with NGS techniques.72 Whole exome analysis on these distinct subtypes provides a better understanding of mechanisms underlying specific cancers and also identifies new biomarkers and/or drug targets, as recently reported, for example, in individuals with acute myeloid leukaemia.68 80 81

WES is thus opening new avenues towards understanding the molecular pathogenesis of cancers. For example, the discovery of DNMT3A (a gene involved in DNA methylation) mutations in acute myeloid leukaemia (AML) may imply that aberrant epigenetic regulation is critical for pathogenesis, but the exact link—whether it be altered gene expression or genome instability—has yet to be uncovered. Other key questions include what aspects of leukaemia biology can be attributed to mutations in this gene and why it is concentrated in specific AML subtypes and associated with a poor prognosis. Thus, additional genetic and/or epigenetic events can modulate this disease type and must be uncovered through further exploration of the genome and the epigenome. WGS is also being used to investigate cancers, with an initial focus on very specific subgroups in certain cancers to minimise the confounding effect of genetic heterogeneity.70

Other innovative approaches are making use of NGS in targeted exome/genome sequencing for the design of cost-effective targeted sequencing methods to the benefit of personalised chemotherapy.82 In another recently published study, targeted NGS detected point mutations, insertions, deletions and balanced chromosomal rearrangements and identified novel leukaemia-specific fusion genes in a single procedure combining 454 shotgun pyrosequencing with long oligonucleotide sequence capture arrays.83 In yet another study, NGS was applied as a screening method to characterise a number of known genetic alterations in chronic myelomonocytic leukaemia and identified that a pattern of molecular mutations translated into distinct biological and prognostic categories.84

There are unique methodological considerations in NGS analyses of cancer samples (reviewed by Meyerson et al85). Cancer samples and cancer genomes have general characteristics that are distinct from other tissue samples and from genomic sequences that are inherited through the germ line. Cancers themselves may be highly heterogeneous and composed of different clones that have different genomes.86 Cancer genomes are enormously diverse and complex and have major structural variability. They vary substantially in their sequence and structure compared with normal genomes and among themselves. To identify somatic alterations in cancer, comparison with matched normal DNA from the same individual is essential. This is largely because of our incomplete knowledge of the variations in the normal human genome; to date, each ‘matched normal’ cancer genome sequence has identified large numbers of mutations and rearrangements in the germ line that had not previously been described.

Relevance for the clinical use of WES

Gene discovery is an essential starting point for both understanding the genetic mechanisms underlying diseases and for providing clues to therapeutic approaches. Gene-specific treatments are currently ongoing worldwide, and several successful gene therapy trials aim to correct inborn errors for diseases such as immune deficiencies, metabolic disorders and, more recently, thalassaemia.87–93 Local delivery of the replacement gene is also being tested in human clinical trials for several forms of hereditary blindness such as Leber congenital amaurosis and retinitis pigmentosa.94 Also, genetic testing for common mutations in recessive disorders such as Tay-Sachs disease has proved to be of benefit both for diagnosis and carrier detection.5 For complex traits, understanding the genetic alterations in disease variability and resistance to treatment in a given individual could revolutionise care and may soon make the concept of personalised medicine a reality. WES is paving the way to identifying driver mutations in cancer as well as the genetic events leading to metastasis, the primary cause of cancer mortality, and which are potentially amenable to therapeutic targeting. These insights will provide improved means to prevent recurrence and to avoid therapeutic resistance. WES can also complement histopathological analysis by allowing for more accurate diagnosis and improved subgrouping of patients.95 The clinical applications are thus enormous. With regard to personal genomics, numerous companies already use SNP arrays to offer predictions of common disease risk directly to consumers, which can influence lifestyle choices and decisions to use relatively non-invasive monitoring programmes (eg, imaging). Genome sequencing will greatly improve the specificity of such predictions and adds the ability to detect novel variants, and might lead to an expansion of fetal screening.

Why not use WGS?

WGS is being increasingly used based on its availability and improved cost efficiency. Indeed, in the future, WGS is predicted to be more economical than WES because the capture process is skipped entirely. This technique has the advantage of capturing all of the exome (as some can be missed by the exome capture process), and can provide information on variants in highly evolutionary conserved non-coding regions and other variants throughout the genome. In addition, WGS using a paired-end approach can be used to detect large structural variants such as large insertions or deletions, inversion and translocations. As the cost of WGS continues to decrease, it will become increasingly popular because of its ability to survey most of the genome as well as additional classes of mutations. However, the amount of data generated from WGS is 100 times more than the already overwhelming amount obtained by WES. The bioinformatics filtering techniques, storage facilities, software and hardware for data analysis will prove a challenge and most ongoing projects initially focus on the exome for the first analysis. It should be noted that WGS is not immune to some of the drawbacks of exome sequencing. There is significant variability in sequencing efficiency across the genome and the fluctuations in coverage will result in many regions of interest being missed. Also, repetitive regions—exonic and others—are difficult to align in either case and can result in missed variants or an excess of variant calls. These problems can be resolved in the future with more uniform library construction, higher sequencing depth and longer reads and paired end fragment sizes. However, in the foreseeable future we are likely to continue facing some gaps in genome coverage. Unless analyses will specifically focus on non-coding regions or on structural variation, WES provides most of the benefits of WGS but with lower costs, both for sequencing and for storage and analysis of the data.

Some practical considerations

Large throughput genomic data analysis has traditionally been the domain of the bioinformatician and statistician. Laboratory researchers have gradually been embracing genomic technologies such as microarrays and applying them as ‘hypothesis-free’ discovery tools to be followed up by focused experimentation. This type of experimentation was usually accompanied by collaboration with skilled data analysts or the development of custom analysis software. NGS data will undoubtedly become a household item in the near future. What can a researcher or a clinician embarking on a WES or WGS adventure expect to obtain from the sequencing service provider?

A number of excellent commercially-available targeted sequence capture kits are available including kits from Agilent (Santa Clara, CA, USA), Illumina (San Diego, CA, USA) and Nimblegen (Madison, WI, USA). Once captured, the DNA fragments need to be sequenced. Currently, ABI (Carlsbad, CA, USA) and Illumina are the two major companies in the sequencing field, but a number of third-generation sequencers are being developed and may enter the field in the near future (table 2). After the sequencing process, individual sequence reads are typically aligned to the reference genome sequence. Next, variant positions are identified between the sample of interest and the reference genome. At this point a number of bioinformatic filtering steps are required to separate common benign polymorphisms from potentially deleterious mutations.

View this table:

Table 2

Comparison of next-generation sequencing instruments

We believe that, while the choice of the optimal technology and analytical pipelines is important, it is secondary to the service provider's experience with the specific technology and willingness to engage in some back-and-forth dialogue with the researcher on custom analysis needs. As an example, at the McGill University and Genome Quebec Innovation Centre, we have to date processed the data from over 300 exomes. This experience is instrumental in identifying systematic false positive and false negative results. False positives most often arise from incorrect mapping and systematic sequencing errors—for example, certain words (combination of nucleotides) being systematically misread by the sequencer. Both of these errors can be removed by comparing each test sample against previously sequenced exomes. Systematic errors occur over and over again but, if they are present in a certain proportion of all sequenced samples, they can be easily removed from the final list of variants. False negative results can result from low overall coverage, poor capture efficiency of certain regions and difficulty in unambiguously aligning repetitive regions. Such missing regions can easily be flagged and reported to the researcher who may want to follow them up by targeted sequencing.

The final output for each sample is a list of variants that can be easily manipulated in a spreadsheet. In our experience, each sample produces roughly 500 potentially ‘interesting’ protein-coding variants—that is, those that have not been seen in more than 5% of other exomes. Our annotated output file contains basic information on the chromosomal position, nucleotide change, predicted protein change, gene name and gene description. Further annotation includes Online Mendelian Inheritance in Man (OMIM) entry (if available), Scale-invariant feature transform (SIFT)96 prediction of how likely the change is to be damaging, interspecies conservation of each residue, dbSNP entry and allele frequency from the 1000 Genomes Project.97 We also provide information on the sequencing quality of each variant and clickable links to the primary sequencing data visualised in the Integrative Genomics Viewer98 and the graphic display of each position in the UCSC Genome Browser.99 The end user who is interested in a recessive disease and studies a consanguineous family, for example, can then apply simple spreadsheet filtering functions to display only homozygous changes that have never been seen before or that have very low minor allele frequencies. In most cases this will limit the final list to a manageable number of a dozen or fewer candidate variants that can then be followed up manually.

Of course, as the number of samples and complexity of a disorder increases, at some point a switch from a spreadsheet to a dedicated bioinformatician may be necessary. However, for Mendelian disorders a spreadsheet-savvy researcher should be quite successful in analysing exome sequencing results.

Some final rules of thumb: choose a friendly experienced sequencing centre; longer reads are better than shorter reads as they reduce false positives from mapping ambiguity; paired-end reads are better than single-end for the same reason; and 30× median coverage of the target may be sufficient, but 100× coverage is much safer as it ensures that variants can confidently be determined across a higher proportion of the exome.

Ethical issues raised by WES

The increased ability to share large amounts of individual-specific genetic information across borders puts a new twist on perennial ethical issues such as consent, feedback, protection of privacy and the governance of research.100–103 Informed consent is needed from participants in research and has been a guiding principle of medical investigation since the mid-20th century. This conception of consent, along with the concomitant power to withdraw from research without prejudice, arose originally in the context of biomedical research. It had the aim of protecting participants from abuse and from potential physical harm, and focused on clinical interventions and the collection of samples rather than on data collection per se. From its inception, informed consent was strongly concerned with the protection of individuals.

Genomics research, however, moves away from these origins on several counts. The information that is derived from DNA is a powerful personal identifier and can provide information—not just on the individual but also on the individual's relatives and ethnic groups—in a format that is easy to share across international borders. Although samples and data have personal identifiers removed, individuals may still be re-identifiable because of the richness of the data derived from the analysis. The data produced are often shared informally among researchers, but more formal mechanisms have been put in place by funders to ensure the rapid sharing of NGS data, such as the requirements to deposit data sets in open access archives.104 Examples are the European Genotype Archive and dbGaP (NIH-USA). The complexity of genomics research, together with the difficulty of providing precise specifications for future use of data, have prompted serious concerns about whether any consent to such research can be adequately ‘informed’. There is a pressing need to learn from insights gained elsewhere, such as in genetic counselling and in family studies. Likewise, calls to involve the community in consent pose ethical issues about individual and group rights which may be different for communities across the globe.

Reporting findings back to the participants may be considered to be an important part of building and maintaining public trust in research.102 105–107 Providing participants with information about the general findings of research, such as publications based on the research, is an uncontroversial and welcome practice. In contrast, informing a single individual of his or her results remains controversial in many areas of research and particularly in the area of WGS. There appears to be some agreement that, where there is a serious treatable condition, researchers have a moral obligation to feed this information back to research participants.108 In cases where findings are of a less serious nature, are untreatable or of uncertain significance, the potential benefits for participants of being informed need to be balanced against the participant's right not to know. The thoughtful handling of such issues is of clear relevance to the maintenance of public trust in the research process and is the subject of ongoing studies in collaboration with some of the initiatives exploring WES/WGS in rare diseases and cancer in North America. Not surprisingly, an ethical investigation component has been added to each of these initiatives to try and assess the impact of the findings on families and to determine ways for appropriate return of information and protection of privacy.

Conclusions and future directions

Vast amounts of clinical, biological and sequencing data are now being generated by an expanding number of research efforts on a scale that was only imagined just a few years ago. Interpreting these data and translating the findings to improve healthcare is a challenge in itself. In addition to developing locus-specific databases and large data warehouses for NGS datasets, there is a major need to create dedicated databases to enhance the clinical interpretation process. Indeed, the development of analysis techniques to cope with the millions of variants called per genome will be a high priority, as will the development of techniques that can combine data about different rare variants into one analysis. The advances we have described highlight the important implications that particular mutations, discovered through NGS and WES, can have for medical management and for tailoring therapy to the genetic background of a given individual in a vast array of diseases. Furthermore, definitive connections—for example, a clearly functional mutation in a single gene conferring a strongly increased risk of a disease—would provide validated therapeutic targets for the pharmaceutical industry and genetic discovery could be the most likely avenue for ameliorating the ongoing crisis in global drug development. This prediction assumes that rare variants will be found that have large influences on rare and common diseases, that their biological functions will be obvious and that locus and allelic heterogeneity will not prevent insights into the mechanisms of disease. How often these assumptions will hold is currently unknown and will largely determine the rate of discovery in the coming years.

References

↵
1. Watson JD,
2. Crick FH
. The structure of DNA. Cold Spring Harb Symp Quant Biol 1953;18:123–31.
OpenUrl Abstract/FREE Full Text
↵
1. Watson JD,
2. Crick FH
. Molecular structure of nucleic acids; a structure for deoxyribose nucleic acid. Nature 1953;171:737–8.
OpenUrl CrossRef PubMed Web of Science
↵
1. Inoue T,
2. Orgel LE
. A nonenzymatic RNA polymerase model. Science 1983;219:859–62.
OpenUrl Abstract/FREE Full Text
↵
1. Maxam AM,
2. Gilbert W
. A new method for sequencing DNA. Proc Natl Acad Sci U S A 1977;74:560–4.
OpenUrl Abstract/FREE Full Text
↵
1. Sanger F,
2. Nicklen S,
3. Coulson AR
. DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A 1977;74:5463–7.
OpenUrl Abstract/FREE Full Text
↵
1. Sachidanandam R,
2. Weissman D,
3. Schmidt SC,
4. Kakol JM,
5. Stein LD,
6. Marth G,
7. Sherry S,
8. Mullikin JC,
9. Mortimore BJ,
10. Willey DL,
11. Hunt SE,
12. Cole CG,
13. Coggill PC,
14. Rice CM,
15. Ning Z,
16. Rogers J,
17. Bentley DR,
18. Kwok PY,
19. Mardis ER,
20. Yeh RT,
21. Schultz B,
22. Cook L,
23. Davenport R,
24. Dante M,
25. Fulton L,
26. Hillier L,
27. Waterston RH,
28. McPherson JD,
29. Gilman B,
30. Schaffner S,
31. Van Etten WJ,
32. Reich D,
33. Higgins J,
34. Daly MJ,
35. Blumenstiel B,
36. Baldwin J,
37. Stange-Thomann N,
38. Zody MC,
39. Linton L,
40. Lander ES,
41. Altshuler D
. A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms. Nature 2001;409:928–33.
OpenUrl CrossRef PubMed Web of Science
↵
1. Lander ES,
2. Linton LM,
3. Birren B,
4. Nusbaum C,
5. Zody MC,
6. Baldwin J,
7. Devon K,
8. Dewar K,
9. Doyle M,
10. FitzHugh W,
11. Funke R,
12. Gage D,
13. Harris K,
14. Heaford A,
15. Howland J,
16. Kann L,
17. Lehoczky J,
18. LeVine R,
19. McEwan P,
20. McKernan K,
21. Meldrim J,
22. Mesirov JP,
23. Miranda C,
24. Morris W,
25. Naylor J,
26. Raymond C,
27. Rosetti M,
28. Santos R,
29. Sheridan A,
30. Sougnez C,
31. Stange-Thomann N,
32. Stojanovic N,
33. Subramanian A,
34. Wyman D,
35. Rogers J,
36. Sulston J,
37. Ainscough R,
38. Beck S,
39. Bentley D,
40. Burton J,
41. Clee C,
42. Carter N,
43. Coulson A,
44. Deadman R,
45. Deloukas P,
46. Dunham A,
47. Dunham I,
48. Durbin R,
49. French L,
50. Grafham D,
51. Gregory S,
52. Hubbard T,
53. Humphray S,
54. Hunt A,
55. Jones M,
56. Lloyd C,
57. McMurray A,
58. Matthews L,
59. Mercer S,
60. Milne S,
61. Mullikin JC,
62. Mungall A,
63. Plumb R,
64. Ross M,
65. Shownkeen R,
66. Sims S,
67. Waterston RH,
68. Wilson RK,
69. Hillier LW,
70. McPherson JD,
71. Marra MA,
72. Mardis ER,
73. Fulton LA,
74. Chinwalla AT,
75. Pepin KH,
76. Gish WR,
77. Chissoe SL,
78. Wendl MC,
79. Delehaunty KD,
80. Miner TL,
81. Delehaunty A,
82. Kramer JB,
83. Cook LL,
84. Fulton RS,
85. Johnson DL,
86. Minx PJ,
87. Clifton SW,
88. Hawkins T,
89. Branscomb E,
90. Predki P,
91. Richardson P,
92. Wenning S,
93. Slezak T,
94. Doggett N,
95. Cheng JF,
96. Olsen A,
97. Lucas S,
98. Elkin C,
99. Uberbacher E,
100. Frazier M,
101. Gibbs RA,
102. Muzny DM,
103. Scherer SE,
104. Bouck JB,
105. Sodergren EJ,
106. Worley KC,
107. Rives CM,
108. Gorrell JH,
109. Metzker ML,
110. Naylor SL,
111. Kucherlapati RS,
112. Nelson DL,
113. Weinstock GM,
114. Sakaki Y,
115. Fujiyama A,
116. Hattori M,
117. Yada T,
118. Toyoda A,
119. Itoh T,
120. Kawagoe C,
121. Watanabe H,
122. Totoki Y,
123. Taylor T,
124. Weissenbach J,
125. Heilig R,
126. Saurin W,
127. Artiguenave F,
128. Brottier P,
129. Bruls T,
130. Pelletier E,
131. Robert C,
132. Wincker P,
133. Smith DR,
134. Doucette-Stamm L,
135. Rubenfield M,
136. Weinstock K,
137. Lee HM,
138. Dubois J,
139. Rosenthal A,
140. Platzer M,
141. Nyakatura G,
142. Taudien S,
143. Rump A,
144. Yang H,
145. Yu J,
146. Wang J,
147. Huang G,
148. Gu J,
149. Hood L,
150. Rowen L,
151. Madan A,
152. Qin S,
153. Davis RW,
154. Federspiel NA,
155. Abola AP,
156. Proctor MJ,
157. Myers RM,
158. Schmutz J,
159. Dickson M,
160. Grimwood J,
161. Cox DR,
162. Olson MV,
163. Kaul R,
164. Shimizu N,
165. Kawasaki K,
166. Minoshima S,
167. Evans GA,
168. Athanasiou M,
169. Schultz R,
170. Roe BA,
171. Chen F,
172. Pan H,
173. Ramser J,
174. Lehrach H,
175. Reinhardt R,
176. McCombie WR,
177. de la Bastide M,
178. Dedhia N,
179. Blocker H,
180. Hornischer K,
181. Nordsiek G,
182. Agarwala R,
183. Aravind L,
184. Bailey JA,
185. Bateman A,
186. Batzoglou S,
187. Birney E,
188. Bork P,
189. Brown DG,
190. Burge CB,
191. Cerutti L,
192. Chen HC,
193. Church D,
194. Clamp M,
195. Copley RR,
196. Doerks T,
197. Eddy SR,
198. Eichler EE,
199. Furey TS,
200. Galagan J,
201. Gilbert JG,
202. Harmon C,
203. Hayashizaki Y,
204. Haussler D,
205. Hermjakob H,
206. Hokamp K,
207. Jang W,
208. Johnson LS,
209. Jones TA,
210. Kasif S,
211. Kaspryzk A,
212. Kennedy S,
213. Kent WJ,
214. Kitts P,
215. Koonin EV,
216. Korf I,
217. Kulp D,
218. Lancet D,
219. Lowe TM,
220. McLysaght A,
221. Mikkelsen T,
222. Moran JV,
223. Mulder N,
224. Pollara VJ,
225. Ponting CP,
226. Schuler G,
227. Schultz J,
228. Slater G,
229. Smit AF,
230. Stupka E,
231. Szustakowski J,
232. Thierry-Mieg D,
233. Thierry-Mieg J,
234. Wagner L,
235. Wallis J,
236. Wheeler R,
237. Williams A,
238. Wolf YI,
239. Wolfe KH,
240. Yang SP,
241. Yeh RF,
242. Collins F,
243. Guyer MS,
244. Peterson J,
245. Felsenfeld A,
246. Wetterstrand KA,
247. Patrinos A,
248. Morgan MJ,
249. de Jong P,
250. Catanese JJ,
251. Osoegawa K,
252. Shizuya H,
253. Choi S,
254. Chen YJ
. Initial sequencing and analysis of the human genome. Nature 2001;409:860–921.
OpenUrl CrossRef PubMed Web of Science
↵
1. McPherson JD,
2. Marra M,
3. Hillier L,
4. Waterston RH,
5. Chinwalla A,
6. Wallis J,
7. Sekhon M,
8. Wylie K,
9. Mardis ER,
10. Wilson RK,
11. Fulton R,
12. Kucaba TA,
13. Wagner-McPherson C,
14. Barbazuk WB,
15. Gregory SG,
16. Humphray SJ,
17. French L,
18. Evans RS,
19. Bethel G,
20. Whittaker A,
21. Holden JL,
22. McCann OT,
23. Dunham A,
24. Soderlund C,
25. Scott CE,
26. Bentley DR,
27. Schuler G,
28. Chen HC,
29. Jang W,
30. Green ED,
31. Idol JR,
32. Maduro VV,
33. Montgomery KT,
34. Lee E,
35. Miller A,
36. Emerling S,
37. Kucherlapati Gibbs R,
38. Scherer S,
39. Gorrell JH,
40. Sodergren E,
41. Clerc-Blankenburg K,
42. Tabor P,
43. Naylor S,
44. Garcia D,
45. de Jong PJ,
46. Catanese JJ,
47. Nowak N,
48. Osoegawa K,
49. Qin S,
50. Rowen L,
51. Madan A,
52. Dors M,
53. Hood L,
54. Trask B,
55. Friedman C,
56. Massa H,
57. Cheung VG,
58. Kirsch IR,
59. Reid T,
60. Yonescu R,
61. Weissenbach J,
62. Bruls T,
63. Heilig R,
64. Branscomb E,
65. Olsen A,
66. Doggett N,
67. Cheng JF,
68. Hawkins T,
69. Myers RM,
70. Shang J,
71. Ramirez L,
72. Schmutz J,
73. Velasquez O,
74. Dixon K,
75. Stone NE,
76. Cox DR,
77. Haussler D,
78. Kent WJ,
79. Furey T,
80. Rogic S,
81. Kennedy S,
82. Jones S,
83. Rosenthal A,
84. Wen G,
85. Schilhabel M,
86. Gloeckner G,
87. Nyakatura G,
88. Siebert R,
89. Schlegelberger B,
90. Korenberg J,
91. Chen XN,
92. Fujiyama A,
93. Hattori M,
94. Toyoda A,
95. Yada T,
96. Park HS,
97. Sakaki Y,
98. Shimizu N,
99. Asakawa S,
100. Kawasaki K,
101. Sasaki T,
102. Shintani A,
103. Shimizu A,
104. Shibuya K,
105. Kudoh J,
106. Minoshima S,
107. Ramser J,
108. Seranski P,
109. Hoff C,
110. Poustka A,
111. Reinhardt R,
112. Lehrach H
. A physical map of the human genome. Nature 2001;409:934–41.
OpenUrl CrossRef PubMed Web of Science
↵
1. Ng SB,
2. Nickerson DA,
3. Bamshad MJ,
4. Shendure J
. Massively parallel sequencing and rare disease. Hum Mol Genet 2010;19:R119–24.
OpenUrl Abstract/FREE Full Text
↵
1. Ng SB,
2. Turner EH,
3. Robertson PD,
4. Flygare SD,
5. Bigham AW,
6. Lee C,
7. Shaffer T,
8. Wong M,
9. Bhattacharjee A,
10. Eichler EE,
11. Bamshad M,
12. Nickerson DA,
13. Shendure J
. Targeted capture and massively parallel sequencing of 12 human exomes. Nature 2009;461:272–6.
OpenUrl CrossRef PubMed Web of Science
↵
1. Shendure J,
2. Ji H
. Next-generation DNA sequencing. Nat Biotechnol 2008;26:1135–45.
OpenUrl CrossRef PubMed Web of Science
↵
1. Turner EH,
2. Lee C,
3. Ng SB,
4. Nickerson DA,
5. Shendure J
. Massively parallel exon capture and library-free resequencing across 16 genomes. Nat Methods 2009;6:315–16.
OpenUrl CrossRef PubMed Web of Science
↵
1. Albert TJ,
2. Molla MN,
3. Muzny DM,
4. Nazareth L,
5. Wheeler D,
6. Song X,
7. Richmond TA,
8. Middle CM,
9. Rodesch MJ,
10. Packard CJ,
11. Weinstock GM,
12. Gibbs RA
. Direct selection of human genomic loci by microarray hybridization. Nat Methods 2007;4:903–5.
OpenUrl CrossRef PubMed Web of Science
↵
1. Gnirke A,
2. Melnikov A,
3. Maguire J,
4. Rogov P,
5. LeProust EM,
6. Brockman W,
7. Fennell T,
8. Giannoukos G,
9. Fisher S,
10. Russ C,
11. Gabriel S,
12. Jaffe DB,
13. Lander ES,
14. Nusbaum C
. Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing. Nat Biotechnol 2009;27:182–9.
OpenUrl CrossRef PubMed Web of Science
↵
1. Hodges E,
2. Xuan Z,
3. Balija V,
4. Kramer M,
5. Molla MN,
6. Smith SW,
7. Middle CM,
8. Rodesch MJ,
9. Albert TJ,
10. Hannon GJ,
11. McCombie WR
. Genome-wide in situ exon capture for selective resequencing. Nat Genet 2007;39:1522–7.
OpenUrl CrossRef PubMed Web of Science
↵
1. Levin JZ,
2. Berger MF,
3. Adiconis X,
4. Rogov P,
5. Melnikov A,
6. Fennell T,
7. Nusbaum C,
8. Garraway LA,
9. Gnirke A
. Targeted next-generation sequencing of a cancer transcriptome enhances detection of sequence variants and novel fusion transcripts. Genome Biol 2009;10:R115.
OpenUrl CrossRef PubMed
↵
1. Bainbridge MN,
2. Wang M,
3. Burgess DL,
4. Kovar C,
5. Rodesch MJ,
6. D'Ascenzo M,
7. Kitzman J,
8. Wu YQ,
9. Newsham I,
10. Richmond TA,
11. Jeddeloh JA,
12. Muzny D,
13. Albert TJ,
14. Gibbs RA
. Whole exome capture in solution with 3 Gbp of data. Genome Biol 2010;11:R62.
OpenUrl CrossRef PubMed
↵
1. Voelkerding KV,
2. Dames SA,
3. Durtschi JD
. Next-generation sequencing: from basic research to diagnostics. Clin Chem 2009;55:641–58.
OpenUrl Abstract/FREE Full Text
↵
1. Koboldt DC,
2. Ding L,
3. Mardis ER,
4. Wilson RK
. Challenges of sequencing human genomes. Brief Bioinform 2010;11:484–98.
OpenUrl Abstract/FREE Full Text
↵
1. Samuels ME
. Saturation of the human phenome. Curr Genomics 2010;11:482–99.
OpenUrl CrossRef PubMed
↵
1. Davies MA,
2. Samuels Y
. Analysis of the genome to personalize therapy for melanoma. Oncogene 2010;29:5545–55.
OpenUrl CrossRef PubMed
↵
1. Laurier V,
2. Stoetzel C,
3. Muller J,
4. Thibault C,
5. Corbani S,
6. Jalkh N,
7. Salem N,
8. Chouery E,
9. Poch O,
10. Licaire S,
11. Danse JM,
12. Amati-Bonneau P,
13. Bonneau D,
14. Mégarbané A,
15. Mandel JL,
16. Dollfus H
. Pitfalls of homozygosity mapping: an extended consanguineous Bardet-Biedl syndrome family with two mutant genes (BBS2, BBS10), three mutations, but no triallelism. Eur J Hum Genet 2006;14:1195–203.
OpenUrl CrossRef PubMed Web of Science
↵
1. Nishimura DY,
2. Swiderski RE,
3. Searby CC,
4. Berg EM,
5. Ferguson AL,
6. Hennekam R,
7. Merin S,
8. Weleber RG,
9. Biesecker LG,
10. Stone EM,
11. Sheffield VC
. Comparative genomics and gene expression analysis identifies BBS9, a new Bardet-Biedl syndrome gene. Am J Hum Genet 2005;77:1021–33.
OpenUrl CrossRef PubMed Web of Science
↵
1. Paisán-Ruiz C,
2. Scopes G,
3. Lee P,
4. Houlden H
. Homozygosity mapping through whole genome analysis identifies a COL18A1 mutation in an Indian family presenting with an autosomal recessive neurological disorder. Am J Med Genet B Neuropsychiatr Genet 2009;150B:993–7.
OpenUrl
↵
1. Strauss KA,
2. Puffenberger EG,
3. Craig DW,
4. Panganiban CB,
5. Lee AM,
6. Hu-Lince D,
7. Stephan DA,
8. Morton DH
. Genome-wide SNP arrays as a diagnostic tool: clinical description, genetic mapping, and molecular characterization of Salla disease in an Old Order Mennonite population. Am J Med Genet A 2005;138A:262–7.
OpenUrl
↵
1. Kuhlenbäumer G,
2. Hullmann J,
3. Appenzeller S
. Novel genomic techniques open new avenues in the analysis of monogenic disorders. Hum Mutat 2011;32:144–51.
OpenUrl CrossRef PubMed
↵
1. Chen JM,
2. Férec C,
3. Cooper DN
. Revealing the human mutome. Clin Genet 2010;78:310–20.
OpenUrl CrossRef PubMed
↵
1. Patel K,
2. Larson C,
3. Hargreaves M,
4. Schlundt D,
5. Wang H,
6. Jones C,
7. Beard K
. Community screening outcomes for diabetes, hypertension, and cholesterol: Nashville REACH 2010 project. J Ambul Care Manage 2010;33:155–62.
OpenUrl PubMed
↵
1. Hoischen A,
2. van Bon BW,
3. Gilissen C,
4. Arts P,
5. van Lier B,
6. Steehouwer M,
7. de Vries P,
8. de Reuver R,
9. Wieskamp N,
10. Mortier G,
11. Devriendt K,
12. Amorim MZ,
13. Revencu N,
14. Kidd A,
15. Barbosa M,
16. Turner A,
17. Smith J,
18. Oley C,
19. Henderson A,
20. Hayes IM,
21. Thompson EM,
22. Brunner HG,
23. de Vries BB,
24. Veltman JA
. De novo mutations of SETBP1 cause Schinzel-Giedion syndrome. Nat Genet 2010;42:483–5.
OpenUrl CrossRef PubMed Web of Science
↵
1. Ng SB,
2. Bigham AW,
3. Buckingham KJ,
4. Hannibal MC,
5. McMillin MJ,
6. Gildersleeve HI,
7. Beck AE,
8. Tabor HK,
9. Cooper GM,
10. Mefford HC,
11. Lee C,
12. Turner EH,
13. Smith JD,
14. Rieder MJ,
15. Yoshiura K,
16. Matsumoto N,
17. Ohta T,
18. Niikawa N,
19. Nickerson DA,
20. Bamshad MJ,
21. Shendure J
. Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome. Nat Genet 2010;42:790–3.
OpenUrl CrossRef PubMed Web of Science
↵
1. Pierce SB,
2. Walsh T,
3. Chisholm KM,
4. Lee MK,
5. Thornton AM,
6. Fiumara A,
7. Opitz JM,
8. Levy-Lahad E,
9. Klevit RE,
10. King MC
. Mutations in the DBP-deficiency protein HSD17B4 cause ovarian dysgenesis, hearing loss, and ataxia of Perrault syndrome. Am J Hum Genet 2010;87:282–8.
OpenUrl CrossRef PubMed Web of Science
↵
1. Choi M,
2. Scholl UI,
3. Ji W,
4. Liu T,
5. Tikhonova IR,
6. Zumbo P,
7. Nayir A,
8. Bakkaloğlu A,
9. Ozen S,
10. Sanjad S,
11. Nelson-Williams C,
12. Farhi A,
13. Mane S,
14. Lifton RP
. Genetic diagnosis by whole exome capture and massively parallel DNA sequencing. Proc Natl Acad Sci U S A 2009;106:19096–101.
OpenUrl Abstract/FREE Full Text
↵
1. Lalonde E,
2. Albrecht S,
3. Ha KC,
4. Jacob K,
5. Bolduc N,
6. Polychronakos C,
7. Dechelotte P,
8. Majewski J,
9. Jabado N
. Unexpected allelic heterogeneity and spectrum of mutations in Fowler syndrome revealed by next-generation exome sequencing. Hum Mutat 2010;31:918–23.
OpenUrl CrossRef PubMed Web of Science
↵
1. Gilissen C,
2. Arts HH,
3. Hoischen A,
4. Spruijt L,
5. Mans DA,
6. Arts P,
7. van Lier B,
8. Steehouwer M,
9. van Reeuwijk J,
10. Kant SG,
11. Roepman R,
12. Knoers NV,
13. Veltman JA,
14. Brunner HG
. Exome sequencing identifies WDR35 variants involved in Sensenbrenner syndrome. Am J Hum Genet 2010;87:418–23.
OpenUrl CrossRef PubMed Web of Science
↵
1. Ng SB,
2. Buckingham KJ,
3. Lee C,
4. Bigham AW,
5. Tabor HK,
6. Dent KM,
7. Huff CD,
8. Shannon PT,
9. Jabs EW,
10. Nickerson DA,
11. Shendure J,
12. Bamshad MJ
. Exome sequencing identifies the cause of a Mendelian disorder. Nat Genet 2010;42:30–5.
OpenUrl CrossRef PubMed Web of Science
↵
1. Johnson JO,
2. Gibbs JR,
3. Van Maldergem L,
4. Houlden H,
5. Singleton AB
. Exome sequencing in Brown-Vialetto-van Laere syndrome. Am J Hum Genet 2010;87:567–9; author reply 569–70.
OpenUrl CrossRef PubMed
↵
1. Musunuru K,
2. Pirruccello JP,
3. Do R,
4. Peloso GM,
5. Guiducci C,
6. Sougnez C,
7. Garimella KV,
8. Fisher S,
9. Abreu J,
10. Barry AJ,
11. Fennell T,
12. Banks E,
13. Ambrogio L,
14. Cibulskis K,
15. Kernytsky A,
16. Gonzalez E,
17. Rudzicz N,
18. Engert JC,
19. DePristo MA,
20. Daly MJ,
21. Cohen JC,
22. Hobbs HH,
23. Altshuler D,
24. Schonfeld G,
25. Gabriel SB,
26. Yue P,
27. Kathiresan S
. Exome sequencing, ANGPTL3 mutations, and familial combined hypolipidemia. N Engl J Med 2010;363:2220–7.
OpenUrl CrossRef PubMed Web of Science
↵
1. Krawitz PM,
2. Schweiger MR,
3. Rödelsperger C,
4. Marcelis C,
5. Kölsch U,
6. Meisel C,
7. Stephani F,
8. Kinoshita T,
9. Murakami Y,
10. Bauer S,
11. Isau M,
12. Fischer A,
13. Dahl A,
14. Kerick M,
15. Hecht J,
16. Köhler S,
17. Jäger M,
18. Grünhagen J,
19. de Condor BJ,
20. Doelken S,
21. Brunner HG,
22. Meinecke P,
23. Passarge E,
24. Thompson MD,
25. Cole DE,
26. Horn D,
27. Roscioli T,
28. Mundlos S,
29. Robinson PN
. Identity-by-descent filtering of exome sequence data identifies PIGV mutations in hyperphosphatasia mental retardation syndrome. Nat Genet 2010;42:827–9.
OpenUrl CrossRef PubMed Web of Science
↵
1. Anastasio N,
2. Ben-Omran T,
3. Teebi A,
4. Ha KC,
5. Lalonde E,
6. Ali R,
7. Almureikhi M,
8. Der Kaloustian VM,
9. Liu J,
10. Rosenblatt DS,
11. Majewski J,
12. Jerome-Majewska LA
. Mutations in SCARF2 are responsible for Van Den Ende-Gupta syndrome. Am J Hum Genet 2010;87:553–9.
OpenUrl CrossRef PubMed
↵
1. Erlich Y,
2. Edvardson S,
3. Hodges E,
4. Zenvirt S,
5. Thekkat P,
6. Shaag A,
7. Dor T,
8. Hannon GJ,
9. Elpeleg O
. Exome sequencing and disease-network analysis of a single family implicate a mutation in KIF1A in hereditary spastic paraparesis. Genome Res 2011;21:658–64.
OpenUrl Abstract/FREE Full Text
↵
1. Glazov EA,
2. Zankl A,
3. Donskoi M,
4. Kenna TJ,
5. Thomas GP,
6. Clark GR,
7. Duncan EL,
8. Brown MA
. Whole-exome re-sequencing in a family quartet identifies pop1 mutations as the cause of a novel skeletal dysplasia. PLoS Genet 2011;7:e1002027.
OpenUrl CrossRef PubMed
↵
1. Tsurusaki Y,
2. Osaka H,
3. Hamanoue H,
4. Shimbo H,
5. Tsuji M,
6. Doi H,
7. Saitsu H,
8. Matsumoto N,
9. Miyake N
. Rapid detection of a mutation causing X-linked leucoencephalopathy by exome sequencing. J Med Genet 2011;48:606–9.
OpenUrl Abstract/FREE Full Text
↵
1. Bolze A,
2. Byun M,
3. McDonald D,
4. Morgan NV,
5. Abhyankar A,
6. Premkumar L,
7. Puel A,
8. Bacon CM,
9. Rieux-Laucat F,
10. Pang K,
11. Britland A,
12. Abel L,
13. Cant A,
14. Maher ER,
15. Riedl SJ,
16. Hambleton S,
17. Casanova JL
. Whole-exome-sequencing-based discovery of human FADD deficiency. Am J Hum Genet 2010;87:873–81.
OpenUrl CrossRef PubMed Web of Science
↵
1. Caliskan M,
2. Chong JX,
3. Uricchio L,
4. Anderson R,
5. Chen P,
6. Sougnez C,
7. Garimella K,
8. Gabriel SB,
9. dePristo MA,
10. Shakir K,
11. Matern D,
12. Das S,
13. Waggoner D,
14. Nicolae DL,
15. Ober C
. Exome sequencing reveals a novel mutation for autosomal recessive non-syndromic mental retardation in the TECR gene on chromosome 19p13. Hum Mol Genet 2011;20:1285–9.
OpenUrl Abstract/FREE Full Text
↵
1. Walsh T,
2. Shahin H,
3. Elkan-Miller T,
4. Lee MK,
5. Thornton AM,
6. Roeb W,
7. Abu Rayyan A,
8. Loulus S,
9. Avraham KB,
10. King MC,
11. Kanaan M
. Whole exome sequencing and homozygosity mapping identify mutation in the cell polarity protein GPSM2 as the cause of nonsyndromic hearing loss DFNB82. Am J Hum Genet 2010;87:90–4.
OpenUrl CrossRef PubMed Web of Science
↵
1. Johnson JO,
2. Mandrioli J,
3. Benatar M,
4. Abramzon Y,
5. Van Deerlin VM,
6. Trojanowski JQ,
7. Gibbs JR,
8. Brunetti M,
9. Gronka S,
10. Wuu J,
11. Ding J,
12. McCluskey L,
13. Martinez-Lage M,
14. Falcone D,
15. Hernandez DG,
16. Arepalli S,
17. Chong S,
18. Schymick JC,
19. Rothstein J,
20. Landi F,
21. Wang YD,
22. Calvo A,
23. Mora G,
24. Sabatelli M,
25. Monsurrò MR,
26. Battistini S,
27. Salvi F,
28. Spataro R,
29. Sola P,
30. Borghero G,
31. Galassi G,
32. Scholz SW,
33. Taylor JP,
34. Restagno G,
35. Chiò A,
36. Traynor BJ
; ITALSGEN Consortium. Exome sequencing reveals VCP mutations as a cause of familial ALS. Neuron 2010;68:857–64.
OpenUrl CrossRef PubMed
↵
1. Wang JL,
2. Yang X,
3. Xia K,
4. Hu ZM,
5. Weng L,
6. Jin X,
7. Jiang H,
8. Zhang P,
9. Shen L,
10. Guo JF,
11. Li N,
12. Li YR,
13. Lei LF,
14. Zhou J,
15. Du J,
16. Zhou YF,
17. Pan Q,
18. Wang J,
19. Wang J,
20. Li RQ,
21. Tang BS
. TGM6 identified as a novel causative gene of spinocerebellar ataxias using exome sequencing. Brain 2010;133:3510–18.
OpenUrl Abstract/FREE Full Text
↵
1. Isidor B,
2. Lindenbaum P,
3. Pichon O,
4. Bézieau S,
5. Dina C,
6. Jacquemont S,
7. Martin-Coignard D,
8. Thauvin-Robinet C,
9. Le Merrer M,
10. Mandel JL,
11. David A,
12. Faivre L,
13. Cormier-Daire V,
14. Redon R,
15. Le Caignec C
. Truncating mutations in the last exon of NOTCH2 cause a rare skeletal disorder with osteoporosis. Nat Genet 2011;43:306–8.
OpenUrl CrossRef PubMed Web of Science
↵
1. Simpson MA,
2. Irving MD,
3. Asilmaz E,
4. Gray MJ,
5. Dafou D,
6. Elmslie FV,
7. Mansour S,
8. Holder SE,
9. Brain CE,
10. Burton BK,
11. Kim KH,
12. Pauli RM,
13. Aftimos S,
14. Stewart H,
15. Kim CA,
16. Holder-Espinasse M,
17. Robertson SP,
18. Drake WM,
19. Trembath RC
. Mutations in NOTCH2 cause Hajdu-Cheney syndrome, a disorder of severe and progressive bone loss. Nat Genet 2011;43:303–5.
OpenUrl CrossRef PubMed Web of Science
↵
1. Sobreira NL,
2. Cirulli ET,
3. Avramopoulos D,
4. Wohler E,
5. Oswald GL,
6. Stevens EL,
7. Ge D,
8. Shianna KV,
9. Smith JP,
10. Maia JM,
11. Gumbs CE,
12. Pevsner J,
13. Thomas G,
14. Valle D,
15. Hoover-Fong JE,
16. Goldstein DB
. Whole-genome sequencing of a single proband together with linkage analysis identifies a Mendelian disease gene. PLoS Genet 2010;6:e1000991.
OpenUrl CrossRef PubMed
↵
1. Lupski JR,
2. Reid JG,
3. Gonzaga-Jauregui C,
4. Rio Deiros D,
5. Chen DC,
6. Nazareth L,
7. Bainbridge M,
8. Dinh H,
9. Jing C,
10. Wheeler DA,
11. McGuire AL,
12. Zhang F,
13. Stankiewicz P,
14. Halperin JJ,
15. Yang C,
16. Gehman C,
17. Guo D,
18. Irikat RK,
19. Tom W,
20. Fantin NJ,
21. Muzny DM,
22. Gibbs RA
. Whole-genome sequencing in a patient with Charcot-Marie-Tooth neuropathy. N Engl J Med 2010;362:1181–91.
OpenUrl CrossRef PubMed Web of Science
↵
1. Notarangelo LD,
2. Giliani S,
3. Mazza C,
4. Mella P,
5. Savoldi G,
6. Rodriguez-Pérez C,
7. Mazzolari E,
8. Fiorini M,
9. Duse M,
10. Plebani A,
11. Ugazio AG,
12. Vihinen M,
13. Candotti F,
14. Schumacher RF
. Of genes and phenotypes: the immunological and molecular spectrum of combined immune deficiency. Defects of the gamma(c)-JAK3 signaling pathway as a model. Immunol Rev 2000;178:39–48.
OpenUrl CrossRef PubMed
↵
1. de Villartay JP,
2. Lim A,
3. Al-Mousa H,
4. Dupont S,
5. Déchanet-Merville J,
6. Coumau-Gatbois E,
7. Gougeon ML,
8. Lemainque A,
9. Eidenschenk C,
10. Jouanguy E,
11. Abel L,
12. Casanova JL,
13. Fischer A,
14. Le Deist F
. A novel immunodeficiency associated with hypomorphic RAG1 mutations and CMV infection. J Clin Invest 2005;115:3291–9.
OpenUrl CrossRef PubMed Web of Science
↵
1. McCusker C,
2. Hotte S,
3. Le Deist F,
4. Hirschfeld AF,
5. Mitchell D,
6. Nguyen VH,
7. Gagnon R,
8. Mazer B,
9. Turvey SE,
10. Jabado N
. Relative CD4 lymphopenia and a skewed memory phenotype are the main immunologic abnormalities in a child with Omenn syndrome due to homozygous RAG1-C2633T hypomorphic mutation. Clin Immunol 2009;131:447–55.
OpenUrl CrossRef PubMed
↵
1. Corneo B,
2. Moshous D,
3. Gungor T,
4. Wulffraat N,
5. Philippet P,
6. Le Deist FL,
7. Fischer A,
8. de Villartay JP
. Identical mutations in RAG1 or RAG2 genes leading to defective V(D)J recombinase activity can cause either T-B-severe combined immune deficiency or Omenn syndrome. Blood 2001;97:2772–6.
OpenUrl Abstract/FREE Full Text
↵
1. Vissers LE,
2. de Ligt J,
3. Gilissen C,
4. Janssen I,
5. Steehouwer M,
6. de Vries P,
7. van Lier B,
8. Arts P,
9. Wieskamp N,
10. del Rosario M,
11. van Bon BW,
12. Hoischen A,
13. de Vries BB,
14. Brunner HG,
15. Veltman JA
. A de novo paradigm for mental retardation. Nat Genet 2010;42:1109–12.
OpenUrl CrossRef PubMed Web of Science
↵
1. Cirulli ET,
2. Goldstein DB
. Uncovering the roles of rare variants in common disease through whole-genome sequencing. Nat Rev Genet 2010;11:415–25.
OpenUrl CrossRef PubMed Web of Science
↵
1. Wang K,
2. Li M,
3. Hakonarson H
. Analysing biological pathways in genome-wide association studies. Nat Rev Genet 2010;11:843–54.
OpenUrl CrossRef PubMed Web of Science
↵
1. Hakonarson H,
2. Qu HQ,
3. Bradfield JP,
4. Marchand L,
5. Kim CE,
6. Glessner JT,
7. Grabs R,
8. Casalunovo T,
9. Taback SP,
10. Frackelton EC,
11. Eckert AW,
12. Annaiah K,
13. Lawson ML,
14. Otieno FG,
15. Santa E,
16. Shaner JL,
17. Smith RM,
18. Onyiah CC,
19. Skraban R,
20. Chiavacci RM,
21. Robinson LJ,
22. Stanley CA,
23. Kirsch SE,
24. Devoto M,
25. Monos DS,
26. Grant SF,
27. Polychronakos C
. A novel susceptibility locus for type 1 diabetes on Chr12q13 identified by a genome-wide association study. Diabetes 2008;57:1143–6.
OpenUrl Abstract/FREE Full Text
↵
1. Maher B
. Personal genomes: the case of the missing heritability. Nature 2008;456:18–21.
OpenUrl PubMed Web of Science
↵
1. Stankiewicz P,
2. Lupski JR
. Structural variation in the human genome and its role in disease. Annu Rev Med 2010;61:437–55.
OpenUrl CrossRef PubMed Web of Science
↵
1. Dickson SP,
2. Wang K,
3. Krantz I,
4. Hakonarson H,
5. Goldstein DB
. Rare variants create synthetic genome-wide associations. PLoS Biol 2010;8:e1000294.
OpenUrl CrossRef PubMed
↵
1. Ley TJ,
2. Mardis ER,
3. Ding L,
4. Fulton B,
5. McLellan MD,
6. Chen K,
7. Dooling D,
8. Dunford-Shore BH,
9. McGrath S,
10. Hickenbotham M,
11. Cook L,
12. Abbott R,
13. Larson DE,
14. Koboldt DC,
15. Pohl C,
16. Smith S,
17. Hawkins A,
18. Abbott S,
19. Locke D,
20. Hillier LW,
21. Miner T,
22. Fulton L,
23. Magrini V,
24. Wylie T,
25. Glasscock J,
26. Conyers J,
27. Sander N,
28. Shi X,
29. Osborne JR,
30. Minx P,
31. Gordon D,
32. Chinwalla A,
33. Zhao Y,
34. Ries RE,
35. Payton JE,
36. Westervelt P,
37. Tomasson MH,
38. Watson M,
39. Baty J,
40. Ivanovich J,
41. Heath S,
42. Shannon WD,
43. Nagarajan R,
44. Walter MJ,
45. Link DC,
46. Graubert TA,
47. DiPersio JF,
48. Wilson RK
. DNA sequencing of a cytogenetically normal acute myeloid leukaemia genome. Nature 2008;456:66–72.
OpenUrl CrossRef PubMed Web of Science
↵
1. Pleasance ED,
2. Cheetham RK,
3. Stephens PJ,
4. McBride DJ,
5. Humphray SJ,
6. Greenman CD,
7. Varela I,
8. Lin ML,
9. Ordonez GR,
10. Bignell GR,
11. Ye K,
12. Alipaz J,
13. Bauer MJ,
14. Beare D,
15. Butler A,
16. Carter RJ,
17. Chen L,
18. Cox AJ,
19. Edkins S,
20. Kokko-Gonzales PI,
21. Gormley NA,
22. Grocock RJ,
23. Haudenschild CD,
24. Hims MM,
25. James T,
26. Jia M,
27. Kingsbury Z,
28. Leroy C,
29. Marshall J,
30. Menzies A,
31. Mudie LJ,
32. Ning Z,
33. Royce T,
34. Schulz-Trieglaff OB,
35. Spiridou A,
36. Stebbings LA,
37. Szajkowski L,
38. Teague J,
39. Williamson D,
40. Chin L,
41. Ross MT,
42. Campbell PJ,
43. Bentley DR,
44. Futreal PA,
45. Stratton MR
. A comprehensive catalogue of somatic mutations from a human cancer genome. Nature 2010;463:191–6.
OpenUrl CrossRef PubMed Web of Science
↵
1. Stratton MR,
2. Campbell PJ,
3. Futreal PA
. The cancer genome. Nature 2009;458:719–24.
OpenUrl CrossRef PubMed Web of Science
↵
1. Maher CA,
2. Kumar-Sinha C,
3. Cao X,
4. Kalyana-Sundaram S,
5. Han B,
6. Jing X,
7. Sam L,
8. Barrette T,
9. Palanisamy N,
10. Chinnaiyan AM
. Transcriptome sequencing to detect gene fusions in cancer. Nature 2009;458:97–101.
OpenUrl CrossRef PubMed Web of Science
↵
1. Maher CA,
2. Palanisamy N,
3. Brenner JC,
4. Cao X,
5. Kalyana-Sundaram S,
6. Luo S,
7. Khrebtukova I,
8. Barrette TR,
9. Grasso C,
10. Yu J,
11. Lonigro RJ,
12. Schroth G,
13. Kumar-Sinha C,
14. Chinnaiyan AM
. Chimeric transcript discovery by paired-end transcriptome sequencing. Proc Natl Acad Sci U S A 2009;106:12353–8.
OpenUrl Abstract/FREE Full Text
↵
1. Mardis ER,
2. Ding L,
3. Dooling DJ,
4. Larson DE,
5. McLellan MD,
6. Chen K,
7. Koboldt DC,
8. Fulton RS,
9. Delehaunty KD,
10. McGrath SD,
11. Fulton LA,
12. Locke DP,
13. Magrini VJ,
14. Abbott RM,
15. Vickery TL,
16. Reed JS,
17. Robinson JS,
18. Wylie T,
19. Smith SM,
20. Carmichael L,
21. Eldred JM,
22. Harris CC,
23. Walker J,
24. Peck JB,
25. Du F,
26. Dukes AF,
27. Sanderson GE,
28. Brummett AM,
29. Clark E,
30. McMichael JF,
31. Meyer RJ,
32. Schindler JK,
33. Pohl CS,
34. Wallis JW,
35. Shi X,
36. Lin L,
37. Schmidt H,
38. Tang Y,
39. Haipek C,
40. Wiechert ME,
41. Ivy JV,
42. Kalicki J,
43. Elliott G,
44. Ries RE,
45. Payton JE,
46. Westervelt P,
47. Tomasson MH,
48. Watson MA,
49. Baty J,
50. Heath S,
51. Shannon WD,
52. Nagarajan R,
53. Link DC,
54. Walter MJ,
55. Graubert TA,
56. DiPersio JF,
57. Wilson RK,
58. Ley TJ
. Recurring mutations found by sequencing an acute myeloid leukemia genome. N Engl J Med 2009;361:1058–66.
OpenUrl CrossRef PubMed Web of Science
↵
1. Pleasance ED,
2. Stephens PJ,
3. O'Meara S,
4. McBride DJ,
5. Meynert A,
6. Jones D,
7. Lin ML,
8. Beare D,
9. Lau KW,
10. Greenman C,
11. Varela I,
12. Nik-Zainal S,
13. Davies HR,
14. Ordonez GR,
15. Mudie LJ,
16. Latimer C,
17. Edkins S,
18. Stebbings L,
19. Chen L,
20. Jia M,
21. Leroy C,
22. Marshall J,
23. Menzies A,
24. Butler A,
25. Teague JW,
26. Mangion J,
27. Sun YA,
28. McLaughlin SF,
29. Peckham HE,
30. Tsung EF,
31. Costa GL,
32. Lee CC,
33. Minna JD,
34. Gazdar A,
35. Birney E,
36. Rhodes MD,
37. McKernan KJ,
38. Stratton MR,
39. Futreal PA,
40. Campbell PJ
. A small-cell lung cancer genome with complex signatures of tobacco exposure. Nature 2010;463:184–90.
OpenUrl CrossRef PubMed Web of Science
↵
1. Lee W,
2. Jiang Z,
3. Liu J,
4. Haverty PM,
5. Guan Y,
6. Stinson J,
7. Yue P,
8. Zhang Y,
9. Pant KP,
10. Bhatt D,
11. Ha C,
12. Johnson S,
13. Kennemer MI,
14. Mohan S,
15. Nazarenko I,
16. Watanabe C,
17. Sparks AB,
18. Shames DS,
19. Gentleman R,
20. de Sauvage FJ,
21. Stern H,
22. Pandita A,
23. Ballinger DG,
24. Drmanac R,
25. Modrusan Z,
26. Seshagiri S,
27. Zhang Z
. The mutation spectrum revealed by paired genome sequences from a lung cancer patient. Nature 2010;465:473–7.
OpenUrl CrossRef PubMed Web of Science
↵
1. Jones S,
2. Zhang X,
3. Parsons DW,
4. Lin JC,
5. Leary RJ,
6. Angenendt P,
7. Mankoo P,
8. Carter H,
9. Kamiyama H,
10. Jimeno A,
11. Hong SM,
12. Fu B,
13. Lin MT,
14. Calhoun ES,
15. Kamiyama M,
16. Walter K,
17. Nikolskaya T,
18. Nikolsky Y,
19. Hartigan J,
20. Smith DR,
21. Hidalgo M,
22. Leach SD,
23. Klein AP,
24. Jaffee EM,
25. Goggins M,
26. Maitra A,
27. Iacobuzio-Donahue C,
28. Eshleman JR,
29. Kern SE,
30. Hruban RH,
31. Karchin R,
32. Papadopoulos N,
33. Parmigiani G,
34. Vogelstein B,
35. Velculescu VE,
36. Kinzler KW
. Core signaling pathways in human pancreatic cancers revealed by global genomic analyses. Science 2008;321:1801–6.
OpenUrl Abstract/FREE Full Text
↵
1. Parsons DW,
2. Jones S,
3. Zhang X,
4. Lin JC,
5. Leary RJ,
6. Angenendt P,
7. Mankoo P,
8. Carter H,
9. Siu IM,
10. Gallia GL,
11. Olivi A,
12. McLendon R,
13. Rasheed BA,
14. Keir S,
15. Nikolskaya T,
16. Nikolsky Y,
17. Busam DA,
18. Tekleab H,
19. Diaz LA Jr.,
20. Hartigan J,
21. Smith DR,
22. Strausberg RL,
23. Marie SK,
24. Shinjo SM,
25. Yan H,
26. Riggins GJ,
27. Bigner DD,
28. Karchin R,
29. Papadopoulos N,
30. Parmigiani G,
31. Vogelstein B,
32. Velculescu VE,
33. Kinzler KW
. An integrated genomic analysis of human glioblastoma multiforme. Science 2008;321:1807–12.
OpenUrl Abstract/FREE Full Text
↵
1. Sjöblom T,
2. Jones S,
3. Wood LD,
4. Parsons DW,
5. Lin J,
6. Barber TD,
7. Mandelker D,
8. Leary RJ,
9. Ptak J,
10. Silliman N,
11. Szabo S,
12. Buckhaults P,
13. Farrell C,
14. Meeh P,
15. Markowitz SD,
16. Willis J,
17. Dawson D,
18. Willson JK,
19. Gazdar AF,
20. Hartigan J,
21. Wu L,
22. Liu C,
23. Parmigiani G,
24. Park BH,
25. Bachman KE,
26. Papadopoulos N,
27. Vogelstein B,
28. Kinzler KW,
29. Velculescu VE
. The consensus coding sequences of human breast and colorectal cancers. Science 2006;314:268–74.
OpenUrl Abstract/FREE Full Text
↵
1. Jones S,
2. Hruban RH,
3. Kamiyama M,
4. Borges M,
5. Zhang X,
6. Parsons DW,
7. Lin JC,
8. Palmisano E,
9. Brune K,
10. Jaffee EM,
11. Iacobuzio-Donahue CA,
12. Maitra A,
13. Parmigiani G,
14. Kern SE,
15. Velculescu VE,
16. Kinzler KW,
17. Vogelstein B,
18. Eshleman JR,
19. Goggins M,
20. Klein AP
. Exomic sequencing identifies PALB2 as a pancreatic cancer susceptibility gene. Science 2009;324:217.
OpenUrl Abstract/FREE Full Text
↵
1. Thomas RK,
2. Nickerson E,
3. Simons JF,
4. Jänne PA,
5. Tengs T,
6. Yuza Y,
7. Garraway LA,
8. LaFramboise T,
9. Lee JC,
10. Shah K,
11. O'Neill K,
12. Sasaki H,
13. Lindeman N,
14. Wong KK,
15. Borras AM,
16. Gutmann EJ,
17. Dragnev KH,
18. DeBiasi R,
19. Chen TH,
20. Glatt KA,
21. Greulich H,
22. Desany B,
23. Lubeski CK,
24. Brockman W,
25. Alvarez P,
26. Hutchison SK,
27. Leamon JH,
28. Ronan MT,
29. Turenchalk GS,
30. Egholm M,
31. Sellers WR,
32. Rothberg JM,
33. Meyerson M
. Sensitive mutation detection in heterogeneous cancer specimens by massively parallel picoliter reactor sequencing. Nat Med 2006;12:852–5.
OpenUrl CrossRef PubMed Web of Science
↵
1. Bignell GR,
2. Greenman CD,
3. Davies H,
4. Butler AP,
5. Edkins S,
6. Andrews JM,
7. Buck G,
8. Chen L,
9. Beare D,
10. Latimer C,
11. Widaa S,
12. Hinton J,
13. Fahey C,
14. Fu B,
15. Swamy S,
16. Dalgliesh GL,
17. Teh BT,
18. Deloukas P,
19. Yang F,
20. Campbell PJ,
21. Futreal PA,
22. Stratton MR
. Signatures of mutation and selection in the cancer genome. Nature 2010;463:893–8.
OpenUrl CrossRef PubMed Web of Science
↵
1. Dalgliesh GL,
2. Furge K,
3. Greenman C,
4. Chen L,
5. Bignell G,
6. Butler A,
7. Davies H,
8. Edkins S,
9. Hardy C,
10. Latimer C,
11. Teague J,
12. Andrews J,
13. Barthorpe S,
14. Beare D,
15. Buck G,
16. Campbell PJ,
17. Forbes S,
18. Jia M,
19. Jones D,
20. Knott H,
21. Kok CY,
22. Lau KW,
23. Leroy C,
24. Lin ML,
25. McBride DJ,
26. Maddison M,
27. Maguire S,
28. McLay K,
29. Menzies A,
30. Mironenko T,
31. Mulderrig L,
32. Mudie L,
33. O'Meara S,
34. Pleasance E,
35. Rajasingham A,
36. Shepherd R,
37. Smith R,
38. Stebbings L,
39. Stephens P,
40. Tang G,
41. Tarpey PS,
42. Turrell K,
43. Dykema KJ,
44. Khoo SK,
45. Petillo D,
46. Wondergem B,
47. Anema J,
48. Kahnoski RJ,
49. Teh BT,
50. Stratton MR,
51. Futreal PA
. Systematic sequencing of renal carcinoma reveals inactivation of histone modifying genes. Nature 2010;463:360–3.
OpenUrl CrossRef PubMed Web of Science
↵
1. Morin RD,
2. Johnson NA,
3. Severson TM,
4. Mungall AJ,
5. An J,
6. Goya R,
7. Paul JE,
8. Boyle M,
9. Woolcock BW,
10. Kuchenbauer F,
11. Yap D,
12. Humphries RK,
13. Griffith OL,
14. Shah S,
15. Zhu H,
16. Kimbara M,
17. Shashkin P,
18. Charlot JF,
19. Tcherpakov M,
20. Corbett R,
21. Tam A,
22. Varhol R,
23. Smailus D,
24. Moksa M,
25. Zhao Y,
26. Delaney A,
27. Qian H,
28. Birol I,
29. Schein J,
30. Moore R,
31. Holt R,
32. Horsman DE,
33. Connors JM,
34. Jones S,
35. Aparicio S,
36. Hirst M,
37. Gascoyne RD,
38. Marra MA
. Somatic mutations altering EZH2 (Tyr641) in follicular and diffuse large B-cell lymphomas of germinal-center origin. Nat Genet 2010;42:181–5.
OpenUrl CrossRef PubMed Web of Science
↵
1. Purow B,
2. Schiff D
. Advances in the genetics of glioblastoma: are we reaching critical mass? Nat Rev Neurol 2009;5:419–26.
OpenUrl CrossRef PubMed Web of Science
↵
1. Yan XJ,
2. Xu J,
3. Gu ZH,
4. Pan CM,
5. Lu G,
6. Shen Y,
7. Shi JY,
8. Zhu YM,
9. Tang L,
10. Zhang XW,
11. Liang WX,
12. Mi JQ,
13. Song HD,
14. Li KQ,
15. Chen Z,
16. Chen SJ
. Exome sequencing identifies somatic mutations of DNA methyltransferase gene DNMT3A in acute monocytic leukemia. Nat Genet 2011;43:309–15.
OpenUrl CrossRef PubMed Web of Science
↵
1. Ley TJ,
2. Ding L,
3. Walter MJ,
4. McLellan MD,
5. Lamprecht T,
6. Larson DE,
7. Kandoth C,
8. Payton JE,
9. Baty J,
10. Welch J,
11. Harris CC,
12. Lichti CF,
13. Townsend RR,
14. Fulton RS,
15. Dooling DJ,
16. Koboldt DC,
17. Schmidt H,
18. Zhang Q,
19. Osborne JR,
20. Lin L,
21. O'Laughlin M,
22. McMichael JF,
23. Delehaunty KD,
24. McGrath SD,
25. Fulton LA,
26. Magrini VJ,
27. Vickery TL,
28. Hundal J,
29. Cook LL,
30. Conyers JJ,
31. Swift GW,
32. Reed JP,
33. Alldredge PA,
34. Wylie T,
35. Walker J,
36. Kalicki J,
37. Watson MA,
38. Heath S,
39. Shannon WD,
40. Varghese N,
41. Nagarajan R,
42. Westervelt P,
43. Tomasson MH,
44. Link DC,
45. Graubert TA,
46. DiPersio JF,
47. Mardis ER,
48. Wilson RK
. DNMT3A mutations in acute myeloid leukemia. N Engl J Med 2010;363:2424–33.
OpenUrl CrossRef PubMed Web of Science
↵
1. Wesolowska A,
2. Dalgaard MD,
3. Borst L,
4. Gautier L,
5. Bak M,
6. Weinhold N,
7. Nielsen BF,
8. Helt LR,
9. Audouze K,
10. Nersting J,
11. Tommerup N,
12. Brunak S,
13. Sicheritz-Ponten T,
14. Leffers H,
15. Schmiegelow K,
16. Gupta R
. Cost-effective multiplexing before capture allows screening of 25 000 clinically relevant SNPs in childhood acute lymphoblastic leukemia. Leukemia 2011;25:1001–6.
OpenUrl CrossRef PubMed Web of Science
↵
1. Grossmann V,
2. Kohlmann A,
3. Klein HU,
4. Schindela S,
5. Schnittger S,
6. Dicker F,
7. Dugas M,
8. Kern W,
9. Haferlach T,
10. Haferlach C
. Targeted next-generation sequencing detects point mutations, insertions, deletions and balanced chromosomal rearrangements as well as identifies novel leukemia-specific fusion genes in a single procedure. Leukemia 2011;25:671–80.
OpenUrl CrossRef PubMed Web of Science
↵
1. Kohlmann A,
2. Grossmann V,
3. Klein HU,
4. Schindela S,
5. Weiss T,
6. Kazak B,
7. Dicker F,
8. Schnittger S,
9. Dugas M,
10. Kern W,
11. Haferlach C,
12. Haferlach T
. Next-generation sequencing technology reveals a characteristic pattern of molecular mutations in 72.8% of chronic myelomonocytic leukemia by detecting frequent alterations in TET2, CBL, RAS, and RUNX1. J Clin Oncol 2010;28:3858–65.
OpenUrl Abstract/FREE Full Text
↵
1. Meyerson M,
2. Gabriel S,
3. Getz G
. Advances in understanding cancer genomes through second-generation sequencing. Nat Rev Genet 2010;11:685–96.
OpenUrl CrossRef PubMed Web of Science
↵
1. Navin N,
2. Krasnitz A,
3. Rodgers L,
4. Cook K,
5. Meth J,
6. Kendall J,
7. Riggs M,
8. Eberling Y,
9. Troge J,
10. Grubor V,
11. Levy D,
12. Lundin P,
13. Månér S,
14. Zetterberg A,
15. Hicks J,
16. Wigler M
. Inferring tumor progression from genomic heterogeneity. Genome Res 2010;20:68–80.
OpenUrl Abstract/FREE Full Text
↵
1. Cavazzana-Calvo M,
2. Hacein-Bey S,
3. de Saint Basile G,
4. Gross F,
5. Yvon E,
6. Nusbaum P,
7. Selz F,
8. Hue C,
9. Certain S,
10. Casanova JL,
11. Bousso P,
12. Deist FL,
13. Fischer A
. Gene therapy of human severe combined immunodeficiency (SCID)-X1 disease. Science 2000;288:669–72.
OpenUrl Abstract/FREE Full Text
↵
1. Hacein-Bey-Abina S,
2. Le Deist F,
3. Carlier F,
4. Bouneaud C,
5. Hue C,
6. De Villartay JP,
7. Thrasher AJ,
8. Wulffraat N,
9. Sorensen R,
10. Dupuis-Girod S,
11. Fischer A,
12. Davies EG,
13. Kuis W,
14. Leiva L,
15. Cavazzana-Calvo M
. Sustained correction of X-linked severe combined immunodeficiency by ex vivo gene therapy. N Engl J Med 2002;346:1185–93.
OpenUrl CrossRef PubMed Web of Science
↵
1. Aiuti A,
2. Cattaneo F,
3. Galimberti S,
4. Benninghoff U,
5. Cassani B,
6. Callegaro L,
7. Scaramuzza S,
8. Andolfi G,
9. Mirolo M,
10. Brigida I,
11. Tabucchi A,
12. Carlucci F,
13. Eibl M,
14. Aker M,
15. Slavin S,
16. Al-Mousa H,
17. Al Ghonaium A,
18. Ferster A,
19. Duppenthaler A,
20. Notarangelo L,
21. Wintergerst U,
22. Buckley RH,
23. Bregni M,
24. Marktel S,
25. Valsecchi MG,
26. Rossi P,
27. Ciceri F,
28. Miniero R,
29. Bordignon C,
30. Roncarolo MG
. Gene therapy for immunodeficiency due to adenosine deaminase deficiency. N Engl J Med 2009;360:447–58.
OpenUrl CrossRef PubMed Web of Science
↵
1. Bordignon C,
2. Notarangelo LD,
3. Nobili N,
4. Ferrari G,
5. Casorati G,
6. Panina P,
7. Mazzolari E,
8. Maggioni D,
9. Rossi C,
10. Servida P,
11. Ugazio AG,
12. Mavilio F
. Gene therapy in peripheral blood lymphocytes and bone marrow for ADA-immunodeficient patients. Science 1995;270:470–5.
OpenUrl Abstract/FREE Full Text
↵
1. Ott MG,
2. Schmidt M,
3. Schwarzwaelder K,
4. Stein S,
5. Siler U,
6. Koehl U,
7. Glimm H,
8. Kuhlcke K,
9. Schilz A,
10. Kunkel H,
11. Naundorf S,
12. Brinkmann A,
13. Deichmann A,
14. Fischer M,
15. Ball C,
16. Pilz I,
17. Dunbar C,
18. Du Y,
19. Jenkins NA,
20. Copeland NG,
21. Luthi U,
22. Hassan M,
23. Thrasher AJ,
24. Hoelzer D,
25. von Kalle C,
26. Seger R,
27. Grez M
. Correction of X-linked chronic granulomatous disease by gene therapy, augmented by insertional activation of MDS1-EVI1, PRDM16 or SETBP1. Nat Med 2006;12:401–9.
OpenUrl CrossRef PubMed Web of Science
↵
1. Cartier N,
2. Hacein-Bey-Abina S,
3. Bartholomae CC,
4. Veres G,
5. Schmidt M,
6. Kutschera I,
7. Vidaud M,
8. Abel U,
9. Dal-Cortivo L,
10. Caccavelli L,
11. Mahlaoui N,
12. Kiermer V,
13. Mittelstaedt D,
14. Bellesme C,
15. Lahlou N,
16. Lefrere F,
17. Blanche S,
18. Audit M,
19. Payen E,
20. Leboulch P,
21. l'Homme B,
22. Bougneres P,
23. Von Kalle C,
24. Fischer A,
25. Cavazzana-Calvo M,
26. Aubourg P
. Hematopoietic stem cell gene therapy with a lentiviral vector in X-linked adrenoleukodystrophy. Science 2009;326:818–23.
OpenUrl Abstract/FREE Full Text
↵
1. Cavazzana-Calvo M,
2. Payen E,
3. Negre O,
4. Wang G,
5. Hehir K,
6. Fusil F,
7. Down J,
8. Denaro M,
9. Brady T,
10. Westerman K,
11. Cavallesco R,
12. Gillet-Legrand B,
13. Caccavelli L,
14. Sgarra R,
15. Maouche-Chrétien L,
16. Bernaudin F,
17. Girot R,
18. Dorazio R,
19. Mulder GJ,
20. Polack A,
21. Bank A,
22. Soulier J,
23. Larghero J,
24. Kabbara N,
25. Dalle B,
26. Gourmel B,
27. Socie G,
28. Chrétien S,
29. Cartier N,
30. Aubourg P,
31. Fischer A,
32. Cornetta K,
33. Galacteros F,
34. Beuzard Y,
35. Gluckman E,
36. Bushman F,
37. Hacein-Bey-Abina S,
38. Leboulch P
. Transfusion independence and HMGA2 activation after gene therapy of human β-thalassaemia. Nature 2010;467:318–22.
OpenUrl CrossRef PubMed Web of Science
↵
1. Maguire AM,
2. High KA,
3. Auricchio A,
4. Wright JF,
5. Pierce EA,
6. Testa F,
7. Mingozzi F,
8. Bennicelli JL,
9. Ying GS,
10. Rossi S,
11. Fulton A,
12. Marshall KA,
13. Banfi S,
14. Chung DC,
15. Morgan JI,
16. Hauck B,
17. Zelenaia O,
18. Zhu X,
19. Raffini L,
20. Coppieters F,
21. De Baere E,
22. Shindler KS,
23. Volpe NJ,
24. Surace EM,
25. Acerra C,
26. Lyubarsky A,
27. Redmond TM,
28. Stone E,
29. Sun J,
30. McDonnell JW,
31. Leroy BP,
32. Simonelli F,
33. Bennett J
. Age-dependent effects of RPE65 gene therapy for Leber's congenital amaurosis: a phase 1 dose-escalation trial. Lancet 2009;374:1597–605.
OpenUrl CrossRef PubMed Web of Science
↵
1. Aparicio SA,
2. Huntsman DG
. Does massively parallel DNA resequencing signify the end of histopathology as we know it? J Pathol 2010;220:307–15.
OpenUrl PubMed Web of Science
↵
1. Ng PC,
2. Henikoff S
. SIFT: predicting amino acid changes that affect protein function. Nucl Acids Res 2003;31:3812–14.
OpenUrl Abstract/FREE Full Text
↵
1000 Genomes Project Consortium. A map of human genome variation from population-scale sequencing. Nature 2010;467:1061–73.
OpenUrl CrossRef PubMed Web of Science
↵
1. Robinson JT,
2. Thorvaldsdóttir H,
3. Winckler W,
4. Guttman M,
5. Lander ES,
6. Getz G,
7. Mesirov JP
. Integrative genomics viewer. Nat Biotechnol 2011;29:24–6.
OpenUrl CrossRef PubMed Web of Science
↵
1. Kent WJ,
2. Sugnet CW,
3. Furey TS,
4. Roskin KM,
5. Pringle TH,
6. Zahler AM,
7. Haussler D
. The human genome browser at UCSC. Genome Res 2002;12:996–1006.
OpenUrl Abstract/FREE Full Text
↵
1. Kaye J,
2. Boddington P,
3. de Vries J,
4. Hawkins N,
5. Melham K
. Ethical implications of the use of whole genome methods in medical research. Eur J Hum Genet 2010;18:398–403.
OpenUrl CrossRef PubMed Web of Science
↵
1. Lowrance WW,
2. Collins FS
. Ethics. Identifiability in genomic research. Science 2007;317:600–2.
OpenUrl Abstract/FREE Full Text
↵
1. Caulfield T,
2. McGuire AL,
3. Cho M,
4. Buchanan JA,
5. Burgess MM,
6. Danilczyk U,
7. Diaz CM,
8. Fryer-Edwards K,
9. Green SK,
10. Hodosh MA,
11. Juengst ET,
12. Kaye J,
13. Kedes L,
14. Knoppers BM,
15. Lemmens T,
16. Meslin EM,
17. Murphy J,
18. Nussbaum RL,
19. Otlowski M,
20. Pullman D,
21. Ray PN,
22. Sugarman J,
23. Timmons M
. Research ethics recommendations for whole-genome research: consensus statement. PLoS Biol 2008;6:e73.
OpenUrl CrossRef PubMed
↵
1. McGuire AL,
2. Caulfield T,
3. Cho MK
. Research ethics and the challenge of whole-genome sequencing. Nat Rev Genet 2008;9:152–6.
OpenUrl PubMed Web of Science
↵
1. Kaye J,
2. Heeney C,
3. Hawkins N,
4. de Vries J,
5. Boddington P
. Data sharing in genomics: re-shaping scientific practice. Nat Rev Genet 2009;10:331–5.
OpenUrl CrossRef PubMed Web of Science
↵
1. Knoppers BM,
2. Brand AM
. From community genetics to public health genomics – what's in a name? Public Health Genomics 2009;12:1–3.
OpenUrl CrossRef Web of Science
↵
1. Lacroix M,
2. Nycum G,
3. Godard B,
4. Knoppers BM
. Should physicians warn patients' relatives of genetic risks? CMAJ 2008;178:593–5.
OpenUrl FREE Full Text
↵
1. Knoppers BM,
2. Joly Y,
3. Simard J,
4. Durocher F
. The emergence of an ethical duty to disclose genetic research results: international perspectives. Eur J Hum Genet 2006;14:1170–8.
OpenUrl CrossRef PubMed Web of Science
↵
1. Wolf SM,
2. Lawrenz FP,
3. Nelson CA,
4. Kahn JP,
5. Cho MK,
6. Clayton EW,
7. Fletcher JG,
8. Georgieff MK,
9. Hammerschmidt D,
10. Hudson K,
11. Illes J,
12. Kapur V,
13. Keane MA,
14. Koenig BA,
15. Leroy BS,
16. McFarland EG,
17. Paradise J,
18. Parker LS,
19. Terry SF,
20. Van Ness B,
21. Wilfond BS
. Managing incidental findings in human subjects research: analysis and recommendations. J Law Med Ethics 2008;36:219–48, 1.
OpenUrl CrossRef PubMed Web of Science

Footnotes

Funding This work was supported by the McGill University Health Center Research Institute. NJ is the recipient of a Chercheur Boursier award from Fonds de Recherche en Santé du Quebec. JM is a recipient of a Canada Research Chair. EL is funded by the Canadian Institute for Health Research.
Competing interests None.
Patient consent Obtained.
Provenance and peer review Not commissioned; internally peer reviewed.

[1] ↵
Watson JD,
Crick FH
. The structure of DNA. Cold Spring Harb Symp Quant Biol 1953;18:123–31.
OpenUrl Abstract/FREE Full Text

[2] Watson JD,

[3] Crick FH

[4] ↵
Watson JD,
Crick FH
. Molecular structure of nucleic acids; a structure for deoxyribose nucleic acid. Nature 1953;171:737–8.
OpenUrl CrossRef PubMed Web of Science

[5] Watson JD,

[6] Crick FH

[7] ↵
Inoue T,
Orgel LE
. A nonenzymatic RNA polymerase model. Science 1983;219:859–62.
OpenUrl Abstract/FREE Full Text

[8] Inoue T,

[9] Orgel LE

[10] ↵
Maxam AM,
Gilbert W
. A new method for sequencing DNA. Proc Natl Acad Sci U S A 1977;74:560–4.
OpenUrl Abstract/FREE Full Text

[11] Maxam AM,

[12] Gilbert W

[13] ↵
Sanger F,
Nicklen S,
Coulson AR
. DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A 1977;74:5463–7.
OpenUrl Abstract/FREE Full Text

[14] Sanger F,

[15] Nicklen S,

[16] Coulson AR

[17] ↵
Sachidanandam R,
Weissman D,
Schmidt SC,
Kakol JM,
Stein LD,
Marth G,
Sherry S,
Mullikin JC,
Mortimore BJ,
Willey DL,
Hunt SE,
Cole CG,
Coggill PC,
Rice CM,
Ning Z,
Rogers J,
Bentley DR,
Kwok PY,
Mardis ER,
Yeh RT,
Schultz B,
Cook L,
Davenport R,
Dante M,
Fulton L,
Hillier L,
Waterston RH,
McPherson JD,
Gilman B,
Schaffner S,
Van Etten WJ,
Reich D,
Higgins J,
Daly MJ,
Blumenstiel B,
Baldwin J,
Stange-Thomann N,
Zody MC,
Linton L,
Lander ES,
Altshuler D
. A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms. Nature 2001;409:928–33.
OpenUrl CrossRef PubMed Web of Science

[18] Sachidanandam R,

[19] Weissman D,

[20] Schmidt SC,

[21] Kakol JM,

[22] Stein LD,

[23] Marth G,

[24] Sherry S,

[25] Mullikin JC,

[26] Mortimore BJ,

[27] Willey DL,

[28] Hunt SE,

[29] Cole CG,

[30] Coggill PC,

[31] Rice CM,

[32] Ning Z,

[33] Rogers J,

[34] Bentley DR,

[35] Kwok PY,

[36] Mardis ER,

[37] Yeh RT,

[38] Schultz B,

[39] Cook L,

[40] Davenport R,

[41] Dante M,

[42] Fulton L,

[43] Hillier L,

[44] Waterston RH,

[45] McPherson JD,

[46] Gilman B,

[47] Schaffner S,

[48] Van Etten WJ,

[49] Reich D,

[50] Higgins J,

[51] Daly MJ,

[52] Blumenstiel B,

[53] Baldwin J,

[54] Stange-Thomann N,

[55] Zody MC,

[56] Linton L,

[57] Lander ES,

[58] Altshuler D

[60] Lander ES,

[61] Linton LM,

[62] Birren B,

[63] Nusbaum C,

[64] Zody MC,

[65] Baldwin J,

[66] Devon K,

[67] Dewar K,

[68] Doyle M,

[69] FitzHugh W,

[70] Funke R,

[71] Gage D,

[72] Harris K,

[73] Heaford A,

[74] Howland J,

[75] Kann L,

[76] Lehoczky J,

[77] LeVine R,

[78] McEwan P,

[79] McKernan K,

[80] Meldrim J,

[81] Mesirov JP,

[82] Miranda C,

[83] Morris W,

[84] Naylor J,

[85] Raymond C,

[86] Rosetti M,

[87] Santos R,

[88] Sheridan A,

[89] Sougnez C,

[90] Stange-Thomann N,

[91] Stojanovic N,

[92] Subramanian A,

[93] Wyman D,

[94] Rogers J,

[95] Sulston J,

[96] Ainscough R,

[97] Beck S,

[98] Bentley D,

[99] Burton J,

[100] Clee C,

[101] Carter N,

[102] Coulson A,

[103] Deadman R,

[104] Deloukas P,

[105] Dunham A,

[106] Dunham I,

[107] Durbin R,

[108] French L,

[109] Grafham D,

[110] Gregory S,

[111] Hubbard T,

[112] Humphray S,

[113] Hunt A,

[114] Jones M,

[115] Lloyd C,

[116] McMurray A,

[117] Matthews L,

[118] Mercer S,

[119] Milne S,

[120] Mullikin JC,

[121] Mungall A,

[122] Plumb R,

[123] Ross M,

[124] Shownkeen R,

[125] Sims S,

[126] Waterston RH,

[127] Wilson RK,

[128] Hillier LW,

[129] McPherson JD,

[130] Marra MA,

[131] Mardis ER,

[132] Fulton LA,

[133] Chinwalla AT,

[134] Pepin KH,

[135] Gish WR,

[136] Chissoe SL,

[137] Wendl MC,

[138] Delehaunty KD,

[139] Miner TL,

[140] Delehaunty A,

[141] Kramer JB,

[142] Cook LL,

[143] Fulton RS,

[144] Johnson DL,

[145] Minx PJ,

[146] Clifton SW,

[147] Hawkins T,

[148] Branscomb E,

[149] Predki P,

[150] Richardson P,

[151] Wenning S,

[152] Slezak T,

[153] Doggett N,

[154] Cheng JF,

[155] Olsen A,

[156] Lucas S,

[157] Elkin C,

[158] Uberbacher E,

[159] Frazier M,

[160] Gibbs RA,

[161] Muzny DM,

[162] Scherer SE,

[163] Bouck JB,

[164] Sodergren EJ,

[165] Worley KC,

[166] Rives CM,

[167] Gorrell JH,

[168] Metzker ML,

[169] Naylor SL,

[170] Kucherlapati RS,

[171] Nelson DL,

[172] Weinstock GM,

[173] Sakaki Y,

[174] Fujiyama A,

[175] Hattori M,

[176] Yada T,

[177] Toyoda A,

[178] Itoh T,

[179] Kawagoe C,

[180] Watanabe H,

[181] Totoki Y,

[182] Taylor T,

[183] Weissenbach J,

[184] Heilig R,

[185] Saurin W,

[186] Artiguenave F,

[187] Brottier P,

[188] Bruls T,

[189] Pelletier E,

[190] Robert C,

[191] Wincker P,

[192] Smith DR,

[193] Doucette-Stamm L,

[194] Rubenfield M,

[195] Weinstock K,

[196] Lee HM,

[197] Dubois J,

[198] Rosenthal A,

[199] Platzer M,

[200] Nyakatura G,

[201] Taudien S,

[202] Rump A,

[203] Yang H,

[204] Yu J,

[205] Wang J,

[206] Huang G,

[207] Gu J,

[208] Hood L,

[209] Rowen L,

[210] Madan A,

[211] Qin S,

[212] Davis RW,

[213] Federspiel NA,

[214] Abola AP,

[215] Proctor MJ,

[216] Myers RM,

[217] Schmutz J,

[218] Dickson M,

[219] Grimwood J,

[220] Cox DR,

[221] Olson MV,

[222] Kaul R,

[223] Shimizu N,

[224] Kawasaki K,

[225] Minoshima S,

[226] Evans GA,

[227] Athanasiou M,

[228] Schultz R,

[229] Roe BA,

[230] Chen F,

[231] Pan H,

[232] Ramser J,

[233] Lehrach H,

[234] Reinhardt R,

[235] McCombie WR,

[236] de la Bastide M,

[237] Dedhia N,

[238] Blocker H,

[239] Hornischer K,

[240] Nordsiek G,

[241] Agarwala R,

[242] Aravind L,

[243] Bailey JA,

[244] Bateman A,

[245] Batzoglou S,

[246] Birney E,

[247] Bork P,

[248] Brown DG,

[249] Burge CB,

[250] Cerutti L,

[251] Chen HC,

[252] Church D,

[253] Clamp M,

[254] Copley RR,

[255] Doerks T,

[256] Eddy SR,

[257] Eichler EE,

[258] Furey TS,

[259] Galagan J,

[260] Gilbert JG,

[261] Harmon C,

[262] Hayashizaki Y,

[263] Haussler D,

[264] Hermjakob H,

[265] Hokamp K,

[266] Jang W,

[267] Johnson LS,

[268] Jones TA,

[269] Kasif S,

[270] Kaspryzk A,

[271] Kennedy S,

[272] Kent WJ,

[273] Kitts P,

[274] Koonin EV,

[275] Korf I,

[276] Kulp D,

[277] Lancet D,

[278] Lowe TM,

[279] McLysaght A,

[280] Mikkelsen T,

[281] Moran JV,

[282] Mulder N,

[283] Pollara VJ,

[284] Ponting CP,

[285] Schuler G,

[286] Schultz J,

[287] Slater G,

[288] Smit AF,

[289] Stupka E,

[290] Szustakowski J,

[291] Thierry-Mieg D,

[292] Thierry-Mieg J,

[293] Wagner L,

[294] Wallis J,

[295] Wheeler R,

[296] Williams A,

[297] Wolf YI,

[298] Wolfe KH,

[299] Yang SP,

[300] Yeh RF,

[301] Collins F,

[302] Guyer MS,

[303] Peterson J,

[304] Felsenfeld A,

[305] Wetterstrand KA,

[306] Patrinos A,

[307] Morgan MJ,

[308] de Jong P,

[309] Catanese JJ,

[310] Osoegawa K,

[311] Shizuya H,

[312] Choi S,

[313] Chen YJ

[314] ↵
McPherson JD,
Marra M,
Hillier L,
Waterston RH,
Chinwalla A,
Wallis J,
Sekhon M,
Wylie K,
Mardis ER,
Wilson RK,
Fulton R,
Kucaba TA,
Wagner-McPherson C,
Barbazuk WB,
Gregory SG,
Humphray SJ,
French L,
Evans RS,
Bethel G,
Whittaker A,
Holden JL,
McCann OT,
Dunham A,
Soderlund C,
Scott CE,
Bentley DR,
Schuler G,
Chen HC,
Jang W,
Green ED,
Idol JR,
Maduro VV,
Montgomery KT,
Lee E,
Miller A,
Emerling S,
Kucherlapati Gibbs R,
Scherer S,
Gorrell JH,
Sodergren E,
Clerc-Blankenburg K,
Tabor P,
Naylor S,
Garcia D,
de Jong PJ,
Catanese JJ,
Nowak N,
Osoegawa K,
Qin S,
Rowen L,
Madan A,
Dors M,
Hood L,
Trask B,
Friedman C,
Massa H,
Cheung VG,
Kirsch IR,
Reid T,
Yonescu R,
Weissenbach J,
Bruls T,
Heilig R,
Branscomb E,
Olsen A,
Doggett N,
Cheng JF,
Hawkins T,
Myers RM,
Shang J,
Ramirez L,
Schmutz J,
Velasquez O,
Dixon K,
Stone NE,
Cox DR,
Haussler D,
Kent WJ,
Furey T,
Rogic S,
Kennedy S,
Jones S,
Rosenthal A,
Wen G,
Schilhabel M,
Gloeckner G,
Nyakatura G,
Siebert R,
Schlegelberger B,
Korenberg J,
Chen XN,
Fujiyama A,
Hattori M,
Toyoda A,
Yada T,
Park HS,
Sakaki Y,
Shimizu N,
Asakawa S,
Kawasaki K,
Sasaki T,
Shintani A,
Shimizu A,
Shibuya K,
Kudoh J,
Minoshima S,
Ramser J,
Seranski P,
Hoff C,
Poustka A,
Reinhardt R,
Lehrach H
. A physical map of the human genome. Nature 2001;409:934–41.
OpenUrl CrossRef PubMed Web of Science

[315] McPherson JD,

[316] Marra M,

[317] Hillier L,

[318] Waterston RH,

[319] Chinwalla A,

[320] Wallis J,

[321] Sekhon M,

[322] Wylie K,

[323] Mardis ER,

[324] Wilson RK,

[325] Fulton R,

[326] Kucaba TA,

[327] Wagner-McPherson C,

[328] Barbazuk WB,

[329] Gregory SG,

[330] Humphray SJ,

[331] French L,

[332] Evans RS,

[333] Bethel G,

[334] Whittaker A,

[335] Holden JL,

[336] McCann OT,

[337] Dunham A,

[338] Soderlund C,

[339] Scott CE,

[340] Bentley DR,

[341] Schuler G,

[342] Chen HC,

[343] Jang W,

[344] Green ED,

[345] Idol JR,

[346] Maduro VV,

[347] Montgomery KT,

[348] Lee E,

[349] Miller A,

[350] Emerling S,

[351] Kucherlapati Gibbs R,

[352] Scherer S,

[353] Gorrell JH,

[354] Sodergren E,

[355] Clerc-Blankenburg K,

[356] Tabor P,

[357] Naylor S,

[358] Garcia D,

[359] de Jong PJ,

[360] Catanese JJ,

[361] Nowak N,

[362] Osoegawa K,

[363] Qin S,

[364] Rowen L,

[365] Madan A,

[366] Dors M,

[367] Hood L,

[368] Trask B,

[369] Friedman C,

[370] Massa H,

[371] Cheung VG,

[372] Kirsch IR,

[373] Reid T,

[374] Yonescu R,

[375] Weissenbach J,

[376] Bruls T,

[377] Heilig R,

[378] Branscomb E,

[379] Olsen A,

[380] Doggett N,

[381] Cheng JF,

[382] Hawkins T,

[383] Myers RM,

[384] Shang J,

[385] Ramirez L,

[386] Schmutz J,

[387] Velasquez O,

[388] Dixon K,

[389] Stone NE,

[390] Cox DR,

[391] Haussler D,

[392] Kent WJ,

[393] Furey T,

[394] Rogic S,

[395] Kennedy S,

[396] Jones S,

[397] Rosenthal A,

[398] Wen G,

[399] Schilhabel M,

[400] Gloeckner G,

[401] Nyakatura G,

[402] Siebert R,

[403] Schlegelberger B,

[404] Korenberg J,

[405] Chen XN,

[406] Fujiyama A,

[407] Hattori M,

[408] Toyoda A,

[409] Yada T,

[410] Park HS,

[411] Sakaki Y,

[412] Shimizu N,

[413] Asakawa S,

[414] Kawasaki K,

[415] Sasaki T,

[416] Shintani A,

[417] Shimizu A,

[418] Shibuya K,

[419] Kudoh J,

[420] Minoshima S,

[421] Ramser J,

[422] Seranski P,

[423] Hoff C,

[424] Poustka A,

[425] Reinhardt R,

[426] Lehrach H

[427] ↵
Ng SB,
Nickerson DA,
Bamshad MJ,
Shendure J
. Massively parallel sequencing and rare disease. Hum Mol Genet 2010;19:R119–24.
OpenUrl Abstract/FREE Full Text

[428] Ng SB,

[429] Nickerson DA,

[430] Bamshad MJ,

[431] Shendure J

[432] ↵
Ng SB,
Turner EH,
Robertson PD,
Flygare SD,
Bigham AW,
Lee C,
Shaffer T,
Wong M,
Bhattacharjee A,
Eichler EE,
Bamshad M,
Nickerson DA,
Shendure J
. Targeted capture and massively parallel sequencing of 12 human exomes. Nature 2009;461:272–6.
OpenUrl CrossRef PubMed Web of Science

[433] Ng SB,

[434] Turner EH,

[435] Robertson PD,

[436] Flygare SD,

[437] Bigham AW,

[438] Lee C,

[439] Shaffer T,

[440] Wong M,

[441] Bhattacharjee A,

[442] Eichler EE,

[443] Bamshad M,

[444] Nickerson DA,

[445] Shendure J

[446] ↵
Shendure J,
Ji H
. Next-generation DNA sequencing. Nat Biotechnol 2008;26:1135–45.
OpenUrl CrossRef PubMed Web of Science

[447] Shendure J,

[448] Ji H

[449] ↵
Turner EH,
Lee C,
Ng SB,
Nickerson DA,
Shendure J
. Massively parallel exon capture and library-free resequencing across 16 genomes. Nat Methods 2009;6:315–16.
OpenUrl CrossRef PubMed Web of Science

[450] Turner EH,

[451] Lee C,

[452] Ng SB,

[453] Nickerson DA,

[454] Shendure J

[455] ↵
Albert TJ,
Molla MN,
Muzny DM,
Nazareth L,
Wheeler D,
Song X,
Richmond TA,
Middle CM,
Rodesch MJ,
Packard CJ,
Weinstock GM,
Gibbs RA
. Direct selection of human genomic loci by microarray hybridization. Nat Methods 2007;4:903–5.
OpenUrl CrossRef PubMed Web of Science

[456] Albert TJ,

[457] Molla MN,

[458] Muzny DM,

[459] Nazareth L,

[460] Wheeler D,

[461] Song X,

[462] Richmond TA,

[463] Middle CM,

[464] Rodesch MJ,

[465] Packard CJ,

[466] Weinstock GM,

[467] Gibbs RA

[468] ↵
Gnirke A,
Melnikov A,
Maguire J,
Rogov P,
LeProust EM,
Brockman W,
Fennell T,
Giannoukos G,
Fisher S,
Russ C,
Gabriel S,
Jaffe DB,
Lander ES,
Nusbaum C
. Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing. Nat Biotechnol 2009;27:182–9.
OpenUrl CrossRef PubMed Web of Science

[469] Gnirke A,

[470] Melnikov A,

[471] Maguire J,

[472] Rogov P,

[473] LeProust EM,

[474] Brockman W,

[475] Fennell T,

[476] Giannoukos G,

[477] Fisher S,

[478] Russ C,

[479] Gabriel S,

[480] Jaffe DB,

[481] Lander ES,

[482] Nusbaum C

[483] ↵
Hodges E,
Xuan Z,
Balija V,
Kramer M,
Molla MN,
Smith SW,
Middle CM,
Rodesch MJ,
Albert TJ,
Hannon GJ,
McCombie WR
. Genome-wide in situ exon capture for selective resequencing. Nat Genet 2007;39:1522–7.
OpenUrl CrossRef PubMed Web of Science

[484] Hodges E,

[485] Xuan Z,

[486] Balija V,

[487] Kramer M,

[488] Molla MN,

[489] Smith SW,

[490] Middle CM,

[491] Rodesch MJ,

[492] Albert TJ,

[493] Hannon GJ,

[494] McCombie WR

[495] ↵
Levin JZ,
Berger MF,
Adiconis X,
Rogov P,
Melnikov A,
Fennell T,
Nusbaum C,
Garraway LA,
Gnirke A
. Targeted next-generation sequencing of a cancer transcriptome enhances detection of sequence variants and novel fusion transcripts. Genome Biol 2009;10:R115.
OpenUrl CrossRef PubMed

[496] Levin JZ,

[497] Berger MF,

[498] Adiconis X,

[499] Rogov P,

[500] Melnikov A,

[501] Fennell T,

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Statistics from Altmetric.com

Request Permissions

Introduction

Next-generation sequencing (NGS) technologies and experimental approaches for whole exome sequencing (WES)

WES in human disease

Genetic variants identified using WES

WES in characterising monogenic (Mendelian) disorders

Paradigm shift brought by WES in the identification of de novo mutations

WES in characterising complex trait disorders

WES in characterising cancer

Relevance for the clinical use of WES

Why not use WGS?

Some practical considerations

Ethical issues raised by WES

Conclusions and future directions

References

Footnotes

Read the full text or download the PDF:

Log in using your username and password