Whitehead Human Genome Project and SNP Consortium Announce Collaboration To Identify New Genetic Markers for Disease and Enhance Utility of Human Genome "Working Draft"

July 11, 2000

Tags: Genetics + GenomicsAwards + Announcements

BETHESDA, Md., and CHICAGO, Ill. — The Human Genome Project (HGP) and The SNP Consortium today announced plans to generate a new set of human DNA sequence information that will contribute 125,000 to 250,000 validated and useful genetic markers known as single nucleotide polymorphisms, or SNPs. The information also will enhance the HGPÍs "working draft" sequence of the human genome.

All data generated through the collaborative effort will be made publicly available without restrictions on their use, consistent with the current data release policies of the Human Genome Project and of The SNP Consortium. The collaborative effort between HGP and The SNP Consortium is being funded by NHGRI and The SNP Consortium, and is expected to be completed in December 2000.

This collaborative effort takes advantage of the recently announced "working draft" sequence, representing the vast majority of the sequence of the human genome (the full set of genetic instructions, encoded in long strands of DNA, that are contained in the 24 chromosomes). By comparing newly generated sequence data to the "working draft," it will be possible to accelerate the construction of a higher-density SNP map; this map will in turn facilitate identification of genetic variations associated with common diseases from Alzheimer's to heart disease and diabetes. At the same time, the data generated will help improve the "working draft" itself.

Three academic genome research centers — the Whitehead Institute for Biomedical Research in Cambridge, Mass., Washington University School of Medicine in St. Louis, and the Sanger Centre in Hinxton, UK — will participate in this collaboration.

The centers will isolate two million DNA fragments (each about 6,000 base pairs long) from the human genome and determine the sequence of approximately 500 base pairs at both ends of the fragments, resulting in paired sequences of known distance from each other.

The sequences then will be compared to human genome DNA sequences already in GenBank (www.ncbi.nlm.nih.gov/genome/seq), a publicly accessible repository of genome sequence data to identify SNPs. In addition, the paired-end information will help span some gaps in the human genome "working draft," enhancing the value of the draft. This paired-end approach has been used to advantage in the sequencing of the genomes of lower organisms, such as bacteria and the fruit fly Drosophila melanogaster.

"As a physician as well as a laboratory scientist, I am excited about the potential of this collaboration to expedite the discovery of genetic information that will lead to improved diagnosis and treatment of disease," says Francis Collins, M.D., Ph.D., director of the National Human Genome Research Institute (NHGRI) of the National Institutes of Health. "This collaboration will yield a bumper crop of genetic variations. As a bonus, it will also improve the assembly of the human genome sequence so that it is closer to the highly polished ‘finished’ form that is our goal." The DNA to be sequenced will come from 24 anonymous, unrelated donors with diverse geographic origins, making the new sequences a rich source of SNPs. As SNPs are identified, they will be validated, mapped, and deposited in the publicly accessible database, dbSNP.

A high-density map of SNPs—the single base pair variations that occur on the average of once every 1,000 base pairs throughout human DNA — is expected to be a valuable research tool that will help scientists pinpoint genetic differences that predispose some, but not others to disease, and underlie variability in individual response to treatment. In turn, novel diagnostics and drugs can be developed that are tailored to patients' genetic profiles. The SNP Consortium will file provisional patent applications on newly identified and mapped SNPs solely to establish the dates of discovery, but no patents will be allowed to issue, keeping the data freely available for the unrestricted use of researchers worldwide.

"The collaboration between the Human Genome Project and The SNP Consortium shows that public-private cooperation can be an efficient means for developing basic research tools essential for the application of genetic information to the understanding and treatment of disease," says Arthur Holden, chairman and chief executive officer of the consortium, formed in April 1999. "Through this collaboration, The SNP Consortium will be able to contribute up to 50 percent more SNPs to the public domain than otherwise would have been possible under our original scientific plan."

The SNP consortium's initial two-year plan had been to identify 300,000 SNPs and map at least 150,000 SNPs, evenly distributed throughout the genome. An exponential increase in the amount of human genetic sequence data that has become available from the Human Genome Project over the past 15 months has enabled the consortium to proceed at a much faster pace than originally envisioned. To date, the consortium has identified over 140,000 SNPs and mapped 102,719 SNPs. With the Human Genome Project collaboration, the total number of validated and useful SNPs mapped may exceed 750,000 by December 2000. The Human Genome Project (HGP) is an international research effort to characterize the genomes of human and selected model organisms through complete mapping and sequencing of their DNA, to develop technologies for genomic analysis, to examine the ethical, legal, and social implications of human genetics research, and to train scientists who will be able to utilize the tools and resources developed through the HGP to pursue biological studies that will improve human health.

The international Human Genome Sequencing consortium, which has been organized to meet the HGP goal to determine the sequence of the euchromatic portion of the human genome, on June 26 announced that it had assembled a "working draft" of the human genome. On that same day, a private sector effort carried out by Celera Genomics, using a different but complementary strategy, announced their "first assembly" of the human genome. The international consortium is on track to produce the "finished," highly polished reference version by 2003. The HGP consortium includes scientists at 16 institutions in France, Germany, Japan, China, Great Britain and the United States. Participants in the international consortium have all adhered to the project's quality standards and to the daily data release policy. The Human Genome Project is funded by grants from government agencies and public charities in the several countries. The SNP Consortium is organized as a non-profit entity whose goal is to create and make publicly available a high-quality SNP map of the human genome. The consortium's members include the medical research charity The Wellcome Trust; 10 pharmaceutical companies including AstraZeneca PLC, Aventis Pharma, Bayer AG, Bristol-Myers Squibb Company, F. Hoffman-La Roche, Glaxo Wellcome PLC, Novartis, Pfizer Inc, Searle (now part of Pharmacia), and SmithKline Beecham PLC; Motorola, Inc.; IBM, and Amersham Pharmacia Biotech. Academic centers including the Whitehead Institute for Biomedical Research, Washington University School of Medicine in St. Louis, the Wellcome TrustÍs Sanger Centre, Stanford Human Genome Center, and Cold Spring Harbor Laboratory, are involved in SNP identification and analysis.

Note to reporters: For definitions of "working draft" and related terms, please see www.nhgri.nih.gov/news/human_genome_facts.html


Communications and Public Affairs
Phone: 617-258-6851
Email: newsroom@wi.mit.edu

Whitehead Institute is a world-renowned non-profit research institution dedicated to improving human health through basic biomedical research.
Wholly independent in its governance, finances, and research programs, Whitehead shares a close affiliation with Massachusetts Institute of Technology
through its faculty, who hold joint MIT appointments.

© Whitehead Institute for Biomedical Research              455 Main Street          Cambridge, MA 02142