New Methods Identify Thousands of New DNA Sequences Missing from Reference Map of the Human Genome

Featured In: Academia News | Genomics

Wednesday, April 21, 2024

Loading...

By using new approaches, researchers have discovered 2,363 new DNA sequences corresponding to 730 regions on the human genome. These sequences represent segments that were not charted in the reference map of the human genome.

"A large portion of those sequences are either missing, fragmented or misaligned when compared to results from next-generation sequencing genome assemblies on the same samples,"said Dr. Evan Eichler, senior author on the findings published online in advance of print today, April 19, in Nature Methods. Eichler is a University of Washington (UW) professor of genome sciences and an investigator with the Howard Hughes Medical Institute. "These findings suggest that new genome assemblies based solely on next-generation sequencing might miss many of these sites.”

Dr. Jeffrey M. Kidd was lead author of the article, which describes the new techniques the research team used to find some of the missing sequences. Kidd headed the study in Eichler's lab while earning his Ph.D. at the UW, and is now a postdoctoral fellow at Stanford University.

"Over the past several years, the extent to which the structure of the genome varies among humans has become clearer. This variation suggested that there must be portions of the human genome where DNA sequences had yet to be discovered, annotated and characterized,"he said "We hope that these sequences ultimately will be included as part of future releases of the reference human genome sequence.”

The reference genome assembly is a yardstick -- or standard for comparison -- for studies of human genetics. The human reference genome was first created in 2001 and is updated every couple of years, Kidd explained. It's a mosaic of DNA sequences derived from several individuals. He went on to say that about 80 percent of the reference genome came from eight people. One of them actually accounts for more than 66 percent of the total. Along with their collaborators at Agilent, the team designed ways to examine these newly identified sequences in a panel of people representing populations from around the world. The researchers found that, in some cases, the number of copies of these sequences varied from person to person. The fact that a person can have one or more copies, or no copy at all, of a particular DNA sequence may account for why these sequences were missing from the reference genome. The researchers also found that some of these sequences were common or rare in different populations, depending on from which part of the globe their ancestors originated.

"Each segment of the reference genome is from a single person, and reflects the genome of that individual. If the donor sample was missing a sequence that many other people have, that sequence would not be represented in the reference genome."Kidd explained. "That is why some of the positions on the reference genome represent rare structural configurations or entirely omit sequences found in the majority of people."Kidd said that the study published in today's Nature Methods used information from nine individuals, representing various world populations, to search for and fill in some of the missing pieces.

By looking at genomes from seven kinds of animals, the researchers were also able to show that some of the newly identified DNA sequences appear to have been conserved during the evolution of mammals and man. The animals whose genomes were studied were chimpanzee, Bornean orangutan, Rhesus monkey, house mouse, Norway rat, dog, and horse.

"Some of the sequences were present in several different species, but were absent from the reference genome,"Kidd said. "Some of the sequences present in several mammals actually correspond to sites of variations in humans -- some people have retained a particular sequence, and others have lost it.”

The researchers also developed a method to accurately genotype many of the newly found DNA sequences and created a way to look at variations in the number of copies of these sequences, thereby opening up regions of the human genome previously inaccessible to such studies.

"Scientists can now begin trying to understand the functional importance of these sequences and their variations,"Kidd said.

The 1,000 Genomes Project (an international effort to fully sequence the genomes of a thousand anonymous individuals) and other genome studies are amassing massive amounts of data on DNA sequences that are then mapped to the reference genome, he added. Any study, he continued, that improves the completeness and quality of the reference genome assembly will thereby benefit these projects and lead to a fuller picture of the extent of human genomic variation.

The findings are published in Nature Methods as "Characterization of missing human genome sequences and copy-number polymorphic insertions."

SOURCE

Join the Discussion
Rate Article: Average 0 out of 5
register or log in to comment on this article!

0 Comments

Add Comment

Text Only 2000 character limit

Page 1 of 1

Research Exchange

Automated Forensic DNA Methods: Relieving the Pain of Validation

Apr 21

Automation can increase a forensics lab's sample processing capacity, but it can also add to the complexity of system verification and validation.

Successful Sample Identification

Apr 1

2D Barcodes ensure that a multitude of samples can be tracked in a variety of storage conditions.

Multi-Parametric Cellular Analysis

Mar 23

Flow cytometers perform a variety of multi parametric applications and have been used for an expanding set of cell analysis applications over the past forty years.

Maintaining a Healthy Cell Culture Environment

Mar 23

Investing in best practices and products at the beginning of any experiment is the most time- and cost-effective way to approach cell culture.

Step up to the MIQE

Mar 30

Over the years, polymerase chain reaction (PCR) has evolved into a readily automated, high throughput quantitative technology. Real-time quantitative PCR (qPCR) has become the industry standard for the detection and quantification of nucleic acids for multiple application, including quantification of RNA levels. But a lack of consensus among researchers on how to best perform and interpret qPCR experiments presents a major hurdle for advancement of the technology. This problem is exacerbated by insufficient experimental detail in published work, which impedes the ability of others to accurately evaluate or replicate reported results.

Fast Optimization of a Multiplex Influenza Identification Panel Using a Thermal Gradient

Mar 30

The year 2009 was marked by the emergence of a novel influenza A (H1N1) virus that infects humans. There is a need to identify the different strains of influenza virus for purposes of monitoring the H1N1 strain pandemic and for other epidemiological and scientific purposes.

Advantages of Monolithic Laser Combiner Technology in Confocal Microscopy Systems

Jan 6

Fluorescence microscopy techniques require a reliable light source at the desired wavelength or wavelengths, with minimal downtime for maintenance and alignment. Lasers are a popular light source, although the alignment and upkeep of laser combiners is a time-consuming prospect for many users.

Size-Exclusion Chromatography for Purification of Biomolecules

Dec 2 2009

Size-exclusion chromatography (SEC) is a popular method to separate biomolecules based on their size. Primarily, it is applied to the separation of biopolymers such as proteins and nucleic acids, i.e. water-soluble polymers.

Using the Tecan Genesis Workstation to Automate a Cytometric Bead Array (CBA) Immunoassay

Mar 11

The poster describe the process involved in automating a Cytometric Bead Array (CBA) immunoassay developed to measure relative concentrations of serum antibodies against Tetanus (TT), Sperm Whale Myoglobin (SWM) and Keyhole Limpet Hemocyanin (KLH) in KLH-immunized volunteers.

Ensuring Quality in Assays Performed with Automated Liquid Handlers

Feb 2

The focus of this presentation is to highlight the need of ensuring quality in important assays performed with automated liquid handlers. Nearly all assays performed within a laboratory are volume-dependent. In turn, all concentrations of biological and chemical components in these assays, as well as the associated dilution protocols, are volume-dependent. Because analyte concentration is volume-dependent, an assay’s results might be falsely interpreted if liquid handler variability and inaccuracies are unknown or if the system(s) go unchecked for a long period.

Inkjet System for Protein Crystallography

Feb 1

X-ray crystallography is used routinely by scientists to obtain the three dimensional structure of a biological molecule of interest.Such information can be used to determine how a pharmaceutical interacts with a protein target and what changes might improve functionality. However, the crystallization of macromolecules still remains a serious hindrance in structural determination despite impressive advances in screening methods and technologies.

Attention Deficit & Hyperactivity in a Drosophila Memory Mutant

Attention Deficit & Hyperactivity in a Drosophila Memory Mutant

Nov 9 2009

Action selection is modulated by external stimuli either directly or via memory retrieval. In a constantly changing environment, animals have evolved attention-like processes to effectively filter the incoming sensory stream. These attention-like processes, in turn, are modulated by memory. The neurobiological nature of how attention, action selection and memory are inter-connected is unknown. We describe here new phenotypes of the memory mutant radish in the fruit fly Drosophila.

Prokariotic Cell Collection in Denmark

Nov 6 2009

I would like to know about a prokariotic cell collection in Denmark. Is there a cell bank in this country? I need a Lactobacillus strain for a fermentation assay and this information about the bank is very helpful for me.

Request for Entries

Oct 16 2009

Ask the Experts is your chance to get the answers to questions on applications, materials, methods, processes, and technologies. Email you question to bst_web@advantagemedia.com, and the editors of Bioscience Technology will find an appropriate expert to answer it. Watch this space in the future to see the questions your colleagues are posting.

STAY INFORMED: SUBSCRIBE TO

Magazine and E-mail Newsletters

Loading...
E-mail:   

MULTIMEDIA

Video:

Neuroscience Diseases of The Brain and How The Mind Emerges

Neuroscience Diseases of The Brain and How The Mind Emerges

Nov 8 2009

Dennis Choi, director of Emory Universitys Neuroscience Center, is renowned for his groundbreaking research on brain and spinal cord injury.

Podcasts:

Allen Institute for Brain Research

Allen Institute for Brain Research

Oct 14 2009

Discussed in this interview are both the mouse brain project and the human cortex project with an emphasis on the importance of these projects to neuroscience research.

Information: