Quantcast
issue
Read articles, including Science News stories written for ages 9-14, on the SNK website.
More than a chicken, fewer than a grape
Exact tally of human genes remains elusive
A+ A- Text Size

Exact tally of human genes remains elusive

By Tina Hesman Saey

Web edition: October 13, 2010
Print edition: November 6, 2010; Vol.178 #10 (p. 5)

Enlarge
Picking out and counting the genes among the billions of DNA letters (shown here as four different colors) that make up the human genome has proved daunting.
M. Tama/Getty Images

BOSTON — No one really knows all the genetic parts needed to make a human being.

Exactly how many genes make up the human genome remains a mystery, even though scientists announced the completion of the Human Genome Project a decade ago. The project to decipher the genetic blueprint of humans was supposed to reveal all of the protein-producing genes needed to build a human body.

“Not only do we not know what all the genes are, we don’t even know how many there are,” Steven Salzberg of the University of Maryland in College Park said October 11 during a keynote address at the Beyond the Genome conference, held in Boston. Most estimates place the human gene count in the neighborhood of 22,000 genes, which falls between the number of genes in a chicken and the number in a grape.

Grape plants have 30,434 genes, by the latest count. Chickens have 16,736 genes, a number Salzberg said will likely grow as scientists put the finishing touches on the chicken genome. As in humans, the gene totals for each species are not as precise as they seem and are subject to revision.

The most accurate estimate of the human gene count is the RefSeq database maintained by the U.S. National Institutes of Health, Salzberg said. He laid out arguments for favoring this estimate, such as its inclusion of all confirmed genes to date, in a paper published in May in Genome Biology. By the RefSeq count, humans have 22,333 genes. But another government database lists 38,621 human genes. And a different project called Gencode currently recognizes 21,671.

Such disparate numbers stem from the fact that genes comprise only about 1 percent of the 3 billion As, Ts, Gs and Cs that make up the human genetic instruction book. And the genes aren’t conveniently laid out as single, continuous stretches of genetic code. Instead, human genes are found in protein-encoding pieces called exons, interspersed with stretches of DNA that don’t make protein. These spacers are called introns.

Enlarge
Estimates of the number of human genes have fallen and narrowed in range over the years as research has shed more light on how genes are arrayed along DNA.
M. Pertea and S. Salzberg/Genome Biology 2010, Adapted by E. Feliciano

To make matters worse, each exon in a gene codes for only a portion of a protein. Cells can mix and match different combinations of exons to make various proteins.

Traditionally, scientists have used computer programs to sift through billions of DNA letters and pinpoint the locations of genes. The programs have improved over the years, but they still aren’t as good as people at plucking exons from the sea of introns and figuring out how those protein-encoding segments are spliced together, said Clara Amid, a computational biologist at the Wellcome Trust Sanger Institute in Hinxton, England.

Amid is involved in the Gencode project, an effort to identify all the human genes and the many permutations of those genes that can lead to a dizzying number of proteins. She and her colleagues pick out genes the old-fashioned way — by hand. The researchers get plenty of clues where genes are from computerized gene-finders, studies that sequence RNA produced by genes, and from comparisons of human DNA to the genomes of other animals. Synthesizing all that information allows people to accurately find and mark the locations of genes, a process scientists call annotation. “The best computerized methods could replicate the manual annotation only 40 to 50 percent of the time,” Amid said October 12 at the Beyond the Genome conference.

The Gencode team isn’t finished with its work; several chromosomes still need the human touch. Gencode’s current count is 21,671 human genes. “The number will go up, definitely,” Amid said. Already the team has located several new genes on chromosome 4 thanks to data from RNA-sequencing projects, she said.

Exactly how many new genes might be located by sequencing RNA instead of DNA is anyone’s guess. Scientists who sequenced RNA from fruit flies discovered 1,938 new genes, Brenton Graveley from the University of Connecticut Health Center in Farmington said at the conference.

The Mammalian Gene Collection, one effort to catalog all of the full-length RNA versions of genes, lists 18,877 human genes. That number is likely to represent the lower boundary of the gene count, Salzberg said.

If new RNA sequencing methods detect the same proportion of new genes in people as were found in fruit flies, the human genome could gain about 3,000 more genes in addition to those already confirmed by RefSeq. “That would be an exciting result,” Salzberg said. “I’d be surprised, but we like surprises in science.”

Back Story: How Many Genes in a Chicken?

Source: M. Pertea and S. Salzberg/Genome Biology 2010; Credit: T. Dubé, chicken icon: Pinare/Shutterstock, human icon: Mysontuna/Shutterstock

Though simple organisms generally have relatively small genomes, gene number is not necessarily correlated to complexity. Here are a few different organisms, along with their current estimated gene counts.

Comment
Print Friendly and PDF

Presentations at the Beyond the Genome conference, Boston, October 11-13, 2010.
[Go to]

M. Pertea and S. Salzberg. Between a chicken and a grape: estimating the
number of human genes. Genome Biology. Vol. 11, May 5, 2010, p. 206. [Go to]


Mammalian Gene Collection:
[Go to]

RefSeq database:
[Go to]

Entrez Gene database:
[Go to]

Gencode:
[Go to]

Comments (1)

Please alert Science News to any inappropriate posts by clicking the REPORT SPAM link within the post. Comments will be reviewed before posting.

  • On DNA, Genes, And RNAs
    Who Is What, What Is Who


    A. From "More than a chicken, less than a grape"
    A decade after the completion of the Human Genome Project, the exact number of human genes remains elusive.
    sciencenews.org

    "...computerized gene-finders, studies that sequence RNA produced by (DNA) genes..."


    B. Who Is What, What Is Who

    Disentangling who is what and what is who may advance the capability of science to comprehend genetics better.


    C.
    Suggested 2010 Genetics Concepts Update
    Genetic Diversity In Humans And In Cells

    A cell, each and every cell, hosts thousands of not-celled organisms, both RNA and DNA organisms.

    Exploring genetic diversity
    Readjust concepts and comprehension

    "03.2010 Updated Life Manifest"
    the-scientist.com
    Organisms that we host:

    a. 3rd stratum cellular organisms

    Our bacterial symbionts: There are ten times more bacteria colonizing a human than the number of human cells in the body (10^14 versus 10^13, respectively). Circa 900 taxa. The structures of their communities vary tremendously. The gut might be considered New York City, whereas the skin is perhaps more like Memphis


    b. 2nd stratum cellular organisms, DNA multigenes, genomes, cooperative operational work-patterns for the RNA genes, evolved and maintained by the RNAs for carrying on life processes.

    The RNAs work pattern genes, the cooperative-linked DNA genome-members, are distributed unevenly across the chromosomes. Each chromosome contains various gene-rich and gene-poor regions, which seem to be correlated with chromosome bands and GC-content. The significance of these nonrandom patterns of gene density is not well understood.


    c. 1st stratum not-celled primal Earth ORGANISMS, the individual RNA genes and the above DNA genome members genes.

    There are estimated to be between 21,000 and 26,000 human protein-coding DNA genes, organisms, linked members of the DNA genome.

    In addition to protein coding DNA genes, the human genome contains thousands of RNA genes, including tRNA, ribosomal RNA, microRNA, and other non-coding RNA genes.

    Surprisingly, the number of human DNA genes seems to be less than a factor of two greater than that of many much simpler organisms, such as the roundworm and the fruit fly. However, RNAs in human cells make extensive use of alternative splicing to produce several different proteins from a single DNA gene, and the human proteome is thought to be much larger than those of the aforementioned organisms. Besides, most human DNA genes have multiple exons, and human introns are frequently much longer than the flanking exons.


    Dov Henis
    (Comments From The 22nd Century)
    the-scientist.com user/profile

    Cosmic Evolution Simplified
    the-scientist
    Seed of Human-Chimp Genomes Diversity
    pulse.yahoo.com
    Evolution, Natural Selection, Derive From Cosmic Expansion
    darwiniana.com
    Dov Henis Dov Henis
    Oct. 27, 2010 at 4:35am
Registered readers are invited to post a comment. To encourage fruitful discussion, please keep your comments relevant, brief and courteous. Offensive, irrelevant, nonsensical and commercial posts will not be published. (All links will be removed from comments.)

You must register with Science News to add a comment. To log-in click here. To register as a new user, follow this link.

Follow Us