[Home]   [Full version]  

Darwin's famous finches and Venter's marine microbes

Mar 13 ,General Science


Although the Galápagos finches were to play a pivotal role in the inception of Darwin’s theory of evolution through natural selection, he had no inkling of their significance when he collected them during his voyage on the HMS Beagle.

Similarly, it is hard to predict the impact the vast amount of marine microbial DNA – collected during the Sorcerer II Global Ocean Sampling Expedition by J. Craig Venter, Ph.D., and his team – will have on our understanding of the natural world.

"If anything, this is just the beginning," says Gerard Manning, Ph.D., director of the Razavi Newman Center for Bioinformatics at the Salk Institute for Biological Studies. "We’re starting to explore this trove of sequences now, but it may be decades before we fully understand it all."

Just like the famous ornithologist John Gould who had to classify the Galápagos finches before they led Darwin on the right track, Manning and many others have been busy during the last couple of months wading through roughly 7.7 million sequenced snippets of sea-borne genomic DNA to impose order on the flood of data and to classify the identified proteins.

Their findings are detailed in series of papers, published in this week’s online edition of the journal Public Library of Science Biology.

The authors are plying the rapidly emerging trade of metagenomics (also known as environmental genomics) that seeks to examine genomic snapshots taken directly from the environment.

"Metagenomics allows us to sample the 99 percent of all bacteria that won’t grow in the lab," explains Manning. "GOS opens a huge window into biological and genomic diversity and, within this diversity, to better understand many of the fundamentals of biology." he adds.

Expanding the universe of protein families

But instead of whole genomes, metagenomics produces a whole grab bag of bits and pieces for which scientists have to develop new methods to extract meaning. In one of the papers, an array of scientists, spearheaded by first author Shibu Yooseph, Ph.D., and his colleagues at the Craig Venter Institute, compared every DNA fragment with every other available DNA fragment to produce clusters of related sequences. This exhaustive analysis predicted more than 6 million proteins in the GOS data – nearly twice the number of all proteins ever described before – and laid the groundwork for further studies.

Manning, a co-author on Yooseph’s paper, looked at the other side of the coin. He ran all the public sequences and GOS data against Pfam, a collection of signature profiles for all known protein families. Each of these profiles is an average of all known members of a certain protein family.

"Instead of starting with a human kinase to find a bacterial kinase, for example, you start with all of them together, which makes the search much more sensitive, but also very computationally expensive," Manning says. "We did almost 350 million comparisons, which is probably an order of magnitude or two more than anybody has ever done before."

Manning and co-author Yufeng Zhai, Ph.D., a bioinformatics programmer in the Razavi Newman Center for Bioinformatics at the Salk, could only accomplish this rather gargantuan task with the help of Time Logic, a company in Carlsbad, California. The company specializes in hardware that accelerates genomic searches. "We only have one of their accelerators, but Time Logic stepped up and lent us eight more," says Manning. The final computation took two weeks, but would have taken well over a century on a traditional computer.

The Salk scientists could assign over half of all GOS sequences to known protein families, and discovered that certain protein profiles are more popular in the ocean or on land. For example, gram-positive bacteria are best known for their hardy spores, but this ability has been entirely lost in their marine relatives.. Flagella, whip-like extensions propelling bacteria forward and pili, short extensions used to exchange genetic material between bacteria (also known as microbial sex), are also less frequent in marine environments.

"By comparing our findings with the Yooseph clusters, we also discovered hundreds of new gene families that hadn’t even been seen before," says Zhai and adds that by adding the diverse GOS data to known profiles, "we were able to make them more sensitive and diverse, and so increase their power to categorize novel sequences."

Diversity of microbial kinases

In a separate study, Manning, Zhai, and first author Natarajan Kannan, Ph.D., a postdoctoral researcher in the lab of HHMI investigator and UCSD professor Susan S. Taylor, Ph.D., traded the breadth of the ocean survey for the depth of a single protein domain. They zoomed in on kinases, extremely well studied enzymes, which control every aspect of eukaryotic cell biology and are important cancer drug targets. They control the activity of proteins and small molecules by attaching tiny phosphate groups to them. By contrast, much less has been known about their bacterial counterparts.

Again and again, the researchers combed the GOS data for bacterial kinases, each time rebuilding their domain profiles by including the new members found in the previous round. All in all, they dug up 45,000 protein kinase sequences that fell into 20 distinct families, of which the
eukaryotic protein kinases are just one. The additional 19 families spanned a huge range and included several that had never been described before.

"Prokaryotic protein-like kinases were considered to be some sort of niche players, but actually they are more prevalent and widespread than histidine kinases," explains Manning. Bacteria were thought to rely mostly on histidine kinases, which are structurally different from protein kinases, for all their signaling needs.

Even though the different kinase families had very little similarity in their sequence, it emerged that 10 key residues were conserved in almost all kinase families, fingering them as being at the core of what it means to be a kinase. Seven of those had been previously known to be important in human kinases, but the other three were unexpected finds.

The other surprising finding was just how innovative and plastic the different families were, even with these core residues, as one or another family had found ways to eliminate any but one of the 10 key residues. Using structural modeling, and patterns of sequence conservation, Kannan was able to show that loss of one key residue could be compensated by changes around other conserved regions of the protein, and that some of these changes in bacterial kinases are also seen in specific human kinases.

Says Manning, "By looking at all these very distant microbial relatives we can understand more even about human kinases and their relationship to cancer and other diseases. We go out into the ocean, we find all this diversity and analyzing what’s new and what’s not new reflects back on the things we thought we knew well."

Research done at the Salk Institute was supported by the Razavi-Newman Foundation.

Source: Salk Institute

Related stories:

New discovery about growth factor can be breakthrough for cancer research
A research team at the Ludwig Institute and Uppsala University has discovered an entirely new signal path for a growth factor that is of crucial importance for the survival and growth of cancer cells. This discovery, published in today’s issue of Nature Cell Biology, opens up an entirely new landscape for research on breast and prostate cancer, among other types.
Researchers uncover cancer survival secrets
A team of Monash University researchers has uncovered the role of a family of enzymes in the mutation of benign or less aggressive tumours into more aggressive, potentially fatal, cancers in the human body.
Researchers unveil vital key to cancer
University of Manchester scientists have uncovered the 3-D structure of Mps1 -- a protein that regulates the number of chromosomes during cell division and thus has an essential role in the prevention of cancer -- which will lead to the design of safer and more effective therapies.
Biology enters 'The Matrix' through new computer language
Ever since the human genome was sequenced less than 10 years ago, researchers have been able to access a dizzying plethora of genomic information with a simple click of a mouse. This digitizing of genomic data—and its public access—is something that would have been unthinkable a generation earlier.
Discovery of a new signaling mechanism may lead to novel anti-inflammatory therapy
A team of researchers at the University of California, San Diego School of Medicine has uncovered a new signaling mechanism used to activate protein kinases that are critical for the body's inflammatory response. Their work will be published in the July 18 online edition of Science (Science Express).
Crossed (evolutionary) signals?
What do humans and single-celled choanoflagellates have in common? More than you'd think. New research into the choanoflagellate genome shows these ancient organisms have similar levels of proteins that cells in more complex organisms, including humans, use to communicate with each other.
SEX4, starch and phosphorylation
Some of the new molecular mechanisms and regulatory components in starch metabolism have been identified by Dr. Samuel Zeeman and his colleagues. Dr. Zeeman, of the Institute of Plant Sciences, ETH Zurich, in Switzerland, who is the 2007 recipient of the Charles Albert Shull Award, will be presenting this work at the opening Awards Symposium of the annual meeting of the American Society of Plant Biologists in Mérida, Mexico (June 27, 2:30 PM). Mutational and structural analyses by Dr. Zeeman and his colleagues have revealed that starch degradation in Arabidopsis leaves at night differs significantly from the versions traditionally described in textbooks. Specifically, mutations at the Starch Excess 4 (SEX4), Maltose Excess 1 (MEX1) and other loci produce plants unable to metabolize starch to a usable form.
Protein discovery may bolster antibiotic development
A team of scientists from Queen’s University has discovered the first ever three-dimensional structure of a protein family that may help in developing more effective antibiotics.

News discussion:

General Science news

[Home]   [Full version]