Can we ever know the true measure of microbial diversity in the sea or of proteins in nature?
In three new metagenomic studies published online in the open-access journal PLoS Biology, Craig Venter and his team take advantage of the vast amount of microbial sequence data collected during their Sorcerer II Global Ocean Sampling (GOS) expedition to reveal an unprecedented level of genetic and protein diversity in marine microbes.
Venter's team combined the expedition's latest bounty, 6.5 million sequencing "reads," with data previously collected during a pilot study in the Sargasso Sea. The result is a geographically diverse environmental genomic dataset of 6.3 billion base pairstwice the size of the human genome.
The first paper and accompanying poster by Douglas B. Rusch and colleagues describe the immense amount of microbial diversity in the seas, and discuss howor ifthat diversity is structured and what might be shaping that structure. The second paper by Shibu Yooseph and colleagues studies the 6.12 million proteins identified in the GOS sequences to see if we're close to discovering all the proteins in nature. In the third study, Natarajan Kannan, Susan S. Taylor, Gerard Manning, and colleagues present their classification of 45,000 kinases (including 16,000 from the GOS dataset) into 20 distinct families, revealing their structural and functional diversity and an unexpected role for kinases in prokaryotic signaling.
This collection also includes an accessible and nontechnical summary of the broad significance of this research by Liza Gross. Some unexpected intellectual property challenges have arisen from this project, and these are explored in a feature by Henry Nicholls. A "challenge series" essay by Jonathan Eisen provides insight into the issues surrounding the field of metagenomics today.
To host all the additional metadata that surround metagenomic studies, a new database, CAMERA, has been establish
Contact: Natalie Bouaravong
Public Library of Science