 |
|
At Argonne National Laboratory computational biologists Folker Meyer and Elizabeth Glass view charts of metagenomic data analysed using grid computing resources.
Image courtesy of ANL.
|
MG-RAST, which came online in 2008, is a free, fully automated online service for annotating the metagenome (the set of fragments of sequenced DNA) of an environmental sample. With over 1,500 users, currently MG-RAST houses more than 2,600 private and about 300 public metagenome datasets.
Researchers upload their sample’s metagenome, and MG-RAST uses a variety of computing resources – Argonne’s 800-core cluster, TeraGrid and cloud computing – to compare the DNA fragments to those from every other sample in the system as well as to gene sequences in several other publicly-available databases. Via its relationship with the nonprofit organization Fellowship for the Interpretation of Genomes on its “Project to Annotate 1,000 Genomes,” the MG-RAST team also has access to a large basis of smaller curated genome data sets. The software uses similarity to known genes to guide the reconstruction of the various species in the sample and to provide information on their functions.
The databases do not contain the genome of every species of microbe, often making it difficult to classify the organisms in a sample. “It is estimated there are at least 200 major groups of bacteria, and we (the public sector) only have genome data for about 10 of them,” said Eisen.
“Although there is still much work ahead, metagenomics provides a powerful new tool to help researchers better understand microbes they cannot grow in the lab,” Meyer said. “Metagenomics is more or less unleashing our ability to study the genomics of microbes from all sorts of environments across the planet.”
—Amelia Williamson, for iSGTW
|