 |
|
Distribution of BLAST jobs from one Pyrosequencing run (96,866 jobs total, 29 April to 7 May 2009) with the glide-in factory configured for: three TeraGrid resources, one OSG resource, one RENCI resource, and one UNC-CH resource.
Image courtesy of John McGee, Jason Reilly, and Mats Rynge (RENCI)
|
“We created an application that communicates with RENCI's TeraGrid Science Gateway,” said Jason Reilly, a RENCI senior research software developer. “For the user, it’s very simple — just log in and the application maps the data input to specific tasks to be done. The beauty is you don’t have to submit commands over and over again. You can run hundreds or even thousands of operations and you only have to submit the command once.”
The custom application created by Reilly was dubbed BLASTMaster because it builds on the Basic Local Alignment Search Tool (BLAST) used to search sequence databases. BLASTMaster divides commands into tasks and pushes the work to RENCI's TeraGrid Science Gateway, which submits, monitors, and manages the compute workload on systems that are part of TeraGrid’s nationwide network of high performance machines, and to OSG machines. After entering the initial commands, the researchers merely had to wait for their results.
“Large computer farms that we might use are often composed of heterogeneous smaller clusters,” said Wang. “The BLASTMaster tool and a Web services environment is particularly useful to those of us without much experience using compute clusters. It gives us a uniform interface to submit jobs, which greatly enhances our productiveness.”
The sequence analysis work used TeraGrid resources at Purdue University (West Layfayette, IN), OSG resources at RENCI (Chapel Hill, NC) and a cluster in the University of North Carolina at Chapel Hill computer science department supported by the National Institutes of Health. The work has real-world value taken straight from recent headlines about the H1N1 virus.
“Knowing the genomic sequence of a human virus allows for quicker diagnostics to identify infections,” said Delwart. “Quicker diagnostics can lead to more informed decisions on how an emerging virus is spread and how to control it. Knowing the sequence can also help make vaccines or anti-virals against that virus.”
—Karen Green, RENCI
|