iSGTW - International Science Grid This Week
iSGTW - International Science Grid This Week
Null

Home > iSGTW 15 October 2008 > iSGTW Feature - DZero first consumer of opportunistic storage in OSG

Feature - Opportunistic storage increases grid job success rate


Processed events by DZero per week from May to October 2008. The vertical scale goes to 12M events. (Click image for per site representation.)

Image courtesy of DZero.

The DZero high-energy physics experiment at Fermilab, an Open Science Grid user, typically submits 60,000-100,000 simulation jobs per week at 23 sites. The experiment’s application executables make many requests for input data in quick succession.  Due to the lack of storage local to the processing sites, up until recently much of DZero’s simulated data had to transfer in real-time over the wide area network, leading to high latencies, job timeouts and job failures.

OSG worked with member institutions to allow DZero to use opportunistic storage, that is, idle storage on shared machines, at several sites. This represents the first successful deployment of opportunistic storage on OSG, and opens the door for other OSG Virtual Organizations. With allocations of up to 1 TB at sites where it processes jobs, DZero has increased its job success rate from roughly 30% to upwards of 85%.

Hosting storage resources is often tricky, especially for smaller grid sites, both in terms of hardware and professional expertise, says Abhishek Singh-Rana, coordinator of the Virtual Organizations group in OSG, which helps science communities achieve good results using the OSG. For this reason, the VO group negotiated with the larger OSG science communities, US ATLAS and US CMS, to allow other OSG communities to use their storage resources opportunistically. So far, DZero has used storage at six US-LHC Tier-2 sites, and is looking for more.

Tape robot

Image courtesy of Fermilab.  

Opportunities

Work to improve DZero’s job efficiency began in early July and by early August the experiment was producing about 3.7M events per week.  By the second week of September, production reached a record 11.0M events, a 130% increase in its average weekly OSG production rate for the past year.

DZero’s success demonstrates the OSG’s commitment to establishing relationships with its user communities in order to benefit all members.

“We are committed to the goals of the OSG, and that includes the development of opportunistic resources,” says Ken Bloom, manager of the CMS Tier-2 centers in the US. “When the OSG works well, all VOs can benefit.  If we can help get opportunistic storage working for DZero, then maybe DZero sites will make some of their storage opportunistically available to CMS, and if we can make good use of that, the reward will be well worth the effort.”

Marcia Teckenbrock, Open Science Grid

The OSG is continuing to work with its stakeholders and resource providers to improve the mechanism for using opportunistic storage. CDF and SBGrid have also expressed an interest in using opportunistic storage in the future. See the recommendations based on the DZero use scenario for how OSG sites can enable opportunistic storage.

Added 16 October 2008:

DZero's push for storage local to processing nodes on the grid was pioneereed by Joel Snow, of Langston University and Fermilab. Snow studied the low efficiency (high failure rate) problem, determined that local storage elements would be the key for improving the efficiency, and worked with OSG to implement a solution.

Tags:



Null
 iSGTW 1 September 2010

Feature - The forecast before the storm

Q&A - Joe Hellerstein on cloud programming

Q&A - People behind EGI: Steve Brewer steps in as the voice of the user

Poll of the week - Rock stars of scientific computing

Videos of the week - NoHardware.com destroys server huggers' equipment

 Announcements

Symposium on Authentication Technologies for Research and Education abstracts due

Grace Hopper early bird registration due

Gordon Conference 2010 abstracts due

Jobs in distributed computing

 Subscribe

Enter your email address to subscribe to iSGTW.

Unsubscribe

 iSGTW Blog Watch

Keep up with the grid’s blogosphere

 Mark your calendar

September 2010

August 29-Sept 3, CERN School of Computing

2-3, Citizen Cyberscience Summit

6-8, IASTED in Botswana

6-9, PRACE Training Week

6-10, GridKa School 2010

13-15, CaBIG

13-16, UK All Hands Meeting

14-17, EGI Technical Forum

20-24, Cluster 2010

27-29, ICT 2010

21-23, Cybera Summit 2010

More calendar items . . .

FooterINFSOMEuropean CommissionDepartment of EnergyNational Science Foundation RSSHeadlines | Site Map