| Terabytes of data were moved from the Tier-0 site at CERN to Tier-1 sites across Europe (seven sites), America (one site in America and one in Canada) and Asia (one site in Taiwan). Data transfer rates reached the expected maximum during early September. Image courtesy of ATLAS |
Quasi real-time, says Bos, will be more than sufficient to keep the chain moving. “It’s good that we could achieve this transfer in hours, but with the real data there will be a greater delay. We will need to do regular calibrations and the processing and subsequent data transport will have to wait for that. The data chain is designed to cope with this delay and there are sufficient disk buffers at the Tier-0 stage to keep the data for as long as several days.”
Domino data chain of events The data chain is triggered only when a particle enters the ATLAS detector and produces a signal in its sensitive layers. “We know these particles must be muons, because everything else is stopped by the meters of clay above the detector. Only muons can get through and only muons can trigger the detector,” Bos explains. Once the particle has been detected, the domino data chain is set in motion. The raw data is sent to the Tier-0 center, the computer center at CERN, where it is recorded onto tape before being sent to a different part of the center for reconstruction. “The reconstructed data also goes to tape,” says Bos. “It is then exported to all the Tier-1s, and when it arrives it is exported by the Tier-1s to their Tier-2s. When it arrives at the Tier-2s the physicists can pick it up and say, ‘wow, look at that’”. Bos says that while researchers from different Tier-2 centers will be analyzing the data in different ways, they will all be analyzing the same data. “The data analyses are bound to be different, but if you say ‘give me the data from Run 123, Event 345 on Sunday morning’, they should all be able to provide the same thing.”
This data chain will be used to share ATLAS data with more than 1900 physicists from more than 160 participating institutions in 35 countries. The massive data transfer requirements are supported in Europe and the U.S. by the Enabling Grids for E-sciencE and Open Science Grid infrastructures. - Cristy Burne, iSGTW |