 |
|
Internet2 and the Open Science Grid have formed a partnership to deploy advanced network monitoring and diagnostic tools on the grid.
Image courtesy of invoice-factoring.com.au.
|
Network performance problems bedevil scientists and network operators alike. Troubleshooting requires a multi-pronged approach, and problem resolution often drags – holding back scientific progress.
To address this in the grid community, Internet2, the leading U.S. networking consortium in the research and education community, has formed a partnership with the Open Science Grid to deploy a set of advanced network monitoring and diagnostic tools on the grid.
Network users and operators solve problems most quickly when given transparent access to the entire network path involved. Typically, however, separate organizations operate different segments along a full network path and each so-called “domain” controls access to its own diagnostic and performance data. Some organizations make their performance data available while others do not. Without this data, troubleshooters find it extremely difficult to isolate problems.
Adding to the difficulty, performance degradation is often due to factors seemingly unrelated to the network. For example, an improperly configured host or an application’s inadequate internal buffer limits can masquerade as a network bottleneck.
A global collaboration including ESnet, GÉANT, Internet2, and RNP has developed an open, modular infrastructure of services and applications called perfSONAR that enables the gathering and sharing of network performance information, and facilitates troubleshooting of problems across network domains. OSG and Internet2 are working together to begin deploying perfSONAR-ps tools, an easy-to-install implementation of the perfSONAR protocols, in the grid community. Grid operators and network engineers can use these tools to check network performance on specific compute nodes, storage elements or network segments. They will be able to quickly identify problems that impact network performance, and isolate the suspect domain or network segment so that its operators can make corrections in a timely manner.
|