iSGTW - International Science Grid This Week
iSGTW - International Science Grid This Week
Null

Home > iSGTW 28 May 2008 > iSGTW Feature - Relationships in texts UNC

Feature - Claim jumping through scientific literature


Image courtesy of karnatakaforest.gov.in

Finding relationships in texts from diverse fields

Researchers in search of higher productivity and more effective cross-disciplinary collaboration can add a new tool to their kit.

Catherine Blake, an assistant professor in the School of Information and Library Science at the University of North Carolina , has developed a method for retrieving, analyzing and finding relationships in published research texts from multiple disciplines. Claim Jumping through Scientific Literature, as it is called, enables researchers in a given field to collect relevant published information from other fields of study.

Blake’s team analyzed 162,000 documents from a variety of scientific fields using a process called dependency parsing, which analyzes the grammatical structure of sentences to find meaningful relationships among words in different documents.

“Generating these results on a high-end desktop computer would have required about 15,000 hours, or seven and a half years of 40-hour work weeks,” says Blake.

The Renaissance Computing Institute (RENCI), a joint venture between several North Carolina universities and the state government, provided Blake with technical expertise, onsite resources, and connections to Open Science Grid’s nationwide network of computing resources. The work was done through RENCI’s Faculty Fellows program, an effort funded by UNC Chapel Hill that partners faculty members on the Chapel Hill campus with experts and resources available through RENCI.

Catherine Blake of UNC-Chapel Hill

 Image courtesy of UNC

Blake and her research team now generate the sentence structure of millions of sentences in about 28 hours using Open Science Grid resources.

“It is an ideal project for the grid because the job is already carved up into small pieces that can use computer resources anywhere, " Blake says. “Reducing the processing time has enabled us to investigate otherwise prohibitively complex research questions.”

Blake and RENCI are currently working to provide the text processing as a service to researchers in natural language processing.

“Allowing researchers to access computationally expensive tools such as these without the initial start-up time would be of enormous value,” she adds. 


- Karen Green, RENCI

Tags:



Null
 iSGTW 1 September 2010

Feature - The forecast before the storm

Q&A - Joe Hellerstein on cloud programming

Q&A - People behind EGI: Steve Brewer steps in as the voice of the user

Poll of the week - Rock stars of scientific computing

Videos of the week - NoHardware.com destroys server huggers' equipment

 Announcements

Symposium on Authentication Technologies for Research and Education abstracts due

Grace Hopper early bird registration due

Gordon Conference 2010 abstracts due

Jobs in distributed computing

 Subscribe

Enter your email address to subscribe to iSGTW.

Unsubscribe

 iSGTW Blog Watch

Keep up with the grid’s blogosphere

 Mark your calendar

September 2010

August 29-Sept 3, CERN School of Computing

2-3, Citizen Cyberscience Summit

6-8, IASTED in Botswana

6-9, PRACE Training Week

6-10, GridKa School 2010

13-15, CaBIG

13-16, UK All Hands Meeting

14-17, EGI Technical Forum

20-24, Cluster 2010

27-29, ICT 2010

21-23, Cybera Summit 2010

More calendar items . . .

FooterINFSOMEuropean CommissionDepartment of EnergyNational Science Foundation RSSHeadlines | Site Map