Share |

Making data more searchable, shareable, & citable

Photograph of a panoramic view of the Institute Laue-Langevin in France.

A panoramic view of the Institute Laue-Langevin in France. The laboatory recently established a new common data policy to make its data more open for the benefit of researchers, laboratories, funders, and the public. It will come into effect this Autumn. Image courtesy Peter Ginter, ILL.

While a scientific paper still remains the principal way researchers share their findings, the foundations of a project −the primary raw data – are becoming equally, if not more important. For science to be truly productive, widespread initiatives for opening up access to data are required, and fair policies defining how data is accessed are important, as well as clear incentives linked to research impact. Europe's leading neutron source, Institute Laue-Langevin (ILL), and other central facilities in Europe have been working together to develop a shared infrastructure to increase the availability of their data to scientists all over the world.

The ILL, nestled within the French Alps in Grenoble, France, feeds neutrons to a suite of 40 high-performance instruments, helping 2,000 visiting researchers perform over 800 experiments. Neutron scattering helps researchers in fields as diverse as material sciences, molecular biology, nuclear and fundamental physics with their research.

The resulting data avalanche makes it essential that the institute implements a framework for sustainable data management and analysis, as part of their service.“There is growing recognition that these data should be networked and preserved for future studies to reuse in replicating and validating scientific conclusions,” said Jean-Francois Perrin, head of IT services at the ILL.

Although the ILL monitors their published papers, of which there are more than 660 per year, the institute has a lot of different types of data (raw data and metadata), and protocols to collect and store data.

The scope of scientific data management is also broad. Not only does the raw data have to be curated, but its context has to be described (i.e. how, when, and by whom a particular set of data was collected and formatted). This is known as ‘metadata’ (e.g. experimental conditions, instrument type, date or time, compression algorithms, or software code).

ILL is one of 13 European neutron and photon laboratories in Europe and therefore a joint approach to data policy was important. This ambitious task was undertaken by the PaNdata Open Data Infrastructure project.

Although raw data has been published since the very first experiments were carried out at ILL in 1972, the institute suffered from a lack of metadata accessibility to allow further analysis and replication.Their solution was to develop a collaborative open access repository for the community to deposit their metadata, which has now been provided through the ICAT catalogue. “The fact that our data is openly accessible, strongly contributes to collaboration between scientists. Open access serves science and the scientists by creating opportunities and a better reward of our users’ work,” said Perrin.

What does open mean?

Deciding what constitutes ‘open’ is particularly important when developing a policy. The concept of open data was first established over fifty years ago. However, a formalized definition was summarized recently: "a piece of data is open if anyone is free to use, reuse, and redistribute it — subject only, at most, to the requirement to attribute and/or share-alike,” by the The Open Knowledge Foundation Working Group on open data in science.

Before developing a data policy, an organization has to fully understand and audit their requirements for data availability and management. The ILL undertook a year-and-a-half long consultation with its users from April 2010 until November 2011 before publishing their Common Data Policy, which will be applied this autumn.

International and national funding bodies are now also introducing policies to encourage a culture of data sharing. The National Science Foundation, in the US, requires data management plans, including provisions for access and sharing, to be submitted in conjunction with grant applications.

As the open data movement matures, one of the main barriers to access research data, recently highlighted in an EC survey published in April 2012, is a lack of national or regional data policies. However, trusted experts are busy developing strategies for handling their data. In May 2012, one of the largest international scientific experiments and collaborations ˗ the Large Hadron Collider CMS (Compact Muon Solenoid) ˗ announced its policy to manage and share its unique data.

But how do organizations balance the need for ‘openness’ in science with confidentiality, and security concerns? To safeguard researchers from being pre-empted, the ILL proposes a three-year embargo period.

“The ILL provides the beam, the instruments and experts, but it’s the user who produces the idea of the experiment and prepares the samples. After the experiment, the researcher needs time before releasing the data. This can often take a while or even necessitate more than one single experiment, and three years corresponds to this necessary gestation period, and also the typical duration of a PhD”, said Professor Helmut Schober, science director at the ILL. The CMS policy has a similar embargo period.

Perrin said that for the policy to be successful, scientific publications will need to explicitly cite, not only publications, but also experimental data and the teams which produce it.“Often scientists would like to access this information too, reuse the raw data, and as yet, a 'static' image of a graph in a traditional journal doesn't allow this,” said Perrin.

Making data citeable on its own

An image of a graphical representation of an enhanced publication by the project OpenAIRE.

An enhanced publication is a totally new way of publishing in which a traditional publication (a book, article, or a report) is enriched with additional information. An enhanced publication relies on the linking possibilities of the web and includes objects such as primary research results, audio, or video fragments. Image courtesy SURF.

A number of metrics exist for measuring a publication's impact (H-Index, Impact Factor), but there are still less recognized methods for making data citeable. A number of European projects (DataCite, OpenAIREplus, and Opportunities for Data Exchange) are helping incentivize, and assist data sharing. In just the same way that you can cite other sources of information, such as articles and books, DataCite is creating a scholarly structure for identifying and referring to data that will facilitate recognition and reward for data producers.

Linking peer-reviewed research publications and datasets is also important. Building on the OpenAIRE project, which is providing a large-scale repository for European researchers to deposit and access articles and data, is the initiative, OpenAIREplus. This FP7-EC-funded project extends the OpenAIRE infrastructure coverage to include scientific data and is developing the concept of enhanced publications (EP) where research papers link to supplementary data.

Tim Smith, group leader of Collaboration and Information Services at CERN, has been heavily involved in the OpenAIRE and OpenAIREplus projects from the start, and said, “An EP is a compound object which groups together the paper with all associated items such as metadata, datasets, persons involved (by reference), and subsidiary articles.”

On June 11th 2012, OpenAIRE hosted a workshop: ‘Linking Open Access publications to data – policy development and implementation’ in conjunction with the Nordbib Conference 2012, in Copenhagen, on EPs and on developing data policies. Tim Smith also said that a network of national open access desks are on hand to advise repositories on best practice and users on where to store, and how to access publications. “While infrastructure and repositories are a necessary base, policies are there for vision and consistency, and a network of help desks is there to get the momentum going,” Smith said.

If e-science is to offer solutions to grand societal challenges and mysteries of the universe, it is clear that a new data dissemination model and common global policies will be needed to aid cross-disciplinary research communities.

Your rating: None Average: 4.1 (7 votes)

Comments

News

A treasure to get all exclusive details of Sonia Gandhi, Rahul Gandhi, Priyanka Gandhi Vadra and the Congress party. www.pressbrief.in contains all news, articles, videos and press releases in connection with these leaders.
Article Reference :- Rahul Gandhi | Sonia Gandhi | Priyanka Gandhi

Good high intensity prismatic

Good high intensity prismatic reflective sheeting is all for our life to be protected in better way.

This research can give some

This research can give some advices for http://www.ongoodauthority.com/catalog which give good general solutions for research and education.

This is really interesting

This is really interesting read more

accesibility on data

I would like to make a point that the real values of this society are about resources, meaning the more accesible things become the more powerful we become as persons. check this out

I think there is definitely

I think there is definitely so many benefits that come from having more searchable data. So many will be able to benefit so much from it so much here. This needs to be more accessible to all here.
http://www.manitosilk.com/

The resulting data avalanche

The resulting data avalanche makes it essential that the institute implements a framework for sustainable data management and analysis, as part of their service.“There is growing recognition that these data should be networked and preserved for future studies to reuse in replicating and validating scientific conclusions,” said Jean-Francois Perrin, head of IT services at the ILL.
internet i komputery
kultura i sztuka
marketing
prawo i społeczeństwo
produkcja przemysłowa
zdrowie i uroda
katalog stron

Although the ILL monitors their published papers, of which there are more than 660 per year, the institute has a lot of different types of data (raw data and metadata), and protocols to collect and store data.

The scope of scientific data management is also broad. Not only does the raw data have to be curated, but its context has to be described (i.e. how, when, and by whom a particular set of data was collected and formatted). This is known as ‘metadata’ (e.g. experimental conditions, instrument type, date or time, compression algorithms, or software code).

Drink niet soda en andere

Drink niet soda en andere koolzuurhoudende frisdranken. Als een vriend of familielid is ook op zoek naar manieren om gewicht te verliezen snel te vinden, samen uw tijd tafel. Als je niet graag uit te oefenen, probeer te gaan voor een wandeling ongeveer een uur na het eten van een maaltijd. Dukan dieet this website U zult blij zijn u deed. Meer calorieen verbranden tijdens het slapen.

Really informative

Inspiring blog post, lots of enormous information. I’m going to show my friend and ask them what they think about this college thesis topics.

thanks

Really great newegg promo code, Thank you for sharing This knowledge.Excellently written article, if only all bloggers offered the same level of content as you, the internet would be a much better place. Please keep it newegg coupon!

newegg free shipping codes

Protecting the data from plagiarism is eaqully important

When we speak about enhancing the serviceability of online resources, it becomes even more important to protect it from the eyes of plagiarizers. you can do it easily by making the use of plagiarism checker tolls available easily over the internet space.

Neutron scattering helps

Neutron scattering helps researchers in fields as diverse as material sciences, molecular biology, nuclear and fundamental physics with their research. Virtual Office London

I don't mind getting sweaty

I don't mind getting sweaty and dirty, I just don't want to pass out in the heat! Banquet chairs

Really great post, Thank you

Really great post, Thank you for sharing This knowledge.Excellently written article, if only all bloggers offered the same level of content as you, the internet would be a much better place. Please keep it up!
FJ Cruiser TRD

information you offer! I will

information you offer! I will bookmark your blog and have my children check up here often.ppi

Your page is sweet, your

Your page is sweet, your graphics are great, and what's more, you use videos that are relevant to what you're saying. You're definitely one in a million
2014 corvette

accesibility

This seems like a very straight forward way of making information and data accesible to general public. I am sure many visitors will find this info useful. click here

Thanks for interesting

Thanks for interesting article. I will back here more often and bookmark your site. Please visit and leave comment my website too.
Free Cell phone spy

The idea behind this article

The idea behind this article is excellent, and for me the first item ("Create your own damn content!") is the real gem here: most of the people spend their entire lives only consuming what is created by others, and creating nothing themselves--or never sharing what they create, which is better than not creating at all, though not the best they could do.

credit services

hi

I wanted to thank you for this great read!! I definitely enjoyed every little bit of it. I have you bookmarked to check out new stuff on your post.

transvaginal mesh lawsuit

hi

The idea behind this article is excellent, and for me the first item ("Create your own damn content!") is the real gem here: most of the people spend their entire lives only consuming what is created by others, and creating nothing themselves--or never sharing what they create, which is better than not creating at all, though not the best they could do.

san diego bankruptcy lawyer

Kiem tien tren mang

Nice information, valuable and excellent design, as share good stuff with good ideas and concepts, lots of great information and inspiration, both of which I need, thanks to offer such a helpful information here. kiem tien tren mang

The fact that our data is

The fact that our data is openly accessible, strongly contributes to collaboration between scientists. Open access serves science and the scientists by creating opportunities and a better reward of our users’ work,” said Perrin. http://www.casino-portugues.org/

Do you have a german

Do you have a german translation from this site?

More: Website Übersetzung Chinesisch Deutsch, Übersetzung Deutsch Mandarin

Although the ILL monitors

Although the ILL monitors their published papers, of which there are more than 660 per year, the institute has a lot of different types of data (raw data and metadata), and protocols to collect and store data. Local SEO

Hi have a good day to you:)

Hello,I love reading through your blog, I wanted to leave a little comment to support you and wish you a good continuation. Wishing you the best of luck for all your blogging efforts.

mesothelioma lawyer

“An EP is a compound object

“An EP is a compound object which groups together the paper with all associated items such as metadata, datasets, persons involved (by reference), and subsidiary articles.”http://www.rate-my-professors.com/schools/ucf-professor-ratings/

The act of copying and

The act of copying and pasting The Effects of Living in a Foreign Country from this site to theirs is plagiarism and illegal exactly as RAD Essay states in their acceptable use policy.  
persuasive essay topics

hi

Hey mate, .This was an excellent post for such a hard subject to speak about. I look forward to seeing many much more excellent posts like this one. Thanks

ILL is one of 13 European

ILL is one of 13 European neutron and photon laboratories in Europe and therefore a joint approach to data policy was important. citable

informative

I want to thank you for this informative read, I really appreciate sharing this great post. Keep up your work.tax lawyers

hi

You can definitely see your enthusiasm in the work you write. The world hopes for more passionate writers like you who aren?¯t afraid to say how they believe. Always go after your heart.

stryker recall

hi

Give me a few a perishable unusually to clarify as pretty late as about now little this great performance.Great Post

tv products

information you offer! I will

information you offer! I will bookmark your blog and have my children check up here often.
Bankruptcy attorney Brooklyn

hi

This is one of the most incredible blogs I've read in a very long time. The amount of information in here is stunning, like you practically wrote the book on the subject.

as seen on tv

hi

I can not stop reading this. And 'so fresh, so full of information, I do not know. I'm glad that people actually write the smart way to show the different sides of him.

bankruptcy attorney san diego

Invormativ posting and really

Invormativ posting and really intresting it is clear that a new data dissemination model and common global policies will be needed to aid cross-disciplinary research communities youtube video download

Only US$33.20, buy Gemei

Only US$33.20, buy Gemei S5000 4.3" LCD 720P H.264/RMVB/MP4 Media Player with TF Slot/HDTV - White (480*272/4GB) from ...
Buy the One S Android Smartphone Gradient Metal (Unlocked) online for $389.99 with free US shipping. Also accessories, reviews and videos. Worldwide ...

Enhanced Publications in the Netherlands

We are very happy to see that our work on Enhanced Publications has received international attention. At SURF here in the Netherlands we started our work on Enhanced Publications back in 2008 in the SURFshare programme. One of the results was a film explaining the concept and allowing researchers to share their experiences. We are happy to see that the image from this video is being reused. We would however like to point out that the image is courtesy of SURF and not OpenAIRE. We are very happy to see that the concept and knowledge is being carried further in the OpenAIRE plus project.
Read more about the Dutch experiences: www.surf.nl/enhancedpublications.

Post new comment

By submitting this form, you accept the Mollom privacy policy.