Share |

Preserving the confusion of tongues

The confusion of tongues, Gustave Doré (1865).

An engraving of the Confusion of Tongues, by Gustave Doré (1865), depicting the Biblical tale that explains the multiplicity of languages as a way to halt the building of the ambitious Tower of Babel. Image courtesy Wikimedia.

Languages and cultures have been changing throughout history due to contact between communities and the changes to living conditions. Many of the languages spoken by humans and the cultures they were living in disappeared in the last few centuries leaving only about 6,500 languages.

However, due to globalization and technological innovation in the last decades the rate of change and endangerment of languages and cultures increased so extremely that about one language is dying every week. Since 96% of the languages are spoken by only 3% of people, language extinction is mainly affecting those areas where many languages are spoken by only a few people.

Language change and inherently also cultural change, however, also affect common languages such as English. This is caused by completely different trends such as immigration of increasing amounts of people and/or creolization.

"Unique creations of evolution"

Since we can look at languages as unique creations of evolution designed to help people to survive in their environments we lose a treasure of mankind with every language dying. What we also seem to observe is a blurring of the structures between languages. We are facing a gigantic loss of knowledge and cannot assume that these trends will stop.

More than ever, we need to be documenting our languages and, if we do it properly, also document the cultural background on which they were spoken. Language documentation can be used to maintain diversity where possible, to better understand the construction of languages, and to transfer knowledge to future generations. Since we cannot foresee what future generations will do with this information, we need to carry out this documentation work careful and also take care of preserving our digital records.

This was the basis motivation of the DOBES program that started in 2000 and now covers about 50 multinational and multidisciplinary teams, with linguists, anthropologists, musicologists, ethno-biologists, and others, documenting more than 70 languages from all over the world – from Iwaidjan in the Cobourg Peninsula in Northern Australia and Totoli in Sulawesi, Indonesia, to Gorani in Northwest Iran and Awetí in Mato Grosso, Brazil.

No longer as simple as compiling a wordlist

Language documentation is no longer seen as only generating a description of the grammar and some wordlists, but it now needs to be based on large amounts of primary data, such as audio and video recordings of the speakers. In particular, video recordings also capture the environment in which languages are spoken.

These media recordings are being transcribed to a certain extent, a free translation is created into one of the main languages. For some material, morphosyntactic glossing - that is, annotations of linguistic content (morphology, syntax, semantics) - is being added to describe part of the linguistic system and other type of information can be added by special analysis such as describing the gestures, anthropologic phenomena, etc.

This annotation work is very time consuming since it has to be done manually. For higher linguistic annotations, this can take more than 100 times real time. In addition lexica are derived and where possible sketch grammars are being added. Conceptual spaces where major concepts are brought into relation allow documenters and language community members to access the documentation material from a cultural point of view.

It is well understood now that digital data will be lost in shortest time if it is not uploaded to a digital repository that is fulfilling a number of criteria. This is the reason that from the beginning the DOBES project was associated with a digital archive that will take care of bit-stream preservation and format curation.

Four copies at large data centres

Bit-stream preservation is supported by generating four external and dynamic copies of all data objects at large remote data centers and by having set up 10 regional archives at places where the languages being recorded are spoken.

The use of standards is the basis for long-term interpretability of the data. But also the use of metadata is important since it allows to relate objects with each other and it provides the contextual and provenance information that is necessary to interpret the object.

The DOBES program can be seen as a very successful start into the eHumanities era as well, since it helped in changing the scientific culture and since the archived data can now be used to carry out cross-linguistic studies.

The goals and principles of DOBES were an excellent starting point for the infrastructure work being done in CLARIN which is one of the projects selected to be on the ESFRI roadmap. CLARIN’s goal is to establish an integrated and interoperable domain of language resources and tools, the persistence of which will be guaranteed by a network of strong service centers. As an early example of the integration efforts we can refer to the virtual language observatory that also covers for example all DOBES data.

This is a brief summary of the talk "Language and culture documentation in DOBES" given by Peter Wittenburg at APAN in Mumbai this week.

Your rating: None Average: 4.3 (7 votes)

Comments

innovative post

Ecopolitan EC can be a 99-years leasehold Punggol EC improvement located from Punggol Walk inside District 19. It is positioned right beside Punggol MRT Station. Ecopolitan EC

great {post|publish|submit|article|write-up}

Corals @ Keppel Bay could be the most recent inclusion from the Keppel Fresh area simply by Keppel Corporation. Displaying one of a kind architecture by means of globally renowned architect Daniel Libeskind.Corals at Keppelbay

Language documentation can be

Language documentation can be used to maintain diversity where possible, to better understand the construction of languages, and to transfer knowledge to future generations. Since we cannot foresee what future generations will do with this information, we need to carry out this documentation work careful and also take care of preserving our digital records.joyetech grossiste

To make sense of so many

To make sense of so many images, the TEAM researchers turned to computing Kosmetik Online experts at the Kitchen Set Murah San Diego Supercomputer Center.

reallly good post

Bartley Ridge may be the most up-to-date impending release combined Support Vernon Road around Bartley MRT.BartleyRidge

impressive {post|publish|submit|article|write-up}

ang mo kio bunch houses freehold tongeng developement based at inside District 20 within Seletar stumbled property enclave. Belgravia Villa

Very nice postI just stumbled

Very nice postI just stumbled upon your weblog and wished to say that I've really enjoyed browsing your blog postsAfter all I will be subscribing to your feed and I hope you write again soon!

You can read actual news Sekolah Belajar Forex FBS Indonesia & don't forget click nice news Konsumen Cerdas Paham Perlindungan Konsumen, & don't forget click nice news ESER Unlimited Power Bank & good Cipto Junaedy dan juga Cipto Junaedy & good news Iconia PC tablet dengan Windows 8 and also nulis that interesting. Please, Love it!

Thanks for writing such a

Thanks for writing such a good article, I stumbled onto your blog and read a few post. I like your style of writing.. blog in japan

Took me time to read all the

Took me time to read all the comments, but I really enjoyed the article. It proved to be Very helpful to me and I am sure to all the commenters here! It's always nice when you can not only be informed, but also entertained! I'm sure you had fun writing this article.jean pascal bruno

Your site is amazing.I am

Your site is amazing.I am very impressed to see this,i want to come back for visiting your site.Keep doing Good as well as you can alexis vaussenat

The DOBES program can be seen

The DOBES program can be seen as a very successful start into the eHumanities era as well, since it helped in changing the scientific culture and since the archived data can now be used to carry out cross-linguistic studies.Yves Beaunesne

More than ever, we need to be

More than ever, we need to be documenting our languages and, if we do it properly, also document the cultural background on which they were spoken. Language documentation can be used to maintain diversity where possible, to better understand the construction of languages, and to transfer knowledge to future generations. Since we cannot foresee what future generations will do with this information, we need to carry out this documentation work careful and also take care of preserving our digital records. netcom sa

Admiring the time and effort

Admiring the time and effort you put into your blog and detailed information you offer! I will bookmark your blog and have my friends also check up here often. Thumbs up!
sito incontri.

best blog

It’s really a nice and useful piece of information. I am satisfied that you just shared this useful info with us. Please stay us informed like this. Thanks for sharing.
ads dating.

Thank you

I enjoyed avery little bit

I enjoyed avery little bit part of it and i will be waiting for the new updates, i am very much impressed from your post.
donna cerca uomo milano.

Kiem tien tren mang

Kiem tien tren mang Thank you, I have just been searching for information approximately this subject for ages and yours is the greatest I have found out till now. Kiem tien tren mang But, what about the conclusion? Cach Kiem tien tren mang Hoc seo online Are you positive in regards to the sourcedo not understood is in truth how you are no longer actually much more smartly-appreciated than you might be right now. You are very intelligent. Kiem tien tren mang

However, due to globalization

However, due to globalization and technological innovation in the last decades the rate of change and endangerment of languages and cultures increased so extremely that about one language is dying every week. Since 96% of the languages are spoken by only 3% of people, language extinction is mainly affecting those areas where many languages are spoken by only a few people.
marketing
prawo i społeczeństwo
produkcja przemysłowa
zdrowie i uroda
katalog stron
Language change and inherently also cultural change, however, also affect common languages such as English. This is caused by completely different trends such as immigration of increasing amounts of people and/or creolization.

It was very interesting to

It was very interesting to read that here

It was very interesting to

It was very interesting to read that here

Very interesting article!

Very interesting article!

Post new comment

By submitting this form, you accept the Mollom privacy policy.