What’s the generic term for a carbonated beverage?
Your answer to this question will be determined by where you live. Answers vary wildly, from soda, pop, and coke (even if the soda pop is not Coca-Cola) in various states of the US to ‘soft drink’ or ‘fizzy drink’ on the other side of the Atlantic.
So it will come as no surprise to anyone who has travelled and tried to order a carbonated beverage that a language can vary from region to region. Similarly, anyone who knows how ‘to google’, heard a ‘vuvuzela’, used 'the grid' or witnessed a ‘bromance’ will know that a language can change over time as well; new words regularly appear in our vernacular, words change their meaning and old words slowly die away.
Dictionaries try to keep on top of changes; around 2,000 new words were added to the New Oxford American Dictionary in 2010 alone.
But beyond this there is no comprehensive, systematic analysis of vocabulary or word formation - why do some words thrive, while other combinations of letters will never be accepted into a language?
An interdisciplinary team from the University of Würzburg in Germany is addressing this problem by building a 'metadictionary' containing the core units of all the words used in the last 500 years in the German language, including those specific to regional dialects. The goal, the researchers said, is to develop and test methods and algorithms for detecting and understanding variance.
The core units of words, the smallest part that conveys any meaning, are called ‘morphemes’. For example, the word ‘craftsmanship’ can be broken down into three morphemes: craft + a 'gap element' s + man + ship, with each morpheme modifying the meaning of the total word.
When the researchers chopped up the words into their morphemes, they built a network where each morpheme was a node and each connection denoted that two morphemes were adjacent in a word. (See image, below)
“Not surprisingly, all networks showed the signposts of small world, scale free, hierarchical networks,” said Joerg Schultz from the University of Würzburg.
“A key feature of such networks is the existence of highly connected nodes called hubs. When we compared these nodes between the networks, we identified a change over time, that is, from 15th century over the 18th to today, but also between different regions at the same time. Thus, we could recall cultural changes by analyzing the morpheme networks,” Schultz said.
The researchers also looked at how the morphemes connected in words – and how these connections were gained and lost over time. They also saw how a new meaning is integrated into a language.
“We found that evolution of a meaning usually happens at the border of the network. Still there were exceptions, morphemes which were lost from the language though highly connected as well as morphemes which 'invaded' the language," Schultz said.
The team working on the project includes computer scientists and linguists, unsurprisingly, but Schultz is actually a biologist. “For us as biologists, the morpheme concept was quite familiar as there is a similar structure in biology: a protein is frequently composed of more than one functional part. These so-called domains can be seen as analogous to the morphemes which compose words,” said Schultz.
The team has already seen that models for understanding variance in language are similar to those that describe variance in genomics.
“Preliminary tests have shown that the corresponding networks have similar properties. This could be due to the fact that the generative processes behind evolution of language and genome might be comparable,” Dietmar Seipel, the computer scientist on the project, reported earlier this year at the International Symposium on Grids and Clouds and the Open Grid Forum in Taipei, Taiwan.
The researchers used publically available digitized dictionaries, such as the Middle High German Dictionaries, the early High German Dictionaries and dictionaries on regional dialects, and developed methods to transform the information from the dictionaries into code, while retaining all information, such as the part of speech, gender and inflectional detail. Analyzing different dictionary entries for sentence structure and parts of speech is difficult, because “there is a lot of structural variance,” Seipel said.
Nevertheless, the team managed to develop an annotation tool, using the declarative programming language PROLOG, so that keywords and other data could be extracted from dictionary entries. The annotated dictionary entries are stored in XML format, in accordance with the guidelines for electronic text encoding and interchange.
“We can apply the results of our dictionary analyses in a second step to text corpora of Middle High German texts and early new High German texts – starting with Luther and the mass of German literary texts – available soon in the TextGrid digital repository,” the linguist on the project, Werner Wegstein, said. TextGrid is technically oriented long-term archive embedded in a grid infrastructure.
From this, “we expect new insights into the combinability of basic units, qualitatively as well as quantitatively, because dictionaries of the German language do not register complex morpheme structures, they only list entries with complex structures showing specific additional semantic features,” said Wegstein.
For example, a German dictionary might not explain the regular compounds, such as ‘haus-dach’ meaning rooftop, while including the irregular compounds such as ‘haus-tür’ meaning house-door, or the door by which people enter a house, specifically.
The researchers’ algorithms for analyzing texts and detecting variance can likely be applied to other languages as well, because morphemes are “common in all Indo-European languages and in one way or another in other types of languages as well,” said Seipel.
The team has plans to use the British National Corpus and Dr Johnson’s Dictionary of English from 1755 in a future project. “It would be interesting to see how the relations we research have developed in English.”
Comments
innovative post
Ecopolitan EC can be a 99-years leasehold Punggol EC growth situated on Punggol Go walking with Area nineteen. It can be based right adjacent to Punggol MRT Station. Ecopolitan EC
Hello there, just became
Hello there, just became aware of your blog through Google, and found that it is truly informative. I’m going to watch out for brusselsI will be grateful if you continue this in future.Many people will be benefited from your writing. Cheers!
Another excite info Sekolah Belajar Forex FBS Indonesia and don't forget click nice review Konsumen Cerdas Paham Perlindungan Konsumen, and don't forget click nice review myblogpost and good Cipto Junaedy dan juga Cipto Junaedy and good review Iconia PC tablet dengan Windows 8 and also nulis that interesting. Enjoy it!
great {post|publish|submit|article|write-up}
Corals @ Keppel Fresh could be the most current inclusion in the Keppel Fresh location by Keppel Institution. Offering distinctive architectural mastery by means of internationally renowned builder Daniel Libeskind.Corals at Keppelbay
“Not surprisingly, all
“Not surprisingly, all networks showed the signposts of small world, scale free, hierarchical networks,” said Joerg Schultz from the University of Würzburg. data could be extracted from dictionary entries. The annotated dictionary entries are stored in XML format, in accordance with the guidelines for electronic text encoding and interchange.
great writeup
Corals @ Keppel These types of is the most recent add-on inside the Keppel Clean area through Keppel Company. Showcasing exclusive buildings through globally renowned builder Daniel Libeskind, Corals @ Keppel Bay is placed for being the following milestone in Singapore’s Southeast Place.
impressive {post|publish|submit|article|write-up}
ang mo kio bunch real estate freehold tongeng developement found from within Section 20 inside Seletar arrived real estate enclave. Belgravia Villa
At first, calls trickled in
At first, calls trickled in from end users, but grew steadily, tripling in volume to 30-40 calls a day in the last year, Carroll says. Its employees and its phone system were “maxxed out,” mainly because the original phone system was never designed for call center use. The only option was to invest in new technology.
peepVain
belgravia villas
belgravia villas Belgravia Villas is located in a central location where there are plenty of amenities in the vicinity within a short driving distance to Ang Mo Kio Hub, NEX, Compass Point, Orchard and Bugis area. Dining, shopping and recreation options for you and your loved ones are just a stone’s throw away.
In extremely rare cases,
In extremelyrare cases, untreated marijuana detox symptoms have proven fatal. Marijuana detox is also capable of producing some seriously nasty side effects, showing up as early as the first day. thc detox None of them has anything to offer for those who are trying to kick a marijuana habit. Are y windows 7 professional upgrade product key
039;ll remember to bookmark it
I'll remember to bookmark it and keep returning to see additional of the helpful information .windows 7 professional product key
The Best Articles
Konsumen Cerdas paham perlindungan konsumen cerita dewasa posisi nungging mahasiswa berjilbab cerita dewasa calon sekertaris vs bos cerita dewasa mahasiswi cantik saksi perkosaan di area parkir kampus kitab keilmuan nusantara me2x mania porn news tutorial miau alat peningkat performa, speed dan power kendaraan9power
Ummmmm....The meaning of a
Ummmmm....The meaning of a word can so grossly mutate over a period of time much briefer than biological evolution.
bathroom glass tile | wicker dining chairs | green bedroom | Ashley Furniture Home Office Desks | home office chairs | Temporary Airbrush Tattoos | Kanji Tattoos | men's back tattoo | tattoo saying quotes | bad henna tattoo | Harry one direction tattoo | Brazil hibiscus tattoo | full body tattoo | beautiful home decoration
Say thanks a ton for yet
Say thanks a ton for yet another wonderful post. The spot else can anybody get that type of information in that ideal way of writing? I've a presentation next week, and additionally I am in the google search for this kind of info.פסיכומטרי
I really enjoy simply reading all of your weblogs
I really enjoy simply reading all of your weblogs. Simply wanted to inform you that you have people like me who appreciate your work. Definitely a great post. Hats off to you! The information that you have provided is very helpful.
konsumen cerdas paham perlindungan konsumen
cibadak
sukabumi
sewa mobil murah jakarta
The blog is really
The blog is really appreciable and i like to keep on visiting this site once again that it would help me in further thanks for sharing the info.קורס פסיכומטרי
I think this Voter ID law is
I think this Voter ID law is a good initiative to prevent voter fraud; moreover this helps to identify the person. But the problem is that the cost for the documentation to obtain this Voter ID is too high. Texas Guitar Repair
This article gives the light
This article gives the light in which we can observe the reality. Boca Raton Plumber
So it will come as no
So it will come as no surprise to anyone who has travelled and tried to order a carbonated produkcja przemysłowa beverage prawo i społeczeństwo that a language can vary from region kultura i sztuka to region. Similarly, anyone who knows how ‘to google’, heard a ‘vuvuzela’, used 'the grid' or witnessed zdrowie i uroda a ‘bromance’ will know that a language can marketing change over time as well; new words regularly appear katalog stron in our vernacular, words change their meaning and old words slowly die away.
internet i komputery
Hi,Thanks for your comments.
Hi,
Thanks for your comments.
I'm not sure if this was intentional, but I am enjoying the irony of this discussion. Perhaps we will only understand whether we can in fact combine 'meta' and 'dictionary' after a metadictionary has been constructed by this team.
To the first poster: before dismissing this team's work based on my article alone, and even going so far as to to compare it to phrenology (in my opinion, a false analogy), you might like to read something by the researchers themselves, such as this paper.
From my understanding, their research would fit into the category of theoretical lexicography, though they never used this term themselves in disucssion with me.
Kind regards,
Jacqui Hayes
Metacommentary
@ Anonymous:
Actually, there not only can be, but there *IS* a word named 'metadictionary'. They just used it. If you know anything about what a dictionary is, then you know that they are collections of common word usage. Dictionaries are descriptive, not proscriptive.
Besides, according to the Oxford English Dictionary a 'dictionary' is "a book that lists the words of a language in alphabetical order and gives their word meaning", while 'meta' can mean "denoting something of a higher or second order kind: metalanguage". It seems to me the term 'metadictionary' is a perfect usage here.
Can bad science replace good art?
This reminds me of the "science" of phrenology. If one studies the bumps on Shakespear's (the orginal spelling) skull, I might become capable of understanding how he wrote Hamlet.
We have always known that the changes in the meaning of words must necessarily change more slowly than the changes that the words describe. But the concept of relating biology to the way one human language is used to incorporate other languages, and to explain how a new combination of words' morphenes becomes common usage (in describing a new idea) is a false analogy.
I think our friends need some grounding in how a dictionary is compiled, and what kind of people are instrumental in their compilation. e.g. http://en.wikipedia.org/wiki/William_Chester_Minor
I take it that, as you are trying to say; a German dictionary might not explain the regularly USEAGE of compound words, such as ‘haus-dach’ meaning rooftop, while including the less regularly USED compounds such as ‘haus-tür’ meaning house-door, or the door by which people enter a house. Quite true. But usage changes constantly both by time and place. One only has to compare between editions to see how much. Dictionaries can explain social acceptance of new meanings like "apple". But they can never explain new incorporations into a group of languages like "lateral thinking". That would require a anthropologic study.
I can understand your interest in this knd of endevour, especially as a lover of English. So please be careful, especially when using a leader like "Constructing a metadictionary". There can be no word named meta- dictionary. To use it displays an ignorance of the word dictionary.
If what you are attempting to describe here is that the boys are playing with the grid to do a little theoretical lexicography, that's fine. (I'm using this definition http://en.wikipedia.org/wiki/Lexicography NB the use of "metalexicography", which displays a misunderstanding of the word lexicography)
The interesting thing in all of this is that lexiocography is an art or craft. It's not a science. And there's an increasing amount of some very bad science, as this report illustrates.
Post new comment