Wednesday, June 10, 2009

Million Words in the English Language?

(from CNN) English contains more words than any other language on the planet and will add its millionth word early Wednesday, according to the Global Language Monitor, a Web site that uses a math formula to estimate how often words are created.

The Global Language Monitor says the millionth word will be added to English on Wednesday. The site estimates the millionth word will be added Wednesday at 5:22 a.m. Its live ticker counted 999,985 English words as of early Tuesday evening.

The "Million Word March," however, has made the man who runs this word-counting project somewhat of a pariah in the linguistic community. Some linguists say it's impossible to count the number of words in a language because languages are always changing, and because defining what counts as a word is a fruitless endeavor.

Paul J.J. Payack, president and chief word analyst for the Global Language Monitor, says he doesn't include all new words in his count. Words must make sense in at least 60 percent of the world to be official, he said. And they must make sense to different communities of people. A new technology term that's only understood in Silicon Valley wouldn't count as a mainstream word, he said.

His computer models check a total of 5,000 Web sites, dictionaries, scholarly publications and news articles to see how frequently words are used, he said. A word must make 25,000 appearances to be deemed legitimate.

Payack said news events have also fueled the rapid expansion of English, which he said has more words than any other language. Mandarin Chinese comes in second with about 450,000 words, he said.

English terms like "Obamamania," "defriend," "wardrobe malfunction," "zombie banks," "shovel ready" and "recessionista" all have grown out of recent news cycles about the presidential election, economic crash, online networking or a sports event, he said. Other languages might not have developed new terms to deal with such phenomena, he said.

Language experts who spoke with CNN said they disapprove of Payack's count, but they agree that English generally has more words than most, if not all, languages.

"This is stuff that you just can't count," said Jesse Sheidlower, editor at large of the Oxford English Dictionary. "No one can count it, and to pretend that you can is totally disingenuous. It simply can't be done." The Oxford English Dictionary has about 600,000 entries, Sheidlower said. But that by no means includes all words, he said.

Linguists and lexicographers run into further complications when trying to count words that are spelled one way but can have several meanings, said Allan Metcalf, an English professor at MacMurray College in Illinois, and an officer at the American Dialect Society. "The word bear, b-e-a-r -- is that two words or one, for example? You have a noun that's a wild creature and then you have b-e-a-r, which means to bear left or to bear right, and there's many other things," he said. "So you really can't be exact about a millionth word."

Payack said he doesn't consider his to be the definitive count, just an interesting estimation based on set criteria he has helped develop. "It's always an estimation," he said. "It's like the height of Mount Everest is an estimation. The height of Mount Everest has changed five times in my lifetime because as we get better tools, the estimates get better."

No comments: