.

 

Polysemizer

The mighty Polysemizer, making sure all the words are widely known.

1. polysemy -- (the ambiguity of an individual word or phrase that can be used (in different contexts) to express two or more different meanings) (from WordNet.

A polysemy count indicates how common a word is. Specifically, polysemy refers to how many different forms of a word are used in a language; "love" has a polysemy count of 6, because you can love cooking and children, make love, score 9 love in a game of squash, and so forth. Polysemy values are analogous to the "familiarity," and, for most words, familiarity and polysemy values are practically the same. Familiarity, however, is uncovered by analyzing large corpuses of text and counting how often words occur. Polysemy comes out of what I suppose you could call "intralinguistic" analysis.

So, what I want is a Polysemizer, something to take a given amount of text and tag the words by their polysemy values, and show the result in a coherent way. For instance, the more common words could be shown in black, the less common in shades of gray. This way, when I'm writing advertising or some other text that needs to be easy to understand for the widest range of people of many ages, I can make sure that my vocabulary stays simple, concrete, and easy to fathom, avoiding words which might be perfectly normal to me but odd to others. It would also be helpful with other writing tasks.

The overall tool would be fairly simple to build, since polysemy isn't dependent on knowing the correct part of speech (in most cases; I guess homonyms are a problem): just tokenize a text and look up the polysemy count via the Lingua::Wordnet Perl module, transform that into a color value per word, and display as HTML. I think it could be built in a day or two.


[Top]

Ftrain.com

PEEK

Ftrain.com is the website of Paul Ford and his pseudonyms.

There is a Facebook group.

And six-words-only Twitter posts.

See also: Gary Benchley, Rock Star, a novel; Harper's Magazine; NPR's All Things Considered; The Morning News.

POKE


Syndicate: RSS1.0, RSS2.0
Links: RSS1.0, RSS2.0

Contact

© 1974-2007 Paul Ford

Recent

Real Editors Ship, by Paul Ford. tl;dr: needs editing. (July 20)

Parka. (April 21)

I'm on a Panel at SxSW. (March 8)

Elsewhere: Just Like Heaven. (January 11)

But melts just like a little girl. (August 26)

Panel/Unicode table for you. (August 21)

Been a while. (February 16)

Learning to Fear the Semantic Web, by Paul Ford. (October 15)

Fixed. (September 18)

NYU. (September 18)

Also. (September 11)

Steering Wheel. (September 11)

I never told you because I was kind of out of it for a while there but. (April 1)

Sasquatch. (March 26)

Over There. (March 24)

Signs. (March 21)

Eloquence Personified. (March 20)

Note. I wonder what the poor folks are doing tonight. (March 20)

The Wind Chest, by Paul Ford. (March 18)

Six-Word Reviews of 763 SXSW Mp3s. (March 13)

More...
Tables of Contents