(no subject)
Apr. 17th, 2008 01:22 pm![[identity profile]](https://www.dreamwidth.org/img/silk/identity/openid.png)
I'm trying to create a vocabulary list from an e-text by means of histogram analysis, so I can concentrate on the most frequently used words.
But my program is unable to link the various forms of Russian verbs, nouns and adjectives.
Does anyone know a public domain algorithm (or a list of sets of related morphemes) that can do this with reasonable accuracy?
Pim