Accents to texts
Sep. 4th, 2007 12:44 amHello,
I wrote a program (Akcentigilo [english frontend]) which, given a text in russian, puts accents over (many) words and moreover, if you click over words, it shows some translations in Esperanto and English.
You can use it from your browser - but not Microsoft Explorer by now!
I find it very useful for my readings. Please don't try to hack the server hard, and don't put too big texts... Otherwise I'll limit it.
I wrote a program (Akcentigilo [english frontend]) which, given a text in russian, puts accents over (many) words and moreover, if you click over words, it shows some translations in Esperanto and English.
You can use it from your browser - but not Microsoft Explorer by now!
I find it very useful for my readings. Please don't try to hack the server hard, and don't put too big texts... Otherwise I'll limit it.
no subject
Date: 2007-09-04 06:40 am (UTC)I just tried to accentuate the following text from wikipedia (http://ru.wikipedia.org/wiki/Microsoft):
Все началось в прошлом веке, в далеком 1975 году,...
История развития Microsoft затрагивается в фильме «Пираты Кремниевой долины».
So, the problems are:
- double accentuation, for example in: году, статью, было.
- missing accents, for example: далёком, прочитав, через, затрагивается, пираты, долины. Putting an accent over ё isn't necessary because it's always stressed but may be still useful and if there's a problem with that either combining Unicode code points could help or you could show the stress in a different way (say, with color, boldness or underscore). I'd double check if there any missing verb forms (participles, reflexive, imperative).
And you should still limit the size and other things. Whatever's hackable will be hacked. Don't ask not to :)
no subject
Date: 2007-09-04 08:07 am (UTC)Double accentuation means that the accent may be in those two positions, depending on the case. If it puts an accent over ё, this means that it could be a ё, or an е with accent. Checking in Zaliznjak may give you some details why there were such accents. For instance, it writes ка'ка'я, not only кака'я. :)
Sure, the dictionary Zaliznjak doesn't have all the russian words. Some are missing. But I'm not working trying to write a dictionary - I only wrote the motor that _may_ use such a dictionary.
So you find "bugs" I already expected. I don't feel I can take the job improving a Russian dictionary, and I don't think many native Russians are going to note and provide a full fix for those problems, apart from very certain people, which may sure not even know of this program.
I don't think I want to limit the program by now. First, I want to check if it can normally stand without limits. If I find any problems I cannot solve fixing the program, I will set some limits. Let's see what happens. :)
But accentuating a full text page is a normal task it should be able to do. It isn't that I want the users to send only sentences to be accentuated. :)
Thank you for your comments!