HtmlKanjiMarker: my red grades

HtmlKanjiMarker

This program was actually made quite some time ago, but I haven’t blogged about it.

HtmlKanjiMarker reads a local HTML file and then marks all unknown kanji red. The list of known kanji is taken from two sources. First is from the “Max grade” textbox on the upper right. I entered 4 because I’ve studied all Jouyou kanji grade 4 and below. The second is from a text file, “ExtraKnownKanji.txt”. The file should contain all kanji you’ve learned, outside from the textbox range.

Using this program, I can visually see how effective my current kanji knowledge is for a certain page. It also makes hunting new kanji easy. Last, It can answer questions such as “what if I learn all grade 5 kanji?”. (just change the “Max grade” textbox)

Programming the algorithm naively yielded a very slow marking. This is because a HTML page contains tons of characters, and there are ten thousands of kanji to check againts. I actually benchmarked and overhauled the algorithm several times. I originally wanted to write about the algorithm changes, but lost the interest by now :) .

So, here’s some generated Wikipedia pages viewed from my eyes of 1249 kanji: Newton, September 11 2001 attacks, Wikipedia. Rest assured, I’m still quite far for literacy…

Keep running, and if tired, walking. A small rest is also fine, just don’t surrender!

6 Responses to “HtmlKanjiMarker: my red grades”

  1. Vadim Tarasov Says:

    Hi, looks very good

    Where can i get this program?

    Thanks,

    Vadim

  2. Agro Rachmatullah Says:

    Thanks for the comments :)

    It’s currently nowhere on the web… Because you’re interested, I’ll package it and post the link on another blog post. I’ll inform you by mail so you won’t miss it…

    Agro

  3. Vadim Tarasov Says:

    Thanks a lot. will be really looking forward to it.

    Best regards,

    Vadim

  4. HtmlKanjiMarker 0.01 « Singularity on the Plane Says:

    [...] HtmlKanjiMarker is a program that reads a local HTML file and then marks all unknown kanji red. If you’re studying kanji, this program might be interesting and useful. [...]

  5. Tom Hodgers Says:

    Hi Agro,
    I’m interested in using HtmlKanjiMarker. Could you please let me have the info about its availability.

    Thanks,

    Tom

  6. Agro Rachmatullah Says:

    I’ve actually released it on:

    http://agro1986.wordpress.com/2007/02/23/htmlkanjimarker-001/

    Enjoy :)

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Connecting to %s


Follow

Get every new post delivered to your Inbox.