TLUG Mailing List

Mailing List Archive

tlug.jp Mailing List tlug archive tlug Mailing List Archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[tlug] [OT/long] Yet another JMdict front-end

Date: Sun, 30 Jul 2006 18:21:58 -0600

From: Matt Gushee <matt@example.com>

Subject: [tlug] [OT/long] Yet another JMdict front-end

User-agent: Thunderbird 1.5.0.5 (X11/20060729)
[apologies if this appears twice--I sent it earlier from the wrong account]
This really has nothing to do with Linux, but I know many of you areinterested in Japanese and Japanese dictionaries, and many of you alsoare knowledgeable about Web design & development, so I thought I'd letyou know about a little project I have begun, and solicit some feedback.
To make a long story short, a couple of weeks ago I was looking for anonline Kanji dictionary, and couldn't find one I really liked. Orrather, I couldn't find an *interface* I really liked. So I decided tocreate my own. The site is intended to be fast, easy to navigate, andaesthetically pleasing. The target audience is English-speaking learnersof Japanese (intermediate-to-advanced?), so the emphasis is on providingeasy access to phrases including the target kanji, readings, anddefinitions--basically the kind of info you find in edict/jmdict. Itcurrently does not provide information of more scholarly interest, suchas Nelson index numbers and all that, though such info could maybe beadded later on as an "advanced option."
Another thing you should know is that my site makes heavy use of thelatest in Web-standard[*] technology. It is an AJAX application, so youat least need a browser with JavaScript enabled, and that supportsXMLHttpRequest. So recent Gecko-based browsers should be fine, alongwith IE 5.5+ (?) and Opera 8+. Since the whole point of the project isto develop a nicer interface to content that is easily availableelsewhere, I don't feel obligated to create an alternative for olderbrowsers, but of course I provide links to other online Kanji dictionaries.
Here's the URL: <http://matt.gushee.net:8250/index.html>. That'sprobably temporary, so even if you really like it, please don't post anylinks to it just yet. If you have comments and don't want to clutterthis list, just send an e-mail to <matt@example.com>.
SOME ISSUES TO CONSIDER
=======================
First of all, the title. I am tentatively calling the thing "楽漢摘." Ilike to think it's rather a clever pun, but if any native Japanesespeakers are reading this, I'd like to know how it sounds to you. Is itjust a wake-wakaranai gaijin joke? Please don't worry about offendingme--I will be happy to change the title if it is too weird.
Now on to more substantive issues:

Indexing approach
-----------------
There will probably be several indexes in the future, but currently Iprovide one way to look up Kanji: a traditional radical/stroke-countindex. Specifically, you select the radical stroke count, then theradical itself, then the stroke count for the whole character, then thespecific character that you want. Although it is a linear process andthus easy to understand in principle, it has the disadvantage thatpeople don't know by heart how many strokes are in a character, and itcan be very hard to figure out for the more complex ones. In a printeddictionary it's less of a problem because you can easily shift your eyesto another part of the page; in a browser I think it will be awkward atbest.
What other alternatives might work well (when you don't know thepronunciation)? I've seen Jim Breen's "multi-radical" method and wasinitially resistant to it for a couple of reasons: first, it isnon-linear, and thus is superficially more complex than theradicals/strokes method.
Second, I have been taught (for both Chinese and Japanese) that theradical is the "meaning" component, and that in general a character hasexactly one radical. At any rate, I believe the radical has etymologicalsignificance, and that understanding which part of the Kanji is theradical can contribute to an overall mastery of the language. And asingle-radical dictionary index reinforces that understanding.
But I'm thinking that a multi--can I say "component" instead of"radical"? Then maybe I could set aside the philosophical objection.Anyway, a well-designed multi-thing index might after all be an easierway to look up Kanji.
Strokes/radicals index navigation
---------------------------------
If I decide to go to a multi-component index, this might not matter anymore. But for the moment, there is an issue with the index menus: inview of the fact that the user will often not be sure how many strokesthere are in a character, I have created dynamic menus such that ...actually it's best if you try it out. Basically, if you move your mouseover an item in one row of the menu, the next row is *temporarily*displayed. Thus, let's say you have chosen a given radical. There is arow of numbers representing stroke counts of characters with thatradical; if you run your mouse along that row you can easily see whatcharacters exist for each stroke count.
So, do you think this is (a) useful, and (b) intuitive? It would be alot easier to make the menus so that the next row only changes when youclick something. But if people find the transient display a very helpfulfeature, I will make it work.
Presentation of results
-----------------------
Currently when you select a Kanji, a request goes to the server, whichreturns a document containing all phrases that start with that Kanji.This document is dumped into a table with 3 columns: [Kanji] Phrase,Reading, and Definitions. This is reasonable in some cases, butsometimes the response document is quite large, so I think some kind ofchunking and/or filtering would be helpful. It gets worse if we want tolook up all phrases *containing* the selected character. My server-sidescript can indeed do that, but sometimes it's just way too much data, soI've disabled that behavior for the moment.
Another issue with the result sets is that they're not sorted in anyuseful way--actually I believe they are ordered according to the JMdictentry sequence number.
So, how can I improve the processing and presentation of the results?

Miscellaneous technical stuff
-----------------------------
Preparing the index: my list of radicals is derived from Jim Breen'sKANJIDIC, but since his data is prepared for a multi-radical lookupsystem, I can't automatically extract a radicals-and-strokes index, so Iam currently creating the index manually. That's why it's so incomplete,of course. Does anyone know of another database somewhere that list eachkanji by (single) radical and stroke count?
Glyphs for radicals: if my understanding of the KANJIDIC documentationis correct, there is a glyph of each radical in Japanese Kanji, but someof them only exist in JISX-0212. If so, you either have to require theuser to have a JISX-0212 font, use images to represent some radicals, oruse substitute glyphs from JISX-0208. The last option is not reallyacceptable, I don't think. E.g., 化 for 人偏??
Nice Japanese font: this is purely subjective, of course, but I findMincho rather ugly. I have a font family called DFKaisho which I find tobe an excellent combination of elegance and readability; my stylesheetspecifies it for some of the Kanji display elements (with "serif" as afallback, of course). But in the interest of a more beautifulKanji-browsing experience, are there other Kaisho or similar fonts thatare widely used? Let me know their names and I'll stick 'em in thestylesheet. Or tell me to just use Mincho if that's your view. But beadvised: I am very stubborn about fonts.
[*] Using the term 'standard' to include some de facto standards as well
    as official published ones.

--
Matt Gushee
: Bantam - lightweight file manager : matt.gushee.net/software/bantam/ :
: RASCL's A Simple Configuration Language :     matt.gushee.net/rascl/ :
Prev by Date: Re: [tlug] Linux Kanji Optical Character Recognition (OCR) software?

Next by Date: Re: [tlug] Upgrading the kernel...?

Previous by thread: Re: [tlug] Flaming laptop

Next by thread: [tlug] abbreviations

Index(es):

Date

Thread

Home | Main Index | Thread Index

Home Page Mailing List Linux and Japan TLUG Members Links