Mailing List Archive
tlug.jp Mailing List tlug archive tlug Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]Re: [tlug] kanji dictionary-eijiro
- Date: 30 Mar 2003 23:22:30 +0900
- From: Edward Middleton <edwardmiddleton@example.com>
- Subject: Re: [tlug] kanji dictionary-eijiro
- References: <20030311063049.GA4279@example.com> <3E6D84AF.39603078@example.com> <3E6D9071.2DBD50F6@example.com> <Pine.LNX.4.51.0303111644100.2246@example.com> <20030311172716.34c6541a.plate@example.com> <Pine.LNX.4.51.0303121014310.2246@example.com> <20030312110307.256a91d9.plate@example.com> <Pine.LNX.4.51.0303121114410.2246@example.com> <1048433690.13430.73.camel@example.com> <15998.16391.734303.517395@example.com> <1048464844.3301.39.camel@example.com>
- Organization:
After using the EIJIRO dictionary in epwing form as described in my previous post I found that the script for converting the EIJIRO removes the kanji readings. This saves about 10 meg on the compressed dictionary but is not really desirable for no Japanese speakers. #du -s * 110458 eijiro 103822 eijiro-old I have made some modification to the script so that the readings are kept. Below is a diff of the files. --- ../eijiro-fpw1.0b.orig/eijiro-fpw.pl 2003-03-30 23:04:29.000000000 +0900 +++ eijiro-fpw.pl 2003-03-30 23:07:51.000000000 +0900 @@ -395,7 +395,6 @@ my(@example.com, $char, $next, $mean, $yorei); $mean = shift(@example.com); - $mean =~ s/\x81\x6f(?:$hiragana|\x81[\x5e\x69\x6a]| )+\x81\x70//g; @example.com = unpack('C*', $mean); $mean = ''; @@ -429,6 +428,10 @@ $mean .= '['; } elsif (0x6e == $next) { # $mean .= ']'; + } elsif (0x6f == $next) { # -> + $mean .= pack('CC',0x81,0x6b); + } elsif (0x70 == $next) { #-> + $mean .= pack('CC',0x81,0x6c); } elsif (0x7b == $next) { # $mean .= '+'; } elsif (0x7c == $next) { # -- Edward Middleton <edwardmiddleton@example.com>
- References:
- [tlug] kanji dictionary
- From: benny k.
- Re: [tlug] kanji dictionary
- From: Anthony Cunningham
- Re: [tlug] kanji dictionary
- From: Anthony Cunningham
- Re: [tlug] kanji dictionary
- From: Joe Larabell
- Re: [tlug] kanji dictionary
- From: Ulrich Plate
- Re: [tlug] kanji dictionary
- From: Joe Larabell
- Re: [tlug] kanji dictionary
- From: Ulrich Plate
- Re: [tlug] kanji dictionary
- From: Joe Larabell
- Re: [tlug] kanji dictionary
- From: Edward Middleton
- Re: [tlug] kanji dictionary
- From: Marcus Metzler
- Re: [tlug] kanji dictionary
- From: Edward Middleton
Home | Main Index | Thread Index
- Prev by Date: Re: [tlug] weird file format
- Next by Date: [tlug] CAT5e CAT6 and Gigabit Ethernet
- Previous by thread: Re: [tlug] kanji dictionary
- Next by thread: Re: [tlug] kanji dictionary
- Index(es):
Home Page Mailing List Linux and Japan TLUG Members Links