Mailing List ArchiveSupport open source code!
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]Re: EDICT and edict.el
- To: tlug@example.com
- Subject: Re: EDICT and edict.el
- From: turnbull@example.com (Stephen J. Turnbull)
- Date: Fri, 1 Nov 96 09:55 JST
- In-reply-to: <19961031101736937.AAA279@example.com> (darren@example.com)
- Reply-To: tlug@example.com
- Sender: owner-tlug
>>>>> "Darren" == Darren Cook <darren@example.com> writes: >> I looked at converting EDICT to a sort of escape-less JIS, >> using say "<>" as JIS-in/JIS-out codes, and interfacing to the >> glimpse text Darren> The problem with this is telling the difference between a Darren> genuine < and one that means the start of JIS. Similarly Darren> between > (ascii code 62) to mean JIS-out, and a kanji Darren> whose first byte is a 62. No problem; "grep -c '[<>]' $EDICT_DICTIONARIES" => 2. I would just quote those with '\' or something. This is intended as a special purpose facility using an existing English-oriented (and therefore ASCII-oriented) indexing package on a very specialized database, not as a general purpose facility. For that, Jeff Friedl's lookup package (ftp://turnbull.sk.tsukuba.ac.jp/pub/linux/packages/Japanese/lookup*, you can also get it at ftp://ftp.cc.monash.ac.jp/pub/nihongo) looks like a much better bet. If I really wanted a general purpose facility, I'd use one of straight JIS, 2-byte EUC, or maybe even "ISO-2022-* MIME-quoted-printable". Steve -- Stephen John Turnbull University of Tsukuba Yaseppochi-Gumi Institute of Policy and Planning Sciences http://turnbull.sk.tsukuba.ac.jp/ Tennodai 1-1-1, Tsukuba, 305 JAPAN turnbull@example.com ----------------------------------------------------------------- a word from the sponsor will appear below ----------------------------------------------------------------- The TLUG mailing list is proudly sponsored by TWICS - Japan's First Public-Access Internet System. Now offering 20,000 yen/year flat rate Internet access with no time charges. Full line of corporate Internet and intranet products are available. info@example.com Tel: 03-3351-5977 Fax: 03-3353-6096
Home | Main Index | Thread Index
- Next by Date: CDROM writer under Linux
- Next by thread: CDROM writer under Linux
- Index(es):
Home Page Mailing List Linux and Japan TLUG Members Links