
Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [tlug] Japanese encoding
- Date: Fri, 23 Aug 2002 15:17:27 +1000 (EST)
- From: Jim Breen <jwb@example.com>
- Subject: Re: [tlug] Japanese encoding
[Brett Robson (Re: [tlug] Japanese encoding) writes:]
>In a loose moment Jim wrote:
>> > the details at:
>> http://www.csse.monash.edu.au/~jwb/wwwjdicinf.html#examp_tag
>> "the collection is in need of considerable editing."
>>
>> I am going to have a very large amount of free time from next week, I'm
>> happy to help.
It's a little early to turn it loose for hand editing. I looked into
possibly setting up a CVS system, but really I want Windblows, Mac, etc. people
in on it too if they can contribute. What I think I'll do eventually is
have it up for rsync collection (rsync is available for Windows), and have
a very standard way of submitting updates so I can run them through a utility.
In the meantime, I'd like to do some more reduction of duplicates using
software. I have tracked down and eliminated the straight replications
in the Japanese text. What I'd like to do it zoom in on things like:
$B;dC#$O$h$/$$$C$7$g$K$*Ck$r?)$Y$^$9!#(B
$B;dC#$O$h$/0l=o$K$*Ck$r?)$Y$^$9!#(B
and knock out the first because $B0l=o$K(B/$B$$$C$7$g$K(B are the same. At
present I'm doing this by eye, noting where the English sentences are the same.
Examples like:
$B;dC#$O%+%L!<$rhttp://www.csse.monash.edu.au/~jwb/)
Computer Science & Software Engineering, Tel: +61 3 9905 3298
P.O Box 26, Monash University, Fax: +61 3 9905 5146
Clayton VIC 3800, Australia $B%8%`!&%V%j!<%s(B@$B%b%J%7%eBg3X(B
Home |
Main Index |
Thread Index