
Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [tlug] Translating old to new kanji forms using tr
On 6/28/05, Danny Wilde <fuzakenbo@example.com> wrote:
> I've done something like this once, some time ago. The solution I used was
> to write a script in Perl. You could just write loads of s/kanji1/kanji2/,
> for example, (s/A/B/ means "substitute A for B" in Perl) or you could stuff
> all the kanji into an associative array and match them using a regular
> expression. I think (not sure) that the newer versions of perl have a \cJ
> operator which matches one Japanese character, so if the above-mentioned
> character overlap is the cause of the problem, then it would be solved that
> way.
Perl 5.8 and newer handle multi-byte encodings properly in regular expressions.
-Josh
Home |
Main Index |
Thread Index