Mailing List Archive
tlug.jp Mailing List tlug archive tlug Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]Re: [tlug] Japanese regex question
- Date: Thu, 25 Aug 2005 11:50:08 +0900
- From: Brett Robson <b-robson@example.com>
- Subject: Re: [tlug] Japanese regex question
- References: <20050825102343.AFF0.B-ROBSON@example.com> <d8fcc08005082419271014a104@example.com>
On Wed, 24 Aug 2005 22:27:28 -0400 Josh Glover <jmglov@example.com> wrote: > On 8/24/05, Brett Robson <b-robson@example.com> wrote: > > > I don't know how much experience you have with J-regex but the biggest > > issue is anchoring. Because it's double byte you can't be sure you're > > matching from the first byte of a character. > > I am fairly certain that with Unicode, Perl 5.8 regular expressions > handle the multi-byte encoding properly and do not treat strings as > arrays of bytes. > But he said he was using raw 2022 encoding and those numbers look correct for katakana in 2022. I was thinking though that he would probably be better off converting to unicode internally for that exact reason. Brett
- Follow-Ups:
- Re: [tlug] Japanese regex question
- From: Josh Glover
- Re: [tlug] Japanese regex question
- From: Jonathan Byrne
- References:
- Re: [tlug] Japanese regex question
- From: Brett Robson
- Re: [tlug] Japanese regex question
- From: Josh Glover
Home | Main Index | Thread Index
- Prev by Date: Re: [tlug] Japanese regex question
- Next by Date: Re: [tlug] Linux Laptop
- Previous by thread: Re: [tlug] Japanese regex question
- Next by thread: Re: [tlug] Japanese regex question
- Index(es):
Home Page Mailing List Linux and Japan TLUG Members Links