Mailing List Archive
tlug.jp Mailing List tlug archive tlug Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]Re: [tlug] [tlug-digest] searching for kanji strings, ignore punctuation and end of lines. Text indexing and retrival in unicode.
- Date: Sat, 14 Jan 2006 11:19:39 +0900
- From: Josh Glover <jmglov@example.com>
- Subject: Re: [tlug] [tlug-digest] searching for kanji strings, ignore punctuation and end of lines. Text indexing and retrival in unicode.
- References: <200601130511.k0D5BxWg015897@example.com> <43C84B5A.7000703@example.com>
On 14/01/06, David Riggs <dariggs@example.com> wrote: > On the other hand, instead of searching each time, is there a text > indexing and search system which works with unicode? All I find googling > around is commerical stuff which seems orientated towards western languages. I don't know about this, but Perl's regexp engine handles Unicode and multi-line strings. Give Perl a whirl. (Sorry.) -Josh
- Follow-Ups:
- References:
Home | Main Index | Thread Index
- Prev by Date: Re: [tlug] Searching for kanji strings: Use UTF-8
- Next by Date: Re: [tlug] UTF-8 makes multi-byte ignorant UNIX tools play nicemulti-byte characters
- Previous by thread: Re: [tlug] Use a shell that groks UTF-8
- Next by thread: Re: [tlug] UTF-8 makes multi-byte ignorant UNIX tools play nicemulti-byte characters
- Index(es):
Home Page Mailing List Linux and Japan TLUG Members Links