Mailing List Archive
tlug.jp Mailing List tlug archive tlug Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][tlug] Character encoding stuff
- Date: Fri, 31 May 2013 00:25:43 +0900
- From: Brian Chandler <brian@example.com>
- Subject: [tlug] Character encoding stuff
- User-agent: Mozilla/5.0 (X11; Linux i686; rv:17.0) Gecko/20130330 Thunderbird/17.0.5
I write bits of my website in PHP, and am always bumping up against character set issues. Here are (is??) a plurality of questions.(1) In particular, when scraping jigsaw puzzle manufacturer websites, I want to know what characters I'm looking at. Things like "Is that cross a *multiplication sign, *lowercase-x, *capital-X, *zenkaku-x, *zenkaku-X, or who knows what (х for example, and I managed to type that one in). I started looking on the web, then realised I actually wrote a primitive one myself: for examplehttp://imaginatorium.org/svc/unicode.php?ins=x%C3%97%D1%85But it would be nice to get more than just numbers: stuff like "Cyrillic", "Punctuation" etc. Any suggestions for useful tools, either Web-based or a screen utility I can run in Linux?(2) I user gedit, which is sort of fine, but it does Really Stupid (sorry, I mean "clever") display tricks, trying to guess how things should be shown depending on surrounding characters. So paste in the following two lines, and the two marus appear completely different (in size: both are circles):これは、○です。(マル) But this is exactly the same character: ○ Are there any suggestions of editors more suited to multi-script work?There are a few other things, but I'd better go an watch детараме хиро now. (That came out wrong...)Brian Chandler
- Follow-Ups:
- Re: [tlug] Character encoding stuff
- From: Nguyen Viet Cuong
- [tlug] Character encoding stuff
- From: Stephen J. Turnbull
- Re: [tlug] Character encoding stuff
- From: Darren Cook
Home | Main Index | Thread Index
- Prev by Date: [tlug] [announcement] June 8 TLUG Technical Meeting
- Next by Date: Re: [tlug] Character encoding stuff
- Previous by thread: [tlug] [announcement] June 8 TLUG Technical Meeting
- Next by thread: Re: [tlug] Character encoding stuff
- Index(es):
Home Page Mailing List Linux and Japan TLUG Members Links