Mailing List Archive
tlug.jp Mailing List tlug archive tlug Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]Re: [tlug] Search MySQL for Japanese Names]
- Date: Fri, 30 Oct 2009 13:23:10 +0900
- From: 黒鉄章 <akira@example.com>
- Subject: Re: [tlug] Search MySQL for Japanese Names]
- References: <5634e9210910191749m675cdf8cl3ca73efa0fcbeccb@example.com> <36e8d89d0910191858j2ba89691lb10648d0465fc109@example.com> <5634e9210910270038m4bbb9528hbec50722666a2007@example.com> <36e8d89d0910290227y3735ba72o2376f592d857a84e@example.com> <5634e9210910292034i149d90avd18a11ff168543cc@example.com>
>> My next interest would be spread of names in the real population. Who >> knows how the results of the above would be weighted then... > > Hard data to get too. When I was at Tokyo Gaidai they had access to > a full copy of the NTT directory. It would have been nice to do some > frequency measures on names, and geographical dispersions on > family names, but there was an embargo on any publications > drawing on the data. They said it was because of "privacy". Well, in the case of rare names some privacy can be broken through practical assumptions. I once read that the US census doesn't open the stats on baby names that occur less than, say, a 100 times a year. > In a year or so i'll be working on a major expansion of the lexicon(s) > used by MeCab et al.. I'll probably be starting with NAIST-JDIC. I'm > less interested in correct POS tagging and more in correctly > identifying compounds. I want 米軍 to be recognized; not come up > as 米 + 軍. Cool. But I'll have to get a server with more memory and as much cache as possible to fit these expanding dictionaries in memory..... damn, intractable Japanese-parsing requirements. Akira
- Follow-Ups:
- Re: [tlug] Search MySQL for Japanese Names]
- From: Edward Middleton
- References:
- Re: [tlug] Search MySQL for Japanese Names]
- From: Jim Breen
- Re: [tlug] Search MySQL for Japanese Names]
- From: 黒鉄章
- Re: [tlug] Search MySQL for Japanese Names]
- From: Jim Breen
- Re: [tlug] Search MySQL for Japanese Names]
- From: 黒鉄章
- Re: [tlug] Search MySQL for Japanese Names]
- From: Jim Breen
Home | Main Index | Thread Index
- Prev by Date: Re: [tlug] Search MySQL for Japanese Names]
- Next by Date: Re: [tlug] Search MySQL for Japanese Names]
- Previous by thread: Re: [tlug] Search MySQL for Japanese Names]
- Next by thread: Re: [tlug] Search MySQL for Japanese Names]
- Index(es):
Home Page Mailing List Linux and Japan TLUG Members Links