Mailing List Archive
tlug.jp Mailing List tlug archive tlug Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]Belated thanks (was: Re: [tlug] Mail archiving question)
- Date: Mon, 5 Nov 2007 20:59:51 +1100
- From: "Jim Breen" <jimbreen@example.com>
- Subject: Belated thanks (was: Re: [tlug] Mail archiving question)
On 04/08/2007, Stephen J. Turnbull <stephen@example.com> wrote: > Jim Breen writes: > > > I foolishly volunteered to help set up a searchable > > email archive for the Honyaku mailing list (A few > > TLUGers are also on that list.) My current task is to > > extract the essential headers (From, Subject, Date, ...) > > and the body of the email, convert them to UTF-8 and > > store them as one file per email. I am working on a collection > > of about 40,000 accumulated emails from the last 18 months. [...] > MHonArc may have an appropriate option. MHonArc worked well, although it did more than I wanted, e.g. dressing each email up in prettified HTML. Also it turned all the Japanese and Chinese into entity codes. My ultimate solution was to run each email through MHonArc, then pipe the output through htlml2text, and then through ascii2uni to recover the Japanese/Chinese as UTF8. Thanks to Stephen and Josh for the suggestions. Cheers Jim -- Jim Breen Honorary Senior Research Fellow Clayton School of Information Technology, Monash University, VIC 3800, Australia http://www.csse.monash.edu.au/~jwb/
Home | Main Index | Thread Index
- Prev by Date: RE: [tlug] grub/possible sata issue
- Next by Date: [tlug] [Announcement] TLUG Technical Meeting 2007-11-10
- Previous by thread: RE: [tlug] grub/possible sata issue: What is MIMO?
- Next by thread: [tlug] A Swap Question
- Index(es):
Home Page Mailing List Linux and Japan TLUG Members Links