[TheForge] searchable theforge archive revisited
Michael H. Murphy
blacksmith at comcast.net
Wed Nov 24 18:33:20 EST 2004
I hate to say this, but I think they should be munged. That will involve a
lot more work, but I think our members might want to get in contact with
some of the people who wrote the stuff.
Delete signatures.
What kind of search engine do you envision? Straight text search for key
words, or try to index the archive with something like Google's new indexing
software?
Murf
> -----Original Message-----
> From: theforge-bounces at mailman.qth.net [mailto:theforge-
> bounces at mailman.qth.net] On Behalf Of terry l. ridder
> Sent: Wednesday, November 24, 2004 2:45 PM
> To: theforge at mailman.qth.net
> Subject: [TheForge] searchable theforge archive revisited
>
> hello;
>
> while working on the searchable theforge archive a couple questions
> have come up. comments and suggestions welcomed.
>
> 0. e-mail addresses in theforge archives. should they be:
> a. deleted
> i.e. luser at example.com => ""
> b. munged
> i.e. luser at example.com => luser at example dot com
> 1. urls. should they be:
> a. validated.
> i.e. check to see the url still resolves to a valid web page.
> b. left as is.
> 2. signatures
> a. deleted
> i.e. anything after the trigraph '-- ' is deleted including the
> trigraph.
> b. left as is.
>
>
> general comments on the searchable theforge archive.
> 0. blank lines are being deleted.
> 1. the various footers inserted by qth.net are being deleted.
> 2. lines which contain only '>' are being deleted.
>
> --
> terry l. ridder ><>
> _______________________________________________
> Manage membership or unsubscribe at:
> http://mailman.qth.net/mailman/listinfo/theforge
> theforge mail list group photo site is
> http://www.photoaccess.com
> Login: blacksmithblacksmith at hotmail.com
> password: anvil
> ___________
>
More information about the TheForge
mailing list