[TheForge] searchable theforge archive revisited
Frederick Faller
f_faller at yahoo.com
Wed Nov 24 16:37:16 EST 2004
Terry,
Are you doing this all by hand?
Which archive files (i.e. how far back) are you doing?
Just the files from qth.net?
Including the ones from the very beginning on the
old system?
I have a program that I wrote that takes the original
files and does an auto cleanup and puts them in a
quick searchable format. If we could compile an entire
archive and make it so it could easily be updated, it
would all fit on a single CD and we could add this
search program to make it usable. The search program
allows boolean searches (i.e. search for "bolts" +
"square head" but not "New York")
Let me know how I can help.
Frederick Faller
--- "terry l. ridder" <terrylr at blauedonau.com> wrote:
> hello;
>
> while working on the searchable theforge archive a
> couple questions
> have come up. comments and suggestions welcomed.
>
> 0. e-mail addresses in theforge archives. should
> they be:
> a. deleted
> i.e. luser at example.com => ""
> b. munged
> i.e. luser at example.com => luser at example
> dot com
> 1. urls. should they be:
> a. validated.
> i.e. check to see the url still resolves to
> a valid web page.
> b. left as is.
> 2. signatures
> a. deleted
> i.e. anything after the trigraph '-- ' is
> deleted including the
> trigraph.
> b. left as is.
>
>
> general comments on the searchable theforge archive.
> 0. blank lines are being deleted.
> 1. the various footers inserted by qth.net are being
> deleted.
> 2. lines which contain only '>' are being deleted.
>
> --
> terry l. ridder ><>
> _______________________________________________
> Manage membership or unsubscribe at:
> http://mailman.qth.net/mailman/listinfo/theforge
> theforge mail list group photo site is
> http://www.photoaccess.com
> Login: blacksmithblacksmith at hotmail.com
> password: anvil
> ___________
>
>
>
=====
Frederick W. Faller
Shiloh Forge Ironware http://users.rcn.com/ffaller/SFI_web.htm
www.immerland.com
__________________________________
Do you Yahoo!?
Yahoo! Mail - Helps protect you from nasty viruses.
http://promotions.yahoo.com/new_mail
More information about the TheForge
mailing list