[TheForge] searchable theforge archive revisited

terry l. ridder terrylr at blauedonau.com
Wed Nov 24 14:44:35 EST 2004


hello;

while working on the searchable theforge archive a couple questions
have come up. comments and suggestions welcomed.

0. e-mail addresses in theforge archives. should they be:
     a. deleted
        i.e. luser at example.com => ""
     b. munged
        i.e. luser at example.com => luser at example dot com
1. urls. should they be:
     a. validated.
        i.e. check to see the url still resolves to a valid web page.
     b. left as is.
2. signatures
     a. deleted
        i.e. anything after the trigraph '-- ' is deleted including the
             trigraph.
     b. left as is.


general comments on the searchable theforge archive.
0. blank lines are being deleted.
1. the various footers inserted by qth.net are being deleted.
2. lines which contain only '>' are being deleted.

-- 
terry l. ridder ><>


More information about the TheForge mailing list