[TheForge] searchable theforge archive draft design
terry l. ridder
terrylr at blauedonau.com
Thu Nov 25 02:21:06 EST 2004
hello;
since there is still confusion about what the searchable theforge
archive is i have put together this draft design document.
theforge mailing list has basically three 'epochs'.
0. wugate epoch
1. majordomo qth.net epoch
2. mailman qth.net epoch
how epochs relate to time.
0. wugate epoch 1995 - dec 1998
1. majordomo qth.net epoch dec 1998 - dec 2001
2. mailman qth.net epoch dec 2001 - present
each epoch has distinct message formats which are processed very
differently.
for each epoch theforge messages will be sorted by month and year.
each message will have non-blank lines numbered.
example:
0 date:
1 from:
2 to:
3 subject:
4 hello;
5 this is an example.
6 > forge_member at example dot com wrote
>
7 > what is a good make of tig welder?
>
--
8 forge_member
each numbered line of a message will be inserted into a mysql database.
the table layout is roughly.
epoch month year msg_number line_number line_text
0-2 0-11 1995-present 0-? 0-? the line text
msg_number is the message received for a given month.
a search for a phrase would find those lines which match.
by knowing which lines match the phrase the original message can be
found and displayed.
options for searching the archive.
all epochs
a single epoch
a single month
a single year
poor ascii art
+--------+ +-------+ +-------+
| apache |<--->| perl |<--->| mysql |
| httpd | | cgi | | db |
| | | script| | |
+---+----+ +-------+ +-------+
|-----------------+----------------+
| | |
+---+----+ +------+----+ +-----+---+
| wugate | | majordomo | | mailman |
| epoch | | epoch | | epoch |
| msgs | | msgs | | msgs |
+--------+ +-----------+ +---------+
--
terry l. ridder ><>
More information about the TheForge
mailing list