tools-yak
[Top] [All Lists]

[tools-yak@collab] Tools-yak archive indexer

To: tools-yak@xxxxxxxxxxxxxxxxxxx
From: "Peter P. Jones" <ppj@xxxxxxxxxxxxxxxxxxxxx>
Date: Sat, 04 Oct 2003 22:38:31 +0100
Message-id: <3F7F4BE7.16292.C181B1@localhost>
Hi,
I've created an indexer in perl that scrapes words and phrases from 
the tools-yak (and potentially any other archive with similar HTML 
markup, such as yak) and puts them into a simple DB table with a 
record of the full link including nid and the nid alone next to each.
It can be found at
http://www.concept67.fsnet.co.uk/temp/bo_idxr_031.zip
Rudimentary instructions are in the perl script near the top.
Uses DBI to hook up to MySQL on my machine, but DBI parameters can be 
changed.    (01)

It's the start of something I'm working on very slowly, but I thought 
folks might like to play with it, and almost certainly make 
improvements.
LGPL license.    (02)

Comments welcomed.    (03)

-- 
Peter    (04)

-- 
This message is archived at:    (05)

http://collab.blueoxen.net/forums/cgi-bin/mesg.cgi?a=tools-yak&i=3F7F4BE7.16292.C181B1@localhost    (06)
<Prev in Thread] Current Thread [Next in Thread>