You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
35 lines
1.6 KiB
35 lines
1.6 KiB
ht://Dig contributed scripts
|
|
|
|
This directory tree contains perl and shell programs that attempt to
|
|
do things with the generated databases. Most of these were written
|
|
for a very specific purpose for the specific version of ht://Dig that
|
|
was current at that point. This means that some of these programs
|
|
will be severely broken! Do not expect them to work; use them only as
|
|
examples of the types of things you can do with the ht://Dig
|
|
databases.
|
|
|
|
More contributed work is available on the ht://Dig website:
|
|
<http://www.htdig.org/contrib/>
|
|
|
|
What's here:
|
|
|
|
acroconv.pl An external converter script that uses acroread to parse PDFs
|
|
autorun An example of automating the database building
|
|
changehost A script to change hostnames of URLs in the databases
|
|
conv_doc.pl A sample script to use the conversion features of external_parsers
|
|
doclist List the information in the doc db (or after a certain date)
|
|
ewswrap Two sample htsearch wrappers to emulate Excite for Web
|
|
Servers (EWS) and to simplify queries
|
|
handler.pl A sample external_protocols script to handle HTTP/HTTPS using curl
|
|
htparsedoc A sample shell script to parse Word documents
|
|
multidig A set of scripts to simplify updating multiple databases
|
|
parse_doc.pl A general external parser script that handles MS Word documents
|
|
(among others)
|
|
run-robot.sh Another example of automating the database building
|
|
scriptname An example of using htsearch within dynamic SSI pages
|
|
status.pl Build a status page of last 5 runs and top 10
|
|
servers (by # URLs)
|
|
urlindex Build an index of all the URLs in the database
|
|
whatsnew Build a "what's new" page with custom header and footer
|
|
wordfreq Build a list of words and frequency in the database
|