You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

35 lines
1.6 KiB

ht://Dig contributed scripts
This directory tree contains perl and shell programs that attempt to
do things with the generated databases. Most of these were written
for a very specific purpose for the specific version of ht://Dig that
was current at that point. This means that some of these programs
will be severely broken! Do not expect them to work; use them only as
examples of the types of things you can do with the ht://Dig
databases.
More contributed work is available on the ht://Dig website:
<http://www.htdig.org/contrib/>
What's here:
acroconv.pl An external converter script that uses acroread to parse PDFs
autorun An example of automating the database building
changehost A script to change hostnames of URLs in the databases
conv_doc.pl A sample script to use the conversion features of external_parsers
doclist List the information in the doc db (or after a certain date)
ewswrap Two sample htsearch wrappers to emulate Excite for Web
Servers (EWS) and to simplify queries
handler.pl A sample external_protocols script to handle HTTP/HTTPS using curl
htparsedoc A sample shell script to parse Word documents
multidig A set of scripts to simplify updating multiple databases
parse_doc.pl A general external parser script that handles MS Word documents
(among others)
run-robot.sh Another example of automating the database building
scriptname An example of using htsearch within dynamic SSI pages
status.pl Build a status page of last 5 runs and top 10
servers (by # URLs)
urlindex Build an index of all the URLs in the database
whatsnew Build a "what's new" page with custom header and footer
wordfreq Build a list of words and frequency in the database