|
Step 3 for Unix PubCrawler:
|
Please report problems to pubcrawlerREMOVECAPShelp@gmail.com
1. Open your crontab-file by entering the following command:
crontab -e
2. Set up a schedule for PubCrawler. For example if you enter the following line:
10 07 * * 1-5 "pubcrawler.pl"PubCrawler would be started at 07:10 hours every weekday (Monday to Friday = days 1-5 of the week.
10 07 * * 1-5 "/home/jsmith/pubcrawler.pl"Your home directory may be substituted with the variable $HOME:
10 07 * * 1-5 "$HOME/pubcrawler.pl"If your configuration file could not be found, specify it in your crontab entry with the -c option:
10 07 * * 1-5 "$HOME/pubcrawler.pl" -c "$HOME/pubcrawler.config"
You can change....
The number of days each document is listed (default: 10)
The number of documents to display per page (default: 20)
The maximum age of documents to consider retrieving (default: 100 days)
The name of the output file, and its location (default: pubcrawler_output.html)
These options are all specified in the pubcrawl.config file. You can also specify Web proxy information, and make use of additional network checking features. Full details are given in the pubcrawler.config file and on the Technical Description page..... and many others.