GAMMA

Step 3 for Unix PubCrawler:
Automating your searches.

Please report problems to pubcrawlerREMOVECAPShelp@gmail.com
 

Automating your searches

When you have completed Steps ALPHA and BETA and you have made a search strategy of your own, you should set it up to run daily.  Each day, PubCrawler will compare the new search results to the results from the previous day, and put the new entries onto the results Web page for you to peruse.

1. Open your crontab-file by entering the following command:

crontab -e

2. Set up a schedule for PubCrawler. For example if you enter the following line:

10 07 * * 1-5 "pubcrawler.pl"
PubCrawler would be started at 07:10 hours every weekday (Monday to Friday = days 1-5 of the week.

NOTE: In some cases it might be necessary to include the whole path in the crontab entry like the following:
10 07 * * 1-5 "/home/jsmith/pubcrawler.pl"
Your home directory may be substituted with the variable $HOME:
10 07 * * 1-5 "$HOME/pubcrawler.pl"
If your configuration file could not be found, specify it in your crontab entry with the -c option:
10 07 * * 1-5 "$HOME/pubcrawler.pl" -c "$HOME/pubcrawler.config"

Please make use of NCBI's off peak hours (1am to 1pm GMT)!
 


For power users....

PubCrawler provides many ways to customize the output.

You can change....

The number of days each document is listed (default: 10)
The number of documents to display per page (default: 20)
The maximum age of documents to consider retrieving (default: 100 days)
The name of the output file, and its location  (default: pubcrawler_output.html)
.... and many others.
These options are all specified in the pubcrawl.config file.  You can also specify Web proxy information, and make use of additional network checking features.  Full details are given in the pubcrawler.config file and on the Technical Description page.


Last modified at $Date: 2018/05/25 12:12:10 $