Guide to what the different columns are? and set refresh?
« on: June 08, 2012, 09:54:55 AM »
Hello,

When you run the crawler from command line, you get the following output...
Resuming the last session (last updated: 1970-01-01 00:00:00)
1 | 67 | 52.6 | 0:00:01 | 0:01:28 | 1 | 1,418.6 Kb | 1 | 0 | 1418
20 | 48 | 1,077.2 | 0:00:04 | 0:00:10 | 1 | 1,760.2 Kb | 17 | 213 | 1760
40 | 28 | 2,207.0 | 0:00:09 | 0:00:06 | 1 | 1,879.3 Kb | 37 | 313 | 1879

and so on.

What does each column mean?
I can work out that column 4 is time taken, and 5 is estimated time to finish?
In the next release could these be added to the guide/readme?

On another note...
Before this was displayed every 20... in version 6 it is displayed every 1...
995 | 2199 | 47,218.5 | 0:16:33 | 0:36:35 | 2 | 11,901.7 Kb | 651 | 5971 | 1781
996 | 2198 | 47,377.8 | 0:16:36 | 0:36:38 | 2 | 11,998.1 Kb | 652 | 6148 | 97
997 | 2197 | 47,544.7 | 0:16:38 | 0:36:41 | 2 | 12,185.8 Kb | 653 | 6332 | 187

Where is the setting to change how often it appears?
As when you run as a cronjob, I have all cron outputs e-mailed, and it now creates a very large e-mail, as it is 20x larger.

Many Thanks,

Rob.
Rob
Re: Guide to what the different columns are? and set refresh?
« Reply #1 on: June 09, 2012, 02:55:43 PM »
Hello,

the output interval cannot be configured, it's shown either after each 5 second or 20 URLs whichever comes first.
it's recommended to disable system cron email for this task once you make sure it's running correctly (you will still get email notfication made by generator itself).
Re: Guide to what the different columns are? and set refresh?
« Reply #2 on: June 13, 2012, 04:40:00 PM »
Just to bumb about the question of what the columns are?

Link | Links Left this level | Links for next level | Time | Est Time Left | Memory Usage | but what are the other columns? or have I got the current ones right?

Many Thanks,

Rob.
Rob
Re: Guide to what the different columns are? and set refresh?
« Reply #3 on: June 14, 2012, 01:26:30 PM »
urls scanned | urls left (current depth level only) | downloaded bytes | time spent | estimated time left | depth level | memory usage | URLs queued | memory usage change