Results
Back to: VRC Approach | VRC Tool Box
Site
Mappers
Comparison of 5 Site Mappers
Site-VRC Control Site
Settings-generally the default settings
Tool |
Time |
Requests |
Bytes |
Status Codes |
|||||||||
Begin |
Run |
Num |
Freq |
Total |
200 |
200 |
206 |
301 |
304 |
403 |
404 |
500 |
|
| SM | 14:58:23 | 69 | 270 | 0.256 | 1043405 | 1041121 | 253 | 0 | 2 | 0 | 5 | 8 | 2 |
| C | 14:39:59 | 14 | 229 | 0.061 | 1436901 | 1429204 | 207 | 0 | 1 | 0 | 13 | 6 | 2 |
| X | 14:57:14 | 511 | 415 | 1.231 | 8300920 | 7963589 | 379 | 19 | 1 | 8 | 4 | 4 | 0 |
| PM | 15:49:31 | 169 | 581 | 0.291 | 2385332 | 2383718 | 557 | 0 | 1 | 16 | 3 | 4 | 0 |
| SMP | 08:42:42 | 28 | 259 | 0.108 | 3603415 | 3573757 | 251 | 2 | 0 | 1 | 2 | 3 | 0 |
The Tool column identifies the Site Mappers we tested: SiteMapper (SM), Custo (C), SiteXpert (X), PowerMapper (PM), and Site Map Pro (SMP). The Time columns note the start time and run time in seconds for the test. The Requests columns documents the number of requests and the frequency – a request was made every n seconds. The Bytes column notes the total number of bytes for all pages requested by the tool, and the number of bytes for pages with status code 200. The Status Codes columns note the number of pages in each status code category found by the tool. These are HTTP status codes that equate to: 200 = okay, 206 = Partial Content, 301 = Moved Permanently, 304 = Not Modified, 403 = Forbidden, 404 = Not Found, and 500 = Internal Server Error. The full set of HTTP status codes and definitions is available at: http://www.ietf.org/rfc/rfc2616.txt.
Site
Mappers
Tool: Custo
Sites: DPM
Tutorial, VRC
Notes: Custo provides a limited amount of raw data, but does export valuable
data that can be imported into a database or otherwise manipulated in an
external application:
—Raw data: Start time, end time, addresses counts for retrived,
processed, accepted and failed
—XML export: Site file information (file name, size, type, modification
date, and path)
—Text export: file paths, external links
—HTML export: site map, error list
Site |
Time |
Addresses |
File Types |
Ext Links | ||||||||||
Name |
Size | Begin |
Run |
Ret |
Proc |
Acc |
Fail |
total |
htm |
image |
pdf |
other |
good | 404 |
| DPM | 5.8 MB | 10:44:05 | 0:0:45 | 597 | 597 | 585 | 12 | 479 | 85 | 392 | 1 | 1 | 101 | 5 |
| VRC | 644 KB | 14:03:45 | 0:03:38 | 292 | 292 | 274 | 18 | 110 | 61 | 47 | 1 | 1 | 164 | 3 |
Run is run time in seconds, Addresses: Ret=Retrieved , Proc=Processed, Acc=Accepted, Fail=Failed.
