Results

Back to:  VRC Approach  |  VRC Tool Box

Control Site Testing

Site Mappers

Comparison of 5 Site Mappers
Site-VRC Control Site
Settings-generally the default settings

Tool
Time
Requests
Bytes
Status Codes
Begin
Run
Num
Freq
Total
200
200
206
301
304
403
404
500
SM 14:58:23 69 270 0.256 1043405 1041121 253 0 2 0 5 8 2
C 14:39:59 14 229 0.061 1436901 1429204 207 0 1 0 13 6 2
X 14:57:14 511 415 1.231 8300920 7963589 379 19 1 8 4 4 0
PM 15:49:31 169 581 0.291 2385332 2383718 557 0 1 16 3 4 0
SMP 08:42:42 28 259 0.108 3603415 3573757 251 2 0 1 2 3 0

The Tool column identifies the Site Mappers we tested: SiteMapper (SM), Custo (C), SiteXpert (X), PowerMapper (PM), and Site Map Pro (SMP). The Time columns note the start time and run time in seconds for the test. The Requests columns documents the number of requests and the frequency – a request was made every n seconds. The Bytes column notes the total number of bytes for all pages requested by the tool, and the number of bytes for pages with status code 200. The Status Codes columns note the number of pages in each status code category found by the tool. These are HTTP status codes that equate to: 200 = okay, 206 = Partial Content, 301 = Moved Permanently, 304 = Not Modified, 403 = Forbidden, 404 = Not Found, and 500 = Internal Server Error. The full set of HTTP status codes and definitions is available at: http://www.ietf.org/rfc/rfc2616.txt.

 

Simulated Testing

Site Mappers

Tool: Custo
Sites: DPM Tutorial, VRC

Notes: Custo provides a limited amount of raw data, but does export valuable data that can be imported into a database or otherwise manipulated in an external application:
 —Raw data: Start time, end time, addresses counts for retrived, processed, accepted and failed
 —XML export: Site file information (file name, size, type, modification date, and path)
 —Text export: file paths, external links
 —HTML export: site map, error list

Site
Time
Addresses
File Types
Ext Links
Name
Size
Begin
Run
Ret
Proc
Acc
Fail
total
htm
image
pdf
other
good
404
DPM 5.8 MB 10:44:05 0:0:45 597 597 585 12 479 85 392 1 1 101 5
VRC 644 KB 14:03:45 0:03:38 292 292 274 18 110 61 47 1 1 164 3

Run is run time in seconds, Addresses: Ret=Retrieved , Proc=Processed, Acc=Accepted, Fail=Failed.