Excerpt | ||
---|---|---|
| ||
Instructions on how to run JWAT-Tools. |
Installing and running
To install JWAT-Tools simply unpack the archive.
To run JWAT-Tools use the Windows or Linux scripts included in the package.
The scripts can be called from any location.
Info | ||||
---|---|---|---|---|
| ||||
jwattools.cmd jwattools_debug.cmd jwattools_debug_suspended.cmd |
Info | ||||
---|---|---|---|---|
| ||||
jwattools.sh jwattools_debug.sh jwattools_debug_suspended.sh |
Options
The command line interface has changed yet again for v0.5.6.
The main help page only lists command and global options.
Use jwattools help <command> to show a command's usage.
Code Block | ||||
---|---|---|---|---|
| ||||
C:\Java\workspace\jwat-tools>target\jwat-tools-0.5.6-SNAPSHOT\jwattools.cmd
JWATTools v0.5.6
usage: JWATTools <command> [<args>]
Commands:
arc2warc convert ARC to WARC
cdx create a CDX index (unsorted)
compress compress
decompress decompress
extract extract ARC/WARC record(s)
interval interval extract
pathindex create a heritrix path index (unsorted)
test test validity of ARC/WARC/GZip file(s)
unpack unpack multifile GZip
See 'jwattools help <command>' for more information on a specific command.
C:\Java\workspace\jwat-tools> |
Command line interface for v0.5.5.
Code Block | ||||
---|---|---|---|---|
| ||||
C:\Java\workspace\jwat-tools>target\jwat-tools-0.5.5-SNAPSHOT\jwattools.cmd
JWATTools v0.5.5
Usage: JWATTools [-dte19] command [file ...]
Commands:
arc2warc convert ARC to WARC
cdx create a CDX index (unsorted)
compress compress
decompress decompress
extract extract ARC/WARC record(s)
interval interval extract
pathindex create a heritrix path index (unsorted)
test test validity of ARC/WARC/GZip file(s)
unpack unpack multifile GZip
Options:
-r recursive (currently has no effect)
-w<x> set the amount of worker thread(s) (defaults to 1)
Test options:
-e show errors
-l relaxed URL URI validation
-x to validate text/xml payload (eg. mets)
Compress options:
-1, --fast compress faster
-9, --slow compress better
C:\Java\workspace\jwat-tools> |
You can supply one or more files. Each file can contain * and/or ? wildcards, but only in the filename part of the path. You can use more wildcards at the same time if you want.