Running JWAT-Tools

Installing and running

To install JWAT-Tools simply unpack the archive.

To run JWAT-Tools use the Windows or Linux scripts included in the package.

The scripts can be called from any location.

Windows scripts

jwattools.cmd

jwattools_debug.cmd

jwattools_debug_suspended.cmd

Linux scripts

jwattools.sh

jwattools_debug.sh

jwattools_debug_suspended.sh

Options

The command line interface has changed yet again for v0.5.6.

The main help page only lists command and global options.

Use jwattools help <command> to show a command's usage.

Commandline options (v0.5.6)
C:\Java\workspace\jwat-tools>target\jwat-tools-0.5.6-SNAPSHOT\jwattools.cmd
JWATTools v0.5.6
usage: JWATTools <command> [<args>]
Commands:
   arc2warc    convert ARC to WARC
   cdx         create a CDX index (unsorted)
   compress    compress
   decompress  decompress
   extract     extract ARC/WARC record(s)
   interval    interval extract
   pathindex   create a heritrix path index (unsorted)
   test        test validity of ARC/WARC/GZip file(s)
   unpack      unpack multifile GZip

See 'jwattools help <command>' for more information on a specific command.

C:\Java\workspace\jwat-tools>

Command line interface for v0.5.5.

Commandline options (v0.5.5)
C:\Java\workspace\jwat-tools>target\jwat-tools-0.5.5-SNAPSHOT\jwattools.cmd
JWATTools v0.5.5
Usage: JWATTools [-dte19] command [file ...]
Commands:
   arc2warc     convert ARC to WARC
   cdx          create a CDX index (unsorted)
   compress     compress
   decompress   decompress
   extract      extract ARC/WARC record(s)
   interval     interval extract
   pathindex    create a heritrix path index (unsorted)
   test         test validity of ARC/WARC/GZip file(s)
   unpack       unpack multifile GZip
Options:
   -r      recursive (currently has no effect)
   -w<x>   set the amount of worker thread(s) (defaults to 1)
Test options:
   -e   show errors
   -l   relaxed URL URI validation
   -x   to validate text/xml payload (eg. mets)
Compress options:
   -1, --fast   compress faster
   -9, --slow   compress better

C:\Java\workspace\jwat-tools>

You can supply one or more files. Each file can contain * and/or ? wildcards, but only in the filename part of the path. You can use more wildcards at the same time if you want.