Excerpt | ||
---|---|---|
| ||
Instructions on how to run JWAT-Tools. |
The command line interface is work in progress. So at some point the arguments/options will be refactored.
Unfortunately I have a small command line package which also requires refactoring.
Options
The following options are currently available in JWAT-Tools.
Code Block | ||||
---|---|---|---|---|
| ||||
C:\Java\workspace\jwat-tools>jwattools.battools>target\jwat-tools-0.5.4-SNAPSHOT\jwattools.cmd JWATTools v0.5.14 usage: JWATTools [-dte19] [file ...] -t test validity of ARC, WARC and/or GZip file(s) -r recursive -e show errors -d decompressl relaxed URL URI validation -x to validate text/xml payload (eg. mets) -rd recursivedecompress -1 compress faster -9 compress better -i interval extract -u unpack multifile gzip -c convert arc to warc -C output CDX |
You can supply one or more files. Each file can contain * and/or ? wildcards, but only in the filename part of the path. You can use more wildcards at the same time if you want.
...
-c (convert)
Convert ARC files to WARC.
-C (output CDX)
Index one or more ARC/WARC files and output the result in CDX format.