Planned ready March 2012. Production quality of the WARC module with all main features implemented
Note that a final release will follow which may be used for extending the features found here.
Tasks - in progress or planned (can be moved accordingly)
- Refactor Gzip code.
- Refactor HttpResponse parser code.
- During the PWG January teleconference people enquired if the modules would be testet with non-Heritrix produces WARC files. Test data could possibly be obtained from the following sources.
- Stanford
- Tobias/Germany?