Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

TestBitrepository
ToolPurposeDenmarkFranceAustriaSpainSweden
DeployApplicationCreates deploy scripts from a deploy-configy




HarvestdatabaseUpdateApplicationUpdates HarvestDB schemay




BuildCompleteSettingsMerges module settings files in NAS to one large global default settings file. Run as part of release process.y




GetFileRetrieves a file via the ArcRepository interface





GetRecordRetrieves a (w)arc-record via the ArcRepository interface





LoadDatabaseChecksumArchiveMigration tool from file-based checksums to database-based checksums





ReestablishAdminDatabaseFor reestablishing the admin database from a 'admin.data' file





RunBatchRuns a batch job from the command line





UploadUploads a file to the ArcRepository from the command line. (Handy for testdata.)y




ReestablishAdminDatabase

Should be deprecated (question) Reads old admin.data file.





ClassDependenciesNon NAS Utility (license is not ours)





CreateIndexCLI to talk to IndexServer via IndexClient





RunChecksumCLI to get all checksums from a Bitarchive (deprecated)





SendDedupIndexRequestToIndexserver

Asynchronously starts a dedup indexing on an IndexServer and then exits. Tue Hejlskov Larsen is this what you use to generate deduplication indexes?







MakeIndexFindRelevantCrawllogLinesHeritrix1ConstantsJMXProxyDeduplicateToCDXApplicationResetFailedFilesARCReaderUtilsRuns a CDX extraction on a single file in a remote ArcRepository





FindRelevantCrawllogLinesFinds crawl-log lines matching a given domain name in a local metadata file





JMXProxy"This tool will simply reregister all MBeans that matches the given query from the JMX hosts read in settings, using* its own platformmbeanserver. It will then wait forever."





DeduplicateToCDXApplicationExtracts CDX records for deduplicate annotations from a local crawl log file





ResetFailedFilesUtility for WaybackIndexer to reset files that have failed more than 3 times so they can be retried





ARCReaderUtilsSplits an arcfile (not warc) and dumps results to a directory





ArcWrap






ExtractCDX






JMSBroker






WriteBytesToFile






FTPValidator








SimpleCmdlineTool








ArcMerge








ArchiveExtractCDX








WARCExtractCDX








ReformatTranslationFile








MailValidator








DigestIndexer








MakeNewMetadataFile








FindDomainsForCrawllogExtraction








CheckDuplicateReduction








StandaloneApplicationReduced








SchedulerDatabaseBuilder








MigrateDefaultHarvestDatabase






CreateCDXMetadataFile






Heritrix3ControllerTest






H3LaunchTest






HarvesterQueueControl






HarvestDatabaseValidator






HarvestTemplateApplication






CheckDomainCrawltraps






CheckTrapsInFile






...