ARC to WARC Migration with CDX Index and w...
(1)
Workflow for migrating ARC to WARC and comparing the CDX index files (Linux).
The workflow has an input port “input_directory” which is a local path to the directory containing the ARC files, and an input port “output_directory” which is the directory where the workflow outputs are created. The files in the input directory are migrated using the “arc2warc_migration_cli” tool service component to perform the migration. The “cdx_creator_arc” and “cdx_creator_warc” tool service components creat...
Created: 2014-07-09
Credits:
Sven