Content from the SCAPE group

Search filter terms
Filter by category
Filter by type
Filter by tag
Filter by user
Filter by licence
Filter by wsdl
Results per page:
Sort by:
Showing 72 results. Use the filters on the left and the search box below to refine the results.
Uploader

Workflow Matchbox Evaluation (1)

Thumb
Matchbox evaluation against ground truth. The evaluation process first creates the matchbox output and ground truth lists. It then counts each page tuple from the matchbox output that is in the ground truth as correctly identified tuple (true positive). Those that are not in the ground truth are counted as incorrectly identified tuples (false positives), and finally, those that are in the ground truth but not in the matchbox output are counted as missed tuples (false negatives). The precision...

Created: 2012-10-02 | Last updated: 2012-10-02

Credits: User Sven

Uploader

Workflow Hadoop Large Document Collection Data Prep... (1)

Thumb
Workflow for preparing large document collections for data analysis. Different types of hadoop jobs (Hadoop-Streaming-API, Hadoop Map/Reduce, and Hive) are used for specific purposes. The *PathCreator components create text files with absolute file paths using the unix command 'find'. The workflow then uses 1) a Hadoop Streaming API component (HadoopStreamingExiftoolRead) based on a bash script for reading image metadata using Exiftool, 2) the Map/Reduce component (HadoopHocrAvBlockWidthMapR...

Created: 2012-08-17 | Last updated: 2012-08-18

Credits: User Sven

Uploader

Workflow Hadoop hOCR parser (1)

Thumb
Big data processing: chaining Hadoop jobs using Taverna. This workflow demonstrates a simple way of linking different hadoop job components using the standard output of the hadoop jobs. It is not for thought for productive use, but for demonstration using small data sets. The code for the hadoop jobs is available on Github: tb-lsdr-hocrparser and tb-lsdr-seqfilecreator.

Created: 2012-08-07 | Last updated: 2012-08-07

Credits: User Sven

Uploader

Workflow Find Duplicates using Matchbox command lin... (1)

Thumb
The workflow takes a list of digital documents as input, extracts SIFT features using image processing algorithms, creates dictionary of visual words, generates BoW (Bag of Words) histogramms and finds duplicates. The count of parallel threads can be passed as a parameter. Finally search results are stored in a text file that contains a list of possible duplicates with associated similarity score. This score values are spread between 0 (low similarity) and 1 (high similarity). Image compariso...

Created: 2012-07-31 | Last updated: 2012-07-31

Credits: User Roman

Workflow Simple executable plan example (1)

Thumb
A simple executable plan containing a migration action component and a QA component. The migration action uses imagemagick convert. The QA component consists of two characterisation components (fits) and another QA components (imagemagick compare).

Created: 2012-06-29

Credits: User Markus Plangg

Workflow Mp3ToWav Mig+QA (2)

Thumb
Migrates Mp3 files from input List to Wav files and performs QA. The QA steps include File Format Validation, Significant Property Comparison and migrationQA file content comparison (all CLI). This workflow matches solution SO4 on the SCAPE opf wiki, see http://wiki.opf-labs.org/display/SP/SO4+Audio+mp3+to+wav+Migration+and+QA+Workflow

Created: 2012-05-21 | Last updated: 2013-04-24

Credits: User Bolette Jurik

Uploader

Workflow A heuristic measure for detecting undesire... (2)

Thumb
The workflow takes TIFF image instances as input, applies a list of JP2 compression parameter values, executes OCR using an open source OCR engine, evaluates the results, and creates a diagram visualising the results. Dependencies on external tools for the tool service components: Tesseract ImageMagick Kakadu Gnuplot Dependencies on external Java libraries of beanshells: Apache commons lang

Created: 2012-02-06 | Last updated: 2012-03-09

Credits: User Sven

Workflow FFmpeg convert audio2aac (SOAP) (1)

Thumb
Converts supported audio files to AAC using FFmpeg through a SOAP webservice.

Created: 2012-01-16 | Last updated: 2012-01-16

Credits: User Rui Castro

Workflow ImageMagick convert image2jp2 (SOAP) (1)

Thumb
Converts supported image files to JPEG2000 using ImageMagick through a SOAP webservice.

Created: 2012-01-16 | Last updated: 2012-01-16

Credits: User Rui Castro

Workflow ImageMagick convert image2jp2 (REST) (1)

Thumb
Converts supported image files to JPEG2000 using ImageMagick through a REST webservice.

Created: 2012-01-16 | Last updated: 2012-01-16

Credits: User Rui Castro

Results per page:
Sort by: