Tag Results

Items tagged with "duplicates search" (2)

Note: some items may not be visible to you, due to viewing permissions.


Workflows (2)
Uploader

Workflow Find Duplicates using Matchbox command lin... (1)

Thumb
The workflow takes a list of digital documents as input, extracts SIFT features using image processing algorithms, creates dictionary of visual words, generates BoW (Bag of Words) histogramms and finds duplicates. The count of parallel threads can be passed as a parameter. Finally search results are stored in a text file that contains a list of possible duplicates with associated similarity score. This score values are spread between 0 (low similarity) and 1 (high similarity). Image compariso...

Created: 2012-07-31 | Last updated: 2012-07-31

Credits: User Roman

Uploader

Workflow MatchboxHadoopAPI (1)

Thumb
The workflow MatchboxHadoopApi.t2flow enables using of matchbox tool on Hadoop with Taverna. This workflow is based on Python scripts and Hadoop Streaming API included in"pythonwf" folder of pc-qa-matchbox project on github (https://github.com/openplanets/scape/tree/master/pc-qa-matchbox/hadoop/pythonwf).For this workflow we assume that digital collection is located on HDFS and we have a list of input files in format "hdfs:///user/training/collection/00000032.jp2" - one ro...

Created: 2013-11-05

Credits: User Roman Network-member SCAPE

What is this?

Linked Data

Non-Information Resource URI: http://www.myexperiment.org/tags/3592


Alternative Formats

HTML
RDF
XML