Semantic clustering (with alpha-clustering) of SPARQL query results over RDF version of videolectures.net dataset

Created: 2012-01-29 15:52:57      Last updated: 2012-01-30 16:10:25

The workflow uses RapidMiner extension named RMonto (http://semantic.cs.put.poznan.pl/RMonto/) to perform clustering of SPARQL query results based on chosen semantic similarity measure. The measure used in this particualr workflow is a kernel that exploits membership of clustered individuals to OWL classes from a background ontology ("Epistemic" kernel from [1]). Since the semantics of the backgound ontology is used in this way, we use the name "semantic clustering". This particular kernel is based on the commitee of features (ontology classes) that are specified in the paremeter of "Create TBox features" operator. The SPARQL query is entered in a parameter of "SPARQL selector" operator. The clustering operator (alpha-clustering) allows to specify which of the query variables are to be used as clustering criteria. If more than one variable is used than the results are clustered in the way that kind of multifaceted hierarchy is computed over them, which is dynamically determined based on the membership of individuals to OWL classes. More on such functionality may be found in [3]. Operator "Build knowledge base" is responsible for collecting data either from OWL files or SPARQL endpoints or RDF repositories and provide it to the subsequent operators in a workflow. In this workflow, this operator retrieves the data from a Sesame/OWLIM repository whose address is given as the operator's parameter. The repository stores the Digital Multimedia Repositories Ontology (DMRO), and DMRO-based RDF version of videolectures.net dataset (see: http://www.e-lico.eu/?q=node/288, http://www.ecmlpkdd2011.org/challenge.php).

[1] C. d'Amato, N. Fanizzi, F. Esposito.  Non-parametric Statistical Learning Methods for Inductive Classifiers in Semantic Knowledge Bases.   ICSC 2008, IEEE International Conference on Semantic Computing, August 4-7, 2008, Santa Clara, CA, USA.

[2] G. Antzoulatos, M. N. Vrahatis: Alpha-Clusterable Sets. ECML/PKDD (1) 2011: 108-123

[3] Lawrynowicz A., Potoniec J., Konieczny L., Madziar M., Nowak A. and K. T. Pawlak:ASPARAGUS - A System for Automatic SPARQL Query Results Aggregation Using Semantics. In ICCCI (1). Vol. 6922 of LNCS, Springer, (2011): 304-313

Information Preview

Information Run

Not available


Information Workflow Components

Unavailable

Information Workflow Type

RapidMiner

Information Uploader

Information License

All versions of this Workflow are licensed under:

Information Version 1 (of 1)

Information Credits (0)

(People/Groups)

None

Information Attributions (0)

(Workflows/Files)

None

Information Tags (8)

Log in to add Tags

Information Shared with Groups (2)

Information Featured In Packs (1)

Log in to add to one of your Packs

Information Attributed By (0)

(Workflows/Files)

None

Information Favourited By (0)

No one

Information Statistics

 

Citations (0)

None


Version History

In chronological order:



Reviews Reviews (0)

No reviews yet

Be the first to review!



Comments Comments (1)

Log in to make a comment

  • Monday 15 May 2017 12:32:21 (UTC)

    This experiment cannot be run in the current version of Rapidminer with ontology extension.




Workflow Other workflows that use similar services (0)

There are no workflows in myExperiment that use similar services to this Workflow.