Semantic clustering (with alpha-clustering) of SPARQL query results over RDF version of videolectures.net dataset
The workflow uses RapidMiner extension named RMonto (http://semantic.cs.put.poznan.pl/RMonto/) to perform clustering of SPARQL query results based on chosen semantic similarity measure. The measure used in this particualr workflow is a kernel that exploits membership of clustered individuals to OWL classes from a background ontology ("Epistemic" kernel from [1]). Since the semantics of the backgound ontology is used in this way, we use the name "semantic clustering". This particular kernel is based on the commitee of features (ontology classes) that are specified in the paremeter of "Create TBox features" operator. The SPARQL query is entered in a parameter of "SPARQL selector" operator. The clustering operator (alpha-clustering) allows to specify which of the query variables are to be used as clustering criteria. If more than one variable is used than the results are clustered in the way that kind of multifaceted hierarchy is computed over them, which is dynamically determined based on the membership of individuals to OWL classes. More on such functionality may be found in [3]. Operator "Build knowledge base" is responsible for collecting data either from OWL files or SPARQL endpoints or RDF repositories and provide it to the subsequent operators in a workflow. In this workflow, this operator retrieves the data from a Sesame/OWLIM repository whose address is given as the operator's parameter. The repository stores the Digital Multimedia Repositories Ontology (DMRO), and DMRO-based RDF version of videolectures.net dataset (see: http://www.e-lico.eu/?q=node/288, http://www.ecmlpkdd2011.org/challenge.php).
[1] C. d'Amato, N. Fanizzi, F. Esposito. Non-parametric Statistical Learning Methods for Inductive Classifiers in Semantic Knowledge Bases. ICSC 2008, IEEE International Conference on Semantic Computing, August 4-7, 2008, Santa Clara, CA, USA.
[2] G. Antzoulatos, M. N. Vrahatis: Alpha-Clusterable Sets. ECML/PKDD (1) 2011: 108-123
[3] Lawrynowicz A., Potoniec J., Konieczny L., Madziar M., Nowak A. and K. T. Pawlak:ASPARAGUS - A System for Automatic SPARQL Query Results Aggregation Using Semantics. In ICCCI (1). Vol. 6922 of LNCS, Springer, (2011): 304-313
Preview
Run
Not available
Workflow Components
Unavailable
Reviews (0)
Other workflows that use similar services (0)
There are no workflows in myExperiment that use similar services to this Workflow.
Comments (1)
Log in to make a comment
This experiment cannot be run in the current version of Rapidminer with ontology extension.