Difference between revisions of "Thesaurus"

From OpenKM Documentation
Jump to: navigation, search
m
 
(25 intermediate revisions by 3 users not shown)
Line 1: Line 1:
{{Warning|This section is still unders contruction}}
+
{{TOCright}} __TOC__
 
 
{{Note|Thesaurus is only available from OpenKM 5.0 and upper.}}
 
  
 
A '''thesaurus''' is a book that lists '''words grouped together according to similarity of meaning''' (containing synonyms and sometimes antonyms), in contrast to a dictionary, which contains definitions and pronunciations.  
 
A '''thesaurus''' is a book that lists '''words grouped together according to similarity of meaning''' (containing synonyms and sometimes antonyms), in contrast to a dictionary, which contains definitions and pronunciations.  
  
 +
{{Warning|Thesaurus is only available starting from OpenKM 5.0.}}
  
In Information Science, Library Science, and '''Information Technology, specialized thesauri are designed for information retrieval'''. They are a type of controlled vocabulary, for indexing or tagging purposes. If you want more information about term you can take a look at http://en.wikipedia.org/wiki/Thesaurus
+
In Information Science, Library Science, and '''Information Technology, specialized thesauri are designed for information retrieval'''. They are a type of controlled vocabulary, for indexing or tagging purposes. If you want more information about the term you can take a look at http://en.wikipedia.org/wiki/Thesaurus.  
 
 
 
 
OpenKM can working with thesaurus defined in '''.owl''' and '''.rdfs''' formats.
 
 
 
 
 
There's a great utility http://protege.stanford.edu/ to create and maintaining your own thesaurus. In Internet can be found specific thesaurus, for example Agrovoc[http://aims.fao.org/website/AGROVOC-Thesaurus/sub] ( FAO ), NASA Thesuarus[http://www.sti.nasa.gov/thesfrm1.htm], UNESCO Thesaurus[http://www2.ulcc.ac.uk/unesco/], Public administration Thesaurus[http://redined.r020.com.ar/es/index.php?tema=55] among others. Someones are free and others have some usage restrictions.
 
  
 +
{{Note|OpenKM can work with thesaurus defined in '''.owl''' and '''.rdfs''' formats.}}
  
In order to correctly configure OpenKM thesaurus you must set this OpenKM.cfg entries:
+
There's a great utility http://protege.stanford.edu/ for creating and maintaining your own thesaurus. Specific thesauri can be found on the Internet. For example, [http://aims.fao.org/website/AGROVOC-Thesaurus/sub Agrovoc] ( FAO ), [http://www.sti.nasa.gov/thesfrm1.htm NASA Thesaurus], [http://www2.ulcc.ac.uk/unesco/ UNESCO Thesaurus], [http://redined.r020.com.ar/es/index.php?tema=55 Public administration Thesaurus] among others. Some of them are free and some have usage restrictions.
  
kea.thesaurus.owl.file
+
In order to correctly configure OpenKM to use a thesaurus the following configuration properties must be set:
kea.thesaurus.base.url
 
kea.thesaurus.tree.root
 
kea.thesaurus.tree.childs
 
  
== Setting the OWL file ==
+
<source lang="java">
kea.thesaurus.owl.file=/vocabulary/file.owl
+
kea.thesaurus.owl.file
 +
kea.thesaurus.base.url
 +
kea.thesaurus.tree.root
 +
kea.thesaurus.tree.childs
 +
</source>
  
== Setting the base url ==
+
'''Set OWL file'''
kea.thesaurus.base.url=http://www.someweb.org
+
<source lang="java">
 +
kea.thesaurus.owl.file=vocabulary/file.owl
 +
</source>
  
== Query to get root node ==
+
'''Set base url'''
kea.thesaurus.tree.root=SELECT DISTINCT UID, TEXT FROM {UID} Y {OBJECT}, {UID} rdfs:label {TEXT} ; [rdfs:subClassOf {CLAZZ}] where not bound(CLAZZ) and lang(TEXT)="en" USING NAMESPACE foaf=<http://xmlns.com/foaf/0.1/>, dcterms=<http://purl.org/dc/terms/>, rdf=<http://www.w3.org/1999/02/22-rdf-syntax-ns#>, owl=<http://www.w3.org/2002/07/owl#>, rdfs=<http://www.w3.org/2000/01/rdf-schema#>, skos=<http://www.w3.org/2004/02/skos/core#>, dc=<http://purl.org/dc/elements/1.1/>
+
<source lang="java">
 +
kea.thesaurus.base.url=http://www.someweb.org
 +
</source>
  
== Query to get childs node ==
+
'''Query to get root node'''
kea.thesaurus.tree.childs=SELECT DISTINCT UID, TEXT FROM {UID} rdfs:subClassOf {CLAZZ}, {UID} rdfs:label {TEXT} where xsd:string(CLAZZ) = "RDFparentID" and lang(TEXT)="en" USING NAMESPACE foaf=<http://xmlns.com/foaf/0.1/>, dcterms=<http://purl.org/dc/terms/>, rdf=<http://www.w3.org/1999/02/22-rdf-syntax-ns#>, owl=<http://www.w3.org/2002/07/owl#>, rdfs=<http://www.w3.org/2000/01/rdf-schema#>, skos=<http://www.w3.org/2004/02/skos/core#>, dc=<http://purl.org/dc/elements/1.1/>
+
<source lang="java">
 +
kea.thesaurus.tree.root=SELECT DISTINCT UID, TEXT FROM {UID} Y {OBJECT}, {UID} rdfs:label {TEXT} ; [rdfs:subClassOf {CLAZZ}] where not bound(CLAZZ)
 +
and lang(TEXT)="en" USING NAMESPACE foaf=<http://xmlns.com/foaf/0.1/>, dcterms=<http://purl.org/dc/terms/>,  
 +
rdf=<http://www.w3.org/1999/02/22-rdf-syntax-ns#>, owl=<http://www.w3.org/2002/07/owl#>, rdfs=<http://www.w3.org/2000/01/rdf-schema#>,
 +
skos=<http://www.w3.org/2004/02/skos/core#>, dc=<http://purl.org/dc/elements/1.1/>
 +
</source>
  
[[Thesaurus full example]]
+
'''Query to get childs node'''
 +
<source lang="java">
 +
kea.thesaurus.tree.childs=SELECT DISTINCT UID, TEXT FROM {UID} rdfs:subClassOf {CLAZZ}, {UID} rdfs:label {TEXT} where xsd:string(CLAZZ) =
 +
"RDFparentID" and lang(TEXT)="en" USING NAMESPACE foaf=<http://xmlns.com/foaf/0.1/>, dcterms=<http://purl.org/dc/terms/>,
 +
rdf=<http://www.w3.org /1999/02/22-rdf-syntax-ns#>, owl=<http://www.w3.org/2002/07/owl#>, rdfs=<http://www.w3.org/2000/01/rdf-schema#>,
 +
skos=<http://www.w3.org/2004/02/skos/core#>, dc=<http://purl.org/dc/elements/1.1/>
 +
</source>
  
 
You could be interested in:
 
You could be interested in:
* A Method for Converting Thesauri to RDF/OWL[http://thesauri.cs.vu.nl/]
+
* [http://thesauri.cs.vu.nl/ A Method for Converting Thesauri to RDF/OWL]
* SKOS Simple Knowledge Organization System[http://www.w3.org/2004/02/skos/]
+
* [http://owl.cs.manchester.ac.uk/owltoskos/ OWL to SKOS online conversor]
 +
* [http://www.ebusiness-unibw.org/tools/skos2owl/ SKOS to OWL online conversor]
 +
* [http://www.w3.org/2004/02/skos/ SKOS Simple Knowledge Organization System]
  
 +
Also check [[Thesaurus full example]] [[File:Padlock.gif]] for a step-by-step guide.
  
 
[[Category: Installation Guide]]
 
[[Category: Installation Guide]]

Latest revision as of 14:07, 12 February 2014

A thesaurus is a book that lists words grouped together according to similarity of meaning (containing synonyms and sometimes antonyms), in contrast to a dictionary, which contains definitions and pronunciations.


Nota advertencia.png Thesaurus is only available starting from OpenKM 5.0.

In Information Science, Library Science, and Information Technology, specialized thesauri are designed for information retrieval. They are a type of controlled vocabulary, for indexing or tagging purposes. If you want more information about the term you can take a look at http://en.wikipedia.org/wiki/Thesaurus.


Nota clasica.png OpenKM can work with thesaurus defined in .owl and .rdfs formats.

There's a great utility http://protege.stanford.edu/ for creating and maintaining your own thesaurus. Specific thesauri can be found on the Internet. For example, Agrovoc ( FAO ), NASA Thesaurus, UNESCO Thesaurus, Public administration Thesaurus among others. Some of them are free and some have usage restrictions.

In order to correctly configure OpenKM to use a thesaurus the following configuration properties must be set:

kea.thesaurus.owl.file
kea.thesaurus.base.url
kea.thesaurus.tree.root
kea.thesaurus.tree.childs

Set OWL file

kea.thesaurus.owl.file=vocabulary/file.owl

Set base url

kea.thesaurus.base.url=http://www.someweb.org

Query to get root node

kea.thesaurus.tree.root=SELECT DISTINCT UID, TEXT FROM {UID} Y {OBJECT}, {UID} rdfs:label {TEXT} ; [rdfs:subClassOf {CLAZZ}] where not bound(CLAZZ)
 and lang(TEXT)="en" USING NAMESPACE foaf=<http://xmlns.com/foaf/0.1/>, dcterms=<http://purl.org/dc/terms/>, 
 rdf=<http://www.w3.org/1999/02/22-rdf-syntax-ns#>, owl=<http://www.w3.org/2002/07/owl#>, rdfs=<http://www.w3.org/2000/01/rdf-schema#>,
 skos=<http://www.w3.org/2004/02/skos/core#>, dc=<http://purl.org/dc/elements/1.1/>

Query to get childs node

kea.thesaurus.tree.childs=SELECT DISTINCT UID, TEXT FROM {UID} rdfs:subClassOf {CLAZZ}, {UID} rdfs:label {TEXT} where xsd:string(CLAZZ) =
 "RDFparentID" and lang(TEXT)="en" USING NAMESPACE foaf=<http://xmlns.com/foaf/0.1/>, dcterms=<http://purl.org/dc/terms/>, 
 rdf=<http://www.w3.org /1999/02/22-rdf-syntax-ns#>, owl=<http://www.w3.org/2002/07/owl#>, rdfs=<http://www.w3.org/2000/01/rdf-schema#>,
 skos=<http://www.w3.org/2004/02/skos/core#>, dc=<http://purl.org/dc/elements/1.1/>

You could be interested in:

Also check Thesaurus full example Padlock.gif for a step-by-step guide.