2019-03-12
Google KG: Entity Pages, Disambiguation
external-id
props are an excellent source (over 3700)The Linked Open Data (LOD) Cloud contained 28 datasets in 2007
16136 links between datasets, 30B triples
Number of Creative Works and Cultural Institutions in Wikidata:
skos:Concept
)Frоm Getty LOD - “Excel-driven Ontology Generation™”
Example: PubMed Field Statistics
Pros:
Cons:
dct:
property is a subprop of dc:
, and such inference may be useless to youHow to do ontology engineering?
Onto ETL tools evaluation
TODO links
name | language | playground, source, distribution |
---|---|---|
shex.js | js | http://rawgit.com/shexSpec/shex.js/master/doc/shex-simple.html, https://github.com/shexSpec/shex.js/ |
ShEx NPM | js | https://www.npmjs.com/package/shex |
ShEx-validator | js | https://github.com/HW-SWeL/ShEx-validator |
Validata | js | http://hw-swel.github.io/Validata/, https://www.w3.org/2015/03/ShExValidata/, https://github.com/HW-SWeL/Validata |
ShExJava | java | http://shexjava.lille.inria.fr/, https://github.com/iovka/shex-java, https://gforge.inria.fr/projects/shex-impl/ |
RDFShape, ShaclEx | scala | http://rdfshape.weso.es/, http://shaclex.herokuapp.com/, https://github.com/labra/rdfshape, https://github.com/labra/shaclex |
TrucHLe | scala | https://github.com/TrucHLe/SHACL |
PyShEx | python | https://github.com/hsolbrig/PyShEx |
shex.rb | ruby | https://github.com/ruby-rdf/shex |
ShExkell | haskell | https://github.com/weso/shexkell |
Semantic enrichment is the adding of value to a dataset by increasing the amount of queryable information it contains and/or decreasing the data’s noisiness. This is done in several ways:
Redundant data is deduplicated and fused to produce a single master dataset without conflicts. selection of representative single fields (eg logo), * accumulation of multiple fields (eg names, transactions), * aggregation of summary fields (eg count or total amount)