KnolBase is a federated knowledge base that contains approximately two billion
facts about 100 million entities.
KnolBase uses the in-memory SLING frame store for representing knowledge.
Every day a new knowledge base is built from a Wikidata live-feed as well as
a number of other information sources:
Wikipedia (aliases, photos, infoboxes, summaries, categories, popularity, 30 languages)
KnolBase is built nightly using a pipeline similar to the one descriped
but it uses the reconcile_items task instead of fuse_items to
build the knowledge base.
This requires a number of additional data set to be build from external sources,
making this much more complex to run. Instead of building this yourself,
it is recommened downloading the prebuilt KnolBase knowledge base from
ringgaard.com using this command:
sling fetch --dataset kb,nametab,phrasetab
You can use the knowledge base browser for viewing the following information
in the knowledge base:
Item id, name, and description (click the Wikidata QID to navigate to the Wikidata page for the item)
Summary from Wikipedia (click on the Wikipedia link to navigate to Wikipedia page)
Item properties (click property value to navigate to item)
Photos (click on photo to view gallery)
Cross references (click reference value to visit external site)
If you type or paste a name or alias for an entity in the search
field, it shows the items matching the query. Clicking an item from the
search list will show the item, or press Enter to show the first item in the
The following types of searches can be performed:
By default, the browser do prefix matching finding items with names or aliases
that starts with the query string.
If the query ends with period, only items with names or aliases that completely
match the query string are shown.
You can do keyword matching by adding a question mark to the end of the query.
This searches for matches in the names, aliases, description, summary, and a
number of other properties.
Keyword search also searches for offline items, e.g. articles, genes,
chemical compounds, stars, and galaxies.
The order of the search terms are not important.
Only items matching all search terms (except for stopwords) will be matched.
Keyword search also searches over a number of different languages, currently
English, Danish, German, French, Spanish, and Italian.
You can search for items by their item id(s), e.g. Wikidata QID. An item can have
multiple ids, if it has been merged during the reconciliation process.
Items can be looked up by their external ids. These are identifiers for the
item in other systems. If an item does not have a Wikidata QID, it is
assigned ids based on its external ids, e.g. an item with CVR number 41641908
will have P1059/41641908 as its id.
You can see the current list of indexed identifiers here.
You can also use mnemonics to search cross-referenced properties,
e.g. searching for "cvr:41641908" will find the item with CVR number 41641908.
W3C Reconciliation Service API
KnolBase has a W3C Reconciliation Service API
so it can be used as a reconciliation service for OpenRefine by
specifying https://ringgaard.com/reconcile/ as the service endpoint.
The KnolBase reconciliation service is much faster than the default Wikibase reconciler
because of the SLING in-memory architecture.
The KnolBase reconciliation service supports the following APIs:
Reconciliation service (matching names to items in the knowledge base)
Preview service (preview panels for items)
Suggest service (suggest entities, types, and properties)
Data Extension service (extend rows with data from the knowledge base)