Why Star Wars in OWL?

OWL

OWL is the w3c standard for building ontologies for the Semantic Web.

An ontology is a knowledge model, containing classes, perhaps instances in a domain, and the properties that relate them.

The idea of the Semantic Web is to have a Worldwide web of data, traversable and computable by computers. An alternative term is “Linked data”.

OWL was developed using “Description Logics” that can be computed over, giving us the benefits of inference and reasoning. In OWL DL, all assertions are logical axioms that have a well understood formal meaning.

It brings with it many other benefits, on top of being a well understood standard.

Inference

We can use a tool known as a reasoner to infer knowledge about our Universe.

Simple type inference

One of the simplest examples of this is type inference based on the transitivity of subclasses.

The Millenium Falcon is a YT-1300f_Light_Freighter
YT-1300f_Light_Freighter is a subclass of Freighter
Freighter is a subclass of Ship
Ship is a subclass of Vehicle ... etc

Which we can use when reasoning about (querying) Vehicles,

Vehicle and ownedBy value Han_Solo

result includes the Millenium Falcon.

This works everywhere, within class descriptions

Tobias_Beckett -> hadRole some Thief
Thief is a subclass of Criminal

So we can ask for all Criminals

hadRole some Criminal

results include Beckett, thieves, smugglers, pirates and more

This gives us some freedom in modelling - see Specificity below

Property inference

Subproperties

Exactly the same as classes, properties are arranged in a hierarchy.

eg an event in which people have different outcomes:

Boba_vs_Bib: Murder
    of Bib_Fortuna
    victoryOf Boba_Fett
    witnessedBy Fennec_Shand

We can ask who were the participants in this event:

participatedIn value Boba_vs_Bib

result includes all 3 because of, victoryOf and witnessedBy are all subproperties of participant (and participatedIn is the inverse of participant)

Transitivity

Transitivity is very useful for spatial, temporal and other properties.

eg. Who was killed in the Mid Rim?

inverse(of) some (Killing and during some (at value Mid_Rim))

results includes Qui-Gon Jinn who was killed in Theed, on Naboo, in the Mid Rim

eg. Who is related to Kylo Ren?

relatedTo value Kylo_Ren

results includes Shmi

Consistency checking

Assertions must be consistent with the rest of the model.

eg There can be no individual Female Crolute as all of the species are Male.

The reasoner lets us prove this.

Crolute and (gender some Female)

Equivalent to owl:Nothing, which is the empty set - ie no individuals could be a member of this class.

If we created a Female Crolute, it would make the ontology inconsistent.

Open World

This is an ongoing story with gaps being filled all the time.

OWL implements an open world model where anything can be true until there is a statement that makes that impossible.

For example, 2 individuals could be the same unless we make them distinct explicitly or by inference (OWL does not make the “Unique Name Assumption”).

Unique Name Assumption (not made in OWL)

It feels odd to say Darth Vader won a podrace on Tatooine. It’s more intuitive to separate the two personas and talk about them in their own contexts. Then OWL allows us to state that Vader and Anakin are the same individual:

Darth_Vader hadRole some Dark_Lord
Capture_of_Tantive_IV participant Darth_Vader

Anakin_Skywalker hadRole some Jedi_Knight
Boonta_Eve_Classic_32BBY victoryOf Anakin_Skywalker

Darth_Vader sameAs Anakin_Skywalker

Disjoints

By default, OWL allows individuals to be members of any number of Classes. Disjoints allow us to assert that members of one Class cannot be members of another Class. This can assist with consistency checking - helpful in catching modelling errors. But, there are also useful queries we might want to make that require disjoints to work, eg:

(hadRole some Jedi) and not(Human)

results include Aayla_Secura

This only works because there are disjoints between Living_Things (at each level below that including Humanoids)

Negative assertions

We assert those that we know to be force sensitive so we can ask:

Human and (connectedTo some The_Force)

But, if we ask for those that aren’t Force sensitive:

Human and not (connectedTo some The_Force)

We do not get any results.

This is because in OWL we would have to assert that someone was not force sensitive - just omitting the information about them being force sensitive is not enough.

In this Universe, it would be quite strong to assert someone was not force sensitive

at what level is someone “sensitive”?
it might just not be apparent within the stories we know
or intentionally hidden by an individual

So, knowing someone is not force sensitive is not the same as not knowing if they are force sensitive (maybe they don’t know themselves)

It can still be useful to ask “who is not known to be force sensitive?”. Especially as a check during authoring. We can use set arithmetic to ask this question as 2 queries:

1) instance of Living_thing 2) instances of connectedTo some The_Force

Then, we simply remove all results of query 2 from the results of query 1.

results

Or Events C-3PO was without R2-D2

Properties create a network or graph

Any property can be navigated as a graph, whether or not they are transitive.

eg Event sequence (after/before) and containment (during)

Specificity

We have ways of modelling that can leave out unknown details or keep our level of commitment low.

Choosing a level of commitment

Classification allows control over how specific we wish to be:

eg Killing and (of some (hadRole some Fighter / Trooper / Stormtrooper / Scout Trooper))

Naming

There is no need to name everything:

eg - we can talk about Rey’s parents without having to name them.

Rey -> hadFather some ( Human and wasCloneOf value Darth_Sidious)

Ranges

eg People can be born in a range of years - eg Din Djarin was born before 19BBY

Din -> born some int [<="-19"(int)]

Modularity

An ontology built in modules allows:

focused engineering - keep parts of each story together in one place - eg rogue one
optimisation - expensive modelling/reasoning can be separated off - eg hasPart, hasTrait
layering of knowledge - detail can be moved into other modules - eg manufacturers
visibility - out of Universe or non-public knowledge can be in their own models - eg behind-the-scenes.owl.ttl
extensibility - responsibility for content can be distributed to other fans

There is a tradeoff with the fluidity of developing one ontology

See modularisation for more details.

Referencing

We can reference any number of sources to our entities/assertions - currently Wookipedia

We could reference images/videos - many ontology tools recognise common formats eg jpg. But, we would need permissions on images and hosting or hot linking etc so this is out of scope.

star-wars-ontology

An OWL ontology describing events, characters and places in the Star Wars Universe