AI Knowledge Representation Guide
AI Knowledge Representation Guide
1. Declarative Knowledge:
o Declarative knowledge is to know about something.
AI knowledge cycle:
An Artificial intelligence system has the following components for displaying intelligent behavior:
o Perception
o Learning
o Knowledge Representation and Reasoning
o Planning
The above diagram is showing how an AI system can interact with the real world and what components help it to show
intelligence. AI system has Perception component by which it retrieves information from its environment. It can be visual,
audio or another form of sensory input. The learning component is responsible for learning from data captured by
Perception comportment. In the complete cycle, the main components are knowledge representation and Reasoning.
These two components are involved in showing the intelligence in machine-like humans. These two components are
independent with each other but also coupled together. The planning and execution depend on analysis of Knowledge
representation and reasoning.
Approaches to knowledge representation:
There are mainly four approaches to knowledge representation, which are givenbelow:
1. Simple relational knowledge:
o It is the simplest way of storing facts which uses the relational method, and each fact about a set of the object is
set out systematically in columns.
o This approach of knowledge representation is famous in database systems where the relationship between
different entities is represented.
o This approach has little opportunity for inference.
Example: The following is the simple relational knowledge representation.
Player1 65 23
Player2 58 18
Player3 75 24
2. Inheritable knowledge:
o In the inheritable knowledge approach, all data must be stored into a hierarchy of classes.
o All classes should be arranged in a generalized form or a hierarchal manner.
o In this approach, we apply inheritance property.
o Elements inherit values from other members of a class.
o This approach contains inheritable knowledge which shows a relation between instance and class, and it is called
instance relation.
o Every individual frame can represent the collection of attributes and its value.
o In this approach, objects and values are represented in Boxed nodes.
o We use Arrows which point from objects to their values.
o Example:
3. Inferential knowledge:
o Inferential knowledge approach represents knowledge in the form of formal logics.
o This approach can be used to derive more facts.
o It guaranteed correctness.
o Example: Let's suppose there are two statements:
a. Marcus is a man
b. All men are mortal
Then it can represent as;
man(Marcus)
∀x = man (x) ----------> mortal (x)s
4. Procedural knowledge:
o Procedural knowledge approach uses small programs and codes which describes how to do specific things, and
how to proceed.
o In this approach, one important rule is used which is If-Then rule.
o In this knowledge, we can use various coding languages such as LISP language and Prolog language.
o We can easily represent heuristic or domain-specific knowledge using this approach.
o But it is not necessary that we can represent all cases in this approach.
Requirements for knowledge Representation system:
A good knowledge representation system must possess the following properties.
1. Representational Accuracy:
KR system should have the ability to represent all kind of required knowledge.
2. Inferential Adequacy:
KR system should have ability to manipulate the representational structures to produce new knowledge
corresponding to existing structure.
3. Inferential Efficiency:
The ability to direct the inferential knowledge mechanism into the most productive directions by storing
appropriate guides.
4. Acquisitional efficiency- The ability to acquire the new knowledge easily using automatic methods.
Techniques of knowledge representation
There are mainly four ways of knowledge representation which are given as follows:
1. Logical Representation
2. Semantic Network Representation
3. Frame Representation
4. Production Rules
1. Logical Representation
Logical representation is a language with some concrete rules which deals with propositions and has no ambiguity in
representation. Logical representation means drawing a conclusion based on various conditions. This representation lays
down some important communication rules. It consists of precisely defined syntax and semantics which supports the
sound inference. Each sentence can be translated into logics using syntax and semantics.
Syntax:
o Syntaxes are the rules which decide how we can construct legal sentences in the logic.
o It determines which symbol we can use in knowledge representation.
o How to write those symbols.
Semantics:
o Semantics are the rules by which we can interpret the sentence in the logic.
o Semantic also involves assigning a meaning to each sentence.
Logical representation can be categorized into mainly two logics:
Propositional Logics
Predicate logics
Advantages of logical representation:
1. Logical representation enables us to do logical reasoning.
2. Logical representation is the basis for the programming languages.
Disadvantages of logical Representation:
1. Logical representations have some restrictions and are challenging to work with.
2. Logical representation technique may not be very natural, and inference may not be so efficient.
2. Semantic Network Representation
Semantic networks are alternative of predicate logic for knowledge representation. In Semantic networks, we can
represent our knowledge in the form of graphical networks. This network consists of nodes representing objects and arcs
which describe the relationship between those objects. Semantic networks can categorize the object in different forms
and can also link those objects. Semantic networks are easy to understand and can be easily extended.
This representation consist of mainly two types of relations:
IS-A relation (Inheritance)
Kind-of-relation
Example: Following are some statements which we need to represent in the form of nodes and arcs.
Statements:
a. Jerry is a cat.
b. Jerry is a mammal
c. Jerry is owned by Priya.
d. Jerry is brown colored.
In the above diagram, we have represented the different type of knowledge in the form of nodes and arcs. Each object is
connected with another object by some relation.
Drawbacks in Semantic representation:
1. Semantic networks take more computational time at runtime as we need to traverse the complete network tree to
answer some questions. It might be possible in the worst case scenario that after traversing the entire tree, we
find that the solution does not exist in this network.
2. Semantic networks try to model human-like memory (Which has 1015 neurons and links) to store the information,
but in practice, it is not possible to build such a vast semantic network.
3. These types of representations are inadequate as they do not have any equivalent quantifier, e.g., for all, for
some, none, etc.
4. Semantic networks do not have any standard definition for the link names.
5. These networks are not intelligent and depend on the creator of the system.
Advantages of Semantic network:
1. Semantic networks are a natural representation of knowledge.
2. Semantic networks convey meaning in a transparent manner.
3. These networks are simple and easily understandable.
3. Frame Representation
A frame is a record like structure which consists of a collection of attributes and its values to describe an entity in the
world. Frames are the AI data structure which divides knowledge into substructures by representing stereotypes
situations. It consists of a collection of slots and slot values. These slots may be of any type and sizes. Slots have names
and values which are called facets.
Facets: The various aspects of a slot is known as Facets. Facets are features of frames which enable us to put constraints
on the frames. Example: IF-NEEDED facts are called when data of any particular slot is needed. A frame may consist of any
number of slots, and a slot may include any number of facets and facets may have any number of values. A frame is also
known as slot-filter knowledge representation in artificial intelligence.
Frames are derived from semantic networks and later evolved into our modern-day classes and objects. A single frame is
not much useful. Frames system consist of a collection of frames which are connected. In the frame, knowledge about an
object or event can be stored together in the knowledge base. The frame is a type of technology which is widely used in
various applications including Natural language processing and machine visions.
Example: 1
Let's take an example of a frame for a book
Slots Filters
Year 1996
Page 1152
Example 2:
Let's suppose we are taking an entity, Peter. Peter is an engineer as a profession, and his age is 25, he lives in city London,
and the country is England. So following is the frame representation for this:
Slots Filter
Name Peter
Profession Doctor
Age 25
Weight 78
Example:
Z
Father Parent
X Y
Gfather
Example:
male
Isa Isa
X human
Isa
female
The inference that “for every action of giving, there is an action of taking”in clausal
logic written as
Action
E take
ESNet
Action
E give
The inference rule such as “an actor of taking action is also the recipientof the action”
can be easily represented in clausal logic as:
M.Ramakoteswara
– Here E Rao
is aM.Tech(CSE)
variable representing an event where an action of taking is Page 9
happening.
Artificial Intelligence Unit-4
ESNet
The inference rule such as “an actor of taking action is also the recipientof the action”
can be easily represented in clausal logic as:
– Here E is a variable representing an event where an action of taking is
happening.
– Recipient(E, Y) Acton(E,
Action take), Actor (E, Y)
ESNet E take
Recipient
Actor
Example 2:
e E Recipient
Actor Action Actor
Action
john take Y
Contradiction
Part_of
P X
Isa
Part_of
Y
Deduction in ESNet
CD representation of a sentence is not built using words in the sentencerather built using
conceptual primitives which give the intended meanings of words.
CD provides structures and specific set of primitives from whichrepresentation can be built.
Primitive Acts of CD theory
● ATRANS Transfer of an abstract relationship (i.e. give)
● PTRANS Transfer of the physical location of an object (e.g., go)
● PROPEL Application of physical force to an object (e.g. push)
● MOVE Movement of a body part by its owner (e.g. kick)
● GRASP Grasping of an object by an action (e.g. throw)
● INGEST Ingesting of an object by an animal (e.g. eat)
● EXPEL Expulsion of something from the body of an animal
(e.g. cry)
It should be noted that this representation is same for different sayingwith same
meaning. For example
Few conventions
● Arrows indicate directions of dependency
● Double arrow indicates two way link between actor and action.
Rule 5: PP PA
● It describes the relationship between a PP and a PA that is asserted to
describe it.
– PA represents states of PP such as height, health etc.
● Example: John is fat
CD Rep John weight (> 80)
Rule 6: PP PA
● It describes the relationship between a PP and an attribute that already has
been predicated of it.
– Direction is towards PP being described.
– Example: Smart John
CD Rep John smart
It describes the relationship between one conceptualization and anotherthat causes it.
General inferences are stored with each primitive Act thus reducing the number of
inferences that need to be stored explicitly with each concept.
For example, from a sentence “John killed Mike”, we can infer that “Mikeis dead”.
Example: The verbs {give, take, steal, donate} involve a transfer ofownership of an object.
– If any of them occurs, then inferences about who now has theobject and who once had the
object may be important.
– In a CD representation, these possible inferences can be statedonce and associated with the
primitive ACT “ATRANS”.
– Sentence interpretation is that Bill informed John that he (Bill) willdo something to break
john’s nose.
– Bill did (said) so in order that John will believe that if he (john) does some other thing
(different from what Bill wanted) then Billwill break John’s nose.
Rules are to be carefully designed for each primitive action in order toobtain semantically
correct interpretation.
Many verbs may fall under different primitive ACTs, and it becomesdifficult to find correct
primitive in the given context.
The CD representation becomes complex requiring lot of storage formany simple actions.
For example, the sentence “John bet Mike that Indian cricket team willwin incoming
world cup” will require huge CD structure.
Conceptual P ar s i n g
Conceptual parsing is required for generating CD representation fromsource sentences in
natural language.
– Syntactic processor extracts main verb and noun along with syntactic category of the verb
(transitive or intransitive) from the sentence.
– Once the correct entry from dictionary is chosen, CD processor analyses the rest of sentence
Rule 1: If PP in ‘with PP’ phrase is non-animate and CD Act requires instrument then the
sentence is of Type1, where PP (hammer) is resolved to instrument.
Rule 2: If PP in ‘with PP’ phrase is animate and CD Act requires instrument then the
sentence is of Type2, where PP (Mike) is resolved as co-actor.
Case2: If PPs in both the sentences are non-animate, then they have to be resolved using
semantic lexicon.
Such association of word senses could be found in Word-Net and then disambiguation is
possible.
Case 3: If PPs in the sentences are animate, then they have to be resolved using semantic
lexicon and context.
In these sentences, Mike, butterflies and dog are animate PPs and can be resolved as
follows:
– Mike is easily resolved to co-actor of John as both are human and have
similar characteristics
Word-Net can be used to check if butterfly and garden has some commonsense.
The scripts are useful in describing certain stereotyped situations such as going to theater
It consists of set of slots containing default values along with some information about the type
of values similar to frames.
It differs from FS as the values of the slots in scripts must be ordered and have more
specialized roles.
In real world situations, we see that event tends to occur in known patterns because of
clausal relationship to the occurrence of events
Script Components:
Each script contains the following main components.
– Entry Conditions: Must be satisfied before events in the script can occur.
– Track: Specific variation on more general pattern in the script. Different tracks may
share many components of the same script but not all.
– Scenes: The sequence of events that occur. Events are represented in conceptual
dependency form.
Script Invocation:
If we have a sentence like “John went to theater to pick his daughter”,then invoking this
script will lead to many wrong answers.
Getting significance from the story is not straightforward. However, someheuristics can be
applied to get the value.
Advantages / Disadvantages of Script
Advantages
Disadvantage
To deal with inflexibility, smaller modules called memory organizationpackets (MOP) can be
combined in a way that is appropriate for the situation.
CYC
What is CYC?
An ambitious attempt to form a very large knowledge base aimed atcapturing
commonsense reasoning.
– Then we (humans) can conclude Napoleon never new that Wellington had died.How do we
do this?
We require special implicit knowledge or commonsense such as:
Brittleness
– Specialized knowledge bases are brittle. Hard to encode new situations and non-graceful degradation in
performance. Commonsense based knowledge bases should have a firmer foundation.
Form and Content
– Knowledge representation may not be suitable for AI. Commonsense strategies could point out
where difficulties in content may affect the form.
Shared Knowledge
– Should allow greater communication among systems with common bases and
assumptions.
How is CYC coded?
By hand.
– LISP like.
– Frame based
– Multiple inheritance
This theory analyzes the surface syntactic structure of sentences by studying the
combination of deep cases (i.e. semantic roles)
For instance, the verb "give" in English requires an Agent (A) and Object (O), and a
Beneficiary (B); e.g. "Jones (A) gave money (O) to the school (B).
According to Fillmore, each verb selects a certain number of deep cases which
form its case frame.
Case frames are subject to certain constraints, such as that a deep case can occur only
once per sentence.
Some of the cases are obligatory and others are optional. Obligatory cases may not be
deleted, at the risk of producing ungrammatical sentences.
Fillmore (1968) puts forwards the following hierarchy for a universal subject
selection rule:
That means that if the case frame of a verb contains an agent, this one is realized as the subject of an
active sentence; otherwise, the deep case following the agent in the hierarchy (i.e. Instrumental) is
promoted to subject.
The influence of case grammar on contemporary linguistics has been significant, to the
extent that numerous linguistic theories incorporate deep roles in one or other form, such
as the so-called Thematic structure in Government and Binding theory. It has also inspired
the development of frame-based representations in AI research
SEMATIC WEB
The Semantic Web is an extension of the World Wide Web through standards set by the
World Wide Web Consortium (W3C).
The goal of the Semantic Web is to make Internet data machine-readable. To enable the
encoding of semantics with the data, technologies such as Resource
Description Framework (RDF) and Web Ontology Language (OWL) are used.
For example, ontology can describe concepts, relationships between entities, and
categories of things.
These embedded semantics offer significant advantages such as reasoning over data and
operating with heterogeneous data sources.
These standards promote common data formats and exchange protocols on the Web,
fundamentally the RDF.
According to the W3C, "The Semantic Web provides a common framework that allows data
to be shared and reused across application, enterprise, and community boundaries."
The Semantic Web is therefore regarded as an integrator across different content and
information applications and systems.
The term was coined by Tim Berners-Lee for a web of data (or data web) that can be
processed by machines—that is, one in which much of the meaning is machine- readable.
While its critics have questioned its feasibility, proponents argue that applications in library
and information science, industry, biology and human sciences research have already proven
the validity of the original concept.
In addition to the classic “Web of documents” W3C is helping to build a technology stack
to support a “Web of data,” the sort of data you find in databases.
The ultimate goal of the Web of data is to enable computers to do more useful work and to
develop systems that can support trusted interactions over the network.
The term “Semantic Web” refers to W3C’s vision of the Web of linked data.
Semantic Web technologies enable people to create data stores on the Web, build
vocabularies, and write rules for handling data.
Linked data are empowered by technologies such as XML, XML Schema, RDF, RDF
Schema, SPARQL, OWL, and SKOS.