Resource Description Framework (RDF) Mod

Resource Description Framework (RDF) Model and Syntax Specification
REC-rdf-syntax-19990222
Resource Description Framework
(RDF) Model and Syntax Specification
W3C Recommendation 22 February 1999
This Version:
Newest Version:
Editors:
Ora Lassila

Nokia Research Center
Ralph R. Swick

World Wide Web Consortium
Document Status
W3C
MIT
INRIA
Keio
), All Rights Reserved. W3C
liability,
trademark
document
use
and
software
licensing
rules apply.
Status of This Document
This document has been reviewed by W3C Members and other interested
parties and has been endorsed by the Director as a
W3C Recommendation
. It is
a stable document and may be used as reference material or cited as
a normative reference from other documents. W3C's role in making the
Recommendation is to draw attention to the specification and to promote
its widespread deployment. This enhances the functionality and
interoperability of the Web.
The list of know errors in this specification is available at
Comments on this specification may be sent to
www-rdf-comments@w3.org
>.
The archive of public comments is available at
Table of Contents
Introduction
Basic RDF
Containers
Statements About Statements
Formal Model for RDF
Formal Grammar for RDF
Examples
Acknowledgements
Appendix A: Glossary
Appendix B: Transporting RDF
Appendix C: Notes about Usage
Appendix D: References
Appendix E: Changes From Previous Version
1. Introduction
The World Wide Web was originally built
for human consumption, and although everything on it is
machine-readable
this data is not
machine-understandable
. It is very hard to automate
anything on the Web, and because of the volume of information the Web contains,
it is not possible to manage it manually. The solution proposed here is
to use
metadata
to describe the data contained on the Web. Metadata
is "data about data" (for example, a library catalog is metadata,
since it describes publications) or specifically in the context of this
specification "data describing Web resources". The distinction
between "data" and "metadata" is not an absolute one;
it is a distinction created primarily by a particular application, and
many times the same resource will be interpreted in both ways simultaneously.
Resource Description Framework (RDF) is a foundation for processing
metadata; it provides interoperability between applications that exchange
machine-understandable information on the Web. RDF emphasizes facilities
to enable automated processing of Web resources. RDF can be used in a variety
of application areas; for example: in
resource discovery
to provide
better search engine capabilities, in
cataloging
for describing
the content and content relationships available at a particular Web site,
page, or digital library, by
intelligent software agents
to facilitate
knowledge sharing and exchange, in
content rating
, in describing
collections of pages
that represent a single logical "document",
for describing
intellectual property rights
of Web pages, and for
expressing the
privacy preferences
of a user as well as the
policies
of a Web site. RDF with
digital signatures
will be
key to building the "Web of Trust" for electronic commerce, collaboration,
and other applications.
This document introduces a model for representing RDF metadata as well
as a syntax for encoding and transporting this metadata in a manner that
maximizes the interoperability of independently developed Web servers and
clients. The syntax presented here uses the Extensible Markup Language
[XML]: one of the goals of RDF is to make it possible to specify semantics
for data based on XML in a standardized, interoperable manner. RDF and
XML are complementary: RDF is a model of metadata and only addresses by
reference many of the encoding issues that transportation and file storage
require (such as internationalization, character sets, etc.). For these
issues, RDF relies on the support of XML. It is also important to understand
that this XML syntax is only one possible syntax for RDF and that alternate
ways to represent the same RDF data model may emerge.
The broad goal of RDF is to define a mechanism for describing resources
that makes no assumptions about a particular application domain, nor defines
(a priori) the semantics of any application domain. The definition of the
mechanism should be domain neutral, yet the mechanism should be suitable
for describing information about any domain.
This specification will be followed by other documents that will complete
the framework. Most importantly, to facilitate the definition of metadata,
RDF will have a class system much like many object-oriented programming
and modeling systems. A collection of classes (typically authored for a
specific purpose or domain) is called a
schema
. Classes are organized
in a hierarchy, and offer extensibility through subclass refinement. This
way, in order to create a schema slightly different from an existing one
it is not necessary to "reinvent the wheel" but one can just
provide incremental modifications to the base schema. Through the sharability
of schemas RDF will support the reusability of metadata definitions. Due
to RDF's incremental extensibility, agents processing metadata will be
able to trace the origins of schemata they are unfamiliar with back to
known schemata and perform meaningful actions on metadata they weren't
originally designed to process. The sharability and extensibility of RDF
also allows metadata authors to use multiple inheritance to "mix"
definitions, to provide multiple views to their data, leveraging work done
by others. In addition, it is possible to create RDF instance data based
on multiple schemata from multiple sources (i.e., "interleaving"
different types of metadata). Schemas may themselves be written in RDF;
a companion document to this specification,
RDFSchema
], describes one
set of properties and classes for describing RDF schemas.
As a result of many communities coming together and agreeing on basic
principles of metadata representation and transport, RDF has drawn influence
from several different sources. The main influences have come from the
Web standardization community
itself in the form of HTML metadata
and PICS, the
library community
, the
structured document community
in the form of SGML and more importantly XML, and also the
knowledge
representation (KR) community
. There are also other areas of technology
that contributed to the RDF design; these include object-oriented programming
and modeling languages, as well as databases.
While RDF draws from the KR community, readers familiar with that field
are cautioned that RDF does not specify a mechanism for
reasoning
RDF can be characterized as a simple frame system. A reasoning mechanism
could be built on top of this frame system.
2. Basic RDF
2.1. Basic RDF Model
The foundation of RDF is a model for representing named properties
and property values. The RDF model draws on well-established principles
from various data representation communities. RDF properties may be thought
of as attributes of resources and in this sense correspond to traditional
attribute-value pairs. RDF properties also represent relationships
between resources and an RDF model can therefore resemble an
entity-relationship diagram. (More precisely, RDF Schemas —
which are themselves instances of RDF data models — are ER diagrams.)
In object-oriented design terminology, resources correspond to
objects and properties correspond to instance variables.
The RDF data model is a syntax-neutral way of representing RDF
expressions. The data model representation is used to evaluate equivalence
in meaning. Two RDF expressions are equivalent if and only if their data
model representations are the same. This definition of equivalence permits
some syntactic variation in expression without altering the meaning.
(See
Section 6.
for additional discussion
of string comparison issues.)
The basic data model consists of three object types:
Resources
All things being described by RDF expressions are called
resources
A resource may be an entire Web page; such as the HTML document
"http://www.w3.org/Overview.html" for example.
A resource may be a part of a Web page; e.g. a specific HTML or XML element
within the document source. A resource may also be a whole collection of
pages; e.g. an entire Web site. A resource may also be an object that
is not directly accessible via the Web; e.g. a printed book.
Resources are always named by URIs plus optional anchor ids (see
URI
]).
Anything can have a URI; the extensibility of URIs allows the
introduction of identifiers
for any entity imaginable.
Properties
property
is a specific aspect, characteristic, attribute,
or relation used to describe a resource. Each property has a specific
meaning, defines its permitted values, the types of resources it can
describe, and its relationship with other properties. This document
does not address how the characteristics of properties are expressed;
for such information, refer to the
RDF
Schema specification
).
Statements
A specific resource together with a named property plus the value of
that property for that resource is an RDF
statement
These three individual parts of a statement are called, respectively,
the
subject
, the
predicate
, and the
object
The object of a statement (i.e., the property value) can be another
resource or it can be a literal; i.e., a resource (specified by a URI)
or a simple string or other primitive datatype defined by XML. In RDF
terms, a
literal
may have content that is XML markup
but is not further evaluated by the RDF processor. There are some
syntactic restrictions on how markup in literals may be expressed; see
Section 2.2.1.
2.1.1. Examples
Resources are identified by a
resource identifier
A resource identifier is a URI plus an optional anchor id (see
Section
2.2.1.
). For the purposes of this
section, properties will be referred to by a simple name.
Consider as a simple example the sentence:
Ora Lassila is the creator of the resource http://www.w3.org/Home/Lassila.
This sentence has the following parts:
Subject (Resource)
Predicate (Property)
Creator
Object (literal)
"Ora Lassila"
In this document we will diagram an RDF statement pictorially using
directed labeled graphs (also called "nodes and arcs diagrams").
In these diagrams, the nodes (drawn as ovals) represent resources and arcs
represent named properties. Nodes that represent string literals will
be drawn as rectangles. The sentence above would thus be diagrammed as:
Figure 1: Simple node and arc diagram
Note: The direction of the arrow is important. The arc always starts
at the subject and points to the object of the statement.
The simple diagram above may also
be read "
",
or in general "<
subject> HAS