Vidéo pédagogique

Notice

Langue :

Anglais

Crédits

Fabien Gandon (Intervention), Catherine Faron (Intervention), Olivier Corby (Intervention)

Conditions d'utilisation

Unless otherwise specified, the course material is provided under the Creative Commons License BY-NC-ND: the name of the author should always be mentioned; the user can exploit the work except in a commercial context; and he or she cannot make changes to the original work.

DOI : 10.60527/2sgm-8b72

Citer cette ressource :

Fabien Gandon, Catherine Faron, Olivier Corby. Inria. (2016, 8 septembre). 4. Linked Data Principles , in 1. Principles of a Web of Linked Data. [Vidéo]. Canal-U. https://doi.org/10.60527/2sgm-8b72. (Consultée le 19 juillet 2025)

4. Linked Data Principles

Réalisation : 8 septembre 2016 - Mise en ligne : 13 novembre 2018

document 1 document 2 document 3
niveau 1 niveau 2 niveau 3

Descriptif

In this fourth part, we're going to see the principles behind Linked Data.

What we're going to do is to change slightly how we use the Web architecture. The principles say we're going to use HTTP URIs to allow dereferencing the address for naming everything around us. For instance, if I gave a URI to my necktie, I will use an HTTP URI for that, so that if you find that URI, you can call that URI to get data and discover what it is about. Then when a URI is accessed, we provide data about the resource it represents, for instance we provide data about the necktie. Finally, in the data we provide as many links as possible to other data on the Web, for instance to my suit...

Intervention

Gandon

Fabien

Chercheur à l'INRIA de Sophia-Antipolis, FR (en 2016). Directeur de recherche à l'INRIA Sophia Antipolis-Méditerranée, Université Côte d'Azur (en 2021)

Titulaire d'un doctorat en sciences (Informatique, Nice, 2002)

Faron

Catherine

Professeure en poste à Université Côte d'Azur (en 2022)

Maîtresse de conférences en poste à l'Université de Nice-Sophia Antipolis (en 2020)

Autrice d'une thèse en sciences appliquées soutenue à Paris 6 en 1997

Professeur des Universités Université de Côte d'Azur

Rapporteure lors d'une thèse soutenue à l'INSA Lyon en 2024

Présidente du jury d'une thèse en Informatique à Université Côte d'Azur en 2024

Corby

Olivier

Auteur d'une thèse en Informatique à Nice en 1988

Thème

Discipline :

Informatique

Documentation

Documents pédagogiques

Discover who rents a domain name

The Web application below allows you to provide a domain name to see who controls this domain and to which machine calls to this address are routed. Here for instance we called that service on the domain name “dbpedia.org”

https://who.is/whois/dbpedia.org

Choosing a scheme for your URIs

An HTTP URI is a URI created to name anything we want to talk about but that uses the HTTP in order to be “dereferenceable” i.e. so that a person or a software finding that URI (e.g. a Web crawler) may easily learn more about the resource represented by that URI by just making and HTTP call to the HTTP address it provides. We don’t use the term URL (locator) because the thing that is being represented may not be itself on the Web at this address.
For example, I may want to give an HTTP URI to Mytsie (my cat). No matter how hard I try, Mytsie itself will never be “located” on the Web (it is a not a URL) but this adorable cat can be identified on the Web by an HTTP URI and if you ever go to that address you will be provided with a description on the Web about the resource represented by that URI, i.e. my cat.

Now, how do we choose the URIs we are going to use to talk about the things we want to describe? What should be their structure or schema?

The generic form of a URI is

scheme:[//[user:password@]host[:port]][/]path[?query][#fragment]

For classical HTTP URIs we will have a schema of the form:

http://host[:port]][/]path[?query][#fragment]

We already mentioned the importance of choosing well the domain name for the host par of the identifier. But what about the rest of the address?

There is no unique correct answer to that question and here are two documents that discuss the different options with pros and cons. As you will see the answer is neither simple nor closed:

Cool URIS: http://www.w3.org/TR/cooluris/
Issue 57: http://www.w3.org/2001/tag/awwsw/issue57/latest/

In many cases, the objects we want to describe already have some kind of identifier. In theory, you can transform any identifier into an HTTP URI, for instance, just by choosing a transformation (URI scheme) of the form

http:///

For example, if I want to identify cats, I could choose the following minting scheme:

cat;1278 → http://animals.org/cat/1278

Then HTTP content negotiation (conneg) and, possibly, redirections are configured on the server to provide content in XML, RDF, HTML, JSON, etc. to whoever accesses that address.
Of course, depending on the type of identifier you initially had, you may need to use the URI encoding mechanism we introduced before.

To illustrate that first step, we can mention the real example of the digital object identifier (DOI). There is a way to lookup any DOI on the Web through a service implementing a mapping from DOIs to HTTP URIs.
If you take the following DOI for instance:

doi:10.1007/3-540-45741-0_18

You can transform it into the following HTTP URI following the URI minting scheme implemented by doi.org:
http://dx.doi.org/10.1007/3-540-45741-0_18

This HTTP URI will then redirect you to a description of the object identify by the DOI.

So, choosing the URIs will strongly depend on the domain to which the objects you want to describe belong.

However, there are two families of HTTP URIs that can be considered every time you want to choose a naming scheme: the “hash URIs” (long story) and the “slash URIs” and the discussion they led to.

When a URI contains a hash (i.e. the symbol # ), this indicates a fragment in the URI:

http://my.domain.name/my/path#the-fragment

The HTTP standard requires the Web client to remove the fragment before making a request so if you make an HTTP call on this URI it will in fact be performed on the address:

http://my.domain.name/my/path

The use of a fragment has two advantages:

To immediately differentiate, for instance, the name (URL) of a file on the Web containing descriptions and the names (URIs with fragments) of the resources it describes;
The grouping of several descriptions in one file that can be cached and avoid several calls to discover different linked resources.

For example, in one source at the address:

http://fabien.gandon.me/my/objects/cars

I can describe several things:

http://fabien.gandon.me/my/objects/cars#bmw1http://fabien.gandon.me/my/objects/cars#smart1http://fabien.gandon.me/my/objects/cars#tesla1…

It has "the disadvantages of its advantages ": one cannot obtain the description of only one resource since the whole document is retrieved every time the address is accessed and this could be costly in terms of network traffic, memory and processing when the file is large.

The alternative is to use only the path with slashes (i.e. the symbol / ) to generate identifiers. For instance:

http://fabien.gandon.me/my/objects/cars/bmw1

In that case the server needs to implement a redirection to respond to these addresses with an HTTP 303 error code "See Other". This is to indicate that this URI identifies a resource that is not directly available on the Web and to redirect the requester to another URL where a description about that resource is available. A server should not answer directly (HTTP 200 OK) because it would mean the object (the car for instance here) is available on the Web and it can be retrieved through HTTP which is not true. So the server should redirect ( HTTP 303 error code "See Other") the requester to another address where to find data about the object (the car in our example). Again the content negotiation is used to redirect the requester to a URL corresponding to the requested content format. For instance in HTML:

http://fabien.gandon.me/my/objects/cars/bmw1.html

or in XML:

http://fabien.gandon.me/my/objects/cars/bmw1.xml

This alternative, using slashes, allows us to be much more modular in the storage and transfer of descriptions. Here, a Web client can retrieve only the description it is interested in.

Disadvantages include the multiplication (by two) of HTTP calls (the first access and the second one after the redirection) and the fragmentation of the data that requires multiple calls when one wants to retrieve a collection of them.

To summarize, fragments can be used for small datasets where grouping makes sense (unity of content, linked, same life cycle). This option is also the simplest one as it can be implemented, for instance, just by hosting a file on a Web server. The redirection by HTTP 303 is more technical but allows more control over the data served. Finally, nothing prevents you from using and mixing these two options even inside the same dataset.

FREE BOOK ONLINE

Tom Heath and Christian Bizer (2011) Linked Data: Evolving the Web into a Global Data Space (1st edition). Synthesis Lectures on the Semantic Web: Theory and Technology, 1:1, 1-136. Morgan & Claypool. http://linkeddatabook.com/

To go further...

The Web site "Linking Open Data cloud diagram" provides an overview of the linked open data cloud on the Web.

LODStats based on the CKAN dataset metadata registry to obtain a comprehensive picture of the current state of the Data Web
The free HTML version of the book by Tom Heath & Christian Bizer (2011) "Linked Data: Evolving the Web into a Global Data Space". Synthesis Lectures on the Semantic Web: Theory and Technology, 1:1, 1-136. Morgan & Claypool.

The initial document about Linked Data by Tim Berners-Lee, Design Issues, W3C, 2006

Linked Data Mug

Data on the Web Best Practices, W3C Recommendation 31 January 2017: best practices for publication and usage of data on the Web to facilitate interaction between publishers and consumers. This document from 2017 also shows the evolution of W3C activity to facilitate data on the Web in general: https://www.w3.org/TR/dwbp/

Liens

Slides of The 1st part : principles to a web of linked data

Dans la même collection

Vidéo pédagogique

00:03:28

Favoris
3. From pages to resources
GANDON Fabien

FARON Catherine

CORBY Olivier
In this third part, we will see another evolution of the Web, or more precisely, an evolution of the way we use the Web. We will
Internet
Url
Semantic web
Web of data
Web Architecture
13.11.2018
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:01:00

Favoris
Demos about a Web of Linked data
GANDON Fabien

FARON Catherine

CORBY Olivier
The BBC Web site uses linked (open) data The Wildlife documentary catalog on the Web site of BBC The Web site of BBC is structured and augmented with both internal and public linked data. In
Internet
DBpedia
BBC
Sindice
Semantic web
13.11.2018
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:03:49

Favoris
2. Separating Presentation and Content
GANDON Fabien

FARON Catherine

CORBY Olivier
We now consider one of the first evolutions of the web, to separate the presentation and the content. In 1996, CSS, standing for
Internet
Semantic web
Web of data
Web Architecture
Css
13.11.2018
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:02:12

Favoris
5. Stack of Standards and Languages
GANDON Fabien

FARON Catherine

CORBY Olivier
Let us now conclude this first part with an overview of the stack of standards and languages that are used to publish data on the
Internet
W3C
Semantic web
Web of data
Standardization
13.11.2018
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:02:53

Favoris
1. Historical Introduction to the Web Architecture
GANDON Fabien

FARON Catherine

CORBY Olivier
Going back in history, back in 1945, Vannevar Bush wrote an article entitled "As we may think". In this article, he
Internet
Semantic web
Web Architecture
History of the web
13.11.2018
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3

Voir tout

Avec les mêmes intervenants et intervenantes

Vidéo pédagogique

00:05:29

Favoris
1. RDF Graph Pattern Matching
GANDON Fabien

FARON Catherine

CORBY Olivier
This third part presents the SPARQL (pronounced sparkle) Query Language that enables users to query RDF triple stores. The SPARQL query language enables us to access data
Internet
Linked data
RDF
SPARQL
Semantic web
13.11.2018
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:03:49

Favoris
2. Separating Presentation and Content
GANDON Fabien

FARON Catherine

CORBY Olivier
We now consider one of the first evolutions of the web, to separate the presentation and the content. In 1996, CSS, standing for
Internet
Semantic web
Web of data
Web Architecture
Css
13.11.2018
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:01:53

Favoris
6.Naming graphs
GANDON Fabien

FARON Catherine

CORBY Olivier
This sequence explain how to name graphs in the RDF model and what is the utility of it. In many applications it is very useful to be
Internet
Linked data
RDF
Semantic web
Data model
13.11.2018
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:06:34

Favoris
1. RDFa: an RDF syntax inside HTML
GANDON Fabien

FARON Catherine

CORBY Olivier
The idea of the integration of the web of linked data with other data formats and sources is determined by the fact that the Web is evolving towards all forms of
Internet
HTML
RDF
RDFa
Semantic web
13.11.2018
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:05:56

Favoris
1. Describing resources
GANDON Fabien

FARON Catherine

CORBY Olivier
In this second part we will focus on RDF. RDF is the first brick of the Semantic Web Standards Stack and comprises both a model and several serialization syntaxes, to publish data about anything on
Internet
Linked data
RDF
Semantic web
Data model
13.11.2018
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:04:54

Favoris
3.Filter, Constraint and Function
GANDON Fabien

FARON Catherine

CORBY Olivier
In the third part, we will see the filters, constraints and functions. It is possible to filter the results of query using
Internet
Linked data
RDF
SPARQL
Semantic web
13.11.2018
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:02:40

Favoris
5. R2RML: integration with databases
GANDON Fabien

FARON Catherine

CORBY Olivier
R2RML allows us to integrate data from databases into RDF. There are two ways of transforming a relational database into RDF using R2RML.
Internet
HTML
RDF
Semantic web
Web of data
13.11.2018
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:04:49

Favoris
4. Values, Types and Languages
GANDON Fabien

FARON Catherine

CORBY Olivier
This sequence is about the specificities of the RDF model related to typing literal values and resources in an RDF graph and indicating the
Internet
Linked data
RDF
Semantic web
Data model
13.11.2018
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:01:44

Favoris
Demos about SPARQL
GANDON Fabien

FARON Catherine

CORBY Olivier
Flint, a SPARQL Query Editor Editors are now available for SPARQL. We present the Flint structured editor which provides syntactic coloration. The editor proposes SPARQL keywords according to the
Internet
SPARQL
Flint
Corese
Semantic web
13.11.2018
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:02:04

Favoris
Conclusion of the MOOC Introduction to a Web of Linked Data
GANDON Fabien

FARON Catherine

CORBY Olivier
This video gives a summary of all the notions that have been presented in the 4 parts of the MOOC Introduction to a Web of Linked Data. We saw that we can use HTTP URIs to
Internet
HTML
Linked data
Semantic web
Web of data
13.11.2018
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:02:56

Favoris
5. Several Query Forms
GANDON Fabien

FARON Catherine

CORBY Olivier
In the fifth part, we will see several query forms. Until now, we have seen the select where SPARQL query form but there are
Internet
Linked data
RDF
SPARQL
Semantic web
13.11.2018
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:03:28

Favoris
3. From pages to resources
GANDON Fabien

FARON Catherine

CORBY Olivier
In this third part, we will see another evolution of the Web, or more precisely, an evolution of the way we use the Web. We will
Internet
Url
Semantic web
Web of data
Web Architecture
13.11.2018
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3