• No se han encontrado resultados

1 Cultural TourApplied to the Cultural HeritageSectorfor

N/A
N/A
Protected

Academic year: 2023

Share "1 Cultural TourApplied to the Cultural HeritageSectorfor"

Copied!
19
0
0

Texto completo

(1)

Cultural Tour

Applied to the Cultural Heritage Sector for

2/27

© Intelligent Software Components, S.A. 2003

(2)

3/27

© Intelligent Software Components, S.A. 2003

The Potential of Semantic Web Technology

ƒ Enable a paradigm switch in searching information

ƒ From

• Information Retrieval

ƒ To

• Question Answering

ƒ This work illustrates an application in this line for one particular domain

Forward

Google: Federico García Lorca

(3)

5/27

© Intelligent Software Components, S.A. 2003

Archivo Virtual: Federico García Lorca

Spain member Organizations

6/27

© Intelligent Software Components, S.A. 2003

The Potential of Semantic Web Technology

ƒ Enable a paradigm switch in searching information

ƒ From

• Information Retrieval

ƒ To

• Question Answering

ƒ This work illustrates an application in this line for one particular domain

Forward

(4)

7/27

© Intelligent Software Components, S.A. 2003

Federico García Lorca

How Does it Work?

Ontology of Cultural Content Knowledge Acquisition Exploitation

Conclusions

A Semantic Portal for the Spanish Silver Age

(5)

9/27

© Intelligent Software Components, S.A. 2003

SOURCES ONTOLOGIES APPLICATION

ACQUISITION EXPLOTATION

ULAN

Places and People

Residencia de Estudiantes.

Revistas

Knowledge Parser Semantic

Portal Duontology Publication

Cultural Content Ontology

The Overall Process

How does it work?

10/27

© Intelligent Software Components, S.A. 2003

Ingredients

ƒ Multiple, heterogeneous sources

ƒ Ontology

ƒ Knowledge Acquisition “Engine”

• Knowledge Parser

• R2O and ODEMapster

ƒ Exploitation of knowledge

• Publishing the results - Duontology

• Semantic Navigation - Hyperlink-based navigation - 3D navigation

• Semantic Search Engine - Keyword based

- NLP queries (not done yet)

- Enriched documents with ontological information (not done yet)

How does it work?

(6)

How Does it Work?

Ontology of Cultural Content Knowledge Acquisition Exploitation

Conclusions

A Semantic Portal for the Spanish Silver Age

Ontology of Cultural Content

ƒ Constructed in collaboration with experts from Residencia de Estudiantes

ƒ Inspired by IFLA and MARC, and also based on general ontologies like SUO and Cyc

ƒ Ontology metrics (after population)

64 concepts

91 properties

60.000 instances

60.000 facts

40Mb in RDF(S) files

ƒ Ontology access and management functionalities built with KPOntology.

• Common API for: JENA1.6, JENA 2.0, Sesame, WebODE

Construction

(7)

13/27

© Intelligent Software Components, S.A. 2003

Ontology of Cultural Content

Illustration

Semantic Portal Definition Ontology of Cultural Content Knowledge Acquisition Exploitation

Conclusions

A Semantic Portal for the Spanish Silver Age

(8)

15/27

© Intelligent Software Components, S.A. 2003

The Sources

Residencia de Estudiantes

ULAN Toponyms and Persons

Supervision

Knowledge Acquisition

ULAN: United List of Artist Names

(9)

17/27

© Intelligent Software Components, S.A. 2003

The Sources

Knowledge Acquisition Residencia de Estudiantes

ULAN Toponyms and Persons

Supervision

18/27

© Intelligent Software Components, S.A. 2003

Residencia de Estudiantes. Revistas

(10)

19/27

© Intelligent Software Components, S.A. 2003

Knowledge Parser® Architecture

ƒ Source Pre-processing

ƒ Information identification

ƒ Ontology population

Pluggable

Strategies Intelligent Population of Ontologies Pre-Processing

Types

Knowledge Acquisition

Different Types of Pre-processing

Text

DOM

Render Layout DOM Text

Source Pre-process Interpretations

Source Preprocess Presentation Structure Text Language

Sources Information Idetification

Operators Identification

Description Hypothesis

Ontology Population

Evaluation Population

Domain

ƒ Plain Text Model

Data

• Regular Expression Check and Retrieval

• Offset References

ƒ DOM/Hypertext

• HTML object identification

• HTTP control and navigation

ƒ NLP

• Basic NLP: Tokenizer, Morphology and Chunk Parsers

• Retrieve phrases using head driven approach

• Basic semantic relations (synonyms, hyponyms, etc.)

ƒ Layout

• Rendered result of a HTML source:

(X,Y) coordinates

• Visual Operators: SAME_ROW,

Knowledge Acquisition

(11)

21/27

© Intelligent Software Components, S.A. 2003

Knowledge Parser® Architecture

ƒ Source Pre-processing

ƒ Information identification

ƒ Ontology population

Pluggable

Strategies Intelligent Population of Ontologies Pre-Processing

Types

Knowledge Acquisition

22/27

© Intelligent Software Components, S.A. 2003

Explicit Extraction Knowledge

Wrapping ontology

ƒ Documents

ƒ Pieces

ƒ Relations

• Semantic

• Layout

ƒ Data Types

• Meaning

• Basic Types

• HTML

Source Preprocess Presentation Structure Text Language

Sources Information Idetification

Operators Identification

Description Hypothesis

Ontology Population

Evaluation Population

Domain Data

Knowledge Acquisition

(12)

23/27

© Intelligent Software Components, S.A. 2003

Operators and Strategies

ƒ Operators

• Check: data types, relations, constraints

• Retrieve: obtains piece or document (precondition)

• Execute: navigate, select, etc…

ƒ Strategies (operators applied for hypothesis construction)

• Greedy: quick but not optimal

• Heuristics: hypothesis construction and pruning

• Optimal Backtracking: covering all search space

Source Preprocess Presentation Structure Text Language

Sources Information Idetification

Operators Identification

Description Hypothesis

Ontology Population

Evaluation Population

Domain Data

Knowledge Acquisition

Back

Knowledge Parser® Architecture

ƒ Source Pre-processing

ƒ Information identification

ƒ Ontology population

Pluggable Strategies

Intelligent Population of Ontologies Pre-Processing

Types

Knowledge Acquisition

(13)

25/27

© Intelligent Software Components, S.A. 2003

Ontology Population

ƒ Actions:

• Create new instance

• Modify existing instance

• Remove existing instance

• Relate existing instances

ƒ Process:

• Hypothesis evaluation

• Population simulation

• Lowest cost simulation algorithm

Source Preprocess Presentation Structure Text Language

Sources Information Idetification

Operators Identification

Description Hypothesis

Ontology Population

Evaluation Population

Domain Data

Knowledge Acquisition

Back

Semantic Portal Definition Ontology of Cultural Content Knowledge Acquisition Exploitation

Conclusions

A Semantic Portal for the Spanish Silver Age

(14)

27/27

© Intelligent Software Components, S.A. 2003

Publishing in a Semantic Portal

Exploitation

Semantic Web Publication: Semantic Portal Need for SW information publication on WWW (for humans)

Person

Partner Document

Milestone

Tool

publication

Knowledge Base Browsable Web Site

Inconveniences of direct publication/translation

• Semantic model is not necessary user-friendly (relations, control attributes)

“Traditional” Publishing

Exploitation

(15)

29/27

© Intelligent Software Components, S.A. 2003 Person

Partner Document

Milestone

Tool

publication

Knowledge Base Browsable Web Site

Person

Partner Deliverable

Plan

Publication Ontology

RDQL

Visualization is independent from the Semantic Model

Decoupled Publishing

Exploitation

Ontology viewRDQL ArchitectureCommand Pattern

Internal format (XML)

ArchitectureCommand Pattern

Person Works at

Partner

V. Richard

Benjamins iSOCO

Works at

Knowledge base

(RDF/RDFS)

Employee English

Richard at iSOCO

Publication model

(RDF/RDFS)

WWW Page (HTML)

RDQL Java

Business Logic

XSL Transformations

30/27

© Intelligent Software Components, S.A. 2003

Semantic Navigation

ƒ Example: “Federico García Lorca”

ƒ Returns instances

ƒ Allows reference consulting

Exploitation

(16)

31/27

© Intelligent Software Components, S.A. 2003

Semantic Navigation and Annotation. Onto-H

ƒ Browsing between ontology and sources

Exploitation

New Ways of Visualising Semantic Web Content

Exploitation

Domain Ontology Visualization

Visualization

Ontology

(17)

33/27

© Intelligent Software Components, S.A. 2003

New Ways of Visualising Semantic Web Content

Exploitation

CREATION PLACES

PEOPLE

PERSON

34/27

© Intelligent Software Components, S.A. 2003

New Ways of Visualising Semantic Web Content

Exploitation

(18)

Semantic Portal Definition Ontology of Cultural Content Knowledge Acquisition Exploitation

Conclusions

A Semantic Portal for the Spanish Silver Age

Conclusions

ƒ Towards a paradigm switch in searching?

ƒ More work to be done

ƒ Detailed failure analysis needed

ƒ Why does Search Engine fail?

• KA limitation - Not in ontology

- Missing/wrong instances

• Query construction - NLP result (too complex) - SeRQL query construction

ƒ Scalability

(19)

37/27

© Intelligent Software Components, S.A. 2003

iSOCO Valencia

Tel +34 96 3467143 Oficina 305

C/ Prof. Beltrán Báguena 4, 46009 Valencia Spain iSOCO Barcelona

Tel +34 93 5677200 C/ Alcalde Barnils 64-68 St. Cugat del Vallès 08190 Barcelona Spain

iSOCO Madrid Tel +34 91 3349797 C/Pedro de Valdivia 10 28006 Madrid Spain

For more information on iSOCO, please contact us at [email protected]

www.isoco.com

Intelligent Software Components, S.A.

Contact Information

Referencias

Documento similar

El projecte del portal - els preliminars El projecte de la creació d'un portal virtual que contingués tota la informació relativa al patrimoni cultural i natural de la vall de Boí

Conclusions Our results show a meaningful increase in the benefits of knowledge acquisition for students using CL, supporting positive group interdependence and creating a

El contenido de la política cultural, su estrategia de implementación, así como las acciones contempladas en los planes de desarrollo cultural deben corresponderse con

Studying aspects of innovation and collaboration legitimizes craft beer as a cultural and creative industry, and also generates knowledge of a sector with economic potential in Baja

Los programas para la protección y conservación del patrimonio natural y cultural en sus características físicas, en sus valores intangibles, expresiones culturales contemporáneas y sus

A Questionnaire Model for Usability Evaluation As the complete questionnaire ontology comprises a large amount of concepts, — ranging from usability evaluation generic knowledge to

An important contribution of the RSR approach to the semantic web exploitation is to provide an instrument for automatic building knowledge bases for different applications such

Namely, ODEWiki interoperates with an existing semantic portal technology (ODESeW), it manages inconsistencies raised because of the distributed nature of knowledge base