• No se han encontrado resultados

1 Cultural TourApplied to the Cultural HeritageSectorfor

N/A
N/A
Protected

Academic year: 2023

Share "1 Cultural TourApplied to the Cultural HeritageSectorfor"

Copied!
19
0
0

Texto completo

(1)

Cultural Tour

Applied to the Cultural Heritage Sector for

2/27

© Intelligent Software Components, S.A. 2003

(2)

3/27

© Intelligent Software Components, S.A. 2003

The Potential of Semantic Web Technology

ƒ Enable a paradigm switch in searching information

ƒ From

• Information Retrieval

ƒ To

• Question Answering

ƒ This work illustrates an application in this line for one particular domain

Forward

Google: Federico García Lorca

(3)

5/27

© Intelligent Software Components, S.A. 2003

Archivo Virtual: Federico García Lorca

Spain member Organizations

6/27

© Intelligent Software Components, S.A. 2003

The Potential of Semantic Web Technology

ƒ Enable a paradigm switch in searching information

ƒ From

• Information Retrieval

ƒ To

• Question Answering

ƒ This work illustrates an application in this line for one particular domain

Forward

(4)

7/27

© Intelligent Software Components, S.A. 2003

Federico García Lorca

How Does it Work?

Ontology of Cultural Content Knowledge Acquisition Exploitation

Conclusions

A Semantic Portal for the Spanish Silver Age

(5)

9/27

© Intelligent Software Components, S.A. 2003

SOURCES ONTOLOGIES APPLICATION

ACQUISITION EXPLOTATION

ULAN

Places and People

Residencia de Estudiantes.

Revistas

Knowledge Parser Semantic

Portal Duontology Publication

Cultural Content Ontology

The Overall Process

How does it work?

10/27

© Intelligent Software Components, S.A. 2003

Ingredients

ƒ Multiple, heterogeneous sources

ƒ Ontology

ƒ Knowledge Acquisition “Engine”

• Knowledge Parser

• R2O and ODEMapster

ƒ Exploitation of knowledge

• Publishing the results - Duontology

• Semantic Navigation - Hyperlink-based navigation - 3D navigation

• Semantic Search Engine - Keyword based

- NLP queries (not done yet)

- Enriched documents with ontological information (not done yet)

How does it work?

(6)

How Does it Work?

Ontology of Cultural Content Knowledge Acquisition Exploitation

Conclusions

A Semantic Portal for the Spanish Silver Age

Ontology of Cultural Content

ƒ Constructed in collaboration with experts from Residencia de Estudiantes

ƒ Inspired by IFLA and MARC, and also based on general ontologies like SUO and Cyc

ƒ Ontology metrics (after population)

64 concepts

91 properties

60.000 instances

60.000 facts

40Mb in RDF(S) files

ƒ Ontology access and management functionalities built with KPOntology.

• Common API for: JENA1.6, JENA 2.0, Sesame, WebODE

Construction

(7)

13/27

© Intelligent Software Components, S.A. 2003

Ontology of Cultural Content

Illustration

Semantic Portal Definition Ontology of Cultural Content Knowledge Acquisition Exploitation

Conclusions

A Semantic Portal for the Spanish Silver Age

(8)

15/27

© Intelligent Software Components, S.A. 2003

The Sources

Residencia de Estudiantes

ULAN Toponyms and Persons

Supervision

Knowledge Acquisition

ULAN: United List of Artist Names

(9)

17/27

© Intelligent Software Components, S.A. 2003

The Sources

Knowledge Acquisition Residencia de Estudiantes

ULAN Toponyms and Persons

Supervision

18/27

© Intelligent Software Components, S.A. 2003

Residencia de Estudiantes. Revistas

(10)

19/27

© Intelligent Software Components, S.A. 2003

Knowledge Parser® Architecture

ƒ Source Pre-processing

ƒ Information identification

ƒ Ontology population

Pluggable

Strategies Intelligent Population of Ontologies Pre-Processing

Types

Knowledge Acquisition

Different Types of Pre-processing

Text

DOM

Render Layout DOM Text

Source Pre-process Interpretations

Source Preprocess Presentation Structure Text Language

Sources Information Idetification

Operators Identification

Description Hypothesis

Ontology Population

Evaluation Population

Domain

ƒ Plain Text Model

Data

• Regular Expression Check and Retrieval

• Offset References

ƒ DOM/Hypertext

• HTML object identification

• HTTP control and navigation

ƒ NLP

• Basic NLP: Tokenizer, Morphology and Chunk Parsers

• Retrieve phrases using head driven approach

• Basic semantic relations (synonyms, hyponyms, etc.)

ƒ Layout

• Rendered result of a HTML source:

(X,Y) coordinates

• Visual Operators: SAME_ROW,

Knowledge Acquisition

(11)

21/27

© Intelligent Software Components, S.A. 2003

Knowledge Parser® Architecture

ƒ Source Pre-processing

ƒ Information identification

ƒ Ontology population

Pluggable

Strategies Intelligent Population of Ontologies Pre-Processing

Types

Knowledge Acquisition

22/27

© Intelligent Software Components, S.A. 2003

Explicit Extraction Knowledge

Wrapping ontology

ƒ Documents

ƒ Pieces

ƒ Relations

• Semantic

• Layout

ƒ Data Types

• Meaning

• Basic Types

• HTML

Source Preprocess Presentation Structure Text Language

Sources Information Idetification

Operators Identification

Description Hypothesis

Ontology Population

Evaluation Population

Domain Data

Knowledge Acquisition

(12)

23/27

© Intelligent Software Components, S.A. 2003

Operators and Strategies

ƒ Operators

• Check: data types, relations, constraints

• Retrieve: obtains piece or document (precondition)

• Execute: navigate, select, etc…

ƒ Strategies (operators applied for hypothesis construction)

• Greedy: quick but not optimal

• Heuristics: hypothesis construction and pruning

• Optimal Backtracking: covering all search space

Source Preprocess Presentation Structure Text Language

Sources Information Idetification

Operators Identification

Description Hypothesis

Ontology Population

Evaluation Population

Domain Data

Knowledge Acquisition

Back

Knowledge Parser® Architecture

ƒ Source Pre-processing

ƒ Information identification

ƒ Ontology population

Pluggable Strategies

Intelligent Population of Ontologies Pre-Processing

Types

Knowledge Acquisition

(13)

25/27

© Intelligent Software Components, S.A. 2003

Ontology Population

ƒ Actions:

• Create new instance

• Modify existing instance

• Remove existing instance

• Relate existing instances

ƒ Process:

• Hypothesis evaluation

• Population simulation

• Lowest cost simulation algorithm

Source Preprocess Presentation Structure Text Language

Sources Information Idetification

Operators Identification

Description Hypothesis

Ontology Population

Evaluation Population

Domain Data

Knowledge Acquisition

Back

Semantic Portal Definition Ontology of Cultural Content Knowledge Acquisition Exploitation

Conclusions

A Semantic Portal for the Spanish Silver Age

(14)

27/27

© Intelligent Software Components, S.A. 2003

Publishing in a Semantic Portal

Exploitation

Semantic Web Publication: Semantic Portal Need for SW information publication on WWW (for humans)

Person

Partner Document

Milestone

Tool

publication

Knowledge Base Browsable Web Site

Inconveniences of direct publication/translation

• Semantic model is not necessary user-friendly (relations, control attributes)

“Traditional” Publishing

Exploitation

(15)

29/27

© Intelligent Software Components, S.A. 2003 Person

Partner Document

Milestone

Tool

publication

Knowledge Base Browsable Web Site

Person

Partner Deliverable

Plan

Publication Ontology

RDQL

Visualization is independent from the Semantic Model

Decoupled Publishing

Exploitation

Ontology viewRDQL ArchitectureCommand Pattern

Internal format (XML)

ArchitectureCommand Pattern

Person Works at

Partner

V. Richard

Benjamins iSOCO

Works at

Knowledge base

(RDF/RDFS)

Employee English

Richard at iSOCO

Publication model

(RDF/RDFS)

WWW Page (HTML)

RDQL Java

Business Logic

XSL Transformations

30/27

© Intelligent Software Components, S.A. 2003

Semantic Navigation

ƒ Example: “Federico García Lorca”

ƒ Returns instances

ƒ Allows reference consulting

Exploitation

(16)

31/27

© Intelligent Software Components, S.A. 2003

Semantic Navigation and Annotation. Onto-H

ƒ Browsing between ontology and sources

Exploitation

New Ways of Visualising Semantic Web Content

Exploitation

Domain Ontology Visualization

Visualization

Ontology

(17)

33/27

© Intelligent Software Components, S.A. 2003

New Ways of Visualising Semantic Web Content

Exploitation

CREATION PLACES

PEOPLE

PERSON

34/27

© Intelligent Software Components, S.A. 2003

New Ways of Visualising Semantic Web Content

Exploitation

(18)

Semantic Portal Definition Ontology of Cultural Content Knowledge Acquisition Exploitation

Conclusions

A Semantic Portal for the Spanish Silver Age

Conclusions

ƒ Towards a paradigm switch in searching?

ƒ More work to be done

ƒ Detailed failure analysis needed

ƒ Why does Search Engine fail?

• KA limitation - Not in ontology

- Missing/wrong instances

• Query construction - NLP result (too complex) - SeRQL query construction

ƒ Scalability

(19)

37/27

© Intelligent Software Components, S.A. 2003

iSOCO Valencia

Tel +34 96 3467143 Oficina 305

C/ Prof. Beltrán Báguena 4, 46009 Valencia Spain iSOCO Barcelona

Tel +34 93 5677200 C/ Alcalde Barnils 64-68 St. Cugat del Vallès 08190 Barcelona Spain

iSOCO Madrid Tel +34 91 3349797 C/Pedro de Valdivia 10 28006 Madrid Spain

For more information on iSOCO, please contact us at smartsearch@isoco.com

www.isoco.com

Intelligent Software Components, S.A.

Contact Information

Referencias

Documento similar

These cultural differences suggest that Mexico and the United States are appropriate settings in which to examine the impact of differing cultural norms regarding trust

La producción de didácticas para trabajar con los adolescentes y jóvenes Down es poco visible; situación que motiva a los autores a presentar esta investigación que espera aportar

 El turisme pot ser un sistema idoni de preservació del patrimoni si les accions turístiques tenen una coherència cultural bàsica: ha de conduir tant al respecte de la

The conversion of what we might term a classic cultural product the book, record, film on videotape) into a digital file or accessible online document and its assimilation into

Cuando Pedro de Valdivia funda Santiago de Chile, establece como estrategia geométrica para la creación de nuevas ciudades, la trama romana universal.. Proponemos una sencilla trama

Completed Cultural Competence Training Entrenamiento de Competencia

Organizado por la Comisión Estatal de Derechos Humanos y el Tecnológico de Monterrey, este diplomado forma parte del Plan de Educación para los Derechos Humanos que será

The second and most ambitious objective of the thesis has been to collect and contrast as much information as possible about the elements implied in the design, management and