Handbook Of Metadata, Semantics And Ontologies
eBook - ePub

Handbook Of Metadata, Semantics And Ontologies

  1. 580 pages
  2. English
  3. ePUB (mobile friendly)
  4. Available on iOS & Android
eBook - ePub

Handbook Of Metadata, Semantics And Ontologies

About this book

Metadata research has emerged as a discipline cross-cutting many domains, focused on the provision of distributed descriptions (often called annotations) to Web resources or applications. Such associated descriptions are supposed to serve as a foundation for advanced services in many application areas, including search and location, personalization, federation of repositories and automated delivery of information. Indeed, the Semantic Web is in itself a concrete technological framework for ontology-based metadata. For example, Web-based social networking requires metadata describing people and their interrelations, and large databases with biological information use complex and detailed metadata schemas for more precise and informed search strategies.

There is a wide diversity in the languages and idioms used for providing meta-descriptions, from simple structured text in metadata schemas to formal annotations using ontologies, and the technologies for storing, sharing and exploiting meta-descriptions are also diverse and evolve rapidly. In addition, there is a proliferation of schemas and standards related to metadata, resulting in a complex and moving technological landscape — hence, the need for specialized knowledge and skills in this area.

The Handbook of Metadata, Semantics and Ontologies is intended as an authoritative reference for students, practitioners and researchers, serving as a roadmap for the variety of metadata schemas and ontologies available in a number of key domain areas, including culture, biology, education, healthcare, engineering and library science.

Contents:

  • Metadata Research: Making Digital Resources Useful Again? (Miguel-Angel Sicilia)
  • Metadata Typology and Metadata Uses (Eva Méndez and Seth van Hooland)
  • The Value and Cost of Metadata (Miltiadis D Lytras, Miguel-Ángel Sicilia and Cristian Cechinel)
  • Metadata Quality (Xavier Ochoa)
  • Ontologies in Systems Theory (Emilia Currás)
  • Introduction to XML and Its Applications (Laura Papaleo)
  • Ontologies and Ontology Languages (Sinuhé Arroyo and Katharina Siorpaes)
  • Topic Maps (Piedad Garrido Picazo and Jesús Tramullas)
  • Methodologies for the Creation of Semantic Data (Tobias Bürger, Elena Simperl and Christoph Tempich)
  • Metadata and Ontologies in e-Learning (Manuel E Prieto Méndez, Víctor H Menéndez Domínguez and Christian L Vidal Castro)
  • Metadata and Ontologies for Health (Gianluca Colombo, Daniele Merico and Michaela Gündel)
  • Agricultural Knowledge Organization Systems: An Analysis of an Indicative Sample (Nikos Palavitsinis and Nikos Manouselis)
  • Metadata and Ontologies for Bioinformatics (E Blanco)
  • Metadata and Ontologies for Mechanical Objects' Design and Manufacturing (Fabio Sartori and Stefania Bandini)
  • Metadata and Ontologies for Emergency Management (Leopoldo Santos-Santos and Tomás Aguado-Gómez)
  • Metadata and Ontologies for Tourism (Dimitris Kanellopoulos)
  • Metadata Standards and Ontologies for Multimedia Content (Tobias Bürger and Michael Hausenblas)
  • Technologies for Metadata Integration and Interoperability (Ricardo Eito-Brun)
  • Technologies for Metadata Extraction (Koraljka Golub, Henk Muller and Emma Tonkin)
  • Technologies for Metadata and Ontology Storage (Mary Parmelee and Leo Obrst)


Readership: Graduates and senior undergraduates in computing or information science; researchers in metadata, semantics and ontologies; practitioners in planning or managing information systems.
Key Features:

  • The first comprehensive guide to metadata and semantics from a broad perspective
  • Up-to-date surveys of metadata, semantics and ontologies in key domains such as healthcare, learning technology and agriculture
  • An integrative account of metadata and semantics that can reach a broad audience, including not only computer scientists and information scientists but also IT managers and technologists in general

Frequently asked questions

Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription.
No, books cannot be downloaded as external files, such as PDFs, for use outside of Perlego. However, you can download books within the Perlego app for offline reading on mobile or tablet. Learn more here.
Perlego offers two plans: Essential and Complete
  • Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
  • Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Both plans are available with monthly, semester, or annual billing cycles.
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, we’ve got you covered! Learn more here.
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Yes! You can use the Perlego app on both iOS or Android devices to read anytime, anywhere — even offline. Perfect for commutes or when you’re on the go.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Yes, you can access Handbook Of Metadata, Semantics And Ontologies by Miguel-Angel Sicilia in PDF and/or ePUB format, as well as other popular books in Computer Science & Artificial Intelligence (AI) & Semantics. We have over one million books available in our catalogue for you to explore.

CHAPTER I.1

METADATA RESEARCH: MAKING DIGITAL RESOURCES USEFUL AGAIN?

Miguel-Angel Sicilia
Department of Computer Science
University of AlcalĆ”, Polytechnic building
Ctra. Barcelona km. 33.6
AlcalĆ” de Henares(Madrid), Spain
[email protected]
The growth of the Web represents also one of its major challenges, as users face the problem of selecting the pages that are most relevant to their taskfrom a vast amount of information. Search engines and microdata are an example of a means towards the end of helping in better targeting search, but there is no universal perfect solution for all information needs. In addition, the last years have witnessed the emergence of the Web of Linked Data, fostered by the increasing adoption of openness as a paradigm for sharing information for the benefit of the commons. Microdata, linked data and other technologies are no other thing than different ways of using metadata to enhance information seeking, targeting and integration. Metadata thus is nowadays the fabric of the Web. Understanding the different forms and arrangements of metadata is in consequence a required skill for researchers and practitioners that aim at understanding and getting value from the Web.
Keywords: Metadata, Linked Data, microdata, terminologies

1. Introduction

ā€œMetadataā€ has become a term frequently used both in academia and also in the professional context. As an indicator of its growing acceptance as a common concept, the Google Scholar service1 estimates more than 1 million results when we formulate a query using only the term. The results’ estimation become more than 30 million if we use instead the non-specialized search service provided by Google. While the common usage of the term seems to be uncontroversial, there is an increasing heterogeneity in the ways metadata is defined, created, managed, and stored. This heterogeneity probably comes from the lack of a precise definition of metadata that captures the main elements behind the application of metadata technologies.
Metadata is commonly defined as ā€œdata about dataā€, according to its etymology. While this definition cannot be considered false, it has the problem of covering too many things and at the same time capturing only part of the aspects that are considered important by researchers and practitioners working with metadata. Following such definition, if I write some data in a piece of paper about an interesting book from my local library, that is a piece of metadata. This naĆÆve example actually can be used to raise several of the important questions revolving around metadata research. For example, for some people metadata only applies to digital information (and this is precisely the focus of interest we take here). Or for some others metadata needs to be formulated with some form of schema or structure that brings a level of standardization or homogeneous use across Web sites and systems.
Another problem with metadata as a concept is that it has been metadata and not meta-information the term that has reached widespread use. There is a conceptual distinction according to which data, information and knowledge are different but interrelated things (Zins, 2007). However, to follow the common use of the term, we refer here to metadata as a generic term of any kind of meta-information also.
Metadata existed many years before the Web was even conceived. However, with the Web metadata has been brought to the hearth of the architecture of cyberspace. Originally the Web was only made up of HTML pages following a simple interlinked structure. But it has evolved into something much more complex in which metadata mixes with the contents of the pages or is arranged as a layer of information that ā€œpointsā€ to the resources described via URIs.2 Also, HTML is not anymore the only way of describing information on the Web. XML first and RDF then, along with some microformats, are the main expression mediums for metadata today.
Understanding what is metadata and how it manifests today in the Web is a key skill for practitioners and researchers in a variety of domains. Here we attempt to succinctly delineate the main characteristics of metadata and the way metadata nowadays conforms a space of information that surrounds the Web.
The rest of this chapter is structured as follows. Section 2 briefly discusses the emergence of metadata as a differentiated area of inquiry. Then, in Section 3 a definition of metadata is provided with the aim of covering in a broad sense such inquiry area. Section 4 then discusses some particular kinds of metadata as illustrations of its diversity. Finally, conclusions and outlook are provided in Section 5.

2. Metadata as a research discipline

During the last years we have been starting to speak of ā€œmetadata researchā€. People have started to define themselves as ā€œmetadata specialistsā€ and there have been international projects that were basically ā€œmetadata aggregationā€ projects. But is there anything as a discipline or area of ā€œmetadata research?ā€. This is difficult to say, as the discipline is not defined by any society or professional organization to our knowledge. While there exist a few scholarly journals that have ā€œmetadataā€ in the title, and conferences that explicitly deal with metadata, delineating the boundaries of the topic is a challenging effort.
It is also difficult to clearly define the object of metadata research. A possible tentative would be that of defining that object as to an engineering discipline. Engineering is the science of design and production, and in this case we aim at devising information mechanisms for a better access to information resources.
An information mechanism can be broadly defined as any technique or method (or sets of them) that provides an organization to other information resources. Having databases of XML records with DublinCore metadata3 is such an information mechanism. It has a defined schema, a format of expression and a way to point to the original resources, e.g. using <dc:identifier>. The Linked Open Data approach in DBPedia is another example (Morsey et al., 2012). In this case it is based on the RDF standard,4 and also follows a set of conventions that make it available via dereferenceable URLs. Many different information mechanisms can be devised for the same or different purposes. And metadata research is about how to make these more effective and efficient for particular purposes.
The purposes are the ā€œbetter accessā€ part of the definition. For example, the Europeana digital library5 is essentially a system built on top of the aggregation of metadata using primarily harvesting mechanisms, starting from the OAI-PMH protocol. Here the ā€œbetter accessā€ means several things, including (a) homogeneous presentation of cultural resource descriptions, (b) a single point of interaction for a large mass of content and (c) some form of quality control in the ingestion process. In this example, it becomes evident that information mechanisms are encompassing not only formats, schemas and database technologies but also approaches to quality, organizational issues. In general, they involve a socio-technical system with procedures, technologies, tools and people.
A further characteristic of metadata research that makes it challenging is that the evolution of the field takes place in the context of the social phenomenon of adopting particular schemas and practices. In that direction, the survival and spread of a particular metadata schema arguably depends to a large extent on its readiness to be easily implemented by the community of practitioners and researchers in that area. In consequence, there may be metadata schemas for a given purpose that are richer than others, but they are also more slowly accepted and used. This may be attributed to different causes, as the difficulty of implementing, how hard is to transition legacy metadata and the degree of openness and transparency of their curators, to name a few. This is a sort of ā€œnatural evolutionā€ of schemas and practices that in some cases cannot be directly related to the technical merits of the different approaches. It related to the social nature of the Web (Berners-Lee et al., 2006).
In consequence, it is difficult to say if metadata research is a scientific discipline in itself with its own theories, assumptions and corpus of commonly accepted knowledge. However, it is clear that metadata research is a field of inquiry that is evolving and growing, and concepts and practice get consolidated with the years. It is in consequence worth the effort looking at the evolution of metadata research and doing an attempt to identify its foundations.

3. Defining metadata

Greenberg (2003) defines metadata as ā€œstructured data about an object that supports functions associated with the designated objectā€. Structure in metadata entails that information is organised systematically, and this is nowadays primarly achieved by the use of metadata schemas. The functions enabled can be diverse, but they are in many cases related to facilitating discovery or search, or to restrict access (e.g. in the case of licensing information) or to combine meta-information to relate resources described separately.
The main characteristic of metadata is its referential nature, i.e., metadata predicates about some other thing (even describing another metadata record). Such ā€˜other thing’ can be considered as ā€˜anything’ from the broadest perspective, but such a view could hardly be useful for bringing semantics to current information systems as the web. Then, we will restrict our discussion to digital resources of a diverse kind. In the scope of the current web, resources can be unambiguously identified by the concept of URI.
For metadata to become an object of scientific inquiry there is a need to make it measurable in its core attributes, beyond measures related to size or availability. Metadata then should be considered to be subject to assessment in several dimensions. They include at least the following:
—Quality
—Richness
—Interoperability
While these three aspects are not independent completely, they look at the problem of having better metadata systems from different angles. Current studies on metadata quality mainly deal with completeness of metadata records and in some cases with the degree of use of controlled vocabularies. However, there is little research on richness, i.e. the amount of useful information or possibilities of interlinking of metadata collections or systems. The problem of richness should be approached at two levels. At the schema level, there are still no metrics for assessing and comparing metadata schemas according to their expressivity and possibilities to convey more detailed information. At the record level, the problem becomes even more challenging, as the final richness depends on the schema, the completeness of the records and also some other aspects that are in many cases domain-dependent.
Interoperability should in theory be taken from granted in metadata systems, however, it is a matter of fact that there are differences. The problem of interoperability starts obviously at the syntactic level. In common, general-purpose metadata schemas, simplicity comes at the cost of reducing possibilities to integrate information. There is a sort of trade-off between using highly generic metadata schemas as Dublin C...

Table of contents

  1. Cover Page
  2. Half Title
  3. Title Page
  4. Copyrights
  5. Preface
  6. Contents
  7. Chapter I.1 Metadata Research: Making Digital Resources Useful Again?
  8. Chapter I.2 Metadata Typology and Metadata Uses
  9. Chapter I.3 The Value and Cost of Metadata
  10. Chapter I.4 Metadata Quality
  11. Chapter I.5 Ontologies in Systems Theory
  12. Chapter II.1 Introduction to XML and Its Applications
  13. Chapter II.2 Ontologies and Ontology Languages
  14. Chapter II.3 Topic Maps
  15. Chapter II.4 Methodologies for the Creation of Semantic Data
  16. Chapter III.1 Metadata and Ontologies in e-Learning
  17. Chapter III.2 Metadata and Ontologies for Health
  18. Chapter III.3 Agricultural Knowledge Organization Systems: An Analysis of an Indicative Sample
  19. Chapter III.4 Metadata and Ontologies for Bioinformatics
  20. Chapter III.5 Metadata and Ontologies for Mechanical Objects’ Design and Manufacturing
  21. Chapter III.6 Metadata and Ontologies for Emergency Management
  22. Chapter III.7 Metadata and Ontologies for Tourism
  23. Chapter III.8 Metadata Standards and Ontologies for Multimedia Content
  24. Chapter IV.1 Technologies for Metadata Integration and Interoperability
  25. Chapter IV.2 Technologies for Metadata Extraction
  26. Chapter IV.3 Technologies for Metadata and Ontology Storage