Data Model Patterns: A Metadata Map
eBook - ePub

Data Model Patterns: A Metadata Map

David C. Hay

Share book
  1. 432 pages
  2. English
  3. ePUB (mobile friendly)
  4. Available on iOS & Android
eBook - ePub

Data Model Patterns: A Metadata Map

David C. Hay

Book details
Book preview
Table of contents
Citations

About This Book

Data Model Patterns: A Metadata Map not only presents a conceptual model of a metadata repository but also demonstrates a true enterprise data model of the information technology industry itself. It provides a step-by-step description of the model and is organized so that different readers can benefit from different parts.

It offers a view of the world being addressed by all the techniques, methods, and tools of the information processing industry (for example, object-oriented design, CASE, business process re-engineering, etc.) and presents several concepts that need to be addressed by such tools.

This book is pertinent, with companies and government agencies realizing that the data they use represent a significant corporate resource recognize the need to integrate data that has traditionally only been available from disparate sources. An important component of this integration is management of the "metadata" that describe, catalogue, and provide access to the various forms of underlying business data. The "metadata repository" is essential to keep track of the various physical components of these systems and their semantics.

The book is ideal for data management professionals, data modeling and design professionals, and data warehouse and database repository designers.

  • A comprehensive work based on the Zachman Framework for information architectureā€”encompassing the Business Owner's, Architect's, and Designer's views, for all columns (data, activities, locations, people, timing, and motivation)
  • Provides a step-by-step description of model and is organized so that different readers can benefit from different parts
  • Provides a view of the world being addressed by all the techniques, methods and tools of the information processing industry (for example, object-oriented design, CASE, business process re-engineering, etc.)
  • Presents many concepts that are not currently being addressed by such tools ā€” and should be

Frequently asked questions

How do I cancel my subscription?
Simply head over to the account section in settings and click on ā€œCancel Subscriptionā€ - itā€™s as simple as that. After you cancel, your membership will stay active for the remainder of the time youā€™ve paid for. Learn more here.
Can/how do I download books?
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
What is the difference between the pricing plans?
Both plans give you full access to the library and all of Perlegoā€™s features. The only differences are the price and subscription period: With the annual plan youā€™ll save around 30% compared to 12 months on the monthly plan.
What is Perlego?
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, weā€™ve got you covered! Learn more here.
Do you support text-to-speech?
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Is Data Model Patterns: A Metadata Map an online PDF/ePUB?
Yes, you can access Data Model Patterns: A Metadata Map by David C. Hay in PDF and/or ePUB format, as well as other popular books in Informatica & Database. We have over one million books available in our catalogue for you to explore.

Information

Year
2010
ISBN
9780080477039
Subtopic
Database
1
ABOUT METADATA MODELS
There once was a fellow named Corey
Whose career was not covered in glory
He had a bad day
When he just couldnā€™t say
Me-ta-da-ta Re-pos-i-TOR-y.

WHAT ARE METADATA?*

During the 1990s, the concept of data warehouse** swept the information technology industry. After many years of trying, it appears finally to be possible for a company to store all of its data in one place for purposes of reporting and analysis. The technology for doing this is still new, and the first attempts have had mixed results, but the effort has been quite serious.
One of the problems that arose from this effort was the realization that if a senior executive is going to ask a giant database a question it is necessary to know just what is in the database and what types of questions to ask. In addition to the data themselves, therefore, it is necessary to keep data about the data. The term coined for ā€œdata about dataā€ during the 1990s was metadata.
Since then, numerous books and magazine articles have been published on this subject, but most have focused on why metadata are important and on technologies and techniques for managing them. What these publications have left out is a clear description of exactly what the stuff is. After a decade, there is still no simple, clear description of metadata in a form that is both comprehensive enough to cover our industry and comprehensible enough that it can be used by people. This book is an attempt to produce such a description.
As with all buzzwords, once invented the term metadata has taken on a life of its own. It is variously described as:
ā€¢ Any data about the organizationā€™s data resource [Brackett 2000, p. 149].
ā€¢ All physical data and knowledge from inside and outside an organization, including information about the physical data, technical and business processes, rules and constraints of the data, and structures of the data used by a corporation [Marco 2000, p. 5].
ā€¢ The detailed description of instance data. The format and characteristics of populated instance data: instances and values, dependent on the role of the metadata recipient [Tannenbaum 2002, p. 93].
Several significant points come out of these definitions. First, as Mr. Marco pointed out there is a difference between business metadata and technical metadata. The business user of metadata is interested in definitions and structures of the language as terms for the types of information to be retrieved. The technician is concerned with the physical technologies used to store and manage data. Both of these points of view are important, and both must be addressed.
Second, the subject is concerned with more than just data. It is, as Mr. Brackett said, ā€œany data about an organizationā€™s data resource.ā€ Once you have started looking at the structure of an organizationā€™s data, you have to also account for its activities, people and organizations, locations, timing and events, and motivation.
Third, as Ms. Tannenbaum pointed out, the ā€œmetaā€ aspect of the question is a matter of point of view. There is metadata relative to the data collected by the business. There is also meta-metadata, which is used to understand and manage the metadata.*
This last point is illustrated in Figure 1-1. Here, the bottom row shows examples of things in the world that are often described in information systems. ā€œJulia Robertsā€ is a real human being. The ā€œWall Street branchā€ of a bank is a physical place were business is performed. Checking account ā€œ09743569ā€ is a particular account held in that bank by a particular customer (Julia Roberts, for example). The customer of that account may then perform an actual ā€œATM Withdrawalā€ at a specific time.
image
Fig. 1-1 Data and metadata.
The next row up shows, in the first three columns, the data that might describe those three things: (1) A Customer has the name ā€œJulia Robertsā€ and the ā€œBirthdateā€ of ā€œ10/28/67ā€. (2) A Branch has the address ā€œ111 Wall Streetā€ and a manager, ā€œSam Sneedā€. (3) The checking account has an account number ā€œ09743569ā€ and a monthly charge, ā€œ$4.50ā€. In the fourth column, the first row from the bottom shows that a particular program, called here ā€œJava codeā€, is responsible for a ā€œWithdrawal Transactionā€. These are the things that would concern a person managing data for a banking business. Note that each of the terms was described as to what it was: customer name, branch manager, account number, and so forth.
The third row from the bottom collects those descriptors and labels them in turn. This is to create what we in the data administration world call the metadata. There are two components to these labels. First are the names of the things of significance being described by the business data, such as the entity classes ā€œCustomerā€ and ā€œBranchā€. Second, each of these is in turn described by attributes, such as ā€œNameā€, ā€œAddressā€, and ā€œBirthdateā€. We also discover, in the case of the bank branch, that there is really an additional entity class, ā€œManagerā€, and that it is related to ā€œBranchā€. (ā€œEach Branch must be managed by exactly one Employee.ā€)
In the checking account column, we see that a checking account is actually the subject of a table in a database. The table is called ā€œCHECKING_ACCOUNTā€ and has columns ā€œAccount_numberā€ and ā€œMonthly_chargeā€. The ATM program described in the second row simply as ā€œJava codeā€ is actually a program module with the name ā€œATM Controllerā€ written in the language ā€œJavaā€. As we can see, the metadata row itself encompasses several different types of objects (ā€œEntity classā€, ā€œAttributeā€, ā€œTableā€, ā€œColumnā€, ā€œProgram moduleā€, and ā€œLanguageā€). The assignment of this book, represented by the top row, is to show how these objects relate to one another.
Metadata donā€™t just describe data. They describe how the organization understands not only its data, but also its activities, people and organizations, geography, timing, and motivation. Yes, metadata describe the entity classes and attributes of an entity-relationship model, and the tables and columns by which these are implemented in a computer system. They also provide, however, structure for describing the activities of the organization and the computerized processes that implement these activities. They describe who has access to data, and why. They describe the types of events and responses that are the nature of an organizationā€™s activities. They describe where the data and processes are, and they describe the motivation and business rules that drive the entire thing. So, from all of this comes the following definition of metadata.
Metadata are the data that describe the structure and workings of an organizationā€™s use of information, and which describe the systems it uses to manage that information.
One anomaly has revealed itself in the line between business data and metadata. The information about what constitutes a legal value for a product category or an account type in the business model is often captured in separate reference tables. To reflect these validation structures, a typical data model often has many ā€œtypeā€ entity classes (ACCOUNT TYPE, STATUS, DAY OF THE WEEK, and so on) describing legal values for attribu...

Table of contents