eBook - ePub

A Practical Guide to Database Design

Name: A Practical Guide to Database Design
ISBN: 9781351265461

Rex Hogan,

414 pages
English
ePUB (mobile friendly)
Available on iOS & Android

eBook - ePub

A Practical Guide to Database Design

Rex Hogan,

About this book

Fully updated and expanded from the previous edition, A Practical Guide to Database Design, Second Edition is intended for those involved in the design or development of a database system or application. It begins by illustrating how to develop a Third Normal Form data model where data is placed "where it belongs". The reader is taken step-by-step through the Normalization process, first using a simple then a more complex set of data requirements. Next, usage analysis for each Logical Data Model is reviewed and a Physical Data Model is produced that will satisfy user performance requirements. Finally, each Physical Data Model is used as input to create databases using both Microsoft Access and SQL Server.

The book next shows how to use an industry-leading data modeling tool to define and manage logical and physical data models, and how to create Data Definition Language statements to create or update a database running in SQL Server, Oracle, or other type of DBMS.

One chapter is devoted to illustrating how Microsoft Access can be used to create user interfaces to review and update underlying tables in that database as well as tables residing in SQL Server or Oracle.

For users involved with Cyber activity or support, one chapter illustrates how to extract records of interest from a log file using PERL, then shows how to load these extracted records into one or more SQL Server "tracking" tables adding status flags for analysts to use when reviewing activity of interest. These status flags are used to flag/mark collected records as "Reviewed", "Pending" (currently being analyzed) and "Resolved". The last chapter then shows how to build a web-based GUI using PHP to query these tracking tables and allow an analyst to review new activity, flag items that need to be investigated, and finally flag items that have been investigated and resolved. Note that the book has complete code/scripts for both PERL and the PHP GUI.

Tools to learn more effectively

Saving Books

Keyword Search

Annotating Text

Listen to it instead

Information

Publisher

Year

Print ISBN

eBook ISBN

Topic

Subtopic

Index

CHAPTER 1

Overview of Databases

1.1 WHAT’S A “DATABASE”?

Today, generally everyone uses a computer in one form or another.

• Home-based computers are frequently used for managing a personal business, update spreadsheets, or complete school assignments. Others use them for email, social interaction with friends and family members, monitoring the Internet for news, or for entertainment.

• Owners of small businesses use spreadsheets and/or software products such as QuickBooks to keep track of personal or business expenses.

• Office environments must gather and store and manage information for a wide range of topics or subjects, such as customers or clients, appointments, or customer orders.

• Business environments must manage a much wider scope of data regarding the information and data needed to run or manage the business.

• Users using computers in government offices need computers to manage their jobs. For those working as analysts in the Department of Defense (DOD) or in the Intelligence Community, the nature of the job is continually expanding, requiring analysts to monitor or track new information or data as it becomes available. Analytical teams continually face the responsibility of analyzing new and evolving forms of information to identify and extract information of relevance using software tools available to them. Often, that means having not much more than the desktop Microsoft Office products ranging from Excel to Microsoft Access.

As the data needed by the user or customer community grow in size, complexity, and importance, the care and feeding of that data requires the use of a database management system (DBMS) to store, manage, and protect it.

A DBMS¹ is a special software package that is designed to define and manage data within one or more databases. Individual databases, in turn, manage the definition of data objects/tables in a given subject area and provide controlled user access to that data.

Examples of DBMSs include Structured Query Language (SQL) Server, Oracle, and Microsoft Access. An SQL Server or Oracle instance would then serve as host to, for example, a personnel database.

1.2 GUARANTEED ACCURACY AND AVAILABILITY OF DATA

A DBMS is, by its very nature, built to guarantee the accuracy and availability of data as updates occur. Updates are bundled as application transactions² that apply all data updates within a logical unit of work³ associated with that application. These updates must be made on an all or nothing basis; either all the updates are applied, or, if a logical or database error occurs, none of the updates are applied, leaving all of the data in a clean consistent state from the user and application perspective.

The application software updating the database issues commands to the database to start a unit of work. If all updates complete successfully, a commit call is issued to make those updates permanent. If in the process of making those updates some condition is found that prevents the update from occurring, a rollback call is made to reverse any updates and put the data back in a logical state representing the data at the beginning of the transaction.

For example, a user might log on to their banking system and start an update to move funds from their savings to checking accounts.

• After logging in and starting the transfer, the software performing the updates first issues a database update to debit the savings account for the specified amount.

• If that update is successful, it issues an update to credit the checking account by that amount.

• Upon successful completion, a commit call is issued to commit the changes and release database locks on the rows being updated. An appropriate message would be sent to the user confirming that the funds transfer was completed.

• If, however, the update to the checking account failed (e.g., the user entered the wrong savings account number), a rollback call would be made to reverse all updates made, and an appropriate error message would be sent to the user. As a result, the database and the underlying data are left in a clean, consistent state.

The ACID⁴ properties (atomicity, consistency, isolation, and durability) of database systems and transactions guarantee the accuracy and availability of data.

1.2.1 Atomicity

The atomicity is the all or nothing requirement when making updates. Either all updates made during the unit or work succeed or no updates are made. This protection includes updates in a unit of work or transaction, device input/output errors, network errors, and power failures.

1.2.2 Consistency

Consistency requires that transactions take the database from one valid state to another. Any and all updates must conform and enforce any referential integrity⁵ constraints defined. (Referential integrity constraints define and control any one-to-many relationships between tables in a database.)

1.2.3 Isolation

Isolation of database updates involves mechanisms that enable multiple concurrent users to simultaneously access and update the same data elements within a database.

As database updates occur, locks are transparently placed on updated rows that prevent subsequent users to access or update those rows until the updating process commits those updates and the locks are released. Any processes requesting access to rows being updated are held/delayed until the updater’s commit point is made.

1.2.4 Durability

This feature/requirement ensures that any updates made by a transaction (i.e., a unit of work completed and updates committed) will survive a subsequent system error or problem, for example, a system failure or a power or disk failure.

Database systems have mechanisms/features that support a full database backup. In addition, database systems log updates to nonvolatile devices (a database log file) as updates are made to the database. If/When necessary, a database can be rebuilt/recovered totally by first using the database backup to recover all data to the point the backup was made, then using the database log to reapply all updates made to the database after that point in time. This subject is covered in more detail in Section 1.6.

1.3 DYNAMIC ALTERATION OF DESIGN

Relational database management system (RDBMS) represent the third generation of DBMS products. As one of their key features, these products give the user the ability to dynamically add or drop columns to data or make other changes live while the database is online and being updated by users. That provides a significant change over the second-generation hierarchical systems that had to be taken down and modified off-line to apply changes. Third-generation systems include products such as SQL Server, Oracle, and Microsoft Access.

Note that MySQL is touted as an RDBMS and it has many relational-like features. However, it has significant limitations that, in my opinion, prevent it from being classified as a true RDBMS.

For example, each table in MySQL is implemented as a flat file with indexes as needed to support data retrieval. If/When any changes are required, for example, a column is to be added, MySQL creates a new temporary table with the new column, copies all records from the original file to the new, and then deletes and renames the old and new files accordingly.

In a former role, I prototyped a MySQL implementation for a data collection application running a UNIX (Solaris) server. As the prototype progressed, it was no surprise to find that I needed to add new columns to the MySQL table to help track information about what was being collected. I found that the time requirements to make changes to a MySQL table with a million rows were anything but transparent.

As a work around, I then made what I hoped was a one-time modification to the table adding spare columns (Spare1, Spare2, Spare3, etc.) with the plan of renaming these columns if/when needed to reflect application-specific, meaningful names. That helped, but even then I found that MySQL required/used too much overhead for managing large tables.

The ability to dynamically change table definitions can, in most products, be made using that product’s database administrator (DBA) graphical user interface, or by working at the command line by issuing commands using the product’s data definition language (DDL). The DBA user interface is much easier and quicker to use, but when supporting mission-critical applications, change management procedures are used to control updates across multiple environments and platforms, each with their own copy and version of the application database.

• A Development platform is used to design, develop, and test individual software components and tables within a database.

• Incremental changes are made by manually running DDL changes at the command prompt.

• All incremental changes are accumulated as they are applied, creating a change package with all modifications needed for that release.

• When all changes have been made and tested for a software release, a Test platform is used.

• With the test system database configured for the older software release, the change package is applied and the software release is tested to ensure all updates have been correctly applied and the software works as intended.

• If errors ar...

Cover
Half Title
Title Page
Copyright Page
Dedication
Table of Contents
Introduction
Author
CHAPTER 1 ■ Overview of Databases
CHAPTER 2 ■ Data Normalization
CHAPTER 3 ■ Database Implementation
CHAPTER 4 ■ Normalization and Physical Design Exercise
CHAPTER 5 ■ The erwin Data Modeling Tool
CHAPTER 6 ■ Using Microsoft Access
CHAPTER 7 ■ Using SQL Server
CHAPTER 8 ■ Using Perl to Extract and Load Data
CHAPTER 9 ■ Building User Interfaces
CHAPTER 10 ■ Creating the University Database Application
CHAPTER 11 ■ PHP Implementation and Use
APPENDIX A: WARNING MESSAGES
APPENDIX B: ERROR MESSAGES
APPENDIX C: UNIVERSITY DDL
APPENDIX D: SEARCH FOR TERMS
APPENDIX E: SQL SERVER LOG CHECK
INDEX

Frequently asked questions

Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription

No, books cannot be downloaded as external files, such as PDFs, for use outside of Perlego. However, you can download books within the Perlego app for offline reading on mobile or tablet. Learn how to download books offline

Perlego offers two plans: Essential and Complete

Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.

Both plans are available with monthly, semester, or annual billing cycles.

We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 990+ topics, we’ve got you covered! Learn about our mission

Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more about Read Aloud

Yes! You can use the Perlego app on both iOS and Android devices to read anytime, anywhere — even offline. Perfect for commutes or when you’re on the go.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app

Yes, you can access A Practical Guide to Database Design by Rex Hogan in PDF and/or ePUB format, as well as other popular books in Computer Science & Data Mining. We have over one million books available in our catalogue for you to explore.

About this book

Tools to learn more effectively

Information

Table of contents

Frequently asked questions