Relational Databases & Database Design

Database Systems Chapters - This article is part of a series.

Part 2: This Article

Relational Database
#

A database that represents data in tables, where those tables are connected by the same/common values in related columns.

Database components:

Table
Column/attribute
Row/tuple
Domain

flowchart LR
%% DBMS container holding tables
subgraph DBMS [DBMS]
direction TB
T1[Book Table]
T2[Member Table]
T3[Loan Table]
T4[Return Table]
end

    %% Client PC entities
    PC1(Client PC 1)
    PC2(Client PC 2)
    PC3(Client PC 3)

    %% Connection lines between DBMS and PCs
    DBMS --- PC1
    DBMS --- PC2
    DBMS --- PC3

    %% Styling for neatness
    style DBMS fill:none,stroke-width:2px

Schema:

Book Table: book code (3), book title (20)
Member Table: member code (3), member name (25)
Loan Table: loan code (5), loan date (date), member code (3)
Return Table: return code (5), return date (date), member code (3)

Member Table:

Member Code	Name
A01	Surya
A02	Fitri
A03	Syahrur

Book Table:

Book Code	Title	Stock
B01	C++ Programming	10
B02	Building an Application in 30 Minutes	15
B03	Cooking is Easy	15

Schema:

Book Table: book code (3), book title (20)
Member Table: member code (3), member name (25)
Loan Table: loan code (5), loan date (date), member code (3)
Return Table: return code (5), return date (date), member code (3)

1. Table
#

A table has a name and consists of rows and columns. Tables in a database must not have the same name (unique). A table is also called a Relation or File.

In the diagram there are 4 tables: member table, book table, loan table, and return table.

2. Column/Attribute
#

A column has a name. Columns in a table must not have the same name. The order of names may be arbitrary and does not affect the meaning of the table.

Another name for a column is Field or Attribute.

In the table, columns are member name, book title, member code, book code, etc.

3. Domain
#

A domain is a collection of values that can be stored in one or more columns. A domain may belong to one or more columns, but a column has only one domain. Because a domain restricts and controls the values that can be stored, it is called a domain constraint.

In the diagram, the member code column contains only 3 values, namely “A01”.

4. Row
#

A row contains the data of an object. A row in a table must be unique, can be placed in any order, and does not affect the meaning of the table. A row is also called a Record or tuple.

In the member table it can store three objects (that is, three member records) in three tuples.

Relational Keys
#

A relational key is the identification of one or more columns whose values can uniquely distinguish tuples.

Five relational keys:

Superkey
Candidate key
Primary key
Alternate key
Foreign key

Member Table

Member Code	Name
A01	Surya
A02	Fitri
A03	Syahrur

Return Table

Return Code	Loan Code
KM01	PJ01
KM02	PJ02

Book Table

Book Code	Title	Stock
B01	C++ Programming	10
B02	Building an Application in 30 Minutes	15
B03	Cooking is Easy	15

Loan Table

Loan Code	Loan Date	Book Code	Member Code	Quantity	Return Date
PJ01	10-01-2019	B01	A01	1	13-01-2019
PJ01	10-01-2019	B02	A01	1	13-01-2019
PJ01	10-01-2019	B03	A01	1	13-01-2019
PJ02	12-01-2019	B02	A02	1	14-01-2019
PJ02	12-01-2019	B03	A02	1	14-01-2019

1. Superkey
#

A superkey is a single or group of columns whose values uniquely distinguish tuples in a table.

Each table has more than one superkey, namely:

Member table: member code, member name
Book table: book code, title, stock
Loan table: loan code, loan date, book code, member code, quantity, return date
Return table: return code, loan code

Member table:

Column member code,
Column invoice number and column member name

Book table:

Column book code
Combination of book code, title, stock.

2. Candidate Key
#

A candidate key is a superkey such that no proper subset of it is also a superkey. Not all superkeys are candidate keys. A candidate key that consists of two or more columns is called a composite key.

Each table has candidate keys and non-candidate keys, namely:

Member table:

Column member code: candidate key
Column invoice number and column member name: not a candidate key

Book table:

Column book code: candidate key
Combination of book code, title, stock: not a candidate key

Loan table:

Columns loan code, book code, member code: candidate key
Combination of loan code, book code, member code, quantity, etc.: not a candidate key

3. Primary Key
#

A primary key is the selected candidate key (from among other candidate keys) that uniquely distinguishes tuples in a table. If a table has only one candidate key (for example, the member table and the book table), that key becomes the primary key. But if there is more than one candidate key (for example, the sales table and the return table), then one of those candidate keys may be chosen as the primary key.

4. Alternate Key
#

An alternate key is a candidate key that is not chosen as the primary key. For example, in the return table, if we choose the return code as the primary key, then the loan code can be an alternate key.

5. Foreign Key
#

A foreign key is one or more columns whose values are the same as or related to the candidate key in another table or the same table.

For example, in the loan table there is a member code column that connects to the member table, so member code is a foreign key. These related columns are very important in join operations.

Table Schema (Relation Schema)
#

A table schema is the basic information describing a table, consisting of the table name and a set of column-domain pairs.

Example: Member Table Schema
Member Table (member code, name)

Database Schema (Relational Database Schema)
#

A database schema is a collection of table schemas where each table has a different name.

Example: Library Database Schema

Member table (member code, name)
Book table (book code, title, stock)
Loan table (loan code, loan date, book code, etc.)

Integrity Constraints
#

We have discussed domain constraints. There are four other constraints that maintain the integrity of data stored in a database:

Null
Entity integrity
Referential integrity
General constraints

1. Null
#

Null is a value in a column (tuple) that is still unknown. This can mean that the value cannot be applied to that column.

However, null is not the same as the numeric value zero or the string “-”; zero and whitespace are values, while null indicates the absence of a value.

For example, if a table has data that is not yet known, it may be written as null, but this does not apply to the primary key. Because the primary key column must be unique, if the primary key stores null then the uniqueness property of that column is lost because multiple tuples could have null values.

2. Entity Integrity
#

Entity integrity is the constraint or rule that primary key columns must not store null.

As explained earlier, the primary key is used to uniquely define a tuple.

3. Referential Integrity
#

Referential integrity is the constraint that if a table has a foreign key column, then the value in that foreign key must match the value of the candidate key column, and if not, the foreign key may be written as null.

There are two cases where null is not allowed:

When the column is constrained to not allow null.
When the column is also part of the primary key.

4. General Constraints
#

General constraints are additional rules established by users or database administrators according to the policies and restrictions of an organization.

Example:

Book loans are not allowed if only one copy of the book is in stock.
If a member still has a book that has not been returned, they are not allowed to borrow again.

Database Design
#

The database creation process consists of two main stages:

Analysis and design stage
Implementation stage

1. Analysis Stage
#

The analysis stage is the mapping or modeling of the real world using a specific database design notation, as well as creating an implementation description for the database.

flowchart TD
%% Style definitions for step boxes
classDef langkah stroke-width:1px;

    %% Stage 1: Conceptual (Left to Right)
    subgraph Concept [Database Design Conceptually]
        direction LR
        L1["Step 1: Discovery and
Fact Analysis
• Identify organizational units
• Discover facts
• Summarize discovered facts"]:::langkah
        L2["Step 2: Create ERD
• Determine entity types
• Determine entity relationships
• Determine multiplicity
• Determine attributes
• Draw the ERD
• Refine the ERD structure"]:::langkah

        L1 ==> L2
    end

    %% Stage 2: Logical (Top to Bottom)
    subgraph Logic [Database Design Logically]
        direction TB
        L3["Step 3: Map the ERD
• Map entity types
• Map entity relationships
• Map multi-valued attributes"]:::langkah
        L4["Step 4: Normalize
Table Structure
• Check table normality
• Normalize tables
• Validate tables"]:::langkah

        L3 ==> L4
    end

    %% Stage 3: Physical (Right to Left)
    subgraph Physical [Database Design Physically]
        direction RL
        L5["Step 5: Create
Database Schema Definition
• Table encoding and compression
• Choose DBMS
• Define base tables
• Handle derived data"]:::langkah
        L6["Step 6: Design File and
Index Organization
• Analyze user transactions
• Determine indexes
• Create index documentation and DDL"]:::langkah

        L5 ==> L6
    end

    %% Stage 4: Implementation
    subgraph Implementation [Implementation Stage]
        L7["Step 7:
Implement DDL
• Use command line
• Use graphical interface"]:::langkah
    end

    %% Connecting the stages (Snake flow)
    L2 ==> L3
    L4 ==> L5
    L6 ==> L7

    %% Styling group borders for neatness

The analysis and design stages are divided into three parts:

Conceptual database design
The process of creating a data model that does not depend on the physical aspects of the database.
Logical database design
The process of creating a data model based on a specific data model, but not dependent on a particular DBMS or physical implementation.
Physical database design
The process of creating an implementation description for the database on secondary storage (disk). This description explains the base tables, file organization, indexes for efficient data access, all integrity constraints, and security measures.

2. Implementation Stage
#

This stage implements the database design that has been created. The implementation uses client applications provided by the selected DBMS.

Database Systems Chapters - This article is part of a series.

Part 1: Basic Database Concepts

Part 2: This Article

Part 3: Data Model

Relational Database#

1. Table#

2. Column/Attribute#

3. Domain#

4. Row#

Relational Keys#

1. Superkey#

2. Candidate Key#

3. Primary Key#

4. Alternate Key#

5. Foreign Key#

Table Schema (Relation Schema)#

Database Schema (Relational Database Schema)#

Integrity Constraints#

1. Null#

2. Entity Integrity#

3. Referential Integrity#

4. General Constraints#

Database Design#

1. Analysis Stage#

2. Implementation Stage#

Related