The Project | About Us | Contribute | Donations | License

HOME

Normalization is a systematic approach of organizing data in a database to reduce redundancy and improve data integrity. The process involves dividing large tables into smaller, more manageable pieces while ensuring that the relationships between the data are preserved. This section will cover the steps involved in the normalization process, from understanding the basic concepts to applying them in practical scenarios.

Objectives

Understand the goals of normalization.
Learn the steps involved in the normalization process.
Apply normalization techniques to design efficient database schemas.

Goals of Normalization

Normalization aims to:

Eliminate Redundant Data: Reduce data duplication to save storage space and improve data consistency.
Ensure Data Integrity: Maintain the accuracy and consistency of data over its lifecycle.
Simplify Queries: Make the database easier to query and maintain.

Steps in the Normalization Process

Step 1: First Normal Form (1NF)

A table is in 1NF if:

All columns contain atomic (indivisible) values.
Each column contains values of a single type.
Each column has a unique name.
The order in which data is stored does not matter.

Example: Consider a table storing information about students and their courses:

StudentID	StudentName	Courses
1	Alice	Math, Science
2	Bob	English

To convert this table to 1NF, we need to ensure that each column contains atomic values:

StudentID	StudentName	Course
1	Alice	Math
1	Alice	Science
2	Bob	English

Step 2: Second Normal Form (2NF)

A table is in 2NF if:

It is in 1NF.
All non-key attributes are fully functional dependent on the primary key.

Example: Consider the 1NF table:

StudentID	StudentName	Course
1	Alice	Math
1	Alice	Science
2	Bob	English

To convert this table to 2NF, we need to remove partial dependencies:

StudentID	StudentName
1	Alice
2	Bob

StudentID	Course
1	Math
1	Science
2	English

Step 3: Third Normal Form (3NF)

A table is in 3NF if:

It is in 2NF.
All the attributes are functionally dependent only on the primary key.

Example: Consider the 2NF tables:

StudentID	StudentName
1	Alice
2	Bob

StudentID	Course
1	Math
1	Science
2	English

If there are no transitive dependencies, the tables are already in 3NF. If there were additional attributes that depended on non-primary key attributes, we would need to further decompose the tables.

Step 4: Boyce-Codd Normal Form (BCNF)

A table is in BCNF if:

It is in 3NF.
For every functional dependency (X → Y), X should be a super key.

Example: Consider a table with the following structure:

StudentID	Course	Instructor
1	Math	Dr. Smith
1	Science	Dr. Jones
2	English	Dr. Brown

If a course can only be taught by one instructor, we have a dependency Course → Instructor. To satisfy BCNF, we need to decompose the table:

Course	Instructor
Math	Dr. Smith
Science	Dr. Jones
English	Dr. Brown

StudentID	Course
1	Math
1	Science
2	English

Practical Exercise

Exercise 1: Normalize the Following Table to 3NF

OrderID	CustomerName	CustomerAddress	ProductID	ProductName	Quantity
1	John Doe	123 Elm St	101	Laptop	1
2	Jane Smith	456 Oak St	102	Tablet	2
1	John Doe	123 Elm St	103	Mouse	1

Solution:

1NF:

OrderID	CustomerName	CustomerAddress	ProductID	ProductName	Quantity
1	John Doe	123 Elm St	101	Laptop	1
2	Jane Smith	456 Oak St	102	Tablet	2
1	John Doe	123 Elm St	103	Mouse	1

2NF:

OrderID	CustomerID	ProductID	Quantity
1	1	101	1
2	2	102	2
1	1	103	1

CustomerID	CustomerName	CustomerAddress
1	John Doe	123 Elm St
2	Jane Smith	456 Oak St

ProductID	ProductName
101	Laptop
102	Tablet
103	Mouse

3NF: The tables are already in 3NF as there are no transitive dependencies.

Summary

In this section, we covered the normalization process, including the goals of normalization and the steps to achieve different normal forms. By following these steps, you can design efficient and reliable database schemas that minimize redundancy and maintain data integrity.

Normalization Process

Objectives

Goals of Normalization

Steps in the Normalization Process

Step 1: First Normal Form (1NF)

Step 2: Second Normal Form (2NF)

Step 3: Third Normal Form (3NF)

Step 4: Boyce-Codd Normal Form (BCNF)

Practical Exercise

Exercise 1: Normalize the Following Table to 3NF

Summary

Fundamentals of Databases

Module 1: Introduction to Databases

Module 2: Relational Databases

Module 3: Non-Relational Databases

Module 4: Schema Design

Module 5: Normalization

Module 6: Practical Exercises

Module 7: Case Studies

Module 8: Additional Resources