Introduction to Database Management Systems
Introduction to Database Management Systems
Introduction to Database
Management Systems
repository for data. The
modern application, serving as a structured
A database is a crucial component of any maintain information in an organized, efficient, and
manage and
primary objcctive of a database system is to distinguíshes them from
fundamental concepts of databases,
reliable manner. This chapter introduces thecharacteristics and advantages of using a Database Managernent
traditional file systems, and explores the core relational model, it is
most prevalent type of database, the
System (DBMS). While we will focus on the journey
The history of database systems is a fascinating
important to understand that other models exist. effectively.
ever-increasing volumes of data more
of innovation, driven by the need to handle
Systems
6.1.1 A Brief History of Database to complex, highly
database systems is a story of moving from simple, unstructured storage
The evolution of
structured, and scalable systems,
days of computing, data was stored
Systems (1950s - 1960s): In the early its own programs to manage them.
" Early File Pro cessingapplication had its own set of data files and
in simple flat files, Each inconsistency, and difficulty in managing data
access.
This led to nassive data redundancy,
23
Course: Database Management Systemns Total Lectures: 45-50
" The Hierarchical and Network Models (19608 - 1970s): To address the limitations of file sys
tems, the first generation of DBMSS was developed, The hierarchical model, pioneered by TBM's
Information Management System (IMS), organized data in a tree-like structure. The network model,
developed by the Coference on Data Systems Languages (CODASYL), was a more flexible exten
sion that allowed a child record to have multiple parent records. These models provided better data
management but were complex to implement and maintain.
" The Relational Model (1970s): The modern era of database systems began with the publication
of the seminal paper "A Relational Model of Data for Large Shared Data Banks" by Edgar F. Codd
while he was at IBM. Codd's model introduced a simple, yet powerful, concept: organizing data into
two-dimensional tables. This model was based on a sound mathematical theory of relations and was a
major leap forward in data independence and query flexibility.
Acrucial igure in clariflying and formalizing Codd's work was C. J. Date. An influential author
and researcher, Date's books, particularly "An Introduction to Database Systems," became standard
texts for generations of database professionals. He played a key role in explaining the theoretical
underpinnings of the relational model and its practical application. In his later work, he, along with
Hugh Darwen, also proposed "The Third Manifesto," which aimed to correct perceived deviations from
Codd's original vision.
" Rise of Relational DBMS (1980s): The 1980s saw the comnercialization of Codd's ideas. Oracle
Corporation, founded by Larry Ellison, led this charge by ereating one of the first connercial rela
tional database systems. Other companies, including IBM with its DB2 product, followed suit. The
development of the Structured Query Language (SQL) became the standard for interacting with these
databases.
During this period, the theory of relational database design also saw major developments. William W.
Armstrong made a foundational contribution with his 1974 paper, "Dependency Structures of Data
Base Relations." In this work, he formalized the concept of functional dependencies and provided
a set of inference rules, now known as Armstrong's Axioms. These axions are fundamental to the
theory of normalization, which is a process for designing tables to reduce redundancy and improve data
integrity.
" Object-Oriented and Object-Relational Models (1990s): As applications became more complex.
dealing with rich data types like images and video, new models emerged. The Object-Oriented
Model attempted to bridge the gap between databases and object-oriented progranming. The Object
Relational Model was a hybrid approach, adding object-oriented features
to relational databases.
internet and social media,
" NoSQL and Big Data (2000s - Present): With the explosion of the
"Big Data" challenge of massive, unstructured
a new generation of databases emerged to handle the databases, which often
(Not only SQL) was coined to describe these
data volumes. The term NoSQL scalability and
of horizontal
sacrifice some traditional relational features like strong consistency in favor
databases (like MongoDB), and graph
high availability. Examples include key-value stores, document
databases.
Historical Development of Database Systemns
evolving from rudimentary fle-based
The development of database systems spans multiple decades, databases. This evolution has been
managenent to highly sophisticated distributed and analytical
growing complexity of applications
driven by technological advances, increasing data volumes, and the provides a detailed historical
industrial domains, The following
across scientific, commercial, and types of database structures, and
account, highlighting key milestones, contributors, organizations,
underlying mathematical models,
technology initially emerged to
1960s: Early File-Based and Navigational Systems Database Hierarchical and network models were
address limitations of conventional file-processing systems, Organizations such as IBM and
structured records.
developed to efficiently store, organize, and access
CODASYL (Conference on Data Systems Languages) were pioneers in this era.
24
Course: Database Managenent Systems Total Lectures; 45-50
" Hierarchical Databases: IBM's IMS (Information Management System, 1966) implemented
a tree-like structure where each record type had a single parent and multiple children, providing
cfficient navigational access.
" Network Databases: Charles Bachman developed the Integrated Data Store (|DS), formal
ized later as the CODASYL network model, enabling more flexible many-to-many relationships
between records.
" Mathematical Models: Data relationships were represented using basic set theory and
pointer-based structures, forming the foundation for early query traversal methods and record
linking algorithms.
1970s: Relational Model The relational paradigm, introduced by Edgar F. Codd at IBM in 1970,
provided a formal, declarative approach to database management. By modeling data as relations
(tables) and using relational algebra and relational calculus, Codd established a rigorous foundation
for data integrity, consistency, and querying.
"Organizations: IBM Research and the University of California at Berkeley were key centers
for early research and prototype implementations.
" Database Systems: IBM's System R and UC Berkeley's INGRES became seminal projects
demonstrating the feasibility of relational databases.
" Mathematical Models: Relational algebra and relational calculus allowed precise reasoning
about queries, normalization, and constraint enforcement.
. Contributors: Edgar F. Codd, Raymond Boyce, and Donald D. Chamberlin were instrumental
in developing the theoretical and practical frameworks for relational databases, including the
SQL language.
1980s: Commercial Relational DBMS During this decade, relational databases matured into
robust, commercially viable systems. Widespread adoption was facilitated by standardized query
languages and reliable transaction support.
Commercial Systems: Oracle Database, IBM DB2, Informix, and Sybase introduced
enterprise-grade relational database systems.
. Persons: Donald D. Chamberlin and Raymond F. Boyce, architects of SQL at IBM, played a
pivotal role in standardizing query languages.
. Features: Support or ACID transactions, indexing, views, stored procedures, triggers, and
referential integrity became standard components of relational DBMS.
1990s: Object-Oriented and Object-Relational Extensions The need to manage complex data
types such as multimedia, engineering models, and spatial information led to the emergence of object
oriented DBMS (0ODBMS) and object-relational extensions.
"Organizations: Versant, ObjectStore, and Oracle (with its object-relational extensions) were
leaders in this development.
" Mathematical Models: Type theory, object caleulus, and formal inheritance hierarchies un
derpinned the object-relational mapping and the extension of relational algebra to handle com
plex objects.
. Contributors: Pioneers such as Michael Stonebraker (Ingres, Postgres) and David Maier con
tributed significantly toobject-relational and active database concepts.
2000s: Web, XML, and Semi-Structured Data The explosive growth of the internet and web
applications required databases capable of handling semi-structured and hierarchical data formats like
XML and JSON.
25
Total Lectures: 45-60
Course: Database Management Systems
Relational Model
6.1.2 Core Concepts of the its core
understand
databases, it's essential to
foundation for most modern
Since the relational model is the
terminology.
commonly referred to as a table.
It is a set of records with
formal term for what is composed of rows
" Relation: The two-dimensional structure
structure, defined by a schema. A relation is a
the same
and columns.
domains D,, D2,.., Dn,
subset of the Cartesian product of a set of a
Mathematically, a relation R is a
represents the set of all possible values for
D,: Each domain D,
written as RC D, x D x x
specific attribute.
possible names),
domains Dstudent ID (all possible student IDs), DName (all Cartesian prod
For example, consider the be a subset of the
A relation 'STUDENT' would an ordered list of values, such as
and DMajor (all possible majors).
DMajor: A tuple in this
relation is
uct Dstudent| DX DName X
(101, 'Alice', 'Computer Science').
26
Total Lectures: 45-50
Course: Database Management Systems
Attribute
27
Course: Database Management Systems Total Lectures: 45-50
28
Course: Database Management Systems Total Leetores: 45-50
Student Filo)
Application 1 Applieation D)
Course File
DBMS) Databa)
Student File)
Application 2 [Application 2)
(Financial Aid File)
File System Approach Database System Approach
1. Self-Describing Nature: A DBMS not only contains the database itself but also(data a complete def
constraints, This metadata about data)
inition or description of the database structure and
or System Catalog. The catalog contains
is stored in the DBMS catalog, or data dictionary, keys, foreign keys, and access
information about relation definitions, data types of attributes, primary
schema information
privileges. This allows the DBMS toprocess queries without needing to hard-code
into application programs.
of data files is stored in the DBMS catalog
2. Insulation between Programs and Data: The structure
concept known as data independence.
separately from the access programs. This is a fundamental
There are two types:
conceptual schema (the logical view af
" Logical Data Independence: The ability to change the For example, if you add a new attribute
the data) without having to change application programs. should not need to be modified. as it is
attribute
to a relation, a prograrn that doesn't use that
insulated from the change.
internal schema (the physical storage
"Physical Data Independence: The ability to change the
For example, you could nove a database
of data) without having to change the conceptual schema.a heap file to a B+ tree without affecting
to a different disk or change the file organization from
the application programs that access the data.
be used by various user groups, each of
3. Support for Multiple Views of Data: A database can provides a mechanism to define multiple
DBMS
which may need a different view of the same data. The
different users, customizing the data they see. A student might have a view that shows their
views for have a different view that shows all courses,
enrolled courses and grades, while an administrator might
without exposing sensitive information.
enroll1ments, and instructor assignments,
access of data by multiple users, ensuring
4. Sharing of Data: The DBMS manages the concurrent mnlti-user
data inconsistencies. This is essential for
that these interactíons do not cause conflicts or
simultaneously. A transaction is a
environments where multiple transactíons might be running
ensure data integrity. Concurrency
logical unit of work that must be completed in its entirety toanother's work.
corrupting
control nechanis1ns prevent one user's changes from
By centrally managing data, the DBMS
5. Control over Data Redundancy and Inconsistency: data inconsistencies, which occur when redundant
ninimizes data redundancy,. This helps to prevent
data is not updated properly, A centralized database [Link] problem by ensuring that each
piece of data is stored only once, making it the single source of
29
Course: Database Management Systens Total Lect ures: 45-50
Users/Programmers
Database
System
Application Programs/Queries
DBMS
Software Software to Process
Queries/Programs
Software to Access
Stored Data
Stored Database
Stored Database
Definition
(Meta-Data)
30
Course: Database Management
Systems Total Lectures: 45-50
6.5 Questions
6.5.1 Short Answer Questions
and its instance?
Q1. What is the fundarnental difference between a database's schema
Q1l. What are the primary types of database languages, and what is the purpose of cach?
a Database Ad1ninistrator!
Q12. What is the role of a Database Desiguer, aud how do they differ from
and a single file view in a file-processing
Q13. Explain the difference between multiple views of the data
system.
diference between a "relation" and a "relation
scherna"?
Q14. In the relational model, what is the
integrity constraints in the relational model?
Q15. What are the primary types of
32