Structured Query Language (SQL) is the language understood by DBMS for defining and manipulating data in a database. The statements used in SQL are non procedural, i.e., SQL statements require a user to specify what to be done on data without specifying how to get the work done on the data.
The statements provided in SQL can be categorized into four major groups, namely:
- Data Definition Language (DDL)
- Data Manipulation Language (DML) and
- Data Control Language (DCL)
- Transaction Control Language (TCL)
Data Manipulation Language (DML):
- Procedural DMLs require a user to specify what data are needed and how to get those data.
- Declarative DMLs (also referred to as non procedural DMLs) require a user to specify what data are needed without specifying how to get those data.
- Insertion of new information into the database
- Retrieval of information stored in the database
- Modification of information stored in the database
- Deletion of information from the database
A query is one of the DML statements used for requesting the retrieval of information from a database. A query takes as input several tables (possibly only one) and always returns a single table.
Here is an example of an SQL query that finds the names of all instructors in the History department:
select instructor.name from instructor where instructor.dept_name = "History";
The query specifies that those rows from the table instructor where the dept_name is History must be retrieved, and the name attribute of these rows must be displayed. More specifically, the result of executing this query is a table with a single column labeled name, and a set of rows, each of which contains the name of an instructor whose dept_name, is History.
Queries may involve information from more than one table. For instance, the following query finds the instructor ID and department name of all instructors associated with a department with budget of greater than Rs. 95,000:
select instructor.ID, department.dept_name from instructor, department where instructor.dept_name = department.dept_name and department.budget > 95000;
Data Models:
Underlying the structure of a database is the data model: a collection of conceptual tools for describing data, data relationships, data semantics, and consistency constrains. A data model provides a way to describe the design of a database at the physical, logical and view levels. The following are some of the data models used in database systems for describing and organizing data:
- Network Data Model
- Hierarchical Data Model
- Relational Data Model
- Object Data Model
Historically, the network data model and hierarchical data model preceded the relational data model. These models were tied closely to the underlying implementation, and complicated the task of modeling data. The relational data model is the most widely used data model, and a vast majority of current database systems are based on the relational model.
Relational Data Model uses a collection of tables to represent both data and the relationships among those data. Each table has multiple columns, and each column has a unique name. Tables are also known as relations.
The relational model is an example of a record-based model. Each table in the relational database contains records of a particular type. Each record type defines a fixed number of fields, or attributes. The columns of the table correspond to the attributes of the record type.
Object Oriented Programming (OOP), especially programming using C++ and Java has become the dominant software-development methodology. This led to the development of an object-oriented data model that allows data to be stored as objects in the database.