PostgreSQL

From Wikipedia, the free encyclopedia

PostgreSQL
PostgreSQL logo
Developer: PostgreSQL Global Development Group
Latest release: 8.2 / December 5, 2006
OS: Cross-platform
Use: RDBMS
License: BSD
Website: www.postgresql.org

PostgreSQL is a Free object-relational database server (database management system), released under a flexible BSD-style license. It offers an alternative to other database systems. Similarly to other open-source projects such as Apache, Linux, and MediaWiki, PostgreSQL is not controlled by any single company, but relies on a global community of developers and companies to develop it.

PostgreSQL's unusual-looking name makes some readers pause when trying to pronounce it, especially those who pronounce SQL as "sequel". PostgreSQL's developers pronounce it /poːst ɡɹɛs kjuː ɛl/. (Audio sample, 5.6k MP3). It is also common to hear it abbreviated as simply "postgres", which was its original name. The name refers to the project's origins as a "post-Ingres" database, the original authors having also developed the Ingres database.

Contents

[edit] Features

[edit] Functions

Functions allow blocks of code to be executed by the server. Although these blocks can be written in SQL, the lack of basic programming operations, such as branching and looping, has driven the adoption of other languages inside of functions. Some of the languages can even execute inside of triggers. Functions in PostgreSQL can be written in the following languages:

PostgreSQL supports row-returning functions, where the output of the function is a set of values which can be treated much like a table within queries.

Functions can be defined to execute with the privileges of either the caller or the user who defined the function. Functions are sometimes referred to as stored procedures, although there is a slight technical distinction between the two.

[edit] Indexes

User-defined indexes can be created, or the built-in B-tree, hash table and GiST indices can be used. Indexes in PostgreSQL also support the following features:

  • PostgreSQL is capable of scanning indexes backwards when needed; you never need a separate index to support ORDER BY field DESC.
  • Expressional indexes can be created with an index of the result of an expression or function, instead of simply the value of a column.
  • Partial indexes, which only index part of a table, can be created by adding a WHERE clause to the end of the CREATE INDEX statement. This allows a smaller index to be created.
  • Bitmap index scans are supported as of version 8.1. This involves reading multiple indexes and generating a bitmap that expresses their intersection with the tuples that match the selection criteria. This provides a way of composing indexes together; on a table with 20 columns, there are, in principle, 20! indexes that could be defined - which is far too many to actually use. If you create one index on each column, bitmap scans can compose arbitrary combinations of those indexes at query time for each column that seems worth considering as a constraint.

[edit] Triggers

Triggers are fully supported and can be attached to tables but not to views. Views can have rules, though. Multiple triggers are fired in alphabetical order. In addition to calling functions written in the native PL/PgSQL, triggers can also invoke functions written in other languages like PL/Perl.

[edit] MVCC

PostgreSQL manages concurrency through a system known as Multi-Version Concurrency Control (MVCC), which gives each user a "snapshot" of the database, allowing changes to be made without being visible to other users until a transaction is committed. This largely eliminates the need for read locks, and ensures the database maintains the ACID principles in an efficient manner.

[edit] Rules

Rules allow the "query tree" of an incoming query to be rewritten. One common usage is to implement updatable views.

[edit] Data types

A wide variety of native data types are supported, including:

In addition, users can create their own data types which can usually be made fully indexable via PostgreSQL's GiST infrastructure.

Examples of these are the Geographic information system (GIS) data types from the PostGIS project for PostgreSQL.

[edit] User-defined objects

New types of almost all objects inside the database can be created, including:

  • Indices
  • Operators (and existing ones can be overloaded)
  • Aggregates
  • Domains
  • Casts
  • Conversions

[edit] Inheritance

Tables can be set to inherit their characteristics from a "parent" table. Data is shared between "parent" and "child(ren)" tables. Tuples inserted or deleted in the "child" table will respectively be inserted or deleted in the "parent" table. Also adding a column in the parent table will cause that column to appear in the child table as well. This feature is not fully supported yet -- in particular, table constraints are not currently inheritable. This means that attempting to insert the id of a row from a child table into table that has a foreign key constraint referencing a parent table will fail because Postgres doesn't recognize that the id from the child table is also a valid id in the parent table.

Inheritance provides a way to map the features of generalization hierarchies depicted in Entity Relationship Diagrams (ERD) directly into the PostgreSQL database.

[edit] Other features

[edit] Add-ons

[edit] History

PostgreSQL has had a lengthy evolution, starting with the Ingres project at UC Berkeley. The project leader, Michael Stonebraker, had left Berkeley to commercialize Ingres in 1982, but eventually returned to academia. After returning to Berkeley in 1985, Stonebraker started a post-Ingres project to address the problems with contemporary database systems that had become increasingly clear during the early 1980s. The code bases of Postgres and Ingres started (and remain) completely separated.

The resulting project, named Postgres, aimed to introduce the minimum number of features needed to add complete support for types. These features included the ability to define types, but also the ability to fully describe relationships – something used widely before this time but maintained entirely by the user. In Postgres the database "understood" relationships, and could retrieve information in related tables in a natural way using rules.

Starting in 1986 the team released a number of papers describing the basis of the system, and by 1988 the project had a prototype version up and running. The team released version 1 to a small number of users in June 1989, followed by version 2 with a re-written rules system in June 1990. 1991's version 3 re-wrote the rules system again, but also added support for multiple storage managers and for an improved query engine. By 1993 a huge number of users existed and began to overwhelm the project with requests for support and features. After releasing a Version 4 — primarily as a cleanup — the project ended.

Although the Postgres project had officially ended, the BSD license (under which Berkeley had released Postgres) enabled Open Source developers to obtain copies and to develop the system further. In 1994 two UC Berkeley graduate students, Andrew Yu and Jolly Chen, added a SQL language interpreter to replace the earlier Ingres-based QUEL system, creating Postgres95. The code was subsequently released to the web to find its own way in the world.

In July 1996, Marc Fournier at Hub.Org Networking Services provided the first non-university development server for the open source development effort. Along with Bruce Momjian and Vadim B. Mikheev, work began to stabilize the code inherited from UC Berkeley, with the first open source version released on August 1st 1996.

1996 saw a re-naming of the project: in order to reflect the database's new SQL query language, Postgres95 became PostgreSQL. The first PostgreSQL release formed version 6.0 in January 1997. Since then, a group of database developers and volunteers from around the world, coordinating via the Internet, have maintained the software.

Although the license allowed for the commercialization of Postgres, the Postgres code did not develop commercially with the same rapidity as Ingres — somewhat surprisingly considering the advantages Postgres offered. The main offshoot originated when Paula Hawthorn (an original Ingres team member who moved from Ingres) and Michael Stonebraker formed Illustra Information Technologies to commercialize Postgres.

In 2001, Command Prompt, Inc. released Mammoth PostgreSQL, the oldest surviving commercial PostgreSQL distribution. They actively support the PostgreSQL community through developer sponsorships and projects including PL/Perl, PL/php, and hosting of community projects such as the PostgreSQL Build Farm.

In January 2005, PostgreSQL received backing by another database vendor. Pervasive Software, well known for their Btrieve product which was nearly ubiquitous on the Novell NetWare platform, announced commercial support & community participation. Since then, EnterpriseDB has announced similar involvement, with a particular focus on adding functionality to allow applications written to work with Oracle to be more readily run atop PostgreSQL. Greenplum has been sponsoring efforts to provide enhancements directed at Data Warehouse and Business Intelligence applications, notably including the BizGres project.

In October 2005, John Loiacono, executive vice-president of software at Sun Microsystems, commented that "We're not going to OEM Microsoft but we are looking at PostgreSQL right now," although no specifics have yet been released.

In November 2005, Sun Microsystems announced support for PostgreSQL. As of June 2006, Sun Solaris 10 6/06 ships PostgreSQL.

In July 2006, Pervasive left the PostgreSQL support market.[2]

[edit] Prominent users

[edit] See also

[edit] References

[edit] External links


Topics in database management systems (DBMS)view  talk  edit )

Concepts
Database | Database model | Relational database | Relational model | Relational algebra | Primary key - Foreign key - Surrogate key - Superkey
Database normalization | Referential integrity | Relational DBMS | Distributed DBMS | ACID

Objects
Trigger | View | Table | Cursor | Log | Transaction | Index | Stored procedure | Partition

Topics in SQL
Select | Insert | Update | Merge | Delete | Join | Union | Create | Drop
Comparison of syntax

Implementations of database management systems

Types of implementations
Relational | Flat file | Deductive | Dimensional | Hierarchical | Object oriented | Temporal

Products
Apache Derby | Caché | db4o | dBASE | Firebird | Helix database | DB2 | Informix | Ingres | InterBase | Microsoft SQL Server | MySQL | OpenLink Virtuoso | Oracle | PostgreSQL | SQLite | Sybase IQ | Sybase | Teradata | Visual FoxPro | Comparison - relational | Comparison - object-relational

Components
Query language | Query optimizer | Query plan | ODBC | JDBC
Lists
List of object-oriented database management systems
List of relational database management systems