Representational State Transfer
From Wikipedia, the free encyclopedia
Representational state transfer (REST) is a style of software architecture for distributed hypermedia systems such as the World Wide Web. The terms “representational state transfer” and “REST” were introduced in 2000 in the doctoral dissertation of Roy Fielding,[1] one of the principal authors of the Hypertext Transfer Protocol (HTTP) specification. The terms have since come into widespread use in the networking community.
REST strictly refers to a collection of network architecture principles which outline how resources are defined and addressed. The term is often used in a looser sense to describe any simple interface which transmits domain-specific data over HTTP without an additional messaging layer such as SOAP or session tracking via HTTP cookies. These two meanings can conflict as well as overlap. It is possible to design any large software system in accordance with Fielding’s REST architectural style without using HTTP and without interacting with the World Wide Web.[citation needed] It is also possible to design simple XML+HTTP interfaces which do not conform to REST principles, and instead follow a model of remote procedure call. The difference between the uses of the term “REST” therefore causes some confusion in technical discussions.
Systems which follow Fielding’s REST principles are often referred to as “RESTful”.
Contents |
[edit] Principles
REST's proponents argue that the Web's scalability and growth are a direct result of a few key design principles:
- Application state and functionality are divided into resources
- Every resource is uniquely addressable using a universal syntax for use in hypermedia links
- All resources share a uniform interface for the transfer of state between client and resource, consisting of
- A constrained set of well-defined operations
- A constrained set of content types, optionally supporting code on demand
- A protocol which is:
- Client-server
- Stateless
- Cacheable
- Layered
REST’s client-server separation of concerns simplifies component implementation, reduces the complexity of connector semantics, improves the effectiveness of performance tuning, and increases the scalability of pure server components. Layered system constraints allow intermediaries—proxies, gateways, and firewalls—to be introduced at various points in the communication without changing the interfaces between components, thus allowing them to assist in communication translation or improve performance via large-scale, shared caching. REST enables intermediate processing by constraining messages to be self-descriptive: interaction is stateless between requests, standard methods and media types are used to indicate semantics and exchange information, and responses explicitly indicate cacheability.[2]
[edit] REST's central principle: resources
An important concept in REST is the existence of resources (sources of specific information), each of which can be referred to using a global identifier (a URI). In order to manipulate these resources, components of the network (clients and servers) communicate via a standardized interface (e.g. HTTP) and exchange representations of these resources (the actual documents conveying the information). For example, a resource which is a circle may accept and return a representation which specifies a center point and radius, formatted in SVG, but may also accept and return a representation which specifies any three distinct points along the curve as a comma-separated list.
Any number of connectors (e.g., clients, servers, caches, tunnels, etc.) can mediate the request, but each does so without “seeing past” its own request (referred to as “layering”, another constraint of REST and a common principle in many other parts of information and networking architecture). Thus an application can interact with a resource by knowing two things: the identifier of the resource, and the action required—it does not need to know whether there are caches, proxies, gateways, firewalls, tunnels, or anything else between it and the server actually holding the information. The application does, however, need to understand the format of the information (representation) returned, which is typically an HTML, XML or JSON document of some kind, although it may be an image, plain text, or any other content.
[edit] Claimed benefits
Many of the statements below refer to REST in the specific context of Web Services, as opposed to SOAP. REST was originally defined in Fielding’s dissertation in the context of information and media access. Fielding did not originally contrast REST with RPC.
Advocates claim that REST:[citation needed]
- Provides improved response times and server loading characteristics due to support for caching
- Improves server scalability by reducing the need to maintain communication state. This means that different servers can be used to handle initial and subsequent requests
- Requires less client-side software to be written than other approaches, because a single browser can access any application and any resource
- Depends less on vendor software and mechanisms which layer additional messaging frameworks on top of HTTP
- Provides equivalent functionality when compared to alternative approaches to communication
- Does not require a separate resource discovery mechanism, due to the use of hyperlinks in content
- Provides better long-term compatibility and evolvability characteristics than RPC. This is due to:
- The capability of document types such as HTML to evolve without breaking backwards- or forwards-compatibility, and
- The ability of resources to add support for new content types as they are defined without dropping or reducing support for older content types.
REST detractors note the lack of tool support and the scarcity of truly RESTful applications deployed on the web of today.[citation needed] Some claim that REST is applicable to GET, but unproven for other state transfer operations such as PUT.[citation needed] Fielding points out in his thesis that the REST architecture was designed specifically for massive scale hypermedia distribution, and not as a one size fits all architectural style. Indeed, what characterizes REST are the constraints it imposes on any REST-based system. POST is often considered the only necessary client-to-server state transfer operation, and is treated as a mechanism to tunnel arbitrary method invocations across HTTP.
Some REST systems have been deployed and gained tools support such as WebDAV which uses not only GET and POST, but also established HTTP methods like HEAD, DELETE, PUT as well as WebDAV-specific HTTP methods: PROPFIND, PROPPATCH, MKCOL, COPY, MOVE, LOCK, and UNLOCK.
One common stumbling block in the dialog on Claimed Benefits is focusing too much on web browser support for REST. Gateways, caching servers, proxies, and other REST connectors are the critical components for system design and REST.
[edit] RESTful example: the World Wide Web
The World Wide Web is the key example of a RESTful design. Much of it conforms to the REST principles. The Web consists of the Hypertext Transfer Protocol (HTTP), content types including the Hypertext Markup Language (HTML), and other Internet technologies such as the Domain Name System (DNS).
HTML can include JavaScript and applets to support code on demand, and has implicit support for hyperlinks.
HTTP has a uniform interface for accessing resources, which consists of URIs, methods, status codes, headers, and content distinguished by MIME type.
The most important HTTP methods are POST, GET, PUT and DELETE. These are often compared with the CREATE, READ, UPDATE, DELETE (CRUD) operations associated with database technologies:[3]
The following table associates several common HTTP verbs with similar database operations, however the meaning of the HTTP verbs do not correspond directly with a single database operation. For example, an HTTP PUT is used to set the value of a resource and may result in either a creation or update as needed.
HTTP | CRUD |
---|---|
POST | Create, Update, Delete |
GET | Read |
PUT | Create, Update |
DELETE | Delete |
HTTP separates the notions of a web server and a web browser. This allows the implementation of each to vary from the other based on the client-server principle. When used RESTfully, HTTP is stateless. Each message contains all the information necessary to understand the request when combined with state at the resource. As a result, neither the client nor the server needs to remember any communication state between messages. Any state retained by the server must be modeled as a resource.
The statelessness constraint can be violated in HTTP using cookies to maintain sessions. Fielding notes the risks of privacy leaks and security complications which often arise through the use of cookies, and the confusions and bugs which can result from interactions between cookies and the “back” button in a browser.
HTTP provides mechanisms to control caching, and permits a conversation between web browser and web cache to occur using the same mechanisms as between web browser and web server. No layer can access any conversation other than the one it is immediately involved with.
It is important to note that HTML links produce "get" HTTP requests. HTML forms allow "get" and "post" methods (in lower-case). The other HTTP methods mentioned here are not available in HTML 4.1 or XHTML 1.0.[4]
[edit] REST versus RPC
The statements below refer to REST in the context of Web Services, specifically as opposed to SOAP. Note that Fielding’s dissertation presents REST in the context of information and media access, not web services. It does not contrast REST to RPC (remote procedure call).
- REST
- Resources—Commands are defined in simple terms: resources to be retrieved, stored / get, set—difficult to do many joins
- RPC
- Commands—Commands are defined in methods with varying complexity: depending on “standard”—easier (?) to hide complex things behind a method
- REST
- Nouns—Exchanging resources and concepts
- RPC
- Verbs—Exchanging methods
A RESTful web application requires a different design approach from an RPC application. An RPC application is exposed as one or more network objects, each with an often unique set of functions which can be invoked. Before a client communicates with the application it must have knowledge of the object identity in order to locate it and must also have knowledge of the object type in order to communicate with it.
RESTful design constrains the aspects of a resource which define its interface (the verbs and content types). This leads to the definition of fewer types on the network than an RPC-based application but more resource identifiers (nouns). REST design seeks to define a set of resources with which clients can interact uniformly, and to provide hyperlinks between resources which clients can navigate without requiring knowledge of the whole resource set. Server-provided forms can also be used in a RESTful environment to describe how clients should construct a URL in order to navigate to a particular resource.
[edit] Example
An RPC application might define operations such as the following:
getUser() addUser() removeUser() updateUser() getLocation() addLocation() removeLocation() updateLocation() listUsers() listLocations() findLocation() findUser()
Client code to access this application may look something like this:
exampleAppObject = new ExampleApp('example.com:1234') exampleAppObject.removeUser('001')
With REST, on the other hand, the emphasis is on the diversity of resources, or nouns; for example, a REST application might define the following resources
http://example.com/users/ http://example.com/users/{user} (one for each user) http://example.com/findUserForm http://example.com/locations/ http://example.com/locations/{location} (one for each location) http://example.com/findLocationForm
Client code to access this application may look something like this:
userResource = new Resource('http://example.com/users/001') userResource.delete()
Each resource has its own identifier noun. Clients start at a single resource such as the user resource which represents themselves, and navigate to location resources and other user resources. Clients work with each resource through standard operations, such as GET to download a copy of the resource’s representation, PUT to paste a changed copy over the top of the original, or DELETE to remove the data or state associated with the resource. POST is sometimes used interchangeably with PUT, but can also be seen as a “paste after” rather than a “paste over” request. POST is generally used for actions with side-effects, such as requesting the creation of a purchase order, or adding some data to a collection. Note how each object has its own URL and can easily be cached, copied, and bookmarked.
[edit] Uniform interfaces in REST and RPC
The uniform interface allows clients to access data from a range of resources without special code to deal with each one, so long as it is actually uniform. The content returned from a user resource could be the globally standard and RESTful HTML, a less RESTful industry standard representation such as UserML, or an unRESTful application-specific data format. Which content is returned can be negotiated at request time. The content could even be a combination of these representations: HTML can be marked up with microformats which have general or industry-specific appeal, and these microformats can be extended with application-specific information.
Uniform interfaces reduce the cost of client software by ensuring it is only written once, rather than once per application it has to deal with. Both REST and RPC designs may try to maximise the uniformity of the interface they expose by conforming to industry or global standards. In the RPC model these standards are primarily in the form of standard type definitions and standard choreography. In REST it is primarily the choice of standard content types and verbs which controls uniformity.
[edit] Public implementations
It is possible to claim an enormous number of RESTful applications on the Web (just about everything accessible through an HTTP GET request or updateable through HTTP POST). Taken more narrowly, in its sense as an alternative to both Web Services generally and the RPC style specifically, REST can be found in a number of places on the public Web:
- The “blogosphere” — the universe of weblogs — is mostly REST-based, since it involves downloading XML files (in RSS or Atom format) which contain lists of links to other resources;
- The Atom Publishing Protocol for publishing to blogs is considered a canonical RESTful protocol;
- Various websites and web applications offer REST-like developer interfaces to data (e.g. Flickr or Amazon S3).
Note that WSDL version 2.0 now offers support for binding to all the HTTP request methods (not only GET and POST as in version 1.1).[5]
[edit] False or weak forms
Some interfaces referred to as being "RESTful" do not intentionally respect REST’s architectural constraints. REST advocate Mark Baker uses the term "accidentally RESTful"[6] to describe interfaces that partially follow REST's architectural constraints. For example, Flickr's interface can be considered RESTful in its use of standalone GET operations, but it does not attempt to support the full range of a REST interface. Other interfaces that use HTTP to tunnel function calls or which offer a “POX/HTTP” (Plain Old XML over HTTP) endpoint are also sometimes referred to as "REST" interfaces.[citation needed]
[edit] Implementation challenges
Implementation is hampered by limited support for HTTP PUT and DELETE in popular development platforms. For example, in the LAMP platform, support for PUT must be added as a module. Web searches offer few examples of how to implement updating database-driven content using PUT. For example, it is nontrivial to create a PHP script to update http://example.com/thing/1 with a PUT message when /thing.php will serve a GET request with XML generated from a database. Most published patterns for updating entities use the POST method.
[edit] Outside of the Web
Just as much of the web can be seen as RESTful or nearly RESTful, a number of existing protocols and architectures have RESTful characteristics. Software which may interact with a number of different kinds of objects or devices can do so by virtue of a uniform, agreed interface. Many of these uniform interfaces follow document-oriented REST patterns rather than object-oriented patterns [should expand on and thus clarify this distinction]:
[edit] Modbus
Modbus is a protocol which allows memory ranges within PLCs to be addressed. Ranges can be written and read effectively as PUT and GET operations.
[edit] JavaBeans
JavaBeans and other systems which perform property-based editing follow the PUT and GET model of the REST architectural style. Rather than write object-specific editor code, the code is written once and can interact with various object types. Resources in this model are defined by the combination of an object identifier and a property name.
[edit] SNMP
The SNMP protocol and its object model, which predate the Web, share some characteristics with RESTful systems. A strict verb discipline follows from the protocol's small operator set, and the 'resources' are addressed with a uniform global scheme of Object Identifiers. Most interaction occurs in client-server fashion, and the clients and servers (called managers and agents) can be deployed and evolved independently. Each request-response pair can be understood in isolation.
However, movement through the space of Object identifiers is not assisted by hyperlinks, nor is it considered as traversal through states in a state machine. Also, SNMP usage is generally unconcerned with the WWW requirements of Internet scaling and links between independently authored content.
[edit] Footnotes
- ^ Chapter 5 of Fielding’s dissertation is “Representational State Transfer (REST)”.
- ^ (Fielding 2000, §5.3.1)
- ^ IETF RFC 2616 “Hypertext Transfer Protocol – HTTP/1.1”, R. Fielding et al., June 1999
- ^ HTML 4.01 Specification: Forms in HTML Documents: The Form Element. W3C. Retrieved on 2008-03-23.
- ^ Web Services Description Language (WSDL) Version 2.0 Part 2: Adjuncts.
- ^ accidentally RESTful
[edit] References
- Fielding, Roy T. & Taylor, Richard N. (2002-05), “Principled Design of the Modern Web Architecture”, ACM Transactions on Internet Technology (TOIT) (New York: Association for Computing Machinery) 2 (2): 115–150, ISSN 1533-5399, doi:10.1145/514183.514185, <http://www.ics.uci.edu/~taylor/documents/2002-REST-TOIT.pdf>
- Fielding, Roy Thomas (2000), Architectural Styles and the Design of Network-based Software Architectures, University of California, Irvine, <http://www.ics.uci.edu/~fielding/pubs/dissertation/top.htm>
- Pautasso, Cesare; Zimmermann, Olaf & Leymann, Frank (2008-04), “RESTful Web Services vs. Big Web Services: Making the Right Architectural Decision”, 17th International World Wide Web Conference (WWW2008) (Beijing, China), <http://www.jopera.org/docs/publications/2008/restws>
[edit] External links
- “Architectural Styles and the Design of Network-based Software Architectures”: Dissertation for Doctor of Philosophy, by Roy Thomas Fielding
- “RESTful Web Services with JSP”: Tutorial for easy RESTful Web Service with JSP
- RESTwiki: “descriptions of REST, records of the experiences of REST proponents, and resources to help you apply REST [...] to your software or framework”
- “Constructing or Traversing URIs?”: discusses the constraint on components to use “hypermedia as the engine of application state”.
- The REST Dialogues, Part 1: “Getting Data”: one of nine lessons on applying REST to Web-based business, each lesson in the form of dialog between the author and a fictitious senior technical employee of a company conducting Web-based business.
- “REST for the Rest of Us”: “showcases common REST design patterns that can be put to immediate use”.
- “MindTouch: Introduction to REST”: slides and narration explaining REST.
- “RESTify DayTrader”: a tour of a day-trading interface in REST style.
- “Building Web Services the REST Way”
- "InfoQ: A Brief Introduction to REST"
- "How I Explained REST to my Wife"