HTML |
---|
An HTML element is an individual component of an HTML document. HTML documents are composed of a tree of HTML elements and other nodes, such as text nodes. Each element can have attributes specified. Elements can also have content, including other elements and text. HTML elements represent semantics, or meaning. For example, the title
element represents the title of the document.
In the HTML syntax, most elements are written with a start tag and an end tag, with the content in between. Tags are composed of the name of the element, surrounded by angle brackets. An end tag also has a slash after the opening angle bracket, to distinguish it from the start tag. For example, a paragraph, which is represented by the p
element, would be written as
<p>In the HTML syntax, most elements are written ...</p>
However, not all of these elements require the end tag, or even the start tag, to be present. Some elements, the so-called void elements don't have an end tag. A typical example is the br
element, which represents a significant line break, such as in a poem or an address. For example, the address of the dentist in Finding Nemo would be written as
<p>P. Sherman<br>42 Wallaby Way<br>Sydney</p>
Attributes are specified on the start tag. For example, the abbr
element, which represents an abbreviation, expects a title
attribute with its expansion. This would be written as
<abbr title="Hyper Text Markup Language">HTML</abbr>
Contents |
There are multiple kinds of HTML elements: void elements, raw text elements, and normal elements.
Void elements only have a start tag, which contains any attributes. One example is the link
element, for which the syntax is
<link rel=stylesheet href=fancy.css type="text/css">
This link
element points the browser at a stylesheet to use when presenting the HTML document to the user. Note that in the HTML syntax, attributes don't have to be quoted. When using the XML syntax (XHTML), on the other hand, all attributes must be quoted, and a trailing slash is required before the last angle bracket:
<link rel="stylesheet" href="fancy.css" type="text/css" />
Raw text elements are constructed with:
<tag>
) marking the beginning of an element, which may incorporate any number of attributes;</tag>
. In some versions of HTML, the end tag is optional for some elements. The end tag is required in XHTML.Normal elements usually have both a start tag and an end tag, although for some elements the end tag, or both tags, can be omitted. It is constructed in a similar way:
<tag>
) marking the beginning of an element, which may incorporate any number of attributes;</tag>
.Attributes define desired behavior or indicate additional element properties. Most attributes require a value. In HTML, the value can be left unquoted if it doesn't include spaces (name=value
), or it can be quoted with single or double quotes (name='value'
or name="value"
). In XML, those quotes are required. Boolean attributes, on the other hand, don't require a value to be specified. An example is the checked
for checkboxes:
<input type=checkbox checked>
In the XML syntax, though, the name should be repeated as the value:
<input type="checkbox" checked="checked" />
Informally, HTML elements are sometimes referred to as "tags" (an example of synecdoche), though many prefer the term tag strictly in reference to the markup delimiting the start and end of an element.
Element (and attribute) names may be written in any combination of upper or lower case in HTML, but must be in lower case in XHTML.[1] The canonical form was upper-case until HTML 4, and was used in HTML specifications, but in recent years, lower-case has become more common.
HTML elements are defined in a series of freely-available open standards issued since 1995, initially by the IETF and subsequently by the W3C.
Since the early 1990s, developers of user agents (e.g. web browsers) have often developed their own elements, some of which have been adopted in later standards. Other user agents may not recognize non-standard elements, and they may be ignored or displayed improperly.
In 1998, XML (a simplified form of SGML) introduced mechanisms to allow anyone to develop their own elements and incorporate them in XHTML documents, for use with XML-aware user agents.[2]
Subsequently, HTML 4.01 was rewritten in an XML-compatible form, XHTML 1.0 (eXtensible HTML). The elements in each are identical, and in most cases valid XHTML 1.0 documents will be valid or nearly-valid HTML 4.01 documents. This article mainly focuses on real HTML, unless noted otherwise; however, it remains applicable to XHTML. (See HTML for a discussion of the minor differences between the two).
Since the first version of HTML, several elements have become outmoded, and are deprecated in later standards, or do not appear at all, in which case they are invalid (and will be found invalid, and perhaps not displayed, by validating user agents).[3]
At present, the status of elements is complicated by the existence of three types of HTML 4.01 / XHTML 1.0 DTD:
The first Standard (HTML 2.0) contained four deprecated elements, one of which was invalid in HTML 3.2. All four are invalid in HTML 4.01 Transitional, which also deprecated a further ten elements. All of these, plus two others, are invalid in HTML 4.01 Strict. While the frame elements are still current in the sense of being present in the Transitional and Frameset DTDs, there are no plans to preserve them in future standards, as their function has been largely replaced, and they are highly problematic for user accessibility.
(Strictly speaking, the most recent XHTML standard, XHTML 1.1 (2001), does not include frames at all; it is approximately equivalent to XHTML 1.0 Strict, but also includes the Ruby markup module.)[4]
A common source of confusion is the loose use of deprecated to refer to both deprecated and invalid status, and to elements which are expected to be formally deprecated in future.
In keeping with the principle of separation of concerns, the function of HTML is primarily to add structural and semantic information to the raw text of a document. Presentation and behavior are separate functions, which can be added as desired, ideally through links to external documents such as stylesheets, graphics files, and scripts.
This allows the document to be presented by different user agents according to their purposes and abilities; for example, a user agent can select an appropriate stylesheet to present a document by displaying on a monitor, printing on paper, or to determine speech characteristics in an aural user agent. The structural and semantic functions of the markup remain identical in each case.
Historically, user agents did not always support these features. In the 1990s, as a stop-gap, presentational elements were added to HTML, at the cost of creating problems for interoperability and user accessibility. This is now regarded as outmoded and has been superseded by stylesheet-based design; most presentational elements are now deprecated.[5]
External image files are incorporated with the img or object elements. (With XHTML, the SVG language can also be used to write graphics within the document, though linking to external SVG files is generally simpler.)[6] Where an image is not purely decorative, HTML allows replacement content with similar semantic value to be provided for non-visual user agents.
An HTML document can also be extended through the use of scripts to provide additional behaviors beyond the abilities of HTML hyperlinks and forms.
The elements style and script, with related attributes, provide reference points in HTML markup for links to stylesheets and scripts. They can also contain instructions directly.
<html>...</html>
<head>...</head>
<body>...</body>
<base>
href
and other links in the document. Must appear before any element that refers to an external resource. HTML permits only one base
element for each document. The base
element has attributes, but no contents.<basefont> (deprecated)
font
elements. Deprecated in favor of stylesheets.<isindex> (deprecated)
isindex
could either appear in the document head or in the body, but only once in a document. See Forms.<link>
<link rel="stylesheet" type="text/css" href="url" title="description_of_style">
[8]<link rel="next" href="url">
head
element may contain any number of link
elements. The link
element has attributes, but no contents.meta
elements specify associative key-value pairs. In general, a meta element conveys hidden information about the document. Several meta tags can be used, all of which should be nested in the head element. The specific purpose of each meta element is defined by its attributes.meta
elements can specify HTTP headers which should be sent by a web server before the actual content, for example:
<meta http-equiv="foo" content="bar">
foo
that has a value bar
.meta
element specifies name
and associated content
attributes describing aspects of the HTML page. To prevent possible ambiguity, an optional third attribute, scheme
, may be supplied to specify a semantic framework that defines the meaning of the key and its value: for example:
<meta name="foo" content="bar" scheme="DC">
meta
element identifies itself as containing the foo
element, with a value of bar
, from the DC or Dublin Core resource description framework.<object>...</object>
head
element, it could potentially be used to extract foreign data and associate it with the current document.<script>...</script>
src
attribute.[9] Also usable in the document body to dynamically generate either both block or inline content.<style>...</style>
<style type="text/css"> ... </style>
@import
directives of the form:
<style> @import url; </style>
[10]<title>...</title>
title
element must not contain other elements, only text. Only one title
element is permitted in a document.In visual browsers, displayable elements can be rendered as either block or inline. While all elements are part of the document sequence, block elements appear within their parent elements:
Conversely, inline elements are treated as part of the flow of document text; they cannot have margins, width or height set, and do break across lines.
<p>...</p>
P
existed in HTML Tags, and was standardized in HTML 2.0; still current.<h1>...</h1>
<h2>...</h2>
<h3>...</h3>
<h4>...</h4>
<h5>...</h5>
<h6>...</h6>
<h1>
delimits the highest-level heading, <h2>
the next level down (sub-section), <h3>
for a level below that, and so on to <h6>
.<dl>...</dl>
DL
existed in HTML Tags, and was standardized in HTML 2.0; still current.<dt>...</dt>
DT
existed in HTML Tags, and was standardized in HTML 2.0; still current.<dd>...</dd>
DD
existed in HTML Tags, and was standardized in HTML 2.0; still current.<ol>...</ol>
type
attribute can be used to specify the kind of ordering, but stylesheets give more control: {list-style-type: foo}
. The default is Arabic numbering.OL
existed in HTML Internet Draft 1.2, and was standardized in HTML 2.0; still current.<ul>...</ul>
{list-style-type: foo}
. The default marker is a disc.UL
existed in HTML Tags, and was standardized in HTML 2.0; still current.<li>...</li>
LI
existed in HTML Tags, and was standardized in HTML 2.0; still current.<dir>...</dir> (deprecated)
<ul>
.DIR
existed in HTML Tags, and was standardized in HTML 2.0; deprecated in HTML 4.0 Transitional; invalid in HTML 4.0 Strict.<address>...</address>
ADDRESS
existed in HTML Tags, and was standardized in HTML 2.0; still current. <blockquote>...</blockquote>
cite
attribute may give the source, and must be a fully qualified Uniform Resource Identifier.q
) element.BLOCKQUOTE
existed in HTML Internet Draft 1.2, and was standardized in HTML 2.0; still current. See blockquote element for more information.<center>...</center> (deprecated)
<div>
or another element with centering defined using stylesheets.<del>...</del>
<div>...</div>
<hr>
<ins>...</ins>
<noscript>...</noscript>
<pre>...</pre>
pre
, whitespace should be rendered as authored. (With the CSS properties: {white-space: pre; font-family: monospace;}
, other elements can be presented in the same way.) This element can contain any inline element except: image (IMG
), object (OBJECT
), big font size (BIG
), small font size (SMALL
), superscript (SUP
), and subscript (SUB
).PRE
existed in HTML Internet Draft 1.2, and was standardized in HTML 2.0; still current.<script>...</script>
SCRIPT
is not itself either a block or inline element; by itself it should not display at all, but it can contain instructions to dynamically generate either both block or inline content.Inline elements cannot be placed directly inside the body
element; they must be wholly nested within block-level elements.[11]
<a>...</a>
href
(hypertext reference [1]), the anchor becomes a hyperlink to either another part of the document or another resource (e.g. a webpage) using an external URL.name
or id
attributes set, the element becomes a target. A Uniform Resource Locator can link to this target via a fragment identifier. Any element can now be made into an anchor by using the id
attribute,[2] so using <a name="foo">
is not necessary.title
may be set to give brief information about the link:
<a href="URL" title="additional information">link text</a>
title
appears in some way (varies according to browser). Some browsers render alt text the same way, though this is technically incorrect.A
existed in HTML Tags, and was standardized in HTML 2.0; still current.<abbr>...</abbr>
<abbr title="abbreviation">abbr.</abbr>
<acronym>...</acronym> (deprecated)
abbr
element, but marks an acronym:
<acronym title="Hyper-Text Markup Language">HTML</acronym>
<dfn>...</dfn>
DFN
existed in HTML Internet Draft 1.2, and was fully standardized in HTML 3.2; still current.<em>...</em>
EM
existed in HTML Internet Draft 1.2, and was standardized in HTML 2.0; still current.<strong>...</strong>
STRONG
existed in HTML Internet Draft 1.2, and was standardized in HTML 2.0; still current.These elements are useful primarily for documenting computer code development and user interaction through differentiation of source code (<code>), source code variables (<var>), user input (<kbd>), and terminal output (<samp>).
<code>...</code>
Code snippet.
CODE
existed in HTML Internet Draft 1.2, and was standardized in HTML 2.0; still current.<samp>...</samp>
SAMP
existed in HTML Internet Draft 1.2, and was standardized in HTML 2.0; still current.<kbd>...</kbd>
KBD
existed in HTML Internet Draft 1.2, and was standardized in HTML 2.0; still current.<var>...</var>
VAR
existed in HTML Internet Draft 1.2, and was standardized in HTML 2.0; still current.As visual presentational markup only applies directly to visual browsers, its use is discouraged. Stylesheets should be used instead. Several of these elements are deprecated or invalid in HTML 4 / XHTML 1.0, and the remainder are invalid in the current draft of XHTML 2.0. The current draft of HTML 5, however, includes <b>
, <i>
, and <small>
.
<b>...</b>
{font-weight: bold}
.) <strong>...</strong>
usually has the same effect in visual browsers, as well as having more semantic meaning.B
existed in HTML Internet Draft 1.2, and was standardized in HTML 2.0; still current.<i>...</i>
{font-style: italic}
.) <em>...</em>
usually has the same effect in visual browsers, as well as having more semantic meaning.I
existed in HTML Internet Draft 1.2, and was standardized in HTML 2.0; still current.<big>...</big> (deprecated)
{font-size: larger}
<small>...</small>
{font-size: smaller}
and <strike>...</strike> (deprecated)<s>...</s> (deprecated)
{text-decoration: line-through}
)STRIKE
was standardized in HTML 3.2; deprecated in HTML 4.0 Transitional; invalid in HTML 4.0 Strict.S
is deprecated in HTML 4.0 Transitional (having not appeared in any previous standard), and is invalid in HTML 4.0 Strict.<tt>...</tt>
{font-family: monospace;}
)TT
existed in HTML Internet Draft 1.2, and was standardized in HTML 2.0; still current.<u>...</u> (deprecated)
{text-decoration: underline}
)U
existed in HTML Internet Draft 1.2, was standardized in HTML 3.2 but was deprecated in HTML 4.0 Transitional and is invalid in HTML 4.0 Strict.<font>...</font> (deprecated)
<font [color=color] [size=size] [face=face]>...</font>
color
attribute, typeface with the face
attribute, and absolute or relative size with the size
attribute.<font color="green">text</font>
creates green text.<font color="#114499">text</font>
creates text with hexadecimal color #114499.<font size="4">text</font>
creates text with size 4. Sizes are from 1 to 7. The standard size is 3, unless otherwise specified in the <body> or other tags.<font size="+1">text</font>
creates text with size 1 bigger than the standard. <font size="-1">text</font>
is opposite.<font face="Courier">text</font>
makes text with Courier font.<font size="N">
corresponds to {font-size: Yunits}
(the HTML specification does not define the relationship between size N and unit-size Y, nor does it define a unit).<font color="red">
corresponds to {color: red}
<font face="Courier">
corresponds to {font-family: "Courier"}
<span>...</span>
<br>
<bdo>...</bdo>
<cite>...</cite>
<del>...</del>
<ins>...</ins>
<del>
'd text. Typically rendered underlined: Inserted text.<ins>
and <del>
elements may also be used as block elements: containing other block and inline elements. However, these elements must still remain wholly within their parent element to maintain a well-formed HTML document. For example deleting text from the middle of one paragraph across several other paragraphs and ending in a final paragraph would need to use three separate <del>
elements. Two <del>
elements would be required as inline element to indicate the deletion of text in the first and last paragraphs, and a third, used as a block element, to indicate the deletion in the intervening paragraphs.<q>...</q>
BLOCKQUOTE
). Quote elements may be nested.<q>
should automatically generate quotation marks in conjunction with Stylesheets. Practical concerns due to browser non-compliance may force authors to find work-arounds.cite
attribute gives the source, and must be a fully qualified URI.blockquote
) using stylesheets. For example, with a suitable CSS rule associated with q.lengthy
:
<q class="lengthy">
An inline quotation of significant length (say 25 words, for example) goes here...</q>
<script>...</script>
<script>
is not itself either a block or inline element; by itself it should not display at all, but it can contain instructions to dynamically generate either both block or inline content.<sub>...</sub>
and <sup>...</sup>
{vertical-align: sub}
or {vertical-align: super}
.)<wbr>
<applet>...</applet> (deprecated)
<object>
, as it could only be used with Java applets, and had accessibility limitations.<object>
are not consistent between different browsers.<area>
<img>
src
attribute specifies the image URL. The required alt
attribute provides alternative text in case the image cannot be displayed.[13] (Though alt
is intended as alternative text, Microsoft Internet Explorer 7 and below render it as a tooltip if no title
is given.[14] Safari and Google Chrome, on the other hand, do not display the alt attribute at all.)[15] img was proposed by Marc Andreessen and implemented in the NSCA Mosaic web browser.[16]<map>...</map>
<object>...</object>
type
attribute. This may be in any MIME-type the user agent understands, such as an embedded HTML page, a file to be handled by a plug-in such as Flash, a Java applet, a sound file, etc.<param>
These elements can be combined into a form or in some instances used separately as user-interface controls; in the document, they can be simple HTML or used in conjunction with Scripts. HTML markup specifies the elements that make up a form, and the method by which it will be submitted. However, some form of scripts (server-side, client-side, or both) must be used to process the user’s input once it is submitted.
(These elements are either block or inline elements, but are collected here as their use is more restricted than other inline or block elements.)
<form action="url">...</form>
<button>...</button>
<fieldset>...</fieldset>
<input>
<button>
is preferred if possible (i.e. if the client supports it) as it provides richer possibilities.src
attribute.size
attribute specifies the default width of the input in character-widths. maxlength
sets the maximum number of characters the user can enter (which may be greater than size).<isindex> (deprecated)
isindex
could either appear in the document head or in the body, but only once in a document.<label for="id">...</label>
<legend>...</legend>
<option value="x">
select
list.<optgroup>...</optgroup>
<select name="xyz">...</select>
<textarea rows="8">...</textarea>
cols
(where a col is a one-character width of text) and rows
attributes. The content of this element is restricted to plain text, which appears in the text area as default text when the page is loaded.The format of HTML Tables was proposed in the HTML 3.0 Drafts and the later RFC 1942 HTML Tables. They were inspired on the CALS Table Model. Some elements in these proposals were included in HTML 3.2; the present form of HTML Tables was standardized in HTML 4. (Many of the elements used within tables are neither block nor inline elements.)
<table>...</table>
<tr>...</tr>
<th>...</th>
<td>...</td>
<colgroup>...</colgroup>
<col>
or <col/>
<caption>...</caption>
<thead>...</thead>
<tbody>...</tbody>
<tfoot>...</tfoot>
<thead>
, this section may be repeated by the user agent if the table is split across pages (in printing or other paged media).Frames allow a visual HTML Browser window to be split into segments, each of which can show a different document. This can lower bandwidth use, as repeating parts of a layout can be used in one frame, while variable content is displayed in another. This comes at a significant usability cost, especially in non-visual user agents. Because of this cost, frames (excluding the iframe
element) are only allowed in HTML 4.01 Frameset.
In HTML 4.01, a document may contain a head
and a body
or a head
and a frameset
, but not both a body
and a frameset
. However, iframe can be used in a normal document body.
<frameset>...</frameset>
rows
and cols
attributes.<frame>
or <frame/>
frameset
. A separate document linked with the src
attribute appears inside.<noframes>...</noframes>
<iframe>...</iframe>
object
element, an inline frame can be the "target" frame for links defined by other elements and it can be selected by the user agent as the focus for printing, viewing its source, etc.In HTML, longdesc is an attribute used within the image element, frame element, or iframe element. It is used to reference a long description website of the image, frame, or iframe in question.[17]
Longdesc was designed to be used by screen readers to display image information for computer users with accessibility issues, such as the blind or visually impaired, and is widely implemented by both web browsers and screen readers.[18] Some developers object that [19] it is actually seldom used for this purpose, because there are relatively few authors who use the attribute, and most of those authors use it incorrectly, and have used this argument to recommend dropping longdesc.[20] The publishing industry has responded, advocating the retention of longdesc.[21]
<img src="Hello.jpg" longdesc="description.html">
Content of description.html
:
... <p>This is an image of a two-layered birthday cake.</p> ...
Since very few Graphical browsers support making the link available natively (Opera and iCab being the exceptions), it is useful to include a link to the description page near the img
element whenever possible, as this can also aid sighted users.
<img src="Hello.jpg" longdesc="description.html" /> [<a href= "description.html" title="long description of the image">D</a>]
The following elements were part of the early HTML developed by Tim Berners-Lee from 1989–91; they are mentioned in HTML Tags, but deprecated in HTML 2.0 and were never part of HTML standards.
<listing>...</listing>
<plaintext>
<xmp>...</xmp>
pre
.plaintext
cannot have an end tag – it terminates the markup and causes the rest of the document to be parsed as if it were plain text.<nextid>...</nextid>
nextid
existed in HTML Tags (described as obsolete); deprecated in HTML 2.0; invalid in HTML 3.2 and later.This section lists some widely-used obsolete elements, which means they are not allowed to be used. They may not be supported in all user agents.
<blink>...</blink>
{text-decoration: blink}
(This effect may have negative consequences for people with photosensitive epilepsy;[22] its use on the public Internet should follow the appropriate guidelines.)<marquee>...</marquee>
<nobr>...</nobr>
{white-space: nowrap;}
<noembed>...</noembed>
embed
or object
element.<embed>...</embed>
object
tag, but then was added back into the HTML 5 specification[23][24]<menu>...</menu>
<ul>
list.MENU
existed in HTML Tags, and was standardized in HTML 2.0; deprecated in HTML 4.0 Transitional; invalid in HTML 4.0 Strict; but then redefined in HTML 5.<!-- A Comment -->
style
and script
elements are still sometimes surrounded by comment delimiters.<!--Xbegin<!--Y-->Xend-->
will yield the comment Xbegin<!--Y
and the text Xend-->
after it.