HTML element

From Wikipedia, the free encyclopedia

This article is about the HTML elements in general. For information on how to format Wikipedia entries, see Wikipedia:How to edit a page and Help:HTML in wikitext

In computing, an HTML element indicates structure in an HTML document and a way of hierarchically arranging content. More specifically, an HTML element is an SGML element that meets the requirements of one or more of the HTML Document Type Definitions (DTDs). These elements have properties: both attributes and content, as specified (both allowable and required) according to the appropriate HTML DTD (for example, the HTML 4.01 strict DTD). Elements may represent headings, paragraphs, hypertext links, lists, embedded media, and a variety of other structures.

Parts of a HTML element: <p class="foo">This is a paragraph</p> Start tag: <p class="foo">; Attribute name: class; value: foo; Content: This is a paragraph.; End tag: </p>

Syntactically HTML elements are constructed with: 1) a start tag marking the beginning of an element; 2) any number of attributes (and their associated values); 3) some amount of content (characters and other elements); and 4) an end tag. Many HTML elements include attributes in their start tags, defining desired behavior or indicating additional element properties. The end tag is optional for many elements; in a minimal case, an empty element has no content and requires no end tag. There are a few elements that are not part of any official DTDs, yet are supported by some browsers and used by some web pages. Such elements may be ignored or displayed improperly on browsers not supporting them.

Informally, HTML elements are sometimes referred to as "tags" (an example of synecdoche), though many prefer the term tag strictly in reference to the semantic structures delimiting the start and end of an element.

XHTML is the successor to HTML 4.01; XHTML 1.0 supports the same elements as HTML 4 and in most cases valid XHTML 1.0 documents will be valid or nearly valid HTML 4 documents. XHTML 1.0 migrates HTML from its SGML underpinnings to an XML foundation. Accordingly, the discussion of elements within this article focuses on the final SGML based HTML, version 4.01 (unless noted otherwise). However, to the extent that XHTML 1.0 elements remain identical to the HTML 4.01 elements, the discussion remains applicable (see HTML for a discussion of the minor differences in content between XHTML 1.0 and HTML 4.01).

HTML
This box: view  talk  edit

Contents

[edit] Block-level and inline elements

HTML elements are classified as either block-level or inline (text-level).

Block-level elements
Headings, paragraphs, lists, or tables are "large" structures containing other blocks, inline elements, or text (but see nesting rules, below). They are usually displayed as independent "blocks" separated from other blocks by vertical space (margins).
Inline or text-level elements
Hyperlinks, citations, quotations, or images are "small" structures that represent or describe small pieces of text or data. They may contain only text or other inline elements, and are usually displayed one after another on a line within the block that contains them.

(See "The global structure of an HTML document")

[edit] Nesting of elements

Many HTML elements can be nested, or contained within other HTML elements:

<p>You <em>rock</em></p>

Nesting can be more complex:

<p>King Louis said, <q lang="fr">L'état, c'est <em>moi!</em></q>
(<cite><a href="/wiki/Louis_XIV_of_France">Wikipedia</a></cite>).</p>

Nesting may be arbitrarily deep, but for the code to remain valid, the elements must be closed in the reverse order that they were opened. (This pattern is known as Last in, first out.)

Incorrect
<p>Mary kissed <em>Jimmy</p></em>
Correct
<p>Mary kissed <em>Jimmy</em></p>

Some block-level elements (e.g. paragraphs) may contain only inline elements, and some (e.g. forms, lists) must contain only block-level child elements, but most may contain either block-level or inline elements.

[edit] Document elements

Root elements provide the containers enclosing all other HTML elements. Every HTML page has these elements. The surrounding tags may be omitted. However, some utilities may not recognize or correctly handle the document if this is done.

<html></html>

Delimit an HTML document (as opposed to, for example, an XML or other class of document). The HTML element takes the attributes lang for the primary language of the document (such as lang=en for English) and profile (rarely used) for a uniform resource identifier (URI) specifying metadata for the document. The only contents allowed in an HTML element are one head element and one body element. (See Frames, below, for an exception where body is not used.)

<head></head>

Delimit the header section of the document, which contains information about the page. The head element basically contains the metadata for the document. There are seven possible head elements including a meta element for extensible specification of metadata.

<body></body>

Delimit the body section of the document, which contains the content to be displayed on the page.

These root elements are arranged as follows:

<html>
<head>
Any of the various head elements, arranged in any order and occurring any number of times (except title and base, which can occur only once each). The only required head element is title.
</head>
<body>
One or more block elements and/or script elements, arranged in any order to suit the meaning and also typically the presentation of the document. The block elements are:
  • Paragraph (p)
  • Heading (h1...h6)
  • Block quotation (blockquote)
  • Ordered list (ol)
  • Unordered list (ul)
  • Definition list (dl)
  • Division (div)
  • No script support (noscript)
  • Form (form)
  • Table (table)
  • Fieldset (fieldset)
  • Address (address)
The ins and del elements are also permissible within the body element when used as block-level elements. No other HTML elements are valid within the body element.
</body>
</html>

[edit] Head elements

<title></title>

Define a document title. This element is required in every HTML and XHTML document. Different user agents may make use of the title in different ways. For example:
  • Web browsers usually display it in a window's title bar when the window is open, and in the task bar when the window is minimized.
  • It may become the default filename when saving the page.
  • Search engines' Web crawlers may pay particular attention to the words used in the title.
The title element must not contain any nested tags (that is, it cannot contain any other elements). Only one title element is permitted in a document.

<base>

Specifies a base URL for all relative href and other links in the document. Must appear before any element that refers to an external resource. HTML permits only one base element for each document. The base element has attributes, but no contents.

<link>

Specifies links to other documents, such as "previous" and "next" links, or alternate versions [1]. A common use is to link to external stylesheets, using the form
<link rel="stylesheet" type="text/css" href=url title=description_of_style>[2]
A document's head element may contain any number of link elements. The link element has attributes, but no contents.

<basefont> (deprecated)

Specifies a base font size, typeface, and color for the document. Used together with font elements. Deprecated in favor of stylesheets.

<script></script>

Used to add JavaScript or other scripts to the document. The script code may be typed literally between the script tags or may be given in a separate resource whose URL is specified with the script element's optional src attribute. [3]

<style></style>

Specifies a style for the document, usually in the form
<style type="text/css">…</style>
The CSS (Cascading Style Sheets) statements may be typed literally between the style tags or may be given in separate resources whose URLs are specified with @import directives of the form
<style> @import url; </style>. [4]

<object></object>

Used for including generic objects within the document header. Though rarely used within a head element, it could potentially be used to extract foreign data and associate it with the current document.

<meta>

Can be used to specify additional metadata about a document, such as its author, publication date, expiration date, page description, keywords, or other information not provided through the other header elements and attributes. Because of their generic nature, meta elements specify associative key-value pairs.
In one form, meta elements can specify HTTP headers which should be sent before the actual content when the HTML page is served from Web server to client: for example,
<meta http-equiv="foo" content="bar">
This specifies that the page should be served with an HTTP header called foo that has a value bar.
In the general form, a meta element specifies name and associated content attributes describing aspects of the HTML page. To prevent possible ambiguity, an optional third attribute, scheme, may be supplied to specify a semantic framework that defines the meaning of the key and its value: for example,
<meta name="foo" content="bar" scheme="DC">
In this example, the meta element identifies itself as containing the foo element, with a value of bar, from the DC or Dublin Core resource description framework.

[edit] Inline elements

Inline elements cannot be placed directly inside the body element; they must be wholly nested within block-level elements (see Block elements, below).

[edit] General phrase elements

<em></em>

Emphasis (conventionally displayed in italics)

<strong></strong>

strong emphasis (conventionally displayed bold). An oral user agent may use different voices for emphasis.

<q></q>

A quotation containing only inline elements (for quotations containing block level elements see blockquote below). Quote elements may be nested. By the specification, the author should not include quotation marks. Rather, quotation marks — including nested quotation marks — should be rendered through stylesheet properties or the browser's default stylesheet. Practical concerns due to browser non-compliance may force authors to find work-arounds. The cite attribute gives the source, and must be a fully qualified URI.
Note: within semantic HTML, the display of a lengthy inline quotation as an indented block should be handled through stylesheet presentation. One method is to use an XSLT to select quotation elements exceeding a certain length (in terms of characters or words, etc.) for indented block presentation. An alternative, using Cascading Stylesheets, requires some presentational markup to manually classify the element as a lengthy quotation. For example:
<q class='lengthy'>An inline quotation of significant length (say 25 words, for example) goes here...</q>.

<cite></cite>

A citation. Reference for a quote or statement in the document.

<dfn></dfn>

Defining first instance of a term

<abbr></abbr>

Contains an abbreviation, like abbr.

<acronym></acronym>

Similar to the abbr element, but contains an acronym, like HTML.

[edit] Computer code phrase elements

These elements are useful primarily for documenting computer code development and user interaction through differentiation of source code (<code>), source code variables (<var>), user input (<kbd>), and terminal output (<samp>).

<code></code>

A code snippet. Conventionally rendered in a monospace font: Code snippet.

<samp></samp>

Sample output (from a program or script)

<kbd></kbd>

Keyboard - text to be entered by the user

<var></var>

Variable

[edit] Special inline elements

<sub></sub> <sup></sup>

Create subscript or superscript text: Equivalent CSS: {vertical-align: sub} or {vertical-align: super}

<del></del>

Deleted text. Typically rendered as a strikethrough: Deleted text.

<ins></ins>

Inserted text. Often used to markup replacement text for <del>'d text. Typically rendered underlined: Inserted text.
Note, both ins and del elements may be used as block elements: containing other block and inline elements. However, these elements must still remain wholly within their parent element to maintain a well-formed HTML document. For example deleting text from the middle of one paragraph across several other paragraphs and ending in a final paragraph would need to use three separate del elements. Two del elements would be required as inline element to indicate the deletion of text in the first and last paragraphs, and a third, used as a block element, to indicate the deletion in the intervening paragraphs.

<isindex> (deprecated)

The :isindex element requires server side support for indexing documents. Visually presents a one-line text input for keyword entry. When submitted, the query string is appended to the current URL and the document is displayed with these keywords highlighted. Generally if the server supports this feature it will add the isindex elements to documents without author intervention.

[edit] Links and anchors

<a></a>

Creates an element that becomes a hyperlink with the href (hypertext reference[5]) attribute set to a URL; additionally the attribute title may be set to a hover box text, some informative text about the link:
<a href="URL" title="additional information">link text</a>
In most graphical browsers, when the cursor hovers over the link, the cursor typically changes into a hand with a stretched index finger and the additional information pops up, not in a regular window, but in a special "hover box", usually similar to a Tooltip, which disappears when the cursor is moved away. Some browsers render alt text the same way, though this is technically incorrect.
Alternatively (and sometimes concurrently), the element becomes an anchor with the name attribute set, which preceded by a number sign ' # ', and appended to the URL, acts as a link target in a URI scheme (a "document fragment"), typically causing a Web browser to scroll directly to that point of the page. Any element can be made into an anchor by using the id attribute,[6] so using <a name="foo"> is not necessary.
See also:link

[edit] Images and objects

<img>

Includes an image with the src attribute. The required alt attribute provides alternative text in case the image cannot be displayed. Alt is intended as alternative text, although Microsoft Internet Explorer renders it as a tooltip if no title is given; the title attribute is the tooltip text. It was proposed by Marc Andreessen.[7]

<br>

Specifies a line-break.
<map></map>
Specifies a client-side image map.

<area>

Specifies an area in the map.
<object></object>
Includes an object in the page of the type specified by the type attribute. This may be in any MIME-type the Web browser understands, such as an embedded page, code to be handled by a plug-in such as Flash, a Java applet, a sound file, etc.
<param>
This element may only appear inside an object element. Using attributes such as name and value, each <param> sets a parameter for the object. Examples include width, height, font, background colour, etc, depending on what has been exposed in this way by the object's developers.
<embed>…</embed> (proprietary)
Calls a plug-in handler for the type specified by the type attribute. Used for embedding Flash files, sound files, etc. This is a proprietary Netscape extension to HTML; <object> is the W3C standard method.
<noembed>…</noembed> (proprietary)
Specifies alternative content, if the embed cannot be rendered.
<applet></applet> (deprecated)
Includes a Java applet in the page. Deprecated; <object> is now preferred.

[edit] Span element

<span></span>
Creates an inline logical division. This may be used to identify a part of the HTML page, for example so as to apply an id or class attribute, which can then be referenced from CSS or DOM call. Like most HTML elements, span also supports inline CSS in its optional style attribute.

[edit] Block elements

Many HTML elements are designed for altering the semantic structure or meaning of a document. Some are block-level, but most are inline and can be included in the normal flow of text.

[edit] General block elements

<p></p>

Creates a paragraph, perhaps the most common block level element. The closing tag is not required in HTML.

<blockquote></blockquote>

Contains quoted material when the quotation itself includes block level elements (for instance, quoting several paragraphs). The cite attribute may give the source, and must be a fully qualified Uniform Resource Identifier. The blockquote element is often misunderstood. It is an element meant to contain quotations that are themselves block level. In other words, it contains a complete paragraph or many paragraphs. In HTML strict DTDs inline elements are prohibited from blockquote elements. For quotations not containing block level elements see the quote (q) element.

<hr>

Inserts a horizontal rule. Horizontal rules can also be handled through the CSS properties.

[edit] Headings

<h1></h1> <h2></h2> <h3></h3> <h4></h4> <h5></h5> <h6></h6>

Section headings at different levels. Use <h1> for the highest-level heading (the major sections), <h2> for the next level down (sub-section), <h3> for a level below that, and so on. The lowest level heading is <h6>.
Most web browsers will show <h1> as large text in a different font, and <h6> as small bold-faced text, but this can be overridden with CSS. The heading elements are not intended merely for creating large or bold text: they describe something about the document's structure and organization. Some programs use them to generate outlines and tables of contents.

[edit] Lists

<dl></dl>

Creates a definition list (consisting of definition terms paired with definitions). Can also be used to specify speakers and quoted text.

<dt></dt>

Creates a definition term.

<dd></dd>

Creates a definition.

<ol></ol> and <ul></ul>

Main article: ordered list
Creates an ordered (enumerated) or unordered (bulleted) list. With ol, the type attribute can be used to specify the kind of ordering, but CSS gives more control: {list-style-type: foo}. The default is Arabic numbering. For ul, CSS can be used to specify the list marker: {list-style-type: foo}. The default marker is a disc.

<li></li>

Creates a list item in ordered and unordered lists.

<dir></dir> (deprecated)

Delimits a directory listing. Deprecated in favor of <ul>.

<menu></menu> (deprecated)

Creates a menu listing. Should be more compact than an <ul> list, but badly supported. Deprecated in favor of <ul>.

[edit] Tables

Wikibooks
Wikibooks HTML Programming has a page on the topic of
<table></table>
Creates a table
<tr></tr>
Creates a row in the table.
<th></th>
Creates a table header cell within a row or a column; contents are conventionally displayed bold and centered. An aural user agent may use a louder voice for these items.
<td></td>
Creates a table data cell within a row.
<colgroup></colgroup>
Specifies a column group in a table.
<col>
Specifies attributes for an entire column in a table.
<caption></caption>
Specifies a caption for the entire table.
<thead></thead>
Specifies the header part of a table. This section may be repeated by the user agent if the table is split across pages (in printing or other paged media).
<tbody></tbody>
Specifies the main part of a table.
<tfoot></tfoot>
Specifies the footer part of a table. Like <thead>, this section may be repeated by the user agent if the table is split across pages (in printing or other paged media)

[edit] Forms

These elements can be combined into a form or used separately as user-interface controls. Combined with a first-class javascript engine, these controls provide support for rich user interfaces.

Main article: HTML form

HTML specifies the elements that make up a form, and the method by which it will be submitted. However, some form of script either server-side or client side must be used to process the user's input once it is submitted.

<form action="url"></form>

Creates a form.

<select name="xyz"></select>

Create a selection list, from which the user can select a single option. May be rendered as a dropdown list.

<option value="x">

Creates an item in a select list.

<input type="checkbox">

Creates a checkbox. Can be checked or unchecked.

<input type="radio">

Creates a radio button. If multiple radio buttons are given the same name, the user will only be able to select one of them from this group.

<input type="button">

Creates a general-purpose button. But it's advised to use <button> if possible (i.e. if the client supports it) which provides richer possibilities.

<input type="submit">

Creates a submit button.

<input type="image">

Creates a button using an image. The image URL may be specified with the src tag.

<input type="reset">

Creates a reset button for resetting the form to default values.

<input type="text">

Creates a one-line text input. The size attribute specifies the default width of the input in character-widths. Maxlength sets the maximum number of characters the user can enter (which may be greater than size).

<input type="password">

Creates a variation of one-line text input. The only difference is that inputted text is masked: usually every character is shown as an asterisk or a dot. It should be noted, however, that the value is submitted to the server as clear text. So an underlying secure transport layer like HTTPS is needed if confidentiality is a concern.

<input type="file">

creates a file select

<input type="hidden">

Is not visible in the rendered page, but allows a designer to maintain a copy of something that needs to be submitted to the server as part of the form. This may, for example, be data that this web user entered or selected on a previous form that needs to be processed in conjunction with the current form.

<label for="id"></label>

Creates a label for a form input (e.g. radio button). Clicking on the label fires a click on the matching input.

<textarea rows="8"></textarea>

Create a multiple-line text area, the size of which is specified by cols and rows attributes. Text in between the tags appears in the text area when the page is loaded.

[edit] Other containers

<div></div>
Creates a block logical division. This may be used to identify a part of the HTML document, for example so as to apply an id or class attribute, which can then be referenced from CSS or DOM calls. Like most HTML elements, div also supports inline CSS in its optional style attribute.

<pre></pre>

Creates pre-formatted text. Text within this element is typically displayed in a non-proportional font exactly as it is laid out in the file (see ASCII art). Whereas browsers ignore white space (for example, hard returns, carriage-returns, line-feeds, tabs, and repeated-spaces) for all of the other HTML elements, the pre element signals that this white space should be rendered as authored. With the CSS properties: {white-space: pre; font-family: monospace;}, other elements can be presented in the same way. This element can contain any inline element except: image (img), object (object), big font size (big), small font size (small), superscript (sup), and subscript (sub).

<address></address>

Used to markup contact information like address for the document or a section of it. Inline elements are the only permitted. Authors can use the break element br to insert a new line within this element.
<iframe></iframe>
Includes another HTML document in the page. Although not formally deprecated in HTML 4.01, "The object element allows HTML authors to specify everything required by an object for its presentation by a user agent [and] thus subsumes some of the tasks carried out by" iframe [8]
Unlike an object element, an iframe (inline frame) element may be the "target" frame for links defined in other elements and it may be "selected" by a browser as the focus for printing, viewing HTML source etc.[9]
The <iframe> tag must be closed by </iframe>. Otherwise the content after the <iframe> tag will be taken as alternative text to be displayed when the browser has no iframe support.
The iframe element was not included in the version 1.1 XHTML specification, despite having never been formally deprecated. Those serving their web content as XHTML 1.1 must use the object element.
(Also see Frames below.)

[edit] Frames

An HTML document may contain a header and a body or a header and a frameset, but not both. For frames the Frames DTD must be used.

<frameset></frameset>
Delimit the frameset. The frames layout is given by comma separated lists in the rows and cols attributes.
<frame></frame>
Delimit a single frame, or region, within the frameset. A different document linked with the src attribute appears inside.
<noframes></noframes>
Contains a normal <body> element with child elements that will appear in web browsers that don't support frames.
<iframe></iframe>
An inline frame inside a normal HTML <body>, which embeds another HTML document. A similar effect can also be achieved using the object element. These approaches differ in some ways (World Wide Web Consortium [10]).

See also Framing (World Wide Web).

[edit] Presentational markup

[edit] Official presentational markup

The use of presentational markup is discouraged. The equivalent CSS should be used instead. Deprecated elements are only valid in the Transitional and Frameset variants of HTML 4.01 and XHTML1.0. They are invalid in the Strict variants of HTML 4.01, XHTML1.0 and XHTML 1.1. <b>…</b> and <i>…</i> are invalid in the current draft of XHTML2.0.

<center></center> (deprecated)
Creates a block-level centered division. May also center-align all text. Deprecated in favor of <div> or another element with centering defined using CSS.
<b></b>
Use boldface type. Equivalent CSS: {font-weight: bold}
<i></i>
Use italic type. Equivalent CSS: {font-style: italic}
<big></big>
Creates bigger text. Equivalent CSS: {font-size: larger}.
<small></small>
Creates smaller text. Equivalent CSS: {font-size: smaller}
<s></s> (deprecated)
<strike></strike> (deprecated)
Create strike-through text: Strikethrough Equivalent CSS: {text-decoration: line-through}
<tt></tt>
Use a typewriter-like, also known as teletype, (fixed-width) font. Equivalent CSS: {font-family: monospace}
<u></u> (deprecated)
Use an underlined font. Equivalent CSS: {text-decoration: underline}
<font></font> (deprecated)

Use <span> instead.

<font [color=color] [size=size] [face=face]>…</font>
Can specify the font color with the color attribute, typeface with the face attribute, and absolute or relative size with the size attribute.

Examples (all the examples are deprecated, use CSS equivalents if possible):

  1. <font color="green">text</font> creates green text.
  2. <font color="#1f4099">text</font> creates text with hexadecimal color #1f4099.
  3. <font size="4">text</font> creates text with size 4. Sizes are from 1 to 7. The standard size is 3, unless otherwise specified in the <body> or other tags.
  4. <font size="+1">text</font> creates text with size 1 bigger than the standard.. <font size="-1">text</font> is opposite.
  5. <font face="Courier">text</font> makes text with Courier font.

Equivalent CSS for font attributes:

  • <font size="N"> corresponds to {font-size: Yunits} (the HTML specification does not define the relationship between size N and unit-size Y, nor does it define a unit).
  • <font color="red"> corresponds to {color: red}
  • <font face="Courier"> corresponds to {font-family: "Courier"}

[edit] Unofficial presentational markup

These are unofficial presentational elements that may not be supported in all browsers.

<blink>…</blink> (unofficial)
Causes text to blink. Can be done with CSS where supported: {text-decoration: blink}
<marquee>…</marquee> (unofficial)
Creates scrolling text. No equivalent with CSS; use scripting instead.
<blackface>…</blackface> (unofficial)
Supported only in WebTV browsers, generates extra bold lettering. Can be done with CSS where supported: {font-weight: 900}; with some fonts (such as Arial or Gill Sans), using "black" or "ultra bold" variants are more effective.
<shadow>…</shadow> (unofficial)
Supported only in WebTV browsers, generates a shadow underneath text. It can be simulated with CSS but the process is complicated.
<nobr>…</nobr> (unofficial)
Causes text to not return at end of line. Can be done with CSS: {white-space: nowrap}
<wbr> (unofficial)
Adds an optional line break the browser may use if required for wrapping. It performs the same function as zero width space (HTML entity &#8203;).

HTML comment

<!-- A Comment -->
Encloses a comment. This is an SGML construct and not limited to HTML, so it may appear anywhere in the document, even before the DTD or after </html>. None of its enclosed contents are rendered. For compatibility with some pre-1995 browsers, the contents of <style> and <script> elements are still sometimes surrounded by comment delimiters.

[edit] External links