CARVIEW |
HTML5
A vocabulary and associated APIs for HTML and XHTML
W3C Last Call Working Draft 17 June 2014
- This Version:
- https://www.w3.org/TR/2014/WD-html5-20140617/
- Latest Published Version:
- https://www.w3.org/TR/html5/
- Latest Editor's Draft:
- https://www.w3.org/html/wg/drafts/html/CR/
- Previous Version:
- https://www.w3.org/TR/2014/CR-html5-20140429/
- Editors:
- W3C:
- Robin Berjon, W3C
- Steve Faulkner, The Paciello Group
- Travis Leithead, Microsoft
- Erika Doyle Navara, Microsoft
- Edward O'Connor, Apple Inc.
- Silvia Pfeiffer
- WHATWG:
- Ian Hickson, Google, Inc.
This specification is also available as a single page HTML document.
Copyright © 2014 W3C® (MIT, ERCIM, Keio, Beihang), All Rights Reserved. W3C liability, trademark and document use rules apply.
Abstract
This specification defines the 5th major revision of the core language of the World Wide Web: the Hypertext Markup Language (HTML). In this version, new features are introduced to help Web application authors, new elements are introduced based on research into prevailing authoring practices, and special attention has been given to defining clear conformance criteria for user agents in an effort to improve interoperability.
Status of This document
This section describes the status of this document at the time of its publication. Other documents may supersede this document. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at https://www.w3.org/TR/.
The W3C HTML Working Group is the W3C working group responsible for this specification's progress. This specification is the 17 June 2014 Last Call Working Draft. This specification is intended to become a W3C Recommendation.
The scope of Last Call feedback on this specification is limited to changes that have taken place during the Candidate Recommendation phase. The Last Call period is expected to finish on 15 July 2014.
If you wish to make comments regarding this document in a manner that is tracked by the W3C, please submit them via using our our public bug database. If you cannot do this then you can also e-mail feedback to public-html-comments@w3.org (subscribe, archives), and arrangements will be made to transpose the comments to our public bug database. All feedback is welcome.
Work on extending this specification typically proceeds through extension specifications which should be consulted to see what new features are being reviewed.
The bulk of the text of this specification is also available in the WHATWG HTML Living Standard, under a license that permits reuse of the specification text.
This is a work in progress! For the latest updates from the HTML WG, possibly including important bug fixes, please look at the editor's draft instead.
Publication as a Last Call Working Draft does not imply endorsement by the W3C Membership. This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress.
The latest version of the editor's draft of this specification is always available in the W3C HTML git repository.
Work on this specification is also done at the WHATWG. The W3C HTML working group actively pursues convergence of the HTML specification with the WHATWG living standard, within the bounds of the W3C HTML working group charter. There are various ways to follow this work at the WHATWG:
- Commit-Watchers mailing list (complete source diffs): https://lists.whatwg.org/listinfo.cgi/commit-watchers-whatwg.org
- Annotated summary with unified diffs: https://html5.org/tools/web-apps-tracker
- Raw Subversion interface:
svn checkout https://svn.whatwg.org/webapps/
The following features are at risk and may be removed due to lack of implementation.
This document was produced by a group operating under the 5 February 2004 W3C Patent Policy. W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy.
Table of Contents
- 1 Introduction
- 2 Common infrastructure
- 2.1 Terminology
- 2.2 Conformance requirements
- 2.3 Case-sensitivity and string comparison
- 2.4 Common microsyntaxes
- 2.5 URLs
- 2.6 Fetching resources
- 2.7 Common DOM interfaces
- 2.8 Namespaces
- 3 Semantics, structure, and APIs of HTML documents
- 3.1 Documents
- 3.2 Elements
- 3.2.1 Semantics
- 3.2.2 Elements in the DOM
- 3.2.3 Element definitions
- 3.2.4 Content models
- 3.2.5 Global attributes
- 3.2.5.1 The
id
attribute - 3.2.5.2 The
title
attribute - 3.2.5.3 The
lang
andxml:lang
attributes - 3.2.5.4 The
translate
attribute - 3.2.5.5 The
xml:base
attribute (XML only) - 3.2.5.6 The
dir
attribute - 3.2.5.7 The
class
attribute - 3.2.5.8 The
style
attribute - 3.2.5.9 Embedding custom non-visible data with the
data-*
attributes
- 3.2.5.1 The
- 3.2.6 Requirements relating to the bidirectional algorithm
- 3.2.7 WAI-ARIA
- 4 The elements of HTML
- 4.1 The root element
- 4.2 Document metadata
- 4.3 Sections
- 4.4 Grouping content
- 4.4.1 The
p
element - 4.4.2 The
hr
element - 4.4.3 The
pre
element - 4.4.4 The
blockquote
element - 4.4.5 The
ol
element - 4.4.6 The
ul
element - 4.4.7 The
li
element - 4.4.8 The
dl
element - 4.4.9 The
dt
element - 4.4.10 The
dd
element - 4.4.11 The
figure
element - 4.4.12 The
figcaption
element - 4.4.13 The
div
element - 4.4.14 The
main
element
- 4.4.1 The
- 4.5 Text-level semantics
- 4.5.1 The
a
element - 4.5.2 The
em
element - 4.5.3 The
strong
element - 4.5.4 The
small
element - 4.5.5 The
s
element - 4.5.6 The
cite
element - 4.5.7 The
q
element - 4.5.8 The
dfn
element - 4.5.9 The
abbr
element - 4.5.10 The
data
element - 4.5.11 The
time
element - 4.5.12 The
code
element - 4.5.13 The
var
element - 4.5.14 The
samp
element - 4.5.15 The
kbd
element - 4.5.16 The
sub
andsup
elements - 4.5.17 The
i
element - 4.5.18 The
b
element - 4.5.19 The
u
element - 4.5.20 The
mark
element - 4.5.21 The
ruby
element - 4.5.22 The
rb
element - 4.5.23 The
rt
element - 4.5.24 The
rtc
element - 4.5.25 The
rp
element - 4.5.26 The
bdi
element - 4.5.27 The
bdo
element - 4.5.28 The
span
element - 4.5.29 The
br
element - 4.5.30 The
wbr
element - 4.5.31 Usage summary
- 4.5.1 The
- 4.6 Edits
- 4.7 Embedded content
- 4.7.1 The
img
element- 4.7.1.1 Requirements for providing text to act as an alternative for images
- 4.7.1.1.1 Examples of scenarios where users benefit from text alternatives for images
- 4.7.1.1.2 General guidelines
- 4.7.1.1.3 A link or button containing nothing but an image
- 4.7.1.1.4 Graphical Representations: Charts, diagrams, graphs, maps, illustrations
- 4.7.1.1.5 Images of text
- 4.7.1.1.6 Images that include text
- 4.7.1.1.7 Images that enhance the themes or subject matter of the page content
- 4.7.1.1.8 A graphical representation of some of the surrounding text
- 4.7.1.1.9 A purely decorative image that doesn't add any information
- 4.7.1.1.10 Inline images
- 4.7.1.1.11 A group of images that form a single larger picture with no links
- 4.7.1.1.12 Image maps
- 4.7.1.1.13 A group of images that form a single larger picture with links
- 4.7.1.1.14 Images of Pictures
- 4.7.1.1.15 Webcam images
- 4.7.1.1.16 When a text alternative is not available at the time of publication
- 4.7.1.1.17 An image not intended for the user
- 4.7.1.1.18 Icon Images
- 4.7.1.1.19 Logos, insignia, flags, or emblems
- 4.7.1.1.20 CAPTCHA Images
- 4.7.1.1.21 Guidance for markup generators
- 4.7.1.1.22 Guidance for conformance checkers
- 4.7.1.1 Requirements for providing text to act as an alternative for images
- 4.7.2 The
iframe
element - 4.7.3 The
embed
element - 4.7.4 The
object
element - 4.7.5 The
param
element - 4.7.6 The
video
element - 4.7.7 The
audio
element - 4.7.8 The
source
element - 4.7.9 The
track
element - 4.7.10 Media elements
- 4.7.10.1 Error codes
- 4.7.10.2 Location of the media resource
- 4.7.10.3 MIME types
- 4.7.10.4 Network states
- 4.7.10.5 Loading the media resource
- 4.7.10.6 Offsets into the media resource
- 4.7.10.7 Ready states
- 4.7.10.8 Playing the media resource
- 4.7.10.9 Seeking
- 4.7.10.10 Media resources with multiple media tracks
- 4.7.10.11 Synchronising multiple media elements
- 4.7.10.12 Timed text tracks
- 4.7.10.12.1 Text track model
- 4.7.10.12.2 Sourcing in-band text tracks
- 4.7.10.12.3 Sourcing out-of-band text tracks
- 4.7.10.12.4 Guidelines for exposing cues in various formats as text track cues
- 4.7.10.12.5 Text track API
- 4.7.10.12.6 Text tracks exposing in-band metadata
- 4.7.10.12.7 Text tracks describing chapters
- 4.7.10.12.8 Event definitions
- 4.7.10.13 User interface
- 4.7.10.14 Time ranges
- 4.7.10.15 Event definitions
- 4.7.10.16 Event summary
- 4.7.10.17 Security and privacy considerations
- 4.7.10.18 Best practices for authors using media elements
- 4.7.10.19 Best practices for implementors of media elements
- 4.7.11 The
map
element - 4.7.12 The
area
element - 4.7.13 Image maps
- 4.7.14 MathML
- 4.7.15 SVG
- 4.7.16 Dimension attributes
- 4.7.1 The
- 4.8 Links
- 4.8.1 Links created by
a
andarea
elements - 4.8.2 Following hyperlinks
- 4.8.3 Downloading resources
- 4.8.4 Link types
- 4.8.4.1 Link type "
alternate
" - 4.8.4.2 Link type "
author
" - 4.8.4.3 Link type "
bookmark
" - 4.8.4.4 Link type "
help
" - 4.8.4.5 Link type "
icon
" - 4.8.4.6 Link type "
license
" - 4.8.4.7 Link type "
nofollow
" - 4.8.4.8 Link type "
noreferrer
" - 4.8.4.9 Link type "
prefetch
" - 4.8.4.10 Link type "
search
" - 4.8.4.11 Link type "
stylesheet
" - 4.8.4.12 Link type "
tag
" - 4.8.4.13 Sequential link types
- 4.8.4.14 Other link types
- 4.8.4.1 Link type "
- 4.8.1 Links created by
- 4.9 Tabular data
- 4.9.1 The
table
element - 4.9.2 The
caption
element - 4.9.3 The
colgroup
element - 4.9.4 The
col
element - 4.9.5 The
tbody
element - 4.9.6 The
thead
element - 4.9.7 The
tfoot
element - 4.9.8 The
tr
element - 4.9.9 The
td
element - 4.9.10 The
th
element - 4.9.11 Attributes common to
td
andth
elements - 4.9.12 Processing model
- 4.9.1 The
- 4.10 Forms
- 4.10.1 Introduction
- 4.10.2 Categories
- 4.10.3 The
form
element - 4.10.4 The
label
element - 4.10.5 The
input
element- 4.10.5.1 States of the
type
attribute- 4.10.5.1.1 Hidden state (
type=hidden
) - 4.10.5.1.2 Text (
type=text
) state and Search state (type=search
) - 4.10.5.1.3 Telephone state (
type=tel
) - 4.10.5.1.4 URL state (
type=url
) - 4.10.5.1.5 E-mail state (
type=email
) - 4.10.5.1.6 Password state (
type=password
) - 4.10.5.1.7 Date state (
type=date
) - 4.10.5.1.8 Time state (
type=time
) - 4.10.5.1.9 Number state (
type=number
) - 4.10.5.1.10 Range state (
type=range
) - 4.10.5.1.11 Color state (
type=color
) - 4.10.5.1.12 Checkbox state (
type=checkbox
) - 4.10.5.1.13 Radio Button state (
type=radio
) - 4.10.5.1.14 File Upload state (
type=file
) - 4.10.5.1.15 Submit Button state (
type=submit
) - 4.10.5.1.16 Image Button state (
type=image
) - 4.10.5.1.17 Reset Button state (
type=reset
) - 4.10.5.1.18 Button state (
type=button
)
- 4.10.5.1.1 Hidden state (
- 4.10.5.2 Implemention notes regarding localization of form controls
- 4.10.5.3 Common
input
element attributes- 4.10.5.3.1 The
maxlength
andminlength
attributes - 4.10.5.3.2 The
size
attribute - 4.10.5.3.3 The
readonly
attribute - 4.10.5.3.4 The
required
attribute - 4.10.5.3.5 The
multiple
attribute - 4.10.5.3.6 The
pattern
attribute - 4.10.5.3.7 The
min
andmax
attributes - 4.10.5.3.8 The
step
attribute - 4.10.5.3.9 The
list
attribute - 4.10.5.3.10 The
placeholder
attribute
- 4.10.5.3.1 The
- 4.10.5.4 Common
input
element APIs - 4.10.5.5 Common event behaviors
- 4.10.5.1 States of the
- 4.10.6 The
button
element - 4.10.7 The
select
element - 4.10.8 The
datalist
element - 4.10.9 The
optgroup
element - 4.10.10 The
option
element - 4.10.11 The
textarea
element - 4.10.12 The
keygen
element - 4.10.13 The
output
element - 4.10.14 The
progress
element - 4.10.15 The
meter
element - 4.10.16 The
fieldset
element - 4.10.17 The
legend
element - 4.10.18 Form control infrastructure
- 4.10.19 Attributes common to form controls
- 4.10.19.1 Naming form controls: the
name
attribute - 4.10.19.2 Submitting element directionality: the
dirname
attribute - 4.10.19.3 Limiting user input length: the
maxlength
attribute - 4.10.19.4 Setting minimum input length requirements: the
minlength
attribute - 4.10.19.5 Enabling and disabling form controls: the
disabled
attribute - 4.10.19.6 Form submission
- 4.10.19.7 Autofocusing a form control: the
autofocus
attribute - 4.10.19.8 Autofilling form controls: the
autocomplete
attribute
- 4.10.19.1 Naming form controls: the
- 4.10.20 APIs for the text field selections
- 4.10.21 Constraints
- 4.10.22 Form submission
- 4.10.23 Resetting a form
- 4.11 Scripting
- 4.12 Common idioms without dedicated elements
- 4.13 Disabled elements
- 4.14 Matching HTML elements using selectors
- 5 Loading Web pages
- 5.1 Browsing contexts
- 5.2 The
Window
object- 5.2.1 Security
- 5.2.2 APIs for creating and navigating browsing contexts by name
- 5.2.3 Accessing other browsing contexts
- 5.2.4 Named access on the
Window
object - 5.2.5 Garbage collection and browsing contexts
- 5.2.6 Closing browsing contexts
- 5.2.7 Browser interface elements
- 5.2.8 The
WindowProxy
object
- 5.3 Origin
- 5.4 Sandboxing
- 5.5 Session history and navigation
- 5.6 Browsing the Web
- 5.6.1 Navigating across documents
- 5.6.2 Page load processing model for HTML files
- 5.6.3 Page load processing model for XML files
- 5.6.4 Page load processing model for text files
- 5.6.5 Page load processing model for
multipart/x-mixed-replace
resources - 5.6.6 Page load processing model for media
- 5.6.7 Page load processing model for content that uses plugins
- 5.6.8 Page load processing model for inline content that doesn't have a DOM
- 5.6.9 Navigating to a fragment identifier
- 5.6.10 History traversal
- 5.6.11 Unloading documents
- 5.6.12 Aborting a document load
- 5.7 Offline Web applications
- 5.7.1 Introduction
- 5.7.2 Application caches
- 5.7.3 The cache manifest syntax
- 5.7.4 Downloading or updating an application cache
- 5.7.5 The application cache selection algorithm
- 5.7.6 Changes to the networking model
- 5.7.7 Expiring application caches
- 5.7.8 Disk space
- 5.7.9 Application cache API
- 5.7.10 Browser state
- 6 Web application APIs
- 7 User interaction
- 7.1 The
hidden
attribute - 7.2 Inert subtrees
- 7.3 Activation
- 7.4 Focus
- 7.5 Assigning keyboard shortcuts
- 7.6 Editing
- 7.7 Drag and drop
- 7.1 The
- 8 The HTML syntax
- 8.1 Writing HTML documents
- 8.2 Parsing HTML documents
- 8.2.1 Overview of the parsing model
- 8.2.2 The input byte stream
- 8.2.3 Parse state
- 8.2.4 Tokenization
- 8.2.4.1 Data state
- 8.2.4.2 Character reference in data state
- 8.2.4.3 RCDATA state
- 8.2.4.4 Character reference in RCDATA state
- 8.2.4.5 RAWTEXT state
- 8.2.4.6 Script data state
- 8.2.4.7 PLAINTEXT state
- 8.2.4.8 Tag open state
- 8.2.4.9 End tag open state
- 8.2.4.10 Tag name state
- 8.2.4.11 RCDATA less-than sign state
- 8.2.4.12 RCDATA end tag open state
- 8.2.4.13 RCDATA end tag name state
- 8.2.4.14 RAWTEXT less-than sign state
- 8.2.4.15 RAWTEXT end tag open state
- 8.2.4.16 RAWTEXT end tag name state
- 8.2.4.17 Script data less-than sign state
- 8.2.4.18 Script data end tag open state
- 8.2.4.19 Script data end tag name state
- 8.2.4.20 Script data escape start state
- 8.2.4.21 Script data escape start dash state
- 8.2.4.22 Script data escaped state
- 8.2.4.23 Script data escaped dash state
- 8.2.4.24 Script data escaped dash dash state
- 8.2.4.25 Script data escaped less-than sign state
- 8.2.4.26 Script data escaped end tag open state
- 8.2.4.27 Script data escaped end tag name state
- 8.2.4.28 Script data double escape start state
- 8.2.4.29 Script data double escaped state
- 8.2.4.30 Script data double escaped dash state
- 8.2.4.31 Script data double escaped dash dash state
- 8.2.4.32 Script data double escaped less-than sign state
- 8.2.4.33 Script data double escape end state
- 8.2.4.34 Before attribute name state
- 8.2.4.35 Attribute name state
- 8.2.4.36 After attribute name state
- 8.2.4.37 Before attribute value state
- 8.2.4.38 Attribute value (double-quoted) state
- 8.2.4.39 Attribute value (single-quoted) state
- 8.2.4.40 Attribute value (unquoted) state
- 8.2.4.41 Character reference in attribute value state
- 8.2.4.42 After attribute value (quoted) state
- 8.2.4.43 Self-closing start tag state
- 8.2.4.44 Bogus comment state
- 8.2.4.45 Markup declaration open state
- 8.2.4.46 Comment start state
- 8.2.4.47 Comment start dash state
- 8.2.4.48 Comment state
- 8.2.4.49 Comment end dash state
- 8.2.4.50 Comment end state
- 8.2.4.51 Comment end bang state
- 8.2.4.52 DOCTYPE state
- 8.2.4.53 Before DOCTYPE name state
- 8.2.4.54 DOCTYPE name state
- 8.2.4.55 After DOCTYPE name state
- 8.2.4.56 After DOCTYPE public keyword state
- 8.2.4.57 Before DOCTYPE public identifier state
- 8.2.4.58 DOCTYPE public identifier (double-quoted) state
- 8.2.4.59 DOCTYPE public identifier (single-quoted) state
- 8.2.4.60 After DOCTYPE public identifier state
- 8.2.4.61 Between DOCTYPE public and system identifiers state
- 8.2.4.62 After DOCTYPE system keyword state
- 8.2.4.63 Before DOCTYPE system identifier state
- 8.2.4.64 DOCTYPE system identifier (double-quoted) state
- 8.2.4.65 DOCTYPE system identifier (single-quoted) state
- 8.2.4.66 After DOCTYPE system identifier state
- 8.2.4.67 Bogus DOCTYPE state
- 8.2.4.68 CDATA section state
- 8.2.4.69 Tokenizing character references
- 8.2.5 Tree construction
- 8.2.5.1 Creating and inserting nodes
- 8.2.5.2 Parsing elements that contain only text
- 8.2.5.3 Closing elements that have implied end tags
- 8.2.5.4 The rules for parsing tokens in HTML content
- 8.2.5.4.1 The "initial" insertion mode
- 8.2.5.4.2 The "before html" insertion mode
- 8.2.5.4.3 The "before head" insertion mode
- 8.2.5.4.4 The "in head" insertion mode
- 8.2.5.4.5 The "in head noscript" insertion mode
- 8.2.5.4.6 The "after head" insertion mode
- 8.2.5.4.7 The "in body" insertion mode
- 8.2.5.4.8 The "text" insertion mode
- 8.2.5.4.9 The "in table" insertion mode
- 8.2.5.4.10 The "in table text" insertion mode
- 8.2.5.4.11 The "in caption" insertion mode
- 8.2.5.4.12 The "in column group" insertion mode
- 8.2.5.4.13 The "in table body" insertion mode
- 8.2.5.4.14 The "in row" insertion mode
- 8.2.5.4.15 The "in cell" insertion mode
- 8.2.5.4.16 The "in select" insertion mode
- 8.2.5.4.17 The "in select in table" insertion mode
- 8.2.5.4.18 The "in template" insertion mode
- 8.2.5.4.19 The "after body" insertion mode
- 8.2.5.4.20 The "in frameset" insertion mode
- 8.2.5.4.21 The "after frameset" insertion mode
- 8.2.5.4.22 The "after after body" insertion mode
- 8.2.5.4.23 The "after after frameset" insertion mode
- 8.2.5.5 The rules for parsing tokens in foreign content
- 8.2.6 The end
- 8.2.7 Coercing an HTML DOM into an infoset
- 8.2.8 An introduction to error handling and strange cases in the parser
- 8.3 Serializing HTML fragments
- 8.4 Parsing HTML fragments
- 8.5 Named character references
- 9 The XHTML syntax
- 10 Rendering
- 10.1 Introduction
- 10.2 The CSS user agent style sheet and presentational hints
- 10.3 Non-replaced elements
- 10.4 Replaced elements
- 10.5 Bindings
- 10.5.1 Introduction
- 10.5.2 The
button
element - 10.5.3 The
input
element as a text entry widget - 10.5.4 The
input
element as domain-specific widgets - 10.5.5 The
input
element as a range control - 10.5.6 The
input
element as a color well - 10.5.7 The
input
element as a checkbox and radio button widgets - 10.5.8 The
input
element as a file upload control - 10.5.9 The
input
element as a button - 10.5.10 The
marquee
element - 10.5.11 The
meter
element - 10.5.12 The
progress
element - 10.5.13 The
select
element - 10.5.14 The
textarea
element - 10.5.15 The
keygen
element
- 10.6 Frames and framesets
- 10.7 Interactive media
- 10.8 Print media
- 10.9 Unstyled XML documents
- 11 Obsolete features
- 12 IANA considerations
- Index
- References
- Acknowledgments