CARVIEW

MOTORHOMES

Select Language

HTTP/2 200 date: Fri, 10 Oct 2025 11:50:05 GMT content-type: text/html; charset=utf-8 content-encoding: gzip last-modified: Fri, 02 Dec 2011 15:44:39 GMT cache-control: max-age=21600 expires: Fri, 10 Oct 2025 17:50:05 GMT vary: Accept-Encoding x-backend: www-mirrors x-request-id: 98c5e5e80a88c1bf strict-transport-security: max-age=15552000; includeSubdomains; preload content-security-policy: frame-ancestors 'self' https://cms.w3.org/ https://cms-dev.w3.org/; upgrade-insecure-requests cf-cache-status: BYPASS set-cookie: __cf_bm=KemkmkFvicIyMdejyPKhZp6TInmr9OQPIG7LNLZstbU-1760097005-1.0.1.1-qocW4blJbKcGaW.vZZ4jpTg5d55cvTZCEV7oBXVy.QU307kerI1mCg6mtY2KIm_olgNs.74v1yWYBl9UxSIC0_mUhlB4GHjM07uya7OT5U4; path=/; expires=Fri, 10-Oct-25 12:20:05 GMT; domain=.w3.org; HttpOnly; Secure; SameSite=None server: cloudflare cf-ray: 98c5e5e80a88c1bf-BLR alt-svc: h3=":443"; ma=86400 Towards a score function for WCAG 2.0 benchmarking

Towards a score function for WCAG 2.0 benchmarking

A contribution of the eGovernment Monitoring (eGovMon) project.

Presented by

Annika Nietzio
Forschungsinstitut Technologie und Behinderung (FTB)
email: egovmon@ftb-volmarstein.de

How to develop a Web Accessibility Metric

Experiences from the development of the Unified Web Evaluation Methodology (UWEM) for WCAG 1.0

UWEM indicator refinement process

collection of requirements (crawling and sampling, mathematical and statistical properties, influence of features of the web content)
theoretical analysis (dependencies, potentially conflicting requirements)
experimental evaluation (comparison of result on real and synthetic data)
selection of score function

Lessons learnt

Experimental evaluation is vital.
The score function should be tailored to the structure of the test set.

Differences between WCAG 1.0 and WCAG 2.0

WCAG 1.0: test are independent.
WCAG 2.0: dependencies between Techniques (and the derived tests)
Logical combinations must be taken into account.

Example: 3.3.2 Labels or Instructions (Level A)

Test: Check that the purpose of a form field can be identified?
- H44: label elements
- H65: title attribute
- G167: adjacent button
Logical combination:

if ((H44:cause=no_label | H44:cause=label_empty) 
   & (H65:cause=title_missing | H65:cause=title_empty) 
   & G167:cause=empty_button_as_label)
      return error

Suggested score function and next steps

Web page score

Erratic number of tests per Success Criterion (or Checkpoint)
Accessibility score should not dependent on number of tests.
Solution: use Success Criteria as intermediary aggregation level.
The page score is calculated as the average of the SC-level page results.

Web site score

Aspects to be taken into account in web site score development:

How to accommodate results of tests that are applied on site level?
How to deal with conforming alternate versions?

Future work: A unified WCAG 2.0 score

A generally accepted practice for reporting WCAG 2.0 results does not yet exist.
Scores from WCAG 2.0 tools are not comparable. Differences are:
- granularity of tests
- counting the instances
- result categories (error, potential error, warning)
- reports: absolute numbers, percentages, more sophisticated scores

A unified WCAG 2.0 score

inter-tool reliability
collaboration between tool developers and researchers
- Define comparable tests on atomic level.
- Follow the logical combinations defined in "How to meet WCAG 2.0".
- Agree on indicator requirements.
- Define score function.

Original Source | Taken Source