CARVIEW |
RTC Accessibility User Requirements
W3C First Public Working Draft
- This version:
- https://www.w3.org/TR/2020/WD-raur-20200319/
- Latest published version:
- https://www.w3.org/TR/raur/
- Latest editor's draft:
- https://w3c.github.io/apa/raur/
- Editors:
- Joshue O Connor (W3C)
- Janina Sajka
- Jason White
- Michael Cooper (W3C)
Copyright © 2020 W3C® (MIT, ERCIM, Keio, Beihang). W3C liability, trademark and permissive document license rules apply.
Abstract
This document outlines various accessibility related user needs, requirements and scenarios for Real-time communication (RTC). These user needs should drive accessibility requirements in various related specifications and the overall architecture that enables it. It first introduces a definition of RTC as used throughout the document and outlines how RTC accessibility can support the needs of people with disabilities. It defines the term user needs as used throughout the document and then goes on to list a range of these user needs and their related requirements. Following that some quality related scenarios are outlined and finally a data table that maps the user needs contained in this document to related use case requirements found in other technical specifications.
This document is most explicitly not a collection of baseline requirements. It is also important to note that some of the requirements may be implemented at a system or platform level, and some may be authoring requirements.
Status of This Document
This section describes the status of this document at the time of its publication. Other documents may supersede this document. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at https://www.w3.org/TR/.
This is a First Public Working Draft of RTC Accessibility User Requirements by the Accessible Platform Architectures Working Group. It is developed by the Research Questions Task Force (RQTF) who work to identify accessibility knowledge gaps and barriers in emerging and future web technologies. The requirements outlined here come from research into user needs that then provide the basis for any technical requirements. This version is published to collect public feedback on the requirements prior to finalization as a Working Group Note.
To comment, file an issue in the W3C APA GitHub repository adding the RAUR label. If this is not feasible, send email to public-apa@w3.org (archives). Comments are requested by 21 April 2020. In-progress updates to the document may be viewed in the publicly visible editors' draft.
This document was published by the Accessible Platform Architectures Working Group as a First Public Working Draft.
Publication as a First Public Working Draft does not imply endorsement by the W3C Membership. This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress.
This document was produced by a group operating under the W3C Patent Policy. The group does not expect this document to become a W3C Recommendation. W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy.
This document is governed by the 1 March 2019 W3C Process Document.
1. Introduction
What is Real-time communication (RTC)?
The traditional data exchange model is client to server. Real-time communication (RTC) is game-changing as it is enabled in part by specifications like WebRTC that provides real-time peer to peer audio, video and data exchange directly between supported user agents. This enables instantaneous applications for video and audio calls, text chat, file exchange, screen sharing and gaming, all without the need for browser plugins. However, WebRTC is not the sole specification with responsibility to enable accessible real-time communications, as use cases and requirements are broad - as outlined in the IETF RFC 7478 'Web Real-Time Communication Use Cases and Requirements' document. [ietf-rtc]
1.1 Real-time communication and accessibility
RTC has the potential to allow improved accessibility features that will support a broad range of user needs for people with a wide range of disabilities. These needs can be met through improved audio and video quality, audio routing, captioning, improved live transcription, transfer of alternate formats such as sign-language, text-messaging / chat, real time user support and status polling.
Accessible RTC is enabled by a combination of technologies and specifications such as those from the Media Working Group, Web and Networks IG, Second Screen, and Web Audio Working group as well as AGWG and ARIA. APA hopes this document will inform how these groups meet various responsibilities for enabling accessible RTC, as well updating related use cases in various groups. For examples, view the current work on WebRTC Next Version Use Cases First Public Working Draft. [webrtc-use-cases]
1.2 User needs definition
This document outlines various accessibility related user needs for Accessible RTC. These user needs should drive accessibility requirements for Accessible RTC and its related architecture.
User needs are presented here with their related requirements; some in a range of scenarios (which can be thought of as similar to user stories). User needs and requirements are being actively reviewing by RQTF/APA.
2. User needs and requirements
The following outlines a range of user needs and requirements. The user needs have also been compared to existing use cases for Real-time text (RTT) such as the IETF Framework for Real-time text over IP Using the IETF Session Initiation Protocol RFC 5194 and the European Procurement Standard EN 301 549. [rtt-sip] [EN301-549]
2.1 Incoming calls and caller ID
- User Need 1: A screen-reader user or user with a cognitive impairment needs to know a call is incoming and needs to recognise the ID of the caller.
- REQ 1a: Provide indicatation of incoming calls in an unobtrusive way via a symbol set or other browser notification.
- REQ 1b: Alert assistive technologies via relevant APIs.
2.2 Routing and communication channel control
- User Need 2: A blind user of both screen reader and braille output devices may need to manage audio and text output differently e.g. once they have accepted the call a user may wish to route notifications to a separate braille device while continuing the call on bluetooth headphones.
- REQ 2a: Provide or support a range of browser level audio output and routing options.
- REQ 2b: Allow controlled routing of alerts and other browser output to a braille device or other hardware.
- User Need 3: A deaf user needs to move parts of a live teleconference session (as separate streams) to one or more devices for greater control.
- REQ 3a: Allow the separate routing of video streams such as a sign language interpreter to a separate high resolution display.
- User Need 4: Users with cognitive disabilities or blind users may have relative volume levels set as preferences that relate to importance, urgency or meaning.
- REQ 4a: Allow the panning or setting of relative levels of different audio output.
- REQ 4b: Support multichannel audio in the browser.
2.3 Dynamic audio description values in live conferencing
- User Need 5: A user may struggle to hear audio description in a live teleconferencing situation.
- REQ 5a: Ensure Audio Description (AD) recommended sound values are dynamic.
2.4 Quality synchronisation and playback
- User Need 6: Any deaf or hard of hearing user watching captioning or audio description needs to be confident it is synchronised and accurate.
- REQ 6a: Ensure that any outages or loss to captioning or audio description will be repaired while preserving context and meaning.
- REQ 6b: Ensure that the integrity of related alternate supporting tracks or streams such as transcriptions, are in sync with any repairs.
2.5 Simultaneous voice, text & signing
- User Need 7: A deaf user needs to simultaneously talk on a call, send and receive instant messages via a text interface and watch sign language using a video stream.
- REQ 7a: Ensure support for multiple simultaneous streams.
2.6 Emergency calls: Support for Real-time text (RTT)
- User Need 8: In an emergency situation a deaf, speech impaired, hard of hearing or deaf blind user needs to make an emergency call, instantly send and receive related text messages and/or sign via a video stream.
- REQ 8a: Provide or ensure support for RTT in WebRTC.
2.7 Video relay services (VRS) and remote interpretation (VRI)
- User Need 9: A deaf, speech impaired, or hard of hearing user, needs to communicate on a call using a remote video interpretation service to access sign language and interpreter services.
- REQ 9a: Provide or ensure support for video relay and remote interpretation services. This user need may relate to interoperability with third-party services; IETF has looked at standardizing a way to use Session Initiation Protocol (SIP) with VRS services. [ietf-relay] VRS calls may be made between ASL (American Sign Language) users and hearing persons speaking either English or Spanish.
2.8 Distinguishing sent and received text with RTT
- User Need 10: A deaf or deaf blind user needs to tell the difference between incoming text and outgoing text.
- REQ 10a: Ensure when used with RTT functionality, WebRTC handles the routing of this information to a format or output of the users choosing.
2.9 Call participants and status
- User Need 11: In a teleconference a blind screen-reader user needs to know what participants are on the call, as well as their status.
- REQ 11a: Ensure participant details such as name and status; whether the person is muted or talking is accessible to users of Assistive Technologies.
2.10 Live transcription and captioning support
- User Need 12: A deaf user or user with a cognitive disability needs to access a channel containing live transcriptions or captioning during a conference call or broadcast.
- REQ 12a: Honor user preferences relating to live transcription and captioning e.g. provide support for signing or a related symbol set.
2.11 Assistance for users with cognitive disabilities
- User Need 13: Users with cognitive disabilities may need assistance when using audio or video communication.
- REQ 13a: Ensure a WebRTC video call can host a technical or user support channel.
- REQ 13b: Provide support that is customised to the needs of the user.
2.12 Personalized symbol sets for users with cognitive disabilities
- User Need 14: Users with cognitive disabilities may need to use symbol sets for identifying functions available in a WebRTC enabled client for voice, file or data transfer.
- REQ 14a: Provide personalization support for symbols set replacements of existing user interface rendering of current functions or controls.
2.13 Internet relay chat (IRC) style interfaces required by blind users
- User Need 15: To translate text to speech interactions into comprehensible speech; a blind screen reader user depending on TTS to interact with their computers and smart devices needs a traditional Internet Relay Chat (IRC) style interface.
- REQ 15a: Preserve IRC as a configuration option in agents that implement WebRTC as opposed to having only the Real-time text (RTT) type interface. While (RTT) is very imporant it is favoured by users who are deaf or hearing impaired. For screen reader users, TTS cannot reasonably translate text into comprehensible speech unless the characters to be pronounced are transmitted in very close timing to one another. Typical gaps will result in stuttering and highly unintelligible output from the TTS engine.
Braille users and RTT
Some braille users will also prefer the RTT model. However, braille users desiring text displayed with standard contracted braille might better be served in the manner users relying on text to speech (TTS) engines are served, by buffering the data to be transmitted until an end of line character is reached.
2.14 Deaf users: Video resolution and frame rates
- User Need 15: A deaf user watching a signed broadcast needs a high-quality frame rate to maintain legibility and clarity in order to understand what is being signed.
NOTE: EN 301 549 Section 6, recommends WebRTC applications should support a frame rate of at least 20 frames per second (FPS). More details can be found at Accessible Procurement standard for ICT products and services EN 301 549 (PDF)
3. Quality of service scenarios
3.1 Bandwidth for audio
Scenario: A hard of hearing user needs better stereo sound to have a quality experience in work calls or meetings with friends or family. Transmission aspects, such as decibel range for audio needs to be of high-quality. For calls, industry allows higher audio resolution but still mostly in mono only.
3.2 Bandwidth for video
Scenario: A hard of hearing user needs better stereo sound so they can have a quality experience in watching HD video or having a HD meeting with friends or family. Transmission aspects, such as frames per minute for video quality needs to be of high-quality.
NOTE: EN 301 549 Section 6, recommends for WebRTC enabled conferencing and communication the application shall be able to encode and decode communication with a frequency range with an upper limit of at least 7KHz. More details can be found at Accessible Procurement standard for ICT products and services EN 301 549 (PDF)
NOTE: WebRTC lets applications prioritise bandwidth dedicated to audio / video / data streams; there is also some experimental work in signalling these needs to the network layer as well as support for prioritising frame rate over resolution in case of congestion. [webrtc-priority]
4. Data table mapping user needs with related specifications
The following table maps the user needs and requirements presented in this document with other related specifications such as those defined in RFC 5194 - Framework for Real-time text over IP Using SIP and EN 301 549 - the EU Procurement Standard. [rtt-sip] [EN301-549]
Related specs or groups | Mapping to RFC 5194 - Framework for Real-time text over IP Using SIP: | Mapping to EN 301 549 - EU procurement standard | |
---|---|---|---|
Incoming calls | WCAG/AGWG, ARIA. | Similar to 6.2.4.2 Alerting - RFC 5194/ pre-session set up with RTT 6.2.1 | Maps to 6.2.2.2: Programmatically determinable send and receive direction |
Accessible call setup | WCAG/AGWG, ARIA. | Under 'General Requirements for Text over IP (ToIP) ' | No Mapping |
Routing | Media Working Group, Web and Networks IG, Second Screen. Audio Device Client Proposal may fulfil this need and allow complex routing and management of multiple audio input and output devices. | No Mapping | No Mapping |
Dynamic audio description values | Media Working Group, Web and Networks IG, Second Screen. | No Mapping | No Mapping |
Audio-subtitling/spoken subtitles | Media Working Group, Web and Networks IG, Second Screen. | No Mapping | No Mapping |
Communications control | Media Working Group, Web and Networks IG. Second Screen API may fulfil this user need. HTML5 supports exists and the streams need to be separable. Could be managed via a status bar. | Similar to R26 in 5.2.4. Presentation and User Requirements. | Maps to 6.2.1.2: Concurrent voice and text |
Text communication data channel | Media Working Group | Similar to R26 in RFC 5194 5.2.4. Presentation and User Requirements. NOTE: Very similar user requirement to 'Audio Routing and Communication channel control' | No Mapping |
Control relative volume and panning position for multiple audio | Web Audio Working Group. Multichannel may be covered by the Web Audio group space, and in audio device proposal, with some WebRTC requirements. | No Mapping | Maps to 6.2.1.2: Concurrent voice and text NOTE: Very similar user requirement to 'Audio Routing and Communication channel control' |
Support for Real-time text | WebRTC | Similar to R26 in RFC 5194 5.2.4. Presentation and User Requirements. NOTE: Very similar user requirement to 'Audio Routing and Communication channel control' | No Mapping |
Simultaneous voice, text & signing | Could be partially enabled via RTT in WebRTC. | Relates to RFC 5194 - under R2-R8 | No Mapping |
Support for video relay services (VRS) and remote interpretation (VRI) | May relate to interoperability with third-party services. | Relates to RFC 5194 - under R21-R23 | No Mapping |
Distinguishing sent and received text | May relate to interoperability with third-party services. This is not WebRTC specific and may be a user interface accessibility issue. | Relates to RFC 5194 - under R16 - but this does NOT fully address our use case requirement. | Maps to 6.2.2.1: Visually distinguishable display |
Warning and recovery of lost data | This is not WebRTC specific and may be a user interface accessibility issue. | Relates to RFC 5194 - under R14-R15 | No Mapping |
Quality of video resolution and frame rate | No Mapping | No Mapping | EN 301 549 Section 6, recommends WebRTC applications should support a frame rate of at least 20 frames per second. |
Assistance for older users or users with cognitive disabilities | Needs further clarification/review may be an accessible user interface or personalization issue. | Relates to RFC 5149 - Transport Requirements/Text over IP (ToIP) and Relay Services. | No Mapping |
Identify caller | WCAG/AGWG, ARIA. Needs further clarification/review may be an accessible user interface issue. Identity may be handled by the browser via Identity for WebRTC 1.0. | Similar to R27 in RFC 5194 5.2.4. Presentation and User Requirements | Maps to 6.3 Caller ID |
Live transcription and captioning | Browser APIs needed to implement this are available; needs better integration with third-party services (e.g. for sign language translation). Possibly covered by general requirements for Text over IP (ToIP) contained in RFC 5194. | Covered under 5.2.3 (transcoding service requirements). Referring to relay services that provide conversion from speech to text, or text to speech, to enable communication. | No Mapping |
A. Acknowledgments
A.1 Participants of the APA working group active in the development of this document:
- Judy Brewer, W3C
- Michael Cooper, W3C
- Dominique Hazael-Massieux, W3C
- Scott Hollier, Edith Cowan University & Centre For Accessibility
- Steve Lee, W3C
- Stephen Noble, Pearson Plc
- Estella Oncins Noguer, TransMedia UAB
- Joshue O Connor, W3C
- Janina Sajka, Invited Expert
- Jason White, Educational Testing Service
A.2 Enabling funders
This work is supported by the EC-funded WAI-Guide Project.
B. References
B.1 Informative references
- [EN301-549]
- Accessibility requirements suitable for public procurement of ICT products and services in Europe. CEN/CENELEC/ETSI. August 2018. URL: https://mandate376.standards.eu/standard
- [ietf-relay]
- Interoperability Profile for Relay User Equipment. IETF. August 2020. URL: https://tools.ietf.org/html/draft-ietf-rum-rue-02.html
- [ietf-rtc]
- Web Real-Time Communication Use Cases and Requirements. IETF. March 2015. URL: https://tools.ietf.org/html/rfc7478
- [rtt-sip]
- Framework for Real-Time Text over IP Using the Session Initiation Protocol (SIP). IETF, Network Working Group. June 2008. URL: https://tools.ietf.org/html/rfc5194
- [webrtc-priority]
- WebRTC DSCP Control API. W3C. 12 February 2020. URL: https://w3c.github.io/webrtc-priority/
- [webrtc-use-cases]
- WebRTC Next Version Use Cases. W3C. 11 December 2018. URL: https://www.w3.org/TR/webrtc-nv-use-cases/