| CARVIEW |
Select Language
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Fri, 25 Jul 2025 03:04:32 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"6882f440-13b8"
expires: Sun, 28 Dec 2025 11:25:22 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: B628:234FE9:79B3D4:884A05:6951114A
accept-ranges: bytes
age: 0
date: Sun, 28 Dec 2025 11:15:22 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210098-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766920522.315727,VS0,VE206
vary: Accept-Encoding
x-fastly-request-id: cc5a38b5e4b6d0d5a80e375bbae1ac145808e050
content-length: 2013
GEM
GEM is a benchmark environment for Natural Language Generation with a focus on its Evaluation, both through human annotations and automated Metrics.
GEM aims to:
- measure NLG progress across many NLG tasks across languages.
- audit data and models and present results via data cards and model robustness reports.
- develop standards for evaluation of generated text using both automated and human metrics.
We will regularly update GEM and to encourage more inclusive practices in evaluation by extending existing data or developing datasets for additional languages.