| CARVIEW |
Select Language
HTTP/2 200
date: Sat, 27 Dec 2025 00:42:26 GMT
content-type: text/html
server: cloudflare
cf-ray: 9b44c72a1d5bf470-BLR
cf-cache-status: HIT
age: 295642
content-encoding: gzip
last-modified: Tue, 23 Dec 2025 14:35:03 GMT
strict-transport-security: max-age=31536000
surrogate-control: max-age=432000
surrogate-key: commoncrawl.org 6479b8d98bf5dcb4a69c4f31 pageId:65286671d00525e220701f9f 65286671d00525e220701fe1 65286671d00525e220702005
x-lambda-id: bcdccdc7-7584-4e39-9a95-0d400d7c7e77
vary: accept-encoding
set-cookie: _cfuvid=Ep0ffjsPP8PmpnRJIDvSvZKN7b24rQiC3xljHrNu1yw-1766796146280-0.0.1.1-604800000; path=/; domain=.commoncrawl.org; HttpOnly; Secure; SameSite=None
alt-svc: h3=":443"; ma=86400
Common Crawl - Open Repository of Web Crawl Data
Common Crawl is a 501(c)(3) non–profit founded in 2007.
Overview
Common Crawl maintains a free, open repository of web crawl data that can be used by anyone.
Common Crawl is a 501(c)(3) non–profit founded in 2007.
We make wholesale extraction, transformation and analysis of open web data accessible to researchers.
OverviewOver 300 billion pages spanning 15 years.
Free and open corpus since 2007.
Cited in over 10,000 research papers.
3–5 billion new pages added each month.
Featured Papers:
Latest Blog Post:

Web Graphs
Host- and Domain-Level Web Graphs October, November, December 2025
We are pleased to announce a new release of host-level and domain-level web graphs based on the crawls of October, November, and December 2025.

Thom Vaughan
Thom is a Principal Engineer at the Common Crawl Foundation.