HTTP/2 301
date: Sat, 11 Oct 2025 00:39:09 GMT
content-type: text/html; charset=utf-8
content-length: 0
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
location: https://github.blog/2017-01-19-github-data-ready-for-you-to-explore-with-bigquery/
cache-control: no-cache
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: origin-when-cross-origin, strict-origin-when-cross-origin
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com github.githubassets.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com wss://alive-staging.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com marketplace-screenshots.githubusercontent.com/ copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
set-cookie: _gh_sess=p%2F5ScmiMedeCE4ud9B8AAWl%2BjzdSkHy0CJJb8dRT07FrU4jWJCto%2FL1wJfCuzojmo8Fu4cPTSEF%2Fl3dBUi6CDud2cUe8XicsLq%2Bse70eo51Y5tpr7B1e6smCyFjnnI%2Fghh6XMY1LJ8OIOe51kkX7oF89BkEuvZPUNdxyBYyky6v4VpbYDNmPvCb%2FtNl%2FEHlUUo%2BVHIJwsMjNKyg6SWpst%2F11wi%2BHhS1LvtAxGS6HAjXL2JrGTu7%2BJXreejKOXwVbKezekg2o28e0bVYbmxQKTQ%3D%3D--Jo5liN313Mxe8ZUA--faSVc7NeFJk5TtH%2B10lQ6Q%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.759121918.1760143149; Path=/; Domain=github.com; Expires=Sun, 11 Oct 2026 00:39:09 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Sun, 11 Oct 2026 00:39:09 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: B242:1D4E10:1A4DB7:265D2B:68E9A72D
HTTP/2 301
server: nginx
date: Sat, 11 Oct 2025 00:39:10 GMT
content-type: text/html; charset=utf-8
location: https://github.blog/engineering/github-data-ready-for-you-to-explore-with-bigquery/
x-redirect-by: Yoast SEO Premium
x-cache: MISS
x-rq: bom2 177 249 80
strict-transport-security: max-age=31536000;includeSubdomains;preload
HTTP/2 301
server: nginx
date: Sat, 11 Oct 2025 00:39:10 GMT
content-type: text/html; charset=UTF-8
location: https://github.blog/news-insights/research/github-data-ready-for-you-to-explore-with-bigquery/
x-hacker: If you're reading this, you should visit https://join.a8c.com/viphacker and apply to join the fun, mention this header.
x-powered-by: WordPress VIP
host-header: a9130478a60e5f9135f765b23f26593b
x-frame-options: SAMEORIGIN
x-redirect-by: WordPress
x-cache: MISS
x-rq: bom2 177 253 80
strict-transport-security: max-age=31536000;includeSubdomains;preload
HTTP/2 200
server: nginx
date: Sat, 11 Oct 2025 00:39:11 GMT
content-type: text/html; charset=UTF-8
vary: Accept-Encoding
x-hacker: If you're reading this, you should visit https://join.a8c.com/viphacker and apply to join the fun, mention this header.
x-powered-by: WordPress VIP
host-header: a9130478a60e5f9135f765b23f26593b
x-frame-options: SAMEORIGIN
link: ; rel="https://api.w.org/"
link: ; rel="alternate"; title="JSON"; type="application/json"
link: ; rel=shortlink
content-encoding: gzip
x-rq: bom2 177 253 80
cache-control: max-age=300, must-revalidate
accept-ranges: bytes
x-cache: MISS
strict-transport-security: max-age=31536000;includeSubdomains;preload
GitHub data, ready for you to explore with BigQuery - The GitHub Blog
GitHub data, ready for you to explore with BigQuery
GitHub data is available for public analysis using Google BigQuery, and we’d like to help you take it for a spin. If you’d like to find out more about what…
January 19, 2017
|
Updated May 7, 2021
GitHub data is available for public analysis using Google BigQuery , and we’d like to help you take it for a spin.
If you’d like to find out more about what data is available and how it’s been used so far, watch this conversation between GitHub Data Analyst Alyson La and Google Developer Advocate Felipe Hoffa . You’ll learn the story behind the datasets and what types of analysis they make possible. You’ll also see how we’ve visualized data with Tableau and Looker .
VIDEO
There’s a lot of data out there, but it’s all available through BigQuery in two large data sets. The original, community-led GitHub Archive project launched in 2012 and captures almost 30 million events monthly, including issues, commits, and pushes. Last year, we worked with Google to release The GitHub Public Data Set , separate tables with information on all projects that have open source licenses, including commits, file contents, and file paths.
You can also use the GH torrent project to complement the existing datasets with additional metadata.
We ran a list of queries on the datasets above to create the open source section of our Octoverse report, but anyone can run an analysis. Here are the results of some of the queries run so far.
“This should never have happened” has appeared in code comments more than a million times (hear this data point for yourself in this Changelog episode )
Where does open source happen? GitHub top countries shares which countries have the most open source developers per capita
How reliable is GitHub? Felipe runs a query to find out in GitHub reliability with BigQuery
There are a lot of feels in open source. Geeksta examines how emotions are expressed in GitHub commit messages
Are bigger pull requests better? Jessie Frazelle analyzed the top 15 projects on GitHub in terms of pull requests opened vs. pull requests closed
Happy exploring!
Related posts
In September, we experienced three incidents that resulted in degraded performance across GitHub services.
AI is changing how software gets built. Explore the skills you need to keep up and stand out.
Why the U.S. Supreme Court case Cox v. Sony matters for developers and sharing updates to our Transparency Center and Acceptable Use Policies.
Explore more from GitHub
Docs
Everything you need to master GitHub, all in one place.
GitHub
Build what’s next on GitHub, the place for anyone from anywhere to build anything.
Customer stories
Meet the companies and engineering teams that build with GitHub.
GitHub Universe 2025
Last chance: Save $700 on your IRL pass to Universe and join us on Oct. 28-29 in San Francisco.
We do newsletters, too Discover tips, technical guides, and best practices in our biweekly newsletter just for devs.