CARVIEW |
Select Language
HTTP/2 200
date: Fri, 25 Jul 2025 08:32:08 GMT
content-type: text/html; charset=utf-8
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
etag: W/"9418d08a1f64e2930076ad78d83968a7"
cache-control: max-age=0, private, must-revalidate
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: no-referrer-when-downgrade
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
content-encoding: gzip
accept-ranges: bytes
set-cookie: _gh_sess=OBav%2BUsC6vOv%2FqjptSL0Aj8%2BwiwCMgPEPYhcHq2HaYLgOs1FbHfBL3o%2BY%2FKessBKAg70my3gfVVUpodTvS56vzaEMATZUZ7qGEnDXEf9PeX7EuU3WbM6vkdvuhNqU5LLOfvcVlt2Nv%2FIbYzzR2s%2B3NXPaeAbUuvRArmknqW7CMLoFMhkvFiRXinjXn8iRKvCkskvtbuorodDt20JWya%2FIly3FWHoUjuibV87bo%2Bwcg0NM%2BRULPmPqu1RlEXSgOzWoQFEfOEFy8DDaj%2Bh4jBgWg%3D%3D--cwsihNo11yVWuKlb--NVULaYeBT4IWOP1gbHhRiw%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.1551641849.1753432328; Path=/; Domain=github.com; Expires=Sat, 25 Jul 2026 08:32:08 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Sat, 25 Jul 2026 08:32:08 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: BDB0:15F96:55CADE:677762:68834108
Release Data Formulator 0.2 · microsoft/data-formulator · GitHub
Loading
Skip to content
Navigation Menu
{{ message }}
-
Notifications
You must be signed in to change notification settings - Fork 1k
Compare
·
70 commits
to main
since this release
2fb5016
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Data Formulator 0.2 now supports working with large datasets, powered by the backend database!
Demonstration: Exploration of Metacritic's Best Games and Reviews - 2025
- This Kaggle dataset contains 13k+ games and 1.6M+ reviews of best games based on Metacritics reviews.
- Data source: https://www.kaggle.com/datasets/davutb/metacritic-games
- Exploration:
- What's the relation between user scores and critic scores?
- What are games where user reviews are really high but critic's scores are really low?
- How does the score distribution compare between critics and users?
df-demo-game-reviews.mp4
Release details: data visualization with large sized data.
Data Formulator integrates DuckDB as the backend local database to support data exploration with large datasets (million rows). It is also possible to connect external database with DuckDB, not all connection are supported at the moment, but that's the beginning!
- Upload large sized data to the local database, or connect to existing databases (mysql or postgres) to work with large data.
- A subset of sample data will be pulled to the frontend to explore, you can roll the dice 🎲 or sort the data by different columns to view different samples.
- Manage local database with the Database manager.
- Interaction with Data Formulator as usual:
- Use drag and drop to specify a chart, and Data Formulator can dynamically generate SQL query to fetch data to instantiate data. This process is quite fast!
- Specify new visualization fields / provide NL instructions as usual, and the newly introduced NL2SQL agents can generate SQL queries based on your instruction to prepare the data, and create visualizations.
- Anchor a dataset, followup, join some tables, can you can dive deep pretty fast into insights!
- (Minor feature updates)
- Updated how derived concept works in Data Formulator -- data transformation is executed in the backend and updated data is appended to the new dataset. New concepts can be applied directly to new dataset in one click.
- Improved system performance with configurable sandboxing options (main process versus subprocess) for LLM generated code (~3s interaction time reduction).
- Configurable default visualization size in the main panel.
More explorations on the demo dataset:
- What's your favorite games and how their review change over time?
- What's the franchise that consistently improved reviews?
- What are games that have most different reviews in different platforms?
- What are games with many positive critic reviews but no user bother to play?
- What about reviews trends for the No Man's sky?
Well, it is time to upgrade Data Formulator and play with it! Let us know what you come up with :)
Assets 2
16 people reacted
You can’t perform that action at this time.