HOME
ABOUT
- RESULTS
- differences
- BENEFITS
- HISTORY
- TEAM
- LOCATION
- FACILITIES
- BANKING
- MEMBERSHIPS
- APPROVALS
- LICENCES
- SUPPLIERS
- SPONSORSHIPS
- MEDIA
- PRIVACY
AUCTIONS
SHIPPING
FEES
- TS REWARDS
TOOLS
guides
FAQ
CONTACT
- CONNECT

VEHICLES
BRAND
- JAPANESE CARS
  - DAIHATSU
  - EUNOS
  - FORD
  - HONDA
  - ISUZU
  - LEXUS
  - MAZDA
  - MITSUBISHI
  - MITSUOKA
  - NISSAN
  - SUBARU
  - SUZUKI
  - TOYOTA
- GERMAN CARS
- AMERICAN CARS
- BRITISH CARS
- ITALIAN CARS
- FRENCH CARS
- SWEDISH CARS
- KOREAN CARS
TYPE
- mobility
- VENDING
- instruction
- TAXIS
- AMBULANCES
- FIRE ENGINES
- HEARSES
- LIMOUSINES
- COMMERCIAL
CLASS
FUEL
TRUCKS
minitrucks
- DAIHATSU
- HONDA
- MAZDA
- MITSUBISHI
- NISSAN
- SUBARU
- SUZUKI
- DUMP
- CRANE
- CAMPER
- REFRIGERATED
- 4WD
- NEW
BUSES
MOTORHOMES
- YAHOO!
- RAKUTEN
- DEALER

PARTS
- FREE REPORT
- PARTS CONTAINERS
- PARTS SYSTEMS
- PARTS PROTECTION
- BODY SHELLS
- DISMANTLING
- ONLINE PARTS
- NEW PARTS
- INTERIOR PARTS
- EXTERIOR PARTS
  - BONNETS
  - BUMPERS
  - GRILLES
  - FENDERS
  - DOORS
  - TRUNKS
  - SPOILERS
  - LIGHTS
  - EMBLEMS
  - CAMERAS
- ENGINES
- TRANSMISSIONS
- WHEELS & TYRES
  - WHEELS
  - TYRES
CUTS
PERFORMANCE PARTS
TRUCK PARTS
MOTORBIKE PARTS
- MOTORBIKE ENGINES
- MOTORBIKE ACCESSORIES

MOTORBIKES
MARINE
FORKLIFTS
MACHINERY
AGRICULTURAL
OTHER
COUNTRY
- AUSTRALIA
- CANADA
- KENYA
- MYANMAR
- NEW ZEALAND
- PAKISTAN
- TANZANIA
- UNITED STATES

CARVIEW

MOTORHOMES

Select Language

HTTP/2 200 date: Fri, 10 Oct 2025 16:53:32 GMT content-type: text/html; charset=utf-8 vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With etag: W/"b6a4656e5fe4de0281605458adb90c1d" cache-control: max-age=0, private, must-revalidate strict-transport-security: max-age=31536000; includeSubdomains; preload x-frame-options: deny x-content-type-options: nosniff x-xss-protection: 0 referrer-policy: origin-when-cross-origin, strict-origin-when-cross-origin content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com github.githubassets.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com wss://alive-staging.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com marketplace-screenshots.githubusercontent.com/ copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/ server: github.com content-encoding: gzip accept-ranges: bytes set-cookie: _gh_sess=%2FcqGI9EmKToXJ9jTqGVvW2HWCjERdktR6CUC4muk5Sl7JXnhhPdXzlmc07S8bsOZVIzIcG6C%2BbW6FeRSP9tPD0ve03kGPojTTVz53Ymo85EZNlnATfdW5VIQaTqAqYR01ITJ4KRMwzA9xK9FV55pdOFbiP%2FK3ufIU5Gcw5dcqzhd5QF3AtsejsPX4UFjFvwBaz2sRR%2FhBK9vGsHHd9AZZNOKr7KfUeQ4VNI6JPDTyR19hk%2BZYvR22kK8Ct8j89jQpHD7ik67NsjZn6hYh5HtiA%3D%3D--7J%2Brj%2Fyr7IWiXcb5--CArVVXZ7h76%2B64fWABHDCA%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax set-cookie: _octo=GH1.1.81436604.1760115212; Path=/; Domain=github.com; Expires=Sat, 10 Oct 2026 16:53:32 GMT; Secure; SameSite=Lax set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Sat, 10 Oct 2026 16:53:32 GMT; HttpOnly; Secure; SameSite=Lax x-github-request-id: EBDC:338CFA:9BE64A:B0A377:68E93A0C july-2025.md · GitHub

Search Gists

All gists Back to GitHub Sign in Sign up

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

Instantly share code, notes, and snippets.

simonw/july-2025.md Secret

Created September 1, 2025 19:34

Show Gist options

Download ZIP

Star 0 (0) You must be signed in to star a gist
Fork 0 (0) You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/simonw/722fc2f242977cb185838353776d14f4.js"></script>
Save simonw/722fc2f242977cb185838353776d14f4 to your computer and use it in GitHub Desktop.

Code Revisions 1

Learn more about clone URLs

Clone this repository at <script src="https://gist.github.com/simonw/722fc2f242977cb185838353776d14f4.js"></script>

Save simonw/722fc2f242977cb185838353776d14f4 to your computer and use it in GitHub Desktop.

Download ZIP

Raw

july-2025.md

LLM digest: July 2025

I wrote 98 posts on my blog in July (that page recently enhanced using OpenAI Codex). Here's your sponsors-only summary of the most important trends and highlights from the past month.

Claude Code

I've been spending a lot of time with Claude Code this month. I published a video showing how I used claude --dangerously-skip-permissions to add an automated table of contents to this README. I also wrote about using Claude Code to write, compile and run Mandelbrot in x86 assembly in a Docker container.

Working with Claude Code lead me to the following idea:

Something I've realized about LLM tool use is that it means that if you can reduce a problem to something that can be solved by an LLM in a sandbox using tools in a loop, you can brute force that problem.

The challenge then becomes identifying those problems and figuring out how to configure a sandbox for them, what tools to provide and how to define the success criteria for the model.

That still takes significant skill and experience, but it's at a higher level than chewing through that problem using trial and error by hand.

I've also been experimenting a lot with OpenAI Codex - the tool that runs online (via the ChatGPT app) and files PRs against your code, not their Codex CLI tool that's their version of Claude Code. I wrote about my most substantial experiment with that in Vibe scraping and vibe coding a schedule app for Open Sauce 2025 entirely on my phone.

Model releases in July

There were so many new models released this month!

Grok 4 came out, followed by some embarassing revelations - most notably that it turned out Grok would run a search for tweets from:elonmusk when asked its opinion on controversial topics! This was fixed shortly after by an update to the system prompt.

Google released Gemini 2.5 Flash-Lite, the least expensive model in their Gemini 2.5 family.

Mistral released their first audio-input models, Voxtral Small and Voxtral Mini. They also published detailed figures on their environmental impact and released an updated Codestral code autocomplete model.

It was a huge month for open weight models from Chinese AI labs. I wrote about the following:

Moonshot Kimi-K2-Instruct - 11th July, 1 trillion parameters
Qwen Qwen3-235B-A22B-Instruct-2507 - 21st July, 235 billion
Qwen Qwen3-Coder-480B-A35B-Instruct - 22nd July, 480 billion
Qwen Qwen3-235B-A22B-Thinking-2507 - 25th July, 235 billion
Z.ai GLM-4.5 and GLM-4.5 Air - 28th July, 355 and 106 billion
Qwen Qwen3-30B-A3B-Instruct-2507 - 29th July, 30 billion
Qwen Qwen3-30B-A3B-Thinking-2507 - 30th July, 30 billion
Qwen Qwen3-Coder-30B-A3B-Instruct - 31st July, 30 billion

These are all excellent models. I've been able to run the GLM-4.5 Air and Qwen-30B models on my 64GB M2 MacBook Pro laptop and I have been astonished at how useful they are. I started using a new benchmark, "Write an HTML and JavaScript page implementing space invaders", and got working games from a single shot using GLM-4.5 Air and Qwen3-Coder-30B running directly on my own machine.

I wrote about those two in more detail, with extensive notes on how I ran them:

My 2.5 year old laptop can write Space Invaders in JavaScript now, using GLM-4.5 Air and MLX
Trying out Qwen3 Coder Flash using LM Studio and Open WebUI and LLM

There are two interesting trends here. First, it's now possible to run genuinely useful coding models directly on a high end (32GB or 64GB) developer laptop. Secondly, the Chinese AI labs are now undeniably producing the best available open weight models.

OpenAI's open weights model is rumored to show up any day now. It has some substantial competition!

Gold medal performances in the IMO

The IMO is the International Mathematical Olympiad, an annual mathematical competition for high school students that's been held since 1959. It's long been a goal of AI labs to produce a model that can compete in this contest at a high level.

This year two teams achieved a gold medal performance in IMO, from OpenAI and from Gemini DeepMind.

OpenAI announced first and got a lot of press coverage. The Gemini team announced later and there was then some dispute between the two teams about whether their announcement timings were compatible with the guidelines set out by the IMO themselves.

Both models scored the same, solving 5 of the 6 problems (the unsolved 6th was also the hardest for the human contestants). Notably, neither of the gold medal models had access to tools or internet search - they were able to reason through the problems using their model weights alone.

Google just released Gemini Deep Think for their Gemini Ultra $249.99/month subscribers - a close relative of the model that they used for the IMO.

Reverse engineering system prompts

One of the best ways I know of to level up as a prompt engineer is to reverse engineer the system prompts of other products and see how they work.

I wrote up three of those explorations in detail this month:

Using GitHub Spark to reverse engineer GitHub Spark - GitHub Spark is GitHub's new prompt-to-app platform, and it has a fascinating system prompt which includes multiple paragraphs of instructions on how to implement good design.
OpenAI's new study mode is a mode of ChatGPT designed to help study without doing homework for you - and it's implemented entirely as a system prompt.
Reverse engineering some updates to Claude looks at two new Claude features - "create calendar event/create message" and "Upload PDFs, images, code files, and more to AI-powered apps" - and uses the system prompt to help explain what they are and how they work.

Tools I'm using at the moment

My daily drivers have remained the same as last month: Claude Sonnet 4 for most things, OpenAI o3 for search and research tasks, both through their respective apps and websites.
I'm fully switched to Zed as my editor, because it uses so much less memory than VS Code.
I'm running Claude Code a lot. I've also started tinkering with OpenAI's equivalent, codex-cli, to run Claude Code style tasks with their models.
I continue to use my own LLM tool for other command-line tasks, defaulting to GPT-4.1 but often using Gemini 2.5 Pro and o3 for harder tasks.
The only time I use GPT-4o is for advanced voice mode. I wish they'd upgrade that to use a more powerful model!
For local models I've been leaning more on LM Studio, especially now they've changed their policy to allow commercial use of their free desktop app. I also still run Ollama for those, and frequently dabble with mlx-lm as well.

Bonus links

If you're interested in LLM evals, Frequently Asked Questions (And Answers) About AI Evals by Hamel Husain and Shreya Shankar is essential reading.
Another example of the lethal trifecta: Supabase MCP can leak your entire SQL database.
Christopher Smith put together a delightful video introduction to my LLM tool: Become a command-line superhero with Simon Willison's llm tool.
A paper on LLM programming productivity came out that got a lot of coverage: Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity. They found that developers using LLMs frequently over-estimated their productivity gains and often worked slower, not faster. Here are my own notes on that paper.
Django celebrated its 20th birthday! I published an annotated version of a talk I gave on the 10th birthday about Django Origins.

That's it for July!

If this newsletter was useful feel free to forward it to friends who might find it useful too, especially if they might be convinced to sign up to sponsor me for the next one!

Thanks for your support,

Simon Willison https://simonwillison.net/

I'm available for consulting calls over Zoom or similar, you can contact me at contact@simonwillison.net

I also offer private remote workshops for teams, of both my Building software on top of Large Language Models workshop and a new workshop on Writing code with LLMs.

Footer

Footer navigation

Terms
Privacy
Security
Status
Community
Docs
Contact

You can’t perform that action at this time.

HOME
ABOUT
AUCTIONS
SHIPPING
FEES
TOOLS
HOW
FAQ
CONTACT

Original Source | Taken Source