Currently Available: Need a skilled Software Developer for your next project?

LLM

Updating LLMs: Fine-tuning vs. Retraining or Why ChatGPT’s Cut-Off Date Doesn’t Change

October 26, 2023 by Vincent Schmalbach

Consider this analogy: If a history book written in 2000 is to remain relevant in 2023, it either needs new chapters added or a complete rewrite. Similarly, large language models need either fine-tuning (adding new chapters) or retraining (complete rewrite) to remain current.

Fine-tuning: This is essentially a continuation of training. By using saved weights and introducing new data, the model adjusts its previous knowledge. However, this process might result in the model leaning heavily towards its original training. It's like adding a chapter to our history book — the core doesn't change, but there's an additional perspective.
Retraining: This means starting from scratch. Every piece of data, old and new, is given equal weight. In terms of our book analogy, it's a complete rewrite, ensuring that newer events are interwoven seamlessly into the story. But it's resource-intensive — a long, taxing rewrite.

ChatGPT: Is OpenAI silently fine-tuning their model?

While ChatGPT often cites a 2022 knowledge cutoff, some users have noticed information from 2023 seeping in.

Is OpenAI silently fine-tuning their model?

I do think so. Fine-tuning is the economical choice to add new information. But a fine-tuned model is biased towards its original data. So while the model might know of events from 2023, its core understanding would still reflect the worldview of 2022.

The outcome isn't that great. If you ask ChatGPT to write code in a modern version of a framework it seems to know about some of the new stuff but that will be just mixed in with the old syntax.

So OpenAI doesn't advertise the fine-tuned model as a model with a new cut-off date, because it's not the same as a retrained model with a new cut-off date.

What I'm building

Delegate tasks. Get software.

Give Vroni a GitHub issue, bug report, spec, or rough idea. It reads the repo, plans the change, writes code, runs checks, and works toward a review-ready pull request.

Take a look at vroni.com

Updating LLMs: Fine-tuning vs. Retraining or Why ChatGPT’s Cut-Off Date Doesn’t Change

Delegate tasks. Get software.

Leave a Reply Cancel reply

Google Now Defaults to Not Indexing Your Content

AI Exponentializes Your Tech Debt

Only Experts Can Write Good Prompts

Do Not Use Laravel Cache Tags

I Tried to Make AI Writing Sound Human by Banning AI Words Through logit_bias

A Research Agent Can Leak Private Files Through Its Search Queries

Qualys Says Claude Mythos Preview Helped Find CVE-2026-64600 in Linux XFS

My Book: Rapid SaaS with Laravel

About

Posts

Services

Updating LLMs: Fine-tuning vs. Retraining or Why ChatGPT’s Cut-Off Date Doesn’t Change

Delegate tasks. Get software.

Subscribe to my newsletter

Related posts

I Tried to Make AI Writing Sound Human by Banning AI Words Through logit_bias

A Research Agent Can Leak Private Files Through Its Search Queries

Qualys Says Claude Mythos Preview Helped Find CVE-2026-64600 in Linux XFS

Leave a Reply Cancel reply