Foundations

Training

Also called: pretraining

Training is the process that produces a model: showing it enormous amounts of text and adjusting its parameters until it gets good at predicting what comes next. It happens once, before you ever use the model.

James Phoenix
Understanding Data Updated July 2, 2026

Training is where a model comes from. You take an untrained network, feed it a vast corpus of text and code, and repeatedly nudge its parameters so its next-token predictions get less wrong. Do that at enormous scale and the model absorbs grammar, facts, coding patterns, and a surprising amount of reasoning ability. The output is the finished set of weights.

A one-time, frozen event

The two things worth remembering about training:

  • It is expensive and rare. Training a frontier model costs a fortune in compute and happens on a schedule set by the provider, not by you. You consume the result via inference.
  • It freezes knowledge in time. Whatever the model learned is fixed at the moment training ended. It has no awareness of anything that happened afterwards, which is the root of the knowledge-cutoff problem and a common source of confident-but-outdated answers.

Training vs. giving context

This is the single most useful implication for daily work: you do not "train" a coding agent by talking to it. Correcting it in a conversation changes nothing about its parameters. What you are actually doing is adjusting its context, the text in front of it right now. If you want it to know your codebase or your conventions, you supply that as context on each request, because the training door is closed.

Note
Fine-tuning is a smaller follow-on kind of training that specialises an existing model on extra data. It still produces a new frozen model; it is not the model learning live from your session.

Related terms

Building with AI agents?

This dictionary is part of how I think about agentic engineering. If you want the same thinking applied to your codebase, that is what I do.

See how I can help