2026-06-01 / ai, language models, representation learning

If LLMs Are Just Predicting the Next Word, How Can They Do Amazing Things?

People often say that large language models are "just predicting the next word." In a narrow technical sense, this is true. The model receives a sequence of text and predicts what token should come next. Then it predicts the next token after that, and the next one after that.

But this explanation is incomplete in the same way that saying "a plane is just moving air over wings" is incomplete. It describes the outer mechanism, but it does not explain the structure that makes the mechanism powerful.

Large language models do not usually predict full words. They predict tokens, which are pieces of text. Sometimes a token is a word. Sometimes it is part of a word, punctuation, or spacing. Still, the basic idea is simple: given the text so far, predict what should come next.

The important point is that some next tokens are much harder to predict than others.

If the sentence is "The cat sat on the," the next word might be "mat." That is mostly a language pattern. But if the prompt is "347 + 829 =", the next token depends on arithmetic. If the prompt asks for the meaning of a phrase in a contract, the answer depends on legal and social context. If the prompt says, "Rewrite this apology so it sounds sincere but not desperate," the continuation depends on emotional tone, intention, and judgment.

So yes, the model is predicting the next token. But to predict difficult tokens well, it has to learn much more than word order. It has to learn patterns about grammar, facts, code, numbers, people, objects, cause and effect, social expectations, and the way ideas relate to each other.

The Training Goal Is Simple. The Internal Work Is Not.

During training, the model has one main external objective: reduce prediction error. It is rewarded, mathematically speaking, for assigning higher probability to the actual next token in its training examples.

That objective sounds small. But the world represented in language is not small. Language contains mathematics, stories, software, instructions, jokes, arguments, plans, memories, opinions, recipes, scientific papers, contracts, emotions, and misunderstandings. A model that becomes good at predicting language has to compress many of those structures into its internal state.

This is where intermediate goals appear. The model is not explicitly told, "First understand the user's goal, then identify the relevant concepts, then reason through the answer." But those kinds of internal operations can become useful because they help with the prediction task.

To predict what comes after an equal sign, the model benefits from learning arithmetic structure. To predict the end of a story, it benefits from tracking characters and motivations. To predict a useful answer to a question, it benefits from representing what the question is really asking. To write in a specific tone, it benefits from representing sentiment, formality, confidence, and emotional state.

The outer goal is next-token prediction. The internal machinery may include concept tracking, planning, uncertainty estimation, style control, memory-like retrieval from learned patterns, and step-by-step transformation of ideas.

Humans Also Build Intermediate Models.

A useful comparison is the human brain. A human being has broad biological and social goals: survive, communicate, belong, avoid danger, find meaning, solve problems, and act in the world. But the brain does not store only one giant goal. It builds intermediate models.

We have concepts for people, places, tools, emotions, numbers, stories, danger, fairness, status, trust, and many other things. What stays in the human brain is not raw sensory input. It is compressed into patterns that are useful for prediction and action.

Large language models are different from humans in many important ways. They do not have bodies, biological needs, lived experience, or consciousness in the human sense. But they also compress experience into useful internal representations. Their experience is text and other training data. Their pressure is prediction. The result is a system that builds representations because representations help it predict.

From A to B Is a Series of Steps.

When a model answers a question, it does not jump magically from prompt to final answer. It moves through a chain of internal transformations.

First, the prompt is converted into internal vectors. These vectors carry information about words, phrases, structure, and context. Then many layers of the model update those representations. Each layer can add, remove, combine, or emphasize information. By the time the model predicts the next token, the original text has been transformed into a rich internal state.

That internal state may represent the topic, the user's likely intent, relevant facts, possible answer styles, constraints from the instruction, uncertainty about the answer, and the recent sequence of generated tokens. After the model emits one token, the context changes. The model then repeats the process for the next token.

This is why the phrase "just predicting the next word" hides the interesting part. The next token is the output. The question is how the model decides which token should come next. That decision can depend on a deep stack of learned structure.

What About Doubt and Internal State?

Models often produce language that sounds uncertain: "I am not sure," "it depends," or "there are several possibilities." This does not mean the model has human doubt or inner experience. But it can represent uncertainty in a functional sense.

If the prompt is ambiguous, or if the training data contains many different possible continuations for similar contexts, the model's probabilities may spread across multiple possible answers. The model may also have learned that careful, uncertain wording is appropriate in situations where evidence is incomplete.

So there is an internal state, but we should be precise about what that means. It is not necessarily a private conscious feeling. It is a computational state that can encode competing possibilities, confidence, contradictions, tone, and context.

How Do We Know There Are Concepts Inside?

There are several reasons to think these models form concepts rather than only memorizing strings.

First, similar meanings tend to have related internal representations. Ideas about countries, programming languages, animals, emotions, or mathematical operations are not scattered randomly. They often form patterns in the model's internal space.

Second, models can generalize. They can apply an idea in a new sentence, combine two concepts that appeared separately during training, or solve a problem that is phrased differently from examples they have seen before. Perfect generalization is not guaranteed, but the ability exists often enough that memorization alone is not a satisfying explanation.

Third, researchers can probe and intervene on internal activations. In some cases, changing a representation can change the model's behavior in a targeted way. If adjusting an internal direction changes sentiment, truthfulness, refusal behavior, or the representation of an entity, that suggests the model is using meaningful internal features.

None of this proves that models understand exactly as humans do. "Concept" does not have to mean human-like consciousness. It can mean a reusable internal representation that helps the system organize information and produce behavior across contexts.

The Better Explanation

The better explanation is this: next-token prediction is the training game, but the model wins that game by learning useful internal models of language and the world described by language.

Those internal models can include words, concepts, goals, sentiments, emotional states, logical relations, code structures, mathematical patterns, and social expectations. The model does not need to be explicitly programmed with all of these. They can emerge because they are useful for reducing prediction error.

So when someone says, "It is just predicting the next word," the right response is not to deny it. The right response is to ask: what does a system need to learn in order to predict the next word extremely well?

The answer is: a lot.

Predicting language at a high level requires modeling much of what language is about. It requires structure. It requires compression. It requires concepts. It requires tracking what has been said, what is likely intended, and what would make sense next.

That is why a simple training objective can produce surprisingly rich behavior. The objective is simple. The solution that the model discovers is not.

← Back to blog Back to home