“Write about data analysis.”
A generic, unfocused paragraph that could apply to anyone.
No audience, goal, or format — the model guesses.
Tokenization
Language models don't see words — they see tokens, the chunks text is broken into. Type below and watch the split happen; token count is what you actually pay for and what fills the context window.
Common words are one token; rare or long words split into several — roughly ¾ of a word each.
Next-token prediction
At its core a model just predicts the next token from a probability distribution. Temperature controls the gamble: low is focused and repetitive, high is creative and risky. Sample to see it choose.
Prompt: “The future of AI is ▮”
Higher temperature flattens the odds — unlikely tokens get a real chance, so output gets more varied.
Prompt engineering
The same question gives wildly different answers depending on how you ask. Compare three levels of the same prompt and see what context, role, and constraints buy you.
“Write about data analysis.”
A generic, unfocused paragraph that could apply to anyone.
No audience, goal, or format — the model guesses.
“Explain data analysis to a marketing manager in 3 bullet points.”
Three focused bullets aimed at a marketer's concerns.
Adds audience + format, so it's on-target.
“You are a data strategist. In 3 bullets, tell a B2B marketing manager how to use data analysis to cut CAC, with one metric each and a concrete first step.”
Role-grounded, metric-driven, actionable advice ready to use.
Role + constraints + metrics + a first step = expert output.
Specificity, role, and constraints turn a vague reply into a usable one.