What is an AI model?

Show Video Transcript

At the core of most AI tools you’re using (ChatGPT, Claude, Gumloop) underneath those is a model. This is what’s processing the text, image, or audio you’re sending and giving you a response.

How these models work is actually pretty simple to understand. Let’s be clear: extremely hard to build and vastly more complicated than what I’m about to explain. But LLMs are fundamentally next-word predictors.

You write something like “Who is the first president of the United States?”

And then the model takes that, maps it against all the vast data it’s analyzed, and tries to predict the right next word, word by word.

Every time it picks a word, it looks at your prompt plus what it’s written so far and predicts the next word.

And it does that until it’s answered your question.

This is how chatbots are built. You prompt them and they go word by word responding to you.

How do they go from one answer to a conversation? It’s actually just more of the same: you feed the whole conversation—every message so far—back to the model and it predicts the next word. When you start a new conversation, the model starts fresh. It has no memory of what came before.

Now in Gumloop, you can pick from most of the models out there. How are they different?

Models sit on a spectrum: intelligence and speed, inversely related. The more capable the model, the slower you should expect a response—but with fewer mistakes. And the further up the curve, the more you should expect to pay.

Anthropic, the creator of the Claude models, has three options. Opus sits at one end: thinks deeply, responds slowly, like your grandpa. Then there’s Sonnet in the middle—your capable coworker. And finally Haiku, your eager teenager ready with a quick answer.

OpenAI and Google’s Gemini models sit in similar places.

What should you pick in Gumloop? Well, that depends on your task but what I always recommend is start with an advanced model and then move down to simpler models as long as you’re happy with the results. Keep going until you find the perfect balance: quality you’re satisfied with.

So now we understand what models are: they’re next-word predictors—we can go back and forth with them. And chatbots are simply LLMs that keep the conversation going.

But how do we give AI access to our day-to-day tools so these LLMs can actually do things for us? So we don’t just get a text. That’s in the next lesson.

Underneath every AI tool (ChatGPT, Claude, Gumloop) is a model. This is the engine that processes the text, image, or audio you send and generates a response.

How Large Language Models (LLMs) Actually Work

AI models are machines that try to predict the next word in a sequence based on the previous words. They’re called “large” because they’re trained on massive datasets—billions of web pages, books, and articles—and contain billions or even trillions of parameters that help them understand language patterns.

A user provides input such as: “Who was the first president of the United States?”
The model processes the input by mapping it against the vast data from its training and predicts the most likely next word
After selecting a word, the model considers both the original prompt and its generated text to predict the subsequent word
This process repeats iteratively until a complete answer is formed

How models work: predicting the next word

Models iteratively predict the next word until a complete answer is formed.

From Single Response to Conversation

To go from a single response to a conversation (from a single prompt to a conversation), it’s more of the same process. When you send a follow-up message, the chatbot feeds the entire conversation—every message exchanged so far—back to the model. The model then predicts the next word based on all that context

Models have no persistent memoryWhen you start a new conversation, the model starts completely fresh. It has no memory of previous chats. Each conversation is independent—the model only knows what’s in the current thread. In fact, the model doesn’t even have memory between words, it’s starting fresh with every word!

The Intelligence vs. Speed Tradeoff

With new AI models being released regularly, and even multiple models from the same provider, how should you pick from the different options in Gumloop? Models sit on a spectrum where intelligence and speed are inversely related:

More capable models → Slower responses, fewer mistakes, higher cost
Faster models → Quicker responses, more potential for errors, lower cost

Meet the Model Families

Best For	Anthropic	OpenAI	Google
Complex reasoning, nuanced tasks	Claude Opus 4.5	GPT-5.2	Gemini 3.0
Most business use cases	Claude Sonnet 4.5	GPT-5	Gemini 2.5 Pro
Simple tasks, high volume	Claude Haiku	GPT-4.1 Mini	Gemini 2.5 Flash

Each provider offers models across the spectrum. The naming varies, but the tradeoff is the same: more capable models are slower and cost more, faster models are cheaper but less reliable on complex tasks.

How to Choose the Right Model in Gumloop

Here’s a recommended strategy for choosing the right model in Gumloop: Start with an advanced model. Begin with a more capable model (like Claude Sonnet or GPT-5.2) to establish a quality baseline. Test your workflow and evaluate the results. Are they good enough? If yes, move down. Try a faster, cheaper model and test again. Keep iterating until you find the perfect balance: the fastest, most affordable model that still delivers quality you’re satisfied with.

When in doubt, start capableIt’s much easier to identify when a simpler model is “good enough” than to debug why your automation is producing mediocre results. Start smart, then optimize for speed and cost.

Key Takeaways

Models are next-word predictors — They generate responses by predicting one word at a time based on patterns in their training data
Chatbots are LLMs with context — They maintain conversations by feeding the entire chat history back to the model
No persistent memory — Each new conversation starts fresh; the model doesn’t remember previous chats
Intelligence vs. speed tradeoff — More capable models are slower and costlier; faster models may make more mistakes
Start advanced, then optimize — Begin with a capable model and work your way down to find the best balance for your use case

Now that you understand what models are and how they work, the next question is: how do we give AI access to our tools so it can actually do things for us? That’s what we’ll cover in the next lesson.

Getting Started

Courses

Webinars

What is an AI model?

How Large Language Models (LLMs) Actually Work

From Single Response to Conversation

The Intelligence vs. Speed Tradeoff

Meet the Model Families

How to Choose the Right Model in Gumloop

Key Takeaways

Getting Started

Courses

Webinars

​How Large Language Models (LLMs) Actually Work

​From Single Response to Conversation

​The Intelligence vs. Speed Tradeoff

​Meet the Model Families

​How to Choose the Right Model in Gumloop

​Key Takeaways

How Large Language Models (LLMs) Actually Work

From Single Response to Conversation

The Intelligence vs. Speed Tradeoff

Meet the Model Families

How to Choose the Right Model in Gumloop

Key Takeaways