AI-101

Lesson 17

Context and Memory Management

AI-generated

Learning Objectives
  • Understand how AI context windows work
  • Know when conversations get too long (and what happens)
  • Use effective context-setting at conversation start
  • Manage information across long conversations
  • Know when to start fresh vs. continue
Introduction

One of the most frustrating things about AI: it forgets. You have a great conversation, come back the next day, and AI has no idea what you were talking about.

But even within a single conversation, strange things happen. AI might "forget" something you said earlier, or start contradicting itself, or lose track of your requirements.

Understanding how AI memory works is the key to working around these limitations. This lesson explains what is happening behind the scenes and gives you practical strategies for managing context effectively.

What Is a Context Window? (The AI's Working Memory)

Think of AI's context window like short-term memory. It can hold a certain amount of information, and everything it says is based on what is in that window.

The Technical Reality (Simplified)

When you chat with AI, every message in the conversation (yours and AI's) is fed into the model together. The model can only process a limited amount of text at once. This limit is the "context window."

ModelContext Window (approx)
Claude 3.5 Sonnet200,000 tokens (~150,000 words)
GPT-4128,000 tokens (~96,000 words)
GPT-3.516,000 tokens (~12,000 words)

A "token" is roughly 3/4 of a word. So 100,000 tokens is about 75,000 words.

What This Means Practically

  • Your entire conversation must fit in the context window
  • Older messages might get trimmed as conversations get very long
  • AI has no memory of previous conversations (each starts fresh)
  • Very long documents can fill up the window, leaving less room for conversation

The "It's All Text" Insight

AI does not actually remember anything. Every time you send a message, the entire conversation history is processed fresh. What feels like memory is just the conversation text being included in each new response.

Conversation Length: When Things Start to Drift

Long conversations can degrade in quality. Here is what happens and why.

Signs Your Conversation Is Too Long

  • AI starts forgetting requirements you stated earlier
  • Responses become inconsistent with earlier parts of the conversation
  • AI gets confused about what it already said vs. what you said
  • Quality or coherence noticeably drops
  • AI starts repeating itself or going in circles

Why This Happens

  1. Context trimming: Very long conversations may get truncated, losing early messages
  2. Attention dilution: AI has to process more text, potentially missing important details
  3. Accumulated confusion: Small misunderstandings compound over many exchanges
  4. Goal drift: The conversation's original purpose gets buried under new topics

The 15-20 Exchange Rule of Thumb

Conversations typically work well for 15-20 back-and-forth exchanges. Beyond that, consider whether a fresh start might serve you better.

This is not a hard rule. Complex projects can run longer. Simple Q&A can go much longer. But if quality is dropping, conversation length is often the cause.

Front-Loading Context: The Art of the First Message

Your first message in a conversation is critical. It sets the foundation for everything that follows.

What to Include in Opening Context

  • Who you are (relevant background)
  • What you need (the goal)
  • Important constraints (what to avoid, requirements)
  • Format preferences (how you want responses)

A Strong First Message Template

"I need help with [goal].

Background: I'm a [role/context] working on [project/situation].

Constraints:

  • [Constraint 1]
  • [Constraint 2]

For this conversation, please [communication preference].

Let's start with [specific first step]."

Example

"I need help planning my wedding.

Background: I'm getting married in 6 months. Budget is $20,000. We want something small (50 guests) but memorable.

Constraints:

  • Outdoor venue strongly preferred
  • Several guests have dietary restrictions (vegetarian, gluten-free)
  • I'm handling most planning myself (no wedding planner)

For this conversation, please be direct and give me actionable steps, not vague suggestions.

Let's start with: What should I book first, and how far in advance?"

Why Front-Loading Works

AI gives disproportionate attention to the beginning of conversations. By putting key information upfront, you ensure it shapes all subsequent responses.

Summarization Strategies: Keeping Long Conversations Useful

For conversations that need to go long, summarization keeps things on track.

Periodic Summarization

Every 10-15 exchanges, pause and consolidate:

"Before we continue, let's summarize where we are. What have we decided so far, and what's still open?"

This creates a fresh checkpoint that AI can reference.

The Context Reset

When conversations get unwieldy, you can reset without starting over:

"Let me recap our conversation so far: [your summary]. Going forward, use this recap as our starting point. What should we tackle next?"

This gives AI a clean, compressed version of the important context.

Document Building with Summaries

For long projects, maintain a running document:

  1. End each session: "Summarize what we accomplished and what's next"
  2. Start next session: "Here's where we left off: [paste summary]. Let's continue."

Checking AI's Understanding

Periodically verify AI is tracking correctly:

"Quick check: What do you understand my main requirements to be?"

If AI's answer reveals gaps or misunderstandings, correct them immediately.

Fresh Starts: When to Reset and Rebuild

Sometimes the best strategy is starting over. Know when to make that call.

Start Fresh When:

  • The conversation has accumulated too many confusing tangents
  • AI keeps making the same mistakes despite corrections
  • You have learned new information that changes everything
  • The original goal has significantly shifted
  • Quality has noticeably degraded

Making Fresh Starts Efficient

Do not just re-type your original question. Use what you learned:

  1. Summarize key context and decisions from the old conversation
  2. Identify what went wrong (so you can avoid repeating it)
  3. Write a strong first message with refined requirements
  4. Reference any outputs from the old conversation you want to build on

The "Paste and Continue" Technique

If you have useful output from a previous conversation:

"I'm starting a new conversation but want to build on previous work. Here's what I had: [paste]. Let's continue from here. Specifically, I want to [next step]."

Key Takeaways
  • Context window is finite: Everything in your conversation must fit; older content may be lost
  • Front-load important context: Your first message shapes everything after
  • Summarize periodically: Keep long conversations on track with checkpoints
  • Watch for drift: If quality drops or AI forgets things, the conversation may be too long
  • Fresh starts are strategic: Sometimes starting over with better context beats continuing
Try It Yourself

Test context management with this exercise:

  1. Start a conversation about planning something complex (a trip, event, or project)
  2. Have 10+ back-and-forth messages, adding details and making decisions
  3. Ask: "Summarize what you know about my preferences and requirements so far"
  4. Note what AI remembered and what it missed
  5. Start a new conversation with this opening: "I'm planning [same thing]. Here's what I've already decided: [paste AI's summary plus any corrections]. Let's continue from here."
  6. Compare the quality of responses in the new conversation vs. the end of the old one
Sources