LangGraph from Scratch, Part 7: Conversation Memory

Tell your bot from Part 6 your name. It will say something friendly back. Now ask it what your name is. It has no idea. Not because it's broken, but because it never had a chance: every message you send starts a brand-new conversation with no past in it. Your bot is sharp, it has tools, and it forgets you the instant you hit Send.

By the end of this page, that's fixed. Your bot holds a conversation across turns, a "New conversation" button starts a clean one, and a sidebar lets you flip between past chats, all with no database and about thirty lines of code.

The Lattice assistant with a Memory active status in its header. The conversation shows Alex introducing themself and the assistant correctly recalling the name later in the same thread. — Today's destination. You tell it your name, change the subject, ask again, and it remembers. Same graph, same stream; it stopped starting from zero every time.

This part is almost all about one idea with a friendly name: a checkpointer. It's the piece that lets the graph save where it was and pick back up. You'll add it in one line, hit a very loud error on purpose, fix it by teaching your app to label each conversation, and then build the UI on top. Let's cure the amnesia.

Your bot has the memory of a goldfish

Watch the failure first, because naming it is half the fix. You say your name, the bot greets you, you ask it to recall the name, and it draws a complete blank.

The Lattice interface before memory is added. Alex introduces themself, then the assistant says it does not know their name two messages later. — The bug, met on purpose. The bot greeted Alex one message ago and now swears they've never met. Nothing is broken; nothing is being remembered.

Here's why. Every time the frontend calls /chat, the backend runs graph.astream_events with exactly one message in it: the one you just typed. The graph runs, the model answers, the request ends, and everything the graph held is thrown away. The next message starts a fresh run with no trace of the last one. The model isn't forgetting; there's nothing to forget from. Each turn is the bot's first day on the job.

Comic in two panels. Panel one: Yad, a bearded developer with headphones, leans in warmly next to a small white robot, hand on his chest, saying 'AND THAT'S MY WHOLE STORY.' A goldfish bowl labeled PROD sits beside the robot. Panel two: a moment later the same robot waves at Yad with a blank, polite smile and says 'NICE TO MEET YOU!', while Yad slumps in exasperation and the goldfish swims in an oblivious circle. — In-memory state, taken literally. A goldfish has a famous nine-second memory; a bot whose only memory is one request is worse. The PROD label is the foreshadowing for Part 8.

Threads: one notebook per conversation

To remember a conversation, two things have to be true. The history has to be saved somewhere between requests, and your app has to know which saved history belongs to this chat. The second one is the part people skip, and it's where the whole design hinges.

Picture a drawer of notebooks, one per conversation. Each notebook has a label on its spine. When a message comes in carrying the label a1b2, the graph pulls that exact notebook, reads everything written in it so far, adds the new turn, and puts it back. A message labeled 9f3c opens a different notebook entirely. The label is the only thing that decides which memory you get.

A diagram titled 'Threads: the id is the label on the notebook'. On the left, a request box shows POST /chat with a JSON body containing a message and a thread_id of 'a1b2' highlighted in accent. An arrow labeled 'carries the id' points to a small box, THE GRAPH (llm plus tools, runs as before). From the graph, an arrow labeled 'opens a1b2, reads and writes' points into a panel labeled THE CHECKPOINTER, one notebook per conversation. Inside are three notebook cards: the active one tabbed 'a1b2' showing the Alex conversation, and two muted ones tabbed '9f3c' and 'c7d0' holding other conversations kept apart. — The whole mental model. The thread_id rides in with the request, the checkpointer opens the matching notebook, and the rest of the graph never has to think about it.

Giving the graph a memory

The thing that reads and writes those notebooks is a checkpointer. LangGraph ships one that keeps everything in your computer's memory, InMemorySaver, and wiring it in is genuinely one line of import and one line of use. Open graph.py and find the spot at the bottom where you compile the graph. Right now it reads graph = builder.compile(). Give it a checkpointer:

backend/app/graph.py

from langgraph.checkpoint.memory import InMemorySaver

checkpointer = InMemorySaver()
graph = builder.compile(checkpointer=checkpointer)

That's the whole backend change to enable memory. Everything above it, the state, the llm node, the ToolNode, the conditional edge, stays exactly as you left it in Part 6. compile(checkpointer=...) hands the graph a place to save its state after every step and to reload it before the next run.

The graph now demands to know who it's talking to

Save graph.py, restart the server, and send a message from the UI. Instead of a reply, the backend falls over. Read the error; it's about to tell you exactly what's missing.

A dark backend terminal showing a traceback. The call that failed is 'async for event in graph.astream_events(inputs, version=v2)' in token_stream, with frames descending into langgraph's pregel and checkpoint modules. The exception reads 'ValueError: Checkpointer requires one or more of the following configurable keys: thread_id, checkpoint_ns, checkpoint_id'. Comments note that the graph now demands to know which conversation this is, and that you gave it a memory but never told it the thread_id. A final line shows INFO: 500 Internal Server Error. — The deliberate break. The moment the graph has a memory, it refuses to run without knowing which conversation to use. This is LangGraph protecting you from silently mixing everyone's chats into one notebook.

ValueError: Checkpointer requires one or more of the following 'configurable' keys: thread_id, checkpoint_ns, checkpoint_id. You saw this family of error in Part 3 (missing API key) and Part 6 (missing Tavily key): the library checks a precondition up front and fails loudly instead of doing something quietly wrong. A graph with a checkpointer must be told which thread it's working on, every single call. You added the memory; now you have to hand it the notebook label. Right now you're passing none, so it stops you.

Telling it which conversation

Two small changes wire the thread through. First, the request needs to carry a thread_id, so add it to your ChatRequest model in main.py:

backend/app/main.py

class ChatRequest(BaseModel):
    message: str
    thread_id: str

Second, token_stream has to pass that id into the graph as config. LangGraph reads the thread from a nested dict under the configurable key, and that dict goes in as the second argument to astream_events. Update the generator and the endpoint that calls it:

backend/app/main.py

async def token_stream(message: str, thread_id: str):
    inputs = {"messages": [HumanMessage(content=message)]}
    config = {"configurable": {"thread_id": thread_id}}
    async for event in graph.astream_events(inputs, config, version="v2"):
        ...  # the token / tool_start / tool_end branches from Part 6, unchanged
    yield sse({"type": "done"})


@app.post("/chat")
async def chat(request: ChatRequest):
    return StreamingResponse(
        token_stream(request.message, request.thread_id),
        media_type="text/event-stream",
    )

The body of the event loop doesn't change at all; the three branches you wrote in Part 6 still handle tokens and tool calls exactly as before. The only new thing is config, slotted in as the second argument, carrying the one fact the checkpointer was asking for.

Now look at what inputs still is: a single message, the one the user just typed. Not the whole history. That feels wrong the first time, so here's the picture that makes it click.

A three-stage diagram titled 'Every turn: load, run, save'. Stage 1, LOAD: the checkpointer fills the tray with the saved history (you 'My name is Alex.', bot 'Nice to meet you, Alex.') plus your new line (you 'What's my name?' in accent). An arrow points to stage 2, RUN: the llm node reads the whole tray so it can answer in context. An arrow points to stage 3, SAVE: the reply is appended (bot 'Your name is Alex.' in green) and the notebook is written back, growing by two lines, ready for the next turn. — Why you only send the newest message. The checkpointer loads the saved history onto the tray before the node runs, and saves the grown tray after. You supply one line; it supplies the rest.

Remember the add_messages reducer from Part 3, the rule that appends to the message list instead of replacing it? This is its payoff. The checkpointer loads the saved messages onto the tray, your one new message gets appended by that reducer, the model sees the full conversation, and its reply gets appended and saved. You send one message; the graph remembers the rest. That's the trade the checkpointer makes for you on every turn.

The frontend picks a name for the chat

The backend is ready to remember, but the frontend isn't sending a thread_id yet. Open frontend/app/page.tsx. The plan: on first load, mint a random id and stash it in localStorage so it survives refreshes, then send it with every message.

Add a piece of state for the current thread and an effect that sets it up once:

frontend/app/page.tsx

const [threadId, setThreadId] = useState("");

useEffect(() => {
  let id = localStorage.getItem("thread_id");
  if (!id) {
    id = crypto.randomUUID();
    localStorage.setItem("thread_id", id);
  }
  setThreadId(id);
}, []);

On the first visit there's nothing stored, so crypto.randomUUID() mints a fresh id like a1b2c3d4-... and saves it. On every visit after, the stored id comes straight back, so the same browser keeps talking to the same notebook. Now send it: add thread_id to the body of your fetch:

frontend/app/page.tsx

body: JSON.stringify({ message: text, thread_id: threadId }),

One wrinkle to know about, because it explains a small change to the Send button. Effects run only in the browser, and there's no localStorage during server rendering, so threadId is an empty string on the very first render. That's why Send now reads disabled={!input.trim() || !threadId}: no id yet, no send, so a message can never leave without a notebook label attached to it.

Save both files, send your name, then ask for it back. The bot remembers. Same streaming, same tool bubbles, but now the conversation has a spine.

A button that starts fresh

One thread is a great start, but you'll want to begin a clean conversation without the old one bleeding in. That's a "New conversation" button, and it does four things: mint a new id, point localStorage at it, make it the active thread, and clear the screen along with any error still sitting there.

frontend/app/page.tsx

function newChat() {
  const id = crypto.randomUUID();
  localStorage.setItem("thread_id", id);
  setThreadId(id);
  setMessages([]);
  setError(null);
}

The old conversation isn't deleted; its notebook still sits in the checkpointer under its old id. You've just opened a brand-new blank one and pointed the UI at it. Add a prominent action beneath the brand in the desktop sidebar (the complete file below includes the matching compact mobile action):

frontend/app/page.tsx

<Button
  onClick={newChat}
  className="mt-5 h-11 w-full justify-start rounded-xl px-3 shadow-md shadow-primary/15"
>
  <Plus className="size-4" aria-hidden="true" />
  New conversation
</Button>

Click it and the screen wipes clean. Tell the new chat a different name, ask it back: it knows the new name and has no memory of the old thread. Two separate notebooks, exactly as the diagram promised.

The complete file at the end also refreshes the copy so the UI stops describing the Part 6 app: the status pill reads "Memory active", the desktop header says "Active conversation" over "Messages stay isolated inside this thread", the composer placeholder becomes "Message this conversation…" above the footnote "Lattice can remember earlier messages in this conversation.", and the empty state now opens with "Memory is ready" over "Start a conversation worth remembering.", its first starter prompt being "Remember that my name is Alex". Pure wording, zero behaviour, but it's the difference between an app that has memory and an app that says so.

Right now you have a bot that holds a real conversation, survives a page refresh, and can start over on demand. For most readers that's a perfect place to stop, and the part is complete. The rest is a stretch goal that's genuinely fun: a sidebar of every chat you've had, click to jump back in.

Stretch: a shelf of past conversations

Here's the honest scope before you start, because this is the one section in the series that adds real surface area. To list past chats you need to remember their ids on the frontend; to reopen one you need to rebuild its messages, which means a new backend endpoint that reads a thread's history out of the checkpointer. None of it is hard, but it's more moving parts than a one-line button. If you'd rather ship what you have, skip to the recap; nothing below changes what you already built.

Reading a thread back out of memory

The checkpointer already holds every conversation. You need a way to ask it for one. LangGraph exposes the saved state through get_state, and there's an async version, aget_state, that fits a async def endpoint cleanly. Add this to main.py:

backend/app/main.py

class StoredMessage(BaseModel):
    role: str
    content: str


@app.get("/threads/{thread_id}/messages")
async def thread_messages(thread_id: str) -> list[StoredMessage]:
    config = {"configurable": {"thread_id": thread_id}}
    snapshot = await graph.aget_state(config)
    messages = snapshot.values.get("messages", [])
    return [
        StoredMessage(role="user" if m.type == "human" else "assistant", content=m.content)
        for m in messages
        if m.type in ("human", "ai") and m.content
    ]

aget_state hands back a snapshot of the thread; snapshot.values is the saved state dict, and .get("messages", []) pulls the list out, falling back to empty for a thread that was never used. Each stored message carries a .type, one of human, ai, or tool, and we keep the human and assistant text, mapping it to the {role, content} shape the frontend already speaks. The and m.content filter drops the empty-bodied turns the model emits while it's calling a tool, so the rebuilt history reads like a clean conversation.

Prove the read works before touching the UI. Open a Python shell in your backend, import the graph, run two turns through it yourself under a thread_id of a1b2, then read that thread straight back out of the checkpointer with get_state:

A dark Python REPL in the backend. The session imports graph from app.graph and HumanMessage from langchain_core.messages, builds a config with thread_id 'a1b2', then runs two turns through the graph with graph.invoke: 'My name is Alex.' followed by 'What's my name?'. It then calls snapshot = graph.get_state(config) and loops printing m.type and m.content for every message in snapshot.values['messages']. The output lists the four stored messages: human 'My name is Alex.', ai 'Nice to meet you, Alex.', human 'What's my name?', ai 'Your name is Alex.' A comment notes this shell's checkpointer now holds the thread, read straight back out. — The conversation, read back out of the checkpointer. This is exactly what the new endpoint serves: the saved history of one thread, addressed by its id.

Why seed the conversation in the shell instead of asking it for the thread your browser just made? Because InMemorySaver lives inside one process, and this shell is a different process from the one running uvicorn. It has its own drawer of notebooks, so it can only read the threads it wrote itself; the server keeps its own drawer, and neither can see the other's. Harmless here, since the endpoint you just wrote runs inside the server and reads the server's drawer. Worth filing away, though: it's the same fact that comes for you at the end of this part and again in Part 8.

Listing and switching threads in the UI

Back in page.tsx. The frontend needs to remember which threads exist and let you click between them. Keep a small list of { id, title } objects in state and in localStorage, and load it by adding one line to the effect you already wrote for the thread id. No second effect; it reads from the same box at the same moment:

frontend/app/page.tsx

type Thread = { id: string; title: string };
const [threads, setThreads] = useState<Thread[]>([]);

useEffect(() => {
  // ...the thread_id lines from earlier, then:
  setThreadId(id);
  setThreads(JSON.parse(localStorage.getItem("threads") ?? "[]"));
}, []);

When a brand-new thread sends its first message, record it with that message as its title. Add this at the top of sendMessage, right where you append the user's message:

frontend/app/page.tsx

if (messages.length === 0) {
  const updated = [{ id: threadId, title: text }, ...threads];
  setThreads(updated);
  localStorage.setItem("threads", JSON.stringify(updated));
}

To reopen a thread, point at its id and pull its messages from the new endpoint. Two small functions handle it:

frontend/app/page.tsx

async function loadThread(id: string) {
  const res = await fetch(`${API_BASE}/threads/${id}/messages`);
  if (res.ok) setMessages(await res.json());
}

function switchThread(id: string) {
  localStorage.setItem("thread_id", id);
  setThreadId(id);
  loadThread(id);
}

loadThread asks the backend for a thread's history and drops it into messages; the {role, content} objects it returns already match your assistant and user bubbles, so they render with no extra work. switchThread makes that thread the active one so your next message continues it. Finally, render the list as a sidebar, which takes two layout changes first: the Card swaps flex-col max-w-6xl for flex-row max-w-7xl so the page splits left to right, and the header, message log and composer you already have get wrapped in <section className="flex min-w-0 flex-1 flex-col" aria-label="Lattice chat"> so they become the right-hand column. The sidebar then goes in above that section:

frontend/app/page.tsx

<aside className="hidden w-72 shrink-0 flex-col border-r bg-sidebar/75 p-4 lg:flex">
  <p className="mb-2 mt-7 px-2 text-[11px] font-semibold uppercase tracking-[0.18em] text-muted-foreground">
    Recent conversations
  </p>
  <nav className="chat-scrollbar min-h-0 flex-1 space-y-1 overflow-y-auto" aria-label="Conversation history">
    {threads.map((thread) => (
      <button
        key={thread.id}
        type="button"
        onClick={() => switchThread(thread.id)}
        aria-current={thread.id === threadId ? "page" : undefined}
        className={`flex w-full items-center gap-2.5 rounded-xl px-3 py-2.5 text-left text-sm transition focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-sidebar-ring ${
          thread.id === threadId
            ? "bg-sidebar-accent font-medium text-sidebar-accent-foreground shadow-sm"
            : "text-muted-foreground hover:bg-sidebar-accent/60 hover:text-foreground"
        }`}
      >
        <MessageSquare className="size-4 shrink-0" aria-hidden="true" />
        <span className="truncate">{thread.title}</span>
      </button>
    ))}
  </nav>
</aside>

Each button is one saved thread; clicking it swaps the conversation under you. The active one gets the accent treatment so you always know which notebook you're writing in, and the focus-visible ring matters more than it looks: these buttons are the keyboard path between threads. The complete file adds two small comforts on top: a dashed line reading "Your conversation history will appear here." for when the list is still empty, and a "Thread memory / Stored by LangGraph" card pinned to the bottom of the sidebar, which you can see in the shot below.

The finished Lattice assistant with a Recent conversations sidebar. Three threads are listed, the Alex conversation is highlighted, a Thread memory card sits below, and the main panel shows the assistant recalling Alex's name. — The finished thing. Every conversation is a notebook on the shelf; click one to open it. Each lives under its own thread_id in the same checkpointer the bot reads on every turn.

The catch hiding in InMemorySaver

You built real memory without a database, and that's the magic trick, but it's worth knowing exactly how the trick works. InMemorySaver keeps every thread in a plain dict inside the running Python process. It's fast, it's zero-setup, and it vanishes the instant that process stops. Restart the server and every conversation is gone, every notebook blank. This is the goldfish bowl labeled PROD from the comic: it works beautifully right up until something restarts.

Here's the full frontend/app/page.tsx after this part, sidebar and all, in case a piece drifted while you wired it up:

frontend/app/page.tsx

"use client";

import { useEffect, useRef, useState, type FormEvent } from "react";
import {
  Bot,
  BrainCircuit,
  CircleAlert,
  MessageSquare,
  Plus,
  Send,
  Sparkles,
  Square,
  UserRound,
  Wrench,
} from "lucide-react";
import { Button } from "@/components/ui/button";
import { Input } from "@/components/ui/input";
import { Card } from "@/components/ui/card";

type Message =
  | { role: "user" | "assistant"; content: string }
  | { role: "tool"; name: string; args: Record<string, unknown>; result: string | null };

type ChatMessage = Extract<Message, { role: "user" | "assistant" }>;
type Thread = { id: string; title: string };

const API_BASE = process.env.NEXT_PUBLIC_API_BASE_URL;

const STARTER_PROMPTS = [
  "Remember that my name is Alex",
  "What is 23 × 17?",
  "Search for the latest LangGraph release",
];

function Brand() {
  return (
    <div className="flex min-w-0 items-center gap-3">
      <div className="grid size-10 shrink-0 place-items-center rounded-2xl bg-primary text-primary-foreground shadow-lg shadow-primary/20">
        <Sparkles className="size-5" aria-hidden="true" />
      </div>
      <div className="min-w-0">
        <p className="truncate text-base font-semibold tracking-tight">Lattice</p>
        <p className="truncate text-xs text-muted-foreground">A LangGraph assistant</p>
      </div>
    </div>
  );
}

function EmptyState({ onPick }: { onPick: (prompt: string) => void }) {
  return (
    <div className="mx-auto flex h-full w-full max-w-2xl flex-col items-center justify-center px-2 py-10 text-center">
      <div className="relative mb-6">
        <div className="absolute inset-0 scale-150 rounded-full bg-primary/15 blur-2xl" />
        <div className="relative grid size-16 place-items-center rounded-3xl border border-primary/20 bg-card shadow-xl shadow-primary/10">
          <BrainCircuit className="size-7 text-primary" aria-hidden="true" />
        </div>
      </div>
      <p className="mb-2 text-xs font-semibold uppercase tracking-[0.22em] text-primary">
        Memory is ready
      </p>
      <h1 className="text-balance text-3xl font-semibold tracking-tight sm:text-4xl">
        Start a conversation worth remembering.
      </h1>
      <p className="mt-3 max-w-lg text-pretty text-sm leading-6 text-muted-foreground sm:text-base">
        Each conversation has its own LangGraph thread. Return later and Lattice can pick up where you left off.
      </p>
      <div className="mt-8 grid w-full gap-2 sm:grid-cols-3">
        {STARTER_PROMPTS.map((prompt) => (
          <button
            key={prompt}
            type="button"
            onClick={() => onPick(prompt)}
            className="rounded-2xl border bg-card/70 px-4 py-3 text-left text-sm leading-5 text-foreground shadow-sm transition hover:-translate-y-0.5 hover:border-primary/35 hover:shadow-md focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-ring"
          >
            {prompt}
          </button>
        ))}
      </div>
    </div>
  );
}

function ChatBubble({ message, streaming }: { message: ChatMessage; streaming: boolean }) {
  const isUser = message.role === "user";

  return (
    <article className={`flex items-end gap-3 ${isUser ? "justify-end" : "justify-start"}`}>
      {!isUser ? (
        <div className="grid size-8 shrink-0 place-items-center rounded-xl bg-primary/10 text-primary">
          <Bot className="size-4" aria-hidden="true" />
        </div>
      ) : null}
      <div className={`flex max-w-[82%] flex-col sm:max-w-[72%] ${isUser ? "items-end" : "items-start"}`}>
        <p className={`mb-1.5 px-1 text-[11px] font-medium uppercase tracking-[0.16em] text-muted-foreground ${isUser ? "text-right" : "text-left"}`}>
          {isUser ? "You" : "Lattice"}
        </p>
        <div
          className={
            isUser
              ? "rounded-[1.35rem] rounded-br-md bg-primary px-4 py-3 text-sm leading-6 text-primary-foreground shadow-lg shadow-primary/15"
              : "min-h-12 rounded-[1.35rem] rounded-bl-md border bg-card px-4 py-3 text-sm leading-6 shadow-sm"
          }
        >
          {message.content}
          {streaming ? (
            <span className="ml-1 inline-block h-4 w-0.5 animate-pulse rounded-full bg-primary align-middle" aria-hidden="true" />
          ) : null}
        </div>
      </div>
      {isUser ? (
        <div className="grid size-8 shrink-0 place-items-center rounded-xl bg-secondary text-secondary-foreground">
          <UserRound className="size-4" aria-hidden="true" />
        </div>
      ) : null}
    </article>
  );
}

function ToolCallBubble({ name, args, result }: {
  name: string;
  args: Record<string, unknown>;
  result: string | null;
}) {
  const argText = Object.values(args).join(", ");
  const running = result === null;

  return (
    <article className="flex items-start gap-3">
      <div className="grid size-8 shrink-0 place-items-center rounded-xl bg-amber-500/[0.12] text-amber-600">
        <Wrench className="size-4" aria-hidden="true" />
      </div>
      <div className="w-full max-w-[82%] rounded-2xl border border-amber-500/25 bg-amber-500/[0.06] px-4 py-3 shadow-sm sm:max-w-[72%]">
        <div className="mb-2 flex items-center justify-between gap-3">
          <p className="text-[11px] font-semibold uppercase tracking-[0.16em] text-amber-600">
            Tool activity
          </p>
          <span className="inline-flex items-center gap-1.5 text-[11px] text-muted-foreground">
            <span className={`size-1.5 rounded-full ${running ? "animate-pulse bg-amber-500" : "bg-emerald-500"}`} />
            {running ? "Running" : "Complete"}
          </span>
        </div>
        <code className="block break-all text-xs font-semibold text-foreground">
          {name}({argText})
        </code>
        <p className="mt-2 line-clamp-3 break-all text-xs leading-5 text-muted-foreground" aria-live="polite">
          {running ? "Waiting for the tool result…" : result}
        </p>
      </div>
    </article>
  );
}

export default function Chat() {
  const [messages, setMessages] = useState<Message[]>([]);
  const [input, setInput] = useState("");
  const [loading, setLoading] = useState(false);
  const [error, setError] = useState<string | null>(null);
  const [controller, setController] = useState<AbortController | null>(null);
  const [threadId, setThreadId] = useState("");
  const [threads, setThreads] = useState<Thread[]>([]);
  const bottomRef = useRef<HTMLDivElement>(null);

  useEffect(() => {
    let id = localStorage.getItem("thread_id");
    if (!id) {
      id = crypto.randomUUID();
      localStorage.setItem("thread_id", id);
    }
    setThreadId(id);
    setThreads(JSON.parse(localStorage.getItem("threads") ?? "[]"));
  }, []);

  useEffect(() => {
    bottomRef.current?.scrollIntoView({ behavior: "smooth" });
  }, [messages]);

  function appendToken(token: string) {
    setMessages((prev) => {
      const last = prev[prev.length - 1];
      if (last && last.role === "assistant") {
        const next = [...prev];
        next[next.length - 1] = { ...last, content: last.content + token };
        return next;
      }
      return [...prev, { role: "assistant", content: token }];
    });
  }

  function startTool(name: string, args: Record<string, unknown>) {
    setMessages((prev) => [...prev, { role: "tool", name, args, result: null }]);
  }

  function endTool(result: string) {
    setMessages((prev) => {
      const next = [...prev];
      for (let i = next.length - 1; i >= 0; i--) {
        const message = next[i];
        if (message.role === "tool" && message.result === null) {
          next[i] = { ...message, result };
          break;
        }
      }
      return next;
    });
  }

  function newChat() {
    const id = crypto.randomUUID();
    localStorage.setItem("thread_id", id);
    setThreadId(id);
    setMessages([]);
    setError(null);
  }

  async function loadThread(id: string) {
    const res = await fetch(`${API_BASE}/threads/${id}/messages`);
    if (res.ok) setMessages(await res.json());
  }

  function switchThread(id: string) {
    localStorage.setItem("thread_id", id);
    setThreadId(id);
    loadThread(id);
  }

  function stop() {
    controller?.abort();
  }

  async function sendMessage(e: FormEvent) {
    e.preventDefault();
    const text = input.trim();
    if (!text || loading) return;

    if (messages.length === 0) {
      const updated = [{ id: threadId, title: text }, ...threads];
      setThreads(updated);
      localStorage.setItem("threads", JSON.stringify(updated));
    }

    setMessages((prev) => [...prev, { role: "user", content: text }]);
    setInput("");
    setLoading(true);
    setError(null);

    const controller = new AbortController();
    setController(controller);

    try {
      const res = await fetch(`${API_BASE}/chat`, {
        method: "POST",
        headers: { "Content-Type": "application/json" },
        body: JSON.stringify({ message: text, thread_id: threadId }),
        signal: controller.signal,
      });
      if (!res.ok || !res.body) throw new Error();

      const reader = res.body.getReader();
      const decoder = new TextDecoder();
      let buffer = "";

      while (true) {
        const { value, done } = await reader.read();
        if (done) break;
        buffer += decoder.decode(value, { stream: true });
        const parts = buffer.split("\n\n");
        buffer = parts.pop() ?? "";
        for (const part of parts) {
          if (!part.startsWith("data: ")) continue;
          const envelope = JSON.parse(part.slice(6));
          if (envelope.type === "token") appendToken(envelope.content);
          else if (envelope.type === "tool_start") startTool(envelope.name, envelope.args);
          else if (envelope.type === "tool_end") endTool(envelope.result);
        }
      }
    } catch (err) {
      if ((err as Error).name !== "AbortError") {
        setError("Could not reach the backend. Is it running on :8000?");
      }
    } finally {
      setLoading(false);
      setController(null);
    }
  }

  const hasCurrentThread = threads.some((thread) => thread.id === threadId);

  return (
    <main className="relative min-h-dvh overflow-hidden p-3 sm:p-5 lg:p-7">
      <Card className="relative mx-auto flex h-[calc(100dvh-1.5rem)] min-h-[34rem] max-w-7xl flex-row gap-0 overflow-hidden rounded-[1.75rem] border-foreground/10 bg-card/95 py-0 shadow-2xl shadow-slate-950/10 ring-1 ring-foreground/10 backdrop-blur-xl sm:h-[calc(100dvh-2.5rem)] lg:h-[calc(100dvh-3.5rem)]">
        <aside className="hidden w-72 shrink-0 flex-col border-r bg-sidebar/75 p-4 lg:flex">
          <div className="px-1 py-2">
            <Brand />
          </div>
          <Button onClick={newChat} className="mt-5 h-11 w-full justify-start rounded-xl px-3 shadow-md shadow-primary/15">
            <Plus className="size-4" aria-hidden="true" />
            New conversation
          </Button>

          <p className="mb-2 mt-7 px-2 text-[11px] font-semibold uppercase tracking-[0.18em] text-muted-foreground">
            Recent conversations
          </p>
          <nav className="chat-scrollbar min-h-0 flex-1 space-y-1 overflow-y-auto" aria-label="Conversation history">
            {threads.length === 0 ? (
              <p className="rounded-xl border border-dashed px-3 py-4 text-xs leading-5 text-muted-foreground">
                Your conversation history will appear here.
              </p>
            ) : (
              threads.map((thread) => (
                <button
                  key={thread.id}
                  type="button"
                  onClick={() => switchThread(thread.id)}
                  aria-current={thread.id === threadId ? "page" : undefined}
                  className={`flex w-full items-center gap-2.5 rounded-xl px-3 py-2.5 text-left text-sm transition focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-sidebar-ring ${
                    thread.id === threadId
                      ? "bg-sidebar-accent font-medium text-sidebar-accent-foreground shadow-sm"
                      : "text-muted-foreground hover:bg-sidebar-accent/60 hover:text-foreground"
                  }`}
                >
                  <MessageSquare className="size-4 shrink-0" aria-hidden="true" />
                  <span className="truncate">{thread.title}</span>
                </button>
              ))
            )}
          </nav>

          <div className="mt-4 flex items-center gap-3 rounded-2xl border bg-background/55 p-3">
            <div className="grid size-9 place-items-center rounded-xl bg-primary/10 text-primary">
              <BrainCircuit className="size-4" aria-hidden="true" />
            </div>
            <div>
              <p className="text-xs font-semibold">Thread memory</p>
              <p className="text-[11px] text-muted-foreground">Stored by LangGraph</p>
            </div>
          </div>
        </aside>

        <section className="flex min-w-0 flex-1 flex-col" aria-label="Lattice chat">
          <header className="flex h-20 shrink-0 items-center justify-between border-b px-4 sm:px-6">
            <div className="lg:hidden">
              <Brand />
            </div>
            <div className="hidden lg:block">
              <p className="text-sm font-semibold">Active conversation</p>
              <p className="mt-0.5 text-xs text-muted-foreground">Messages stay isolated inside this thread</p>
            </div>
            <div className="flex items-center gap-2 rounded-full border bg-background/70 px-3 py-1.5 text-xs font-medium text-muted-foreground shadow-sm">
              <span className="size-2 rounded-full bg-emerald-500 ring-4 ring-emerald-500/15" />
              <span className="hidden sm:inline">Memory active</span>
              <span className="sm:hidden">Memory</span>
            </div>
          </header>

          <div className="flex gap-2 border-b bg-sidebar/40 p-2 lg:hidden">
            <label htmlFor="mobile-thread" className="sr-only">
              Open a conversation
            </label>
            <select
              id="mobile-thread"
              value={threadId}
              onChange={(event) => switchThread(event.target.value)}
              className="h-10 min-w-0 flex-1 rounded-xl border bg-background px-3 text-sm text-foreground outline-none focus-visible:ring-2 focus-visible:ring-ring"
            >
              {!hasCurrentThread ? <option value={threadId}>New conversation</option> : null}
              {threads.map((thread) => (
                <option key={thread.id} value={thread.id}>
                  {thread.title}
                </option>
              ))}
            </select>
            <Button type="button" size="icon-lg" variant="outline" onClick={newChat} aria-label="Start a new conversation">
              <Plus className="size-4" aria-hidden="true" />
            </Button>
          </div>

          <div
            className="chat-scrollbar flex-1 overflow-y-auto px-4 py-5 sm:px-7 sm:py-7"
            role="log"
            aria-live="polite"
            aria-relevant="additions text"
          >
            {messages.length === 0 ? (
              <EmptyState onPick={setInput} />
            ) : (
              <div className="mx-auto w-full max-w-3xl space-y-6">
                {messages.map((message, index) =>
                  message.role === "tool" ? (
                    <ToolCallBubble
                      key={`tool-${index}`}
                      name={message.name}
                      args={message.args}
                      result={message.result}
                    />
                  ) : (
                    <ChatBubble
                      key={`${message.role}-${index}`}
                      message={message}
                      streaming={loading && message.role === "assistant" && index === messages.length - 1}
                    />
                  )
                )}
                <div ref={bottomRef} />
              </div>
            )}
          </div>

          <div className="shrink-0 border-t bg-card/80 p-3 sm:p-5">
            {error ? (
              <div
                role="alert"
                className="mx-auto mb-3 flex max-w-3xl items-start gap-2 rounded-xl border border-destructive/25 bg-destructive/10 px-3 py-2.5 text-sm text-destructive"
              >
                <CircleAlert className="mt-0.5 size-4 shrink-0" aria-hidden="true" />
                <span>{error}</span>
              </div>
            ) : null}
            <form
              onSubmit={sendMessage}
              className="mx-auto flex max-w-3xl items-center gap-2 rounded-2xl border bg-background/85 p-2 shadow-lg shadow-slate-950/5 transition focus-within:border-primary/45 focus-within:ring-4 focus-within:ring-primary/10"
            >
              <label htmlFor="chat-message" className="sr-only">
                Message Lattice
              </label>
              <Input
                id="chat-message"
                value={input}
                onChange={(e) => setInput(e.target.value)}
                placeholder="Message this conversation…"
                autoComplete="off"
                disabled={loading}
                className="h-11 flex-1 border-0 bg-transparent px-3 text-sm shadow-none focus-visible:ring-0"
              />
              {loading ? (
                <Button type="button" variant="outline" onClick={stop} aria-label="Stop response" className="h-11 rounded-xl px-4">
                  <Square className="size-3.5 fill-current" aria-hidden="true" />
                  <span className="hidden sm:inline">Stop</span>
                </Button>
              ) : (
                <Button
                  type="submit"
                  aria-label="Send message"
                  disabled={!input.trim() || !threadId}
                  className="h-11 rounded-xl px-4 shadow-md shadow-primary/20"
                >
                  <span className="hidden sm:inline">Send</span>
                  <Send className="size-4" aria-hidden="true" />
                </Button>
              )}
            </form>
            <p className="mt-2 text-center text-[11px] text-muted-foreground">
              Lattice can remember earlier messages in this conversation.
            </p>
          </div>
        </section>
      </Card>
    </main>
  );
}

The same thread remembers Alex, a brand-new chat starts blank, and clicking the old thread restores the whole conversation.

What you built

Part 7

A checkpointer wired into the graph with one line, builder.compile(checkpointer=InMemorySaver()), so the graph saves and reloads its own state.
Threads: every conversation is keyed by a thread_id, generated with crypto.randomUUID() and kept in localStorage so it survives refreshes.
The thread wired end to end: the frontend sends thread_id, the backend passes it as the graph's config, and the bot reads its own history before each reply.
A New conversation button that mints a fresh thread, and a sidebar that lists past chats and reopens them via a GET /threads/{id}/messages endpoint reading from the checkpointer.
A clear-eyed view of the limit: InMemorySaver lives in RAM, so every conversation is wiped on restart, the problem Part 8 leaves you ready to solve.

Test yourself

Score ··

Why did the Part 6 bot forget your name between two messages?

What is a thread_id?

After adding the checkpointer, the server crashed with ValueError: Checkpointer requires one or more of the following 'configurable' keys. Why?

With memory on, why does the backend still send only the newest message to the graph, not the whole history?

The series uses InMemorySaver. What happens to every conversation when the server restarts?

Commit it, from the project root, in a terminal that isn't hosting a server:

BASH

git add .
git commit -m "part 7: give the bot conversation memory with a checkpointer and threads"

Your bot remembers you now, runs tools, and streams its answers, the whole thing humming on your laptop. There's one wall left: it only exists on your machine. In Part 8 you'll put it on the public internet, share the link, and watch it forget everyone the first time you redeploy, which turns out to be the perfect reason to care about everything this part taught you.

The complete, tested code for this part lives in part-07-memory in the companion repo. Code blocks with a GitHub icon link straight to the exact file; "View full file" shows the whole file in place with this section's changes highlighted.