AI Agent doesn't store the Tool usages in memory #14361

fjrdomingues · 2025-04-02T15:52:10Z

Bug Description

The current implementation of the AI Agent and the Memory nodes stores only the input and output messages, not the Tool messages.

Why is this important?
Have you noticed the agent claiming that it called a tool but didn't? Despite the flaws of the models, this is greatly aggravated by this problem. The context window gets filled with messages where the user asks for an action, the AI replies with success and the user replies with positive feedback. Without the tool messages, the LLM will learn the pattern and repeat it, so the next time it won't call a tool and instead will just reply directly to the user.

I have a fork with a working suggestion on how to fix it: fjrdomingues@1af9450

To Reproduce

Use the Simple Memory node, or the Postgres one (the only ones I tested) and check the messages that were saved. The tool calls are always an empty array, on save and load.

Expected behavior

The tools_call array should be populated when saving memories

Operating System

NA

n8n Version

1.83.2

Node.js Version

20.18.3

Database

PostgreSQL

Execution mode

main (default)

The text was updated successfully, but these errors were encountered:

Joffcom · 2025-04-02T15:52:14Z

Hey @fjrdomingues,

We have created an internal ticket to look into this which we will be tracking as "GHC-1434"

Joffcom · 2025-04-02T16:04:37Z

Hey @fjrdomingues,

Is this a bug or an enhancement request?

fjrdomingues · 2025-04-02T16:08:05Z

Hey @Joffcom If there's such a category then it may fit it better - enhancement request.

davidsula · 2025-04-04T08:29:23Z

I totally agree. I want my AI to remember the output of a tool call, which currently stays empty. For now agents only seem to remember the output of tool calls within the response they are called with.
AND YOU ARE SO RIGHT. My AI agent will call the tool the first two times it is meant to, and then eventually just stop when it needs to still be calling it, because it learns the pattern that there were no tool calls before even though there were.

GuillaumeRoy · 2025-04-05T00:59:40Z

I'd suggest this should rank much higher than "enhancement request". Consider this scenario:

You're interacting with an agent. It creates a record in a data store at your request via a tool. Let's say, saving a contact. This returns a contact id.
N messages later in the memory window, you ask the AI to add an email address to that contact.
With the current state of things, it has no knowledge of the id returned by the tool and will require a search first to reacquire that id.

When working directly with langchain, that tool output including the id would be in the memory context and no round-trip would be necessary.

This is significantly holding back agent tool usage via n8n IMHO. Working with langchain directly this was never an issue but it does introduce concerns around managing your tool verbosity vs filling the context with a lot of tokens.

Merlin-Richter · 2025-04-14T15:15:49Z

Yes, this is a bug request for sure.

Many applications of AI agents don't work at the moment because of this issue.
There are many many problems that come downstream from this.

Multi-turn Agents with tools are currently not working.

GuillaumeRoy · 2025-04-14T15:50:11Z

Right now I'm working around this by having the tools manually insert in the memory using the memory manager, but it's an ugly patch with many pitfalls.

Not sure what the proper etiquette is here to get an update and/or more eyes on this?

GuillaumeRoy · 2025-05-01T10:31:09Z

Hi @Joffcom,

Just want to bring this to your attention. In addition to the example I've given above, I am now seeing multiple models hallucinating tool calls due to this shortcoming.

User request leads to a tool call.
Agent executes tool call (nothing persisted to memory, neither input nor output).
Agent replies to user to indicate tool was called.
User requests another tool call.
Agent does not see prior tool call input and output, only the conversation about it. Based on that pattern, it learns that replying about a tool call (not making it) is sufficient to satisfy user request.
Agent replies to the user to indicate tool was called, without having actually called the tool, as this fits with the precedent established in its memory.

IMHO this is totally kneecapping n8n's agents when compared to straight Langchain/Langgraph implementations.

civilcoder55 · 2025-05-01T16:18:30Z

Totally agree with you @GuillaumeRoy

davidsula · 2025-05-03T18:05:36Z

Hi @Joffcom,

Just want to bring this to your attention. In addition to the example I've given above, I am now seeing multiple models hallucinating tool calls due to this shortcoming.

User request leads to a tool call.

Agent executes tool call (nothing persisted to memory, neither input nor output).

Agent replies to user to indicate tool was called.

User requests another tool call.

Agent does not see prior tool call input and output, only the conversation about it. Based on that pattern, it learns that replying about a tool call (not making it) is sufficient to satisfy user request.

Agent replies to the user to indicate tool was called, without having actually called the tool, as this fits with the precedent established in its memory.

IMHO this is totally kneecapping n8n's agents when compared to straight Langchain/Langgraph implementations.

A perfect explanation of the issue. Alongside this, it would be better for the AI to remember the output of a tool call, as there is some info from the tool output that the model might not respond with in the initial tool call that may be relevant later on.

caiosa1337 · 2025-05-05T13:14:58Z

I encountered a similar issue while building an appointment scheduling system via API using an AI agent in n8n. At one point, the agent would call a tool that returned crucial data like available staff IDs and time slots. Initially, everything worked fine — the AI had access to those values in the current turn and could make correct suggestions.

The problem started when the customer confirmed an option in a later turn. At that point, the AI no longer had access to the previous tool response, and it started "making up" IDs and times because that data was no longer available in context.

The root cause was realizing that tool outputs are not automatically persisted or injected into the prompt context across turns. And since n8n lets us configure the memory window, even if the tool response is saved to memory, it may be forgotten quickly as the conversation grows.

My solution was to manually inject all essential data (like IDs, times, names) into the prompt so it would remain available across turns. This worked and made the system more reliable — but it increased prompt complexity and maintenance.

It would be extremely useful if n8n provided an option to persist tool outputs directly into the prompt context, as a kind of “prompt extension,” without relying solely on memory. That would reduce complexity and help avoid fragile behavior in multi-turn flows.

Merlin-Richter · 2025-05-05T14:04:32Z

This needs to be fixed asap. I am currently doing a workaround with normal HTTP requests and firestore. It's so ugly, lol.
But the AI Agent is currently unusable with tools.

gradox2020 · 2025-05-06T11:00:45Z

Without this fix, Agent AI with tools is not usable in real workflows. The agent does not retain tool output and therefore cannot reliably act on previous results. Please consider increasing the priority of this issue. Thank you!

kshwetabh · 2025-05-15T04:54:05Z

Right now I'm working around this by having the tools manually insert in the memory using the memory manager, but it's an ugly patch with many pitfalls.

Not sure what the proper etiquette is here to get an update and/or more eyes on this?

Can you please provide more details on how you achieved this using memory manager? I tried but failed pathetically.

GuillaumeRoy · 2025-05-15T11:31:39Z

Using Redis memory and workflow tool nodes, at the end of the subworkflow I was injecting a message directly into the redis context using the memory manager insert functionality. It kinda sucks because 1) limited applicability 2) manual 3) still can lead to hallucinations 4) message ordering and type is not exactly what it should be.

I ended up implementing my own memory layer as a stopgap and I'm moving serious agent use cases away from n8n 💔

Nervo24 · 2025-05-15T20:58:59Z

Using Redis memory and workflow tool nodes, at the end of the subworkflow I was injecting a message directly into the redis context using the memory manager insert functionality. It kinda sucks because 1) limited applicability 2) manual 3) still can lead to hallucinations 4) message ordering and type is not exactly what it should be.

I ended up implementing my own memory layer as a stopgap and I'm moving serious agent use cases away from n8n 💔

I tried the same using PostgreSQL but by inserting the data into memory at the sub-workflow end the tool response is inserted before the user prompt.

ptr-bloch · 2025-05-26T12:52:57Z

Are there any movements?
The problem is a bit deeper than what is mentioned here about what the model learns from previous messages: some models call the same tool multiple times, with each new user request, perhaps because they see that the previous discussion doesn't mention the tool usage that is required by the system prompt. So, instead of one call, I get extra calls with each agent execution.

GuillaumeRoy · 2025-05-26T13:17:21Z

@ptr-bloch I went on a limb and reached out to a member of the n8n team directly over LinkedIn on May 2nd to bring this issue+thread to their attention and learned that "AI Squad (...) already looking into it and discussing".

aouicher · 2025-05-31T11:27:51Z

I've tested different LLM provider. And it seems have an issue when provider respond with:

[
    {
      "response": {
        "generations": [
          [
            {
              "text": "bla bla bla bla bla bla bla bla bla bla",
              "generationInfo": {
                "prompt": 0,
                "completion": 0,
                "finish_reason": "tool_calls",
                "system_fingerprint": "fp",
                "model_name": "custom_model"
              }
            }
          ]
        ]
      },
      "tokenUsage": {
        "completionTokens": 328,
        "promptTokens": 2852,
        "totalTokens": 3180
      }
    }
  ]

It seems work when:

[
    {
      "response": {
        "generations": [
          [
            {
              "text": "bla bla bla bla bla bla bla bla bla bla",
              "generationInfo": {
                "finish_reason": "tool_calls"
              }
            }
          ]
        ]
      },
      "tokenUsage": {
        "completionTokens": 328,
        "promptTokens": 2852,
        "totalTokens": 3180
      }
    }
  ]

Other thing: for each call to chat model, the action on memory after is ever loadMemoryVariables, never saveContext. When it's works, saveContext is called

Joffcom added the in linear Issue or PR has been created in Linear for internal review label Apr 2, 2025

Joffcom added the Needs Feedback Waiting for further input or clarification. label Apr 2, 2025

Joffcom removed the Needs Feedback Waiting for further input or clarification. label Apr 2, 2025

AI Agent doesn't store the Tool usages in memory #14361

AI Agent doesn't store the Tool usages in memory #14361

Comments

fjrdomingues commented Apr 2, 2025

Bug Description

To Reproduce

Expected behavior

Operating System

n8n Version

Node.js Version

Database

Execution mode

Joffcom commented Apr 2, 2025

Uh oh!

Joffcom commented Apr 2, 2025

Uh oh!

fjrdomingues commented Apr 2, 2025

Uh oh!

davidsula commented Apr 4, 2025

Uh oh!

GuillaumeRoy commented Apr 5, 2025

Uh oh!

Merlin-Richter commented Apr 14, 2025

Uh oh!

GuillaumeRoy commented Apr 14, 2025

Uh oh!

GuillaumeRoy commented May 1, 2025

Uh oh!

civilcoder55 commented May 1, 2025

Uh oh!

davidsula commented May 3, 2025

Uh oh!

caiosa1337 commented May 5, 2025

Uh oh!

Merlin-Richter commented May 5, 2025

Uh oh!

gradox2020 commented May 6, 2025

Uh oh!

kshwetabh commented May 15, 2025

Uh oh!

GuillaumeRoy commented May 15, 2025

Uh oh!

Nervo24 commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ptr-bloch commented May 26, 2025

Uh oh!

GuillaumeRoy commented May 26, 2025

Uh oh!

aouicher commented May 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Nervo24 commented May 15, 2025 •

edited

Loading

aouicher commented May 31, 2025 •

edited

Loading