Quick Reference
| Workflows | Agents | |
|---|---|---|
| Free | 2 concurrent runs | 5 concurrent interactions |
| Pro | 5 concurrent runs | 25 concurrent interactions |
| Enterprise | 15 concurrent runs (customizable) | 100 concurrent interactions (customizable) |
| When limit is hit | Queued (Enterprise) or rejected | Queued (Enterprise) or rejected |
| Scope | Organization-wide (Pro & Enterprise) | Organization-wide (Pro & Enterprise) |
Once a workflow run or agent interaction finishes, it frees up a slot for the next one.
How Rate Limits Work
Every time you start a workflow or agent interaction, the system checks whether your organization has an available slot. If it does, the execution starts immediately. If all slots are in use, the request is either queued or rejected depending on your plan.Request arrives
A workflow is triggered or an agent interaction starts — whether from the UI, API, Slack, Teams, or a scheduled trigger.
Slot check
The system checks how many executions are currently running across your organization against your concurrency limit.
Workflow Rate Limits
Workflow rate limits control how many workflow runs can execute at the same time across your organization.Limits by Plan
| Plan | Concurrent Workflow Runs |
|---|---|
| Free | 2 |
| Pro | 5 |
| Enterprise | 15 (default, customizable per org) |
What Counts Toward the Limit
Counts
- Any workflow run that is currently executing
- Runs triggered via UI, API, webhooks, or scheduled triggers
Does NOT Count
- Runs that have finished (completed, failed, or terminated)
- Runs that are queued (waiting for a slot)
- Subflows (nested workflow runs) — only the parent run counts
How the Limit Is Determined
The system determines your organization’s limit in this order:- Custom organization limit — If your org admin or account manager has set a custom concurrency limit, that value is used.
- Plan default — Otherwise, the default for your subscription plan applies (see table above).
- Per-user cap (optional) — Org admins can set a per-user limit on specific user groups. This prevents any single user from using all the org’s slots.
Example: Per-User Caps Within an Organization
Example: Per-User Caps Within an Organization
Setup:
- Organization concurrency limit: 15
- A user group has a per-user cap of 5
- The organization can have up to 15 workflow runs at once across all members
- Any user in that group can have at most 5 of those 15 running at once
- Both limits must be satisfied for a new run to start
When You Hit the Workflow Limit
- Enterprise (Queued)
- Free & Pro (Rejected)
When an Enterprise organization hits the limit:
- The run is saved as queued
- When another run finishes, the queued run automatically starts
- Queued runs are processed in the order they were submitted
The queue can hold a very large number of items. If the queue is full, the request is rejected — but this is extremely unlikely in practice.
Agent Rate Limits
Agent rate limits control how many agent interactions can run at the same time across your organization. Agents have higher default limits than workflows because interactions tend to be conversational and can run longer.Limits by Plan
| Plan | Concurrent Agent Interactions |
|---|---|
| Free | 5 |
| Pro | 25 |
| Enterprise | 100 (default, customizable per org) |
What Counts Toward the Limit
Counts
- Any agent interaction that is currently running (actively processing a request or executing tools)
- Interactions from all channels: web UI, Slack, Teams, API, and triggers
Does NOT Count
- Interactions that have finished (completed or failed)
- Interactions that are queued (waiting for a slot)
- Interactions that are idle (conversation open but agent is not actively working)
- The Pipeline Builder AI assistant (the in-editor helper for building workflows)
- The Custom Operator Builder AI assistant
- The Gumstack AI assistant
Builder assistants (Pipeline Builder, Custom Operator Builder, Gumstack) are excluded from agent rate limits so that building and editing workflows is never blocked by agent concurrency.
How the Limit Is Determined
Agent limits are determined the same way as workflow limits:- Custom organization limit — A custom agent concurrency limit on the org, if set.
- Plan default — Falls back to the plan-based default shown above.
- Per-user cap (optional) — An optional cap on a user group that restricts how many interactions a single user can run.
Automatic Slot Cleanup
If an agent interaction crashes or is abandoned without finishing properly, the slot it was using is automatically freed after approximately 2 hours. This means you will never get permanently stuck at your limit due to a stuck interaction. Under normal circumstances, slots are released immediately when an interaction finishes — the 2-hour cleanup is only a safety net.What Happens When You’re Rate Limited
How the system communicates rate limiting depends on where the interaction comes from:Web UI
When chatting with an agent in the browser:- Enterprise: The UI shows a “queued” indicator and waits up to 5 minutes for a slot to open. If a slot opens, the interaction starts automatically. If not, you are notified.
- Free & Pro: An error message is shown: “Too many concurrent agent interactions. Please try again shortly.”
API
When starting an agent or workflow via the API:- Enterprise: If the limit is hit, the request is queued. The API returns the interaction with a
queuedstatus and aqueue_positionso you can track progress. - Free & Pro: The API returns HTTP 429 (Too Many Requests).
API Response Examples
API Response Examples
Queued (Enterprise):Rate Limited (Free & Pro):
Slack
When an agent receives a Slack message:- Enterprise: If queued, a message is posted in the thread: “Your agent interaction is queued (position #N). It will be processed shortly.”
- Free & Pro: An ephemeral message is sent: “You have too many agent interactions running at the moment. Please try again shortly.”
Microsoft Teams
Follows the same pattern as Slack — Enterprise users see a queued notification, others see an error message.Scheduled & Event-Based Triggers
- Enterprise: If the limit is hit, the triggered interaction is queued and will start when capacity is available.
- Free & Pro: The trigger execution is skipped.
How the Queue Works
When an Enterprise organization hits the rate limit, requests are placed in a queue instead of being rejected.- When a running execution finishes and frees a slot, the next queued item starts automatically
- The user who freed the slot gets slight priority — their own queued items are checked first, then the org-wide queue
- Within each queue, items are processed in the order they were submitted (first in, first out)
- If a queued item can’t start because of a per-user cap, it is moved to that user’s individual queue (not dropped)
Tips for Avoiding Rate Limits
Stagger Your Triggers
If you’re triggering many workflows at once (e.g., via webhooks), add small delays between them so they don’t all compete for slots at the same time.
Keep Runs Short
Shorter workflow runs and agent interactions free up slots faster. Optimize long-running workflows to reduce their total execution time.
Use Subflows
Subflows (nested workflow runs) don’t count against your concurrency limit. Break large workflows into a parent with subflows to make better use of your slots.
Upgrade Your Plan
If you consistently hit limits, consider upgrading. Enterprise plans have the highest limits and also unlock automatic queuing so requests are never lost.
Enterprise: Request a Custom Limit
Enterprise: Request a Custom Limit
Enterprise organizations can request custom concurrency limits that exceed the defaults. Contact your account manager or support@gumloop.com to adjust:
- Workflow concurrency — the org-wide limit on simultaneous workflow runs
- Agent concurrency — the org-wide limit on simultaneous agent interactions
- Per-user caps via user groups — restrict individual users within the org
Summary
| Aspect | Workflows | Agents |
|---|---|---|
| What’s limited | Concurrent workflow runs | Concurrent agent interactions |
| Limit scope | Organization-wide | Organization-wide |
| Enterprise queuing | Yes (automatic) | Yes (automatic) |
| Free & Pro behavior | Rejected (HTTP 429) | Rejected (error message or HTTP 429) |
| Stuck slot recovery | Periodic cleanup | Automatic (~2 hours) |
| Customizable limits | Yes (Enterprise) | Yes (Enterprise) |
| Per-user caps | Yes (via user groups) | Yes (via user groups) |
Credits
Learn about credit costs for workflows and agents
User Groups
Configure user groups and per-user limits
