Databricks is a unified analytics platform for data engineering, data science, and machine learning. The Databricks MCP server lets you manage clusters, run jobs, execute SQL, and query ML endpoints using natural language.Documentation Index
Fetch the complete documentation index at: https://docs.gumloop.com/llms.txt
Use this file to discover all available pages before exploring further.
What Can It Do?
- Manage clusters by listing, starting, and terminating on demand
- Orchestrate jobs by triggering runs and fetching outputs
- Run SQL on warehouses and return structured data
- Query ML endpoints and vector indexes for AI workflows
Where to Use It
In Agents (Recommended)
Add Databricks as a tool to any agent. The agent can then interact with your workspace conversationally, choosing the right actions based on context. To add an MCP tool to your agent:- Open your agent’s configuration
- Click Add tools → Connect an app with MCP
- Search for the integration and select it
- Authenticate with your account
In Workflows (Via Agent Node)
For automated pipelines, use an Agent Node with Databricks tools. This gives you the flexibility of an agent within a deterministic workflow.As a Custom MCP Node
You can also create a standalone MCP node for a specific action. This generates a reusable node that performs one task, useful when you need the same operation repeatedly in workflows.- Go to your node library and search for the integration
- Click Create a node with AI
- Describe the specific action you want (e.g., “List all active clusters”)
- Test the node and save it for reuse
Custom MCP nodes are single-purpose by design. For tasks that require multiple steps or dynamic decision-making, use an agent instead.
Available Tools
| Tool | Description |
|---|---|
| Get Me | Get authenticated user information |
| List Clusters | List all pinned and active clusters |
| Start Cluster | Start a terminated cluster |
| Terminate Cluster | Terminate a running cluster |
| List Jobs | List jobs with pagination |
| Run Job | Trigger a new job run |
| Manage Job Run | Cancel or delete a job run |
| Get Job Run Output | Get output from a job run |
| Execute SQL | Run SQL on a warehouse |
| List Warehouses | List all SQL warehouses |
| Query Serving Endpoint | Query a model serving endpoint |
| List Serving Endpoints | List all serving endpoints |
| Query Vector Index | Query a vector index |
| List Vector Search Endpoints | List vector search endpoints |
Example Prompts
Use these with your agent or in the Agent Node: Manage clusters:Troubleshooting
| Issue | Solution |
|---|---|
| Agent not finding the right data | Use specific cluster or job names |
| Action not completing | Check that you’ve authenticated and have the necessary workspace permissions |
| Unexpected results | The agent may chain multiple tools (e.g., listing jobs first, then running one). Review the agent’s reasoning to understand its approach. |
| Tool not available | Verify the tool is enabled in your agent’s MCP configuration |
Need Help?
- Agents documentation for setup and best practices
- Agent Node guide for workflow integration
- Gumloop Community for questions and examples
- Contact support@gumloop.com for assistance
Use this integration directly in Claude or Cursor. Connect remotely via the Databricks MCP server using your Gumloop credentials.
