Exa MCP | API Key Exa connector for AI agents

Tools your research agent reaches for on Exa, scoped per user.

CALL ANY TOOL

Run semantic and keyword searches, extract page contents, and find similar URLs across the web.

exa_answer

Answer

Get a natural language answer to a question by searching the web with Exa and synthesizing results. Returns a direct answer with citations to the source pages. Ideal for factual questions, current events, and research queries. Rate limit: 60 requests/minute.

Parameters

Name

Type

Required

Description

query

string

Required

The question or query to answer from web sources.

exclude_domains

array

Optional

JSON array of domains to exclude from answer sources.

include_domains

array

Optional

JSON array of domains to restrict source search to. Example: ["reuters.com","bbc.com"]

include_text

boolean

Optional

When true, also returns the source page text alongside the synthesized answer.

num_results

integer

Optional

Number of web sources to use when generating the answer (1–20). More sources improves accuracy but costs more credits.

exa_crawl

Crawl

exa_delete_webset

Delete Webset

exa_find_similar

Find Similar

exa_get_webset

Get Webset

exa_list_webset_items

List Webset Items

exa_list_websets

List Websets

exa_research

Research

exa_search

Search the web using Exa's AI-powered semantic or keyword search engine. Supports filtering by domain, date range, content category, and result type. Optionally returns page text, highlights, or summaries alongside search results. Rate limit: 60 requests/minute.

Parameters

Name

Type

Required

Description

query

string

Required

The search query. For neural/auto type, natural language works best. For keyword type, use specific terms.

category

string

Optional

Restrict results to a specific content category.

end_crawl_date

string

Optional

Only return pages crawled (discovered) before this date. ISO 8601 format.

end_published_date

string

Optional

Only return pages published before this date. ISO 8601 format: YYYY-MM-DDTHH:MM:SS.000Z

exclude_domains

array

Optional

JSON array of domains to exclude from results. Example: ["reddit.com","quora.com"]

include_domains

array

Optional

JSON array of domains to restrict results to. Example: ["techcrunch.com","wired.com"]

include_highlights

boolean

Optional

When true, returns relevant text snippets from each result page.

include_summary

boolean

Optional

When true, returns an LLM-generated summary for each result page.

include_text

boolean

Optional

When true, returns the full text content of each result page (up to max_characters).

max_age_hours

integer

Optional

Maximum age of cached content in hours. 0 fetches fresh content; -1 always uses cache; omit for fallback. Max 720.

max_characters

integer

Optional

Maximum characters of page text to return per result when include_text is true. Defaults to 3000.

moderation

boolean

Optional

When true, enables content moderation to filter unsafe content from results.

num_results

integer

Optional

Number of results to return (1–100). Defaults to 10.

start_crawl_date

string

Optional

Only return pages crawled (discovered) after this date. ISO 8601 format.

start_published_date

string

Optional

Only return pages published after this date. ISO 8601 format: YYYY-MM-DDTHH:MM:SS.000Z

system_prompt

string

Optional

Additional instructions that guide generated output, source preferences, or agent behavior.

type

string

Optional

Search type: 'neural' for semantic AI search (best for natural language), 'keyword' for exact-match keyword search, 'auto' to let Exa decide.

use_autoprompt

boolean

Optional

When true, Exa automatically rewrites the query to be more semantically effective.

user_location

string

Optional

Two-letter ISO country code of the user, used to localize results. e.g. US, GB, DE.

exa_websets

Websets

Build your Agent

Drop the toolkit in, point it at the user, and your research agent can use Exa from the first run.

import { ScalekitClient } from "@scalekit-sdk/node";
import { DynamicStructuredTool } from "@langchain/core/tools";
import { createReactAgent } from "@langchain/langgraph/prebuilt";
import { z } from "zod";

const sk = new ScalekitClient(envUrl, clientId, clientSecret);

const { tools } = await sk.tools.listScopedTools("user_123", {
  filter: { connectionNames: ["exa"], toolNames: ["exa_search", "exa_find_similar", "exa_get_contents"] },
  pageSize: 100,
});

const lcTools = tools.map((t) => new DynamicStructuredTool({
  name: t.tool.definition.name,
  description: t.tool.definition.description,
  schema: z.object({}).passthrough(),
  func: async (args) => {
    const { data } = await sk.tools.executeTool({
      toolName: t.tool.definition.name,
      identifier: "user_123",
      params: args,
    });
    return JSON.stringify(data);
  },
}));

const agent = createReactAgent({ llm, tools: lcTools });

import { ScalekitClient } from "@scalekit-sdk/node";
import OpenAI from "openai";

const sk = new ScalekitClient(envUrl, clientId, clientSecret);
const openai = new OpenAI();

const { tools } = await sk.tools.listScopedTools("user_123", {
  filter: { connectionNames: ["exa"], toolNames: ["exa_search", "exa_find_similar", "exa_get_contents"] },
  pageSize: 100,
});

const llmTools = tools.map((t) => ({
  type: "function",
  function: {
    name: t.tool.definition.name,
    description: t.tool.definition.description,
    parameters: t.tool.definition.input_schema,
  },
}));

const resp = await openai.responses.create({
  model: "gpt-4o", input: prompt, tools: llmTools,
});

import { ScalekitClient } from "@scalekit-sdk/node";
import Anthropic from "@anthropic-ai/sdk";

const sk = new ScalekitClient(envUrl, clientId, clientSecret);
const anthropic = new Anthropic();

const { tools } = await sk.tools.listScopedTools("user_123", {
  filter: { connectionNames: ["exa"], toolNames: ["exa_search", "exa_find_similar", "exa_get_contents"] },
  pageSize: 100,
});

const llmTools = tools.map((t) => ({
  name: t.tool.definition.name,
  description: t.tool.definition.description,
  input_schema: t.tool.definition.input_schema,
}));

const msg = await anthropic.messages.create({
  model: "claude-sonnet-4-6", max_tokens: 1024,
  tools: llmTools,
  messages: [{ role: "user", content: prompt }],
});

import { Agent } from "@google/adk/agents";
import {
  MCPToolset, StreamableHTTPConnectionParams,
} from "@google/adk/tools/mcp";

const toolset = new MCPToolset({
  connectionParams: new StreamableHTTPConnectionParams({
    url: "https://mcp.scalekit.com/exa",
    headers: { Authorization: `Bearer ${userScopedToken}` },
  }),
});

const agent = new Agent({
  name: "agent", model: "gemini-2.0-flash",
  tools: await toolset.getTools(),
});

Try these prompts

Paste any prompt into your agent to start using Exa.

Web & news

Copy the prompt

Copied

Search for [topic] published this month.

Copy the prompt

Copied

Find the latest papers on [research area].

Copy the prompt

Copied

Search news from [domain] this week.

Copy the prompt

Copied

Get the content of [URL].

Research & sourcing

Copy the prompt

Copied

Find pages similar to [URL].

Copy the prompt

Copied

Search case studies on [industry trend].

Copy the prompt

Copied

Pull recent mentions of [company] online.

Copy the prompt

Copied

Find [competitor] product launches this quarter.

Deep extraction

Copy the prompt

Copied

Search and extract full text: [query].

Copy the prompt

Copied

Find top 5 results on [topic] with highlights.

Copy the prompt

Copied

Get contents of these URLs: [url1], [url2].

Copy the prompt

Copied

Search [keyword] restricted to [domain].

SEE HOW AUTH WORKS

Users authorize Exa once. Their credentials stay vaulted, every call is checked, and every action is logged.

Authorize

Your user connects

Exa

once. We tie it to their identity and the meetings they approved — no shared bot account, no org-wide access

Who:

user ‘A’

when:

Once per user

access:

Limited to user

Store

Their

Exa

token lives in a vault scoped to them. User A's meetings are never reachable by an agent acting for user B, even on the same connection

vault:

encrypted

scope:

per-user

tokens:

auto-refreshed

Resolve

When your agent calls a

Exa

tool, we fetch the right token server-side. It never touches your agent, never appears in the LLM context, never shows up in your logs

speed:

~40ms

check:

before every call

seen by:

nobody

Audit

Every

Exa

tool call is logged — who triggered it, which meeting was fetched, what came back. 90 days of history, tied to the user who authorized it

history:

90 days

export:

SIEM-ready

logged:

every call

Test other agents

Same per-user auth pattern across other research agents and MCP connectors. Working code, live demos, fork what fits.

SALES

Outbound prospecting agent

Build targeted prospect lists with Apollo, enrich with firmographic data, and draft personalised outreach. Runs on a schedule.

Apollo

View agent

SALES

Sales call prep agent

Pull Granola notes and Attio contact history to draft a pre-call brief before every sales meeting. Zero rep input.

Granola

Attio

View agent

Why Scalekit

Secure your agent's access. Connectors ship in minutes

Other connector libraries treat auth as a demo afterthought. Scalekit starts with user identity, scope enforcement, and audit.

01.

Search quota consumed under the wrong identity

A shared Exa API key looks fine in a demo. In production, every search query burns quota against a service account. Per-user rate limits and result caching collapse. Scalekit resolves the user's key so search runs under the right identity.

// shared API key
key = "exa_shared_xxx"
audit → bot_service_account
quota_filter → broken

// scalekit · per-user
key = resolve(user_id)
audit → user_abc
scope → enforced ✓

02.

Authentication is not authorization

03.

Multi-tenancy is architectural

04.

Exa today. Others tomorrow.

“Our agents act across Salesforce, Gong, Google Drive, and more, on behalf of every customer. Scalekit behind the scenes meant we can keep adding tools without ever rebuilding how credentials or tool calling work.”

Venu Madhav Kattagoni

Head of Engineering / Von

FAQs

Frequently Asked Questions

Does the agent access Exa as the user or as a shared key?

As the user. Each workspace member authorizes once and Scalekit resolves their credential at request time. Audit logs attribute every action to that user, not a shared service account.

Where is the Exa api key stored?

In Scalekit's managed AES-256 token vault, namespaced per tenant. Refresh is automatic. Revocation is a single dashboard action. Tokens never appear in prompts, logs, or LLM context.

Can I limit what the agent is allowed to do in Exa?

Yes. Pass a tool name filter to listScopedTools so the research agent only sees the subset you authorize. Pre-API-call scope checks block out-of-policy actions before the request reaches Exa.

What happens when a user revokes Exa access?

The connection is invalidated on the next tool call. Subsequent requests for that user fail closed with a clear error. Other users in the tenant remain unaffected. The event is logged for audit.

Are Exa search results shared or cached across users?

No cross-user caching. Each search request hits Exa's API with the authorizing user's key. Per-user rate limits and monthly query quotas apply independently. Search history and results are never shared or reused across users in the vault.

On this page

Tool and actions Framework Compatibility Starter Prompt Auth Lifecycle Try Other Agents Why Scalekit