What is Scalable OpenAPI Endpoint Discovery and API Request Tool.?
This MCP server provides a scalable solution for discovering OpenAPI endpoints and making API requests using natural language queries. It utilizes in-memory semantic search for fast endpoint discovery and supports customizable configurations through environment variables.
Documentation
MCP Server: Scalable OpenAPI Endpoint Discovery and API Request Tool
TODO
The docker image is 2GB without pre-downloaded models. Its 3.76GB with pre-downloaded models!! Too big, someone please help me to reduce the size.
Configuration
Customize through environment variables. GLOBAL_TOOL_PROMPT is IMPORTANT!
# Creates tools: custom_api_request_schema and custom_make_request
docker run -e MCP_API_PREFIX=finance ...
GLOBAL_TOOL_PROMPT: Optional text to prepend to all tool descriptions. This is crucial to make the Claude select and not select your tool accurately.
# Adds "Access to insights apis for ACME Financial Services abc.com . " to the beginning of all tool descriptions
docker run -e GLOBAL_TOOL_PROMPT="Access to insights apis for ACME Financial Services abc.com ." ...
TL'DR
Why I create this: I want to serve my private API, whose swagger openapi docs is a few hundreds KB in size.
Claude MCP simply error on processing these size of file
I attempted convert the result to YAML, not small enough and a lot of errors. FAILED
I attempted to provide a API category, then ask MCP Client (Claude Desktop) to get the api doc by group. Still too big, FAILED.
Eventually I came down to this solution:
It uses in-memory semantic search to find relevant Api endpoints by natural language (such as list products)
It returns the complete end-point docs (as I designed it to store one endpoint as one chunk) in millionseconds (as it's in memory)
Boom, Claude now knows what API to call, with the full parameters!
Wait I have to create another tool in this server to make the actual restful request, because "fetch" server simply don't work, and I don't want to debug why.
🧠 Use remote openapi json file as source, no local file system access, no updating required for API changes
🔍 Semantic search using optimized MiniLM-L3 model (43MB vs original 90MB)
🚀 FastAPI-based server with async support
🧠 Endpoint based chunking OpenAPI specs (handles 100KB+ documents), no loss of endpoint context
⚡ In-memory FAISS vector search for instant endpoint discovery
Limitations
Not supporting linux/arm/v7 (build fails on Transformer library)
🐢 Cold start penalty (~15s for model loading) if not using docker image
[Obsolete] Current docker image disabled downloading models. You have a dependency over huggingface. When you load the Claude Desktop, it takes some time to download the model. If huggingface is down, your server will not start.
The latest docker image is embedding pre-downloaded models. If there is issues, I would revert to the old one.
Multi-instance config example
Here is the multi-instance config example. I design it so it can more flexibly used for multiple set of apis:
You should get the api spec details from tools financial_api_request_schema
You task is use financial_make_request tool to make the requests to get response. You should follow the api spec to add authorization header:
Authorization: Bearer <xxxxxxxxx>
Note: The base URL will be returned in the api_request_schema response, you don't need to specify it manually.
In chat, you can do:
Get prices for all stocks
Installation# Installing via Smithery
To install Scalable OpenAPI Endpoint Discovery and API Request Tool for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @baryhuang/mcp-server-any-openapi --client claude
Using pip
pip install mcp-server-any-openapi
Available Tools
The server provides the following tools (where {prefix} is determined by MCP_API_PREFIX):
{prefix}_api_request_schema
Get API endpoint schemas that match your intent. Returns endpoint details including path, method, parameters, and response formats.
Input Schema:
{
"query": {
"type": "string",
"description": "Describe what you want to do with the API (e.g., 'Get user profile information', 'Create a new job posting')"
}
}
{prefix}_make_request
Essential for reliable execution with complex APIs where simplified implementations fail. Provides: