arXiv MCP Server

Connect your AI workflows to arXiv with the arXiv MCP Server. Search, retrieve, and load scholarly articles directly into your LLM-powered research assistants.

arXiv MCP Server

What does “arXiv” MCP Server do?

The arXiv MCP Server is a Model Context Protocol (MCP) server designed to enable seamless interaction with the arXiv API using natural language. It acts as a bridge between AI assistants and the arXiv scholarly article repository, allowing developers and AI agents to retrieve article metadata, perform advanced searches, download PDFs, and load article content directly into a large language model’s context. This enhances research workflows by automating information retrieval, document management, and contextual data enrichment for LLMs, making scholarly research more accessible and efficient.

List of Prompts

No prompt templates are mentioned in the repository.

List of Resources

No explicit MCP resources are listed in the repository.

List of Tools

  • get_article_url
    Retrieves the URL of an article hosted on arXiv.org based on its title.
    Parameters: title (String)

  • download_article
    Downloads the article from arXiv.org as a PDF file to the local machine.
    Parameters: title (String)

  • load_article_to_context
    Loads the article content into the context of a large language model for further processing.
    Parameters: title (String)

  • get_details
    Retrieves metadata of an article from arXiv.org based on its title.
    Parameters: title (String)

  • search_arxiv
    Performs a comprehensive search query on the arXiv API and returns matching article metadata.
    Parameters:

    • all_fields (String): General keyword search
    • title (String): Search within titles
    • author (String): Filter by author name
    • abstract (String): Search within abstracts
    • start (int): Index of the first result to return

Use Cases of this MCP Server

  • Academic Literature Search
    AI assistants or developers can use the server to search arXiv’s vast database for relevant articles using keywords, author names, or subjects, streamlining the research process.
  • Automated Document Retrieval
    Enables downloading of PDFs for specific articles directly via tool calls, facilitating quick access to original research papers.
  • Contextual Data Loading for LLMs
    Article content can be loaded into an LLM’s context, allowing for advanced question answering, summarization, or analysis workflows.
  • Metadata Extraction
    Easily retrieve detailed metadata (authors, titles, abstracts) about articles based on natural language queries, supporting bibliographic applications.
  • Integration in AI-Driven Research Assistants
    Forms the backbone for AI-powered assistants that help researchers stay up to date by surfacing the latest papers on topics of interest.

How to set it up

Windsurf

  1. Ensure you have Python 3.13+ installed as a prerequisite.
  2. Clone the arXiv MCP server repository and install dependencies.
  3. Edit your Windsurf configuration file to add the arXiv MCP server.
  4. Insert the following JSON snippet under the mcpServers object:
{
  "arxiv-mcp": {
    "command": "python",
    "args": ["-m", "arxiv_server"]
  }
}
  1. Save the configuration and restart Windsurf.
  2. Verify the server is running and accessible.

Securing API Keys:
If the server or tools require API keys, store them as environment variables and reference them in your configuration:

{
  "arxiv-mcp": {
    "command": "python",
    "args": ["-m", "arxiv_server"],
    "env": {
      "ARXIV_API_KEY": "<your-api-key>"
    },
    "inputs": {
      "api_key": "${env.ARXIV_API_KEY}"
    }
  }
}

Claude

  1. Install Python 3.13+ and clone the arXiv MCP server repository.
  2. Locate Claude’s MCP server configuration file.
  3. Add the following under mcpServers:
{
  "arxiv-mcp": {
    "command": "python",
    "args": ["-m", "arxiv_server"]
  }
}
  1. Save the changes and restart Claude.
  2. Confirm the server is detected by Claude.

Cursor

  1. Make sure Python 3.13+ is installed and clone the repository.
  2. Open Cursor’s configuration file.
  3. Add the arXiv MCP server as below:
{
  "arxiv-mcp": {
    "command": "python",
    "args": ["-m", "arxiv_server"]
  }
}
  1. Save and restart Cursor.
  2. Test the integration.

Cline

  1. Install Python 3.13+ and clone the server repo.
  2. Edit the Cline configuration file to register the server.
  3. Include this block:
{
  "arxiv-mcp": {
    "command": "python",
    "args": ["-m", "arxiv_server"]
  }
}
  1. Save and restart Cline.
  2. Verify connectivity.

Note:
Always secure sensitive information like API keys using environment variables as shown in the example above.

How to use this MCP inside flows

Using MCP in FlowHunt

To integrate MCP servers into your FlowHunt workflow, start by adding the MCP component to your flow and connecting it to your AI agent:

FlowHunt MCP flow

Click on the MCP component to open the configuration panel. In the system MCP configuration section, insert your MCP server details using this JSON format:

{
  "arxiv-mcp": {
    "transport": "streamable_http",
    "url": "https://yourmcpserver.example/pathtothemcp/url"
  }
}

Once configured, the AI agent is now able to use this MCP as a tool with access to all its functions and capabilities. Remember to change “arxiv-mcp” to your server’s name and replace the URL with your own MCP server URL.


Overview

SectionAvailabilityDetails/Notes
Overview
List of PromptsNone found
List of ResourcesNone found
List of Tools
Securing API KeysInstructed
Sampling Support (less important in evaluation)Not mentioned

Our opinion

The arXiv MCP Server offers a focused set of tools for scholarly article retrieval and integration with LLM workflows. Its documentation is clear about features and setup, and it is open source under MIT. However, it does not provide prompt templates or explicit resources, and there is no mention of sampling or roots support. For users needing arXiv integration, it is a solid, reliable option, but it lacks some advanced MCP features.

MCP Score

Has a LICENSE✅ (MIT)
Has at least one tool
Number of Forks1
Number of Stars4

Frequently asked questions

What is the arXiv MCP Server?

The arXiv MCP Server is a Model Context Protocol server that allows AI assistants and developers to interact with the arXiv scholarly article database using natural language. It supports searching, retrieving metadata, downloading PDFs, and loading article content into large language models.

What tools does the arXiv MCP Server provide?

It provides tools for article URL retrieval, PDF downloading, loading article content into LLMs, metadata extraction, and advanced search queries using various filters like title, author, and abstract.

How do I secure my API keys with the arXiv MCP Server?

API keys (if required) should be stored as environment variables and referenced in your MCP server configuration. Example: { \"arxiv-mcp\": { \"command\": \"python\", \"args\": [\"-m\", \"arxiv_server\"], \"env\": { \"ARXIV_API_KEY\": \"\" }, \"inputs\": { \"api_key\": \"${env.ARXIV_API_KEY}\" } } }

Can I load full article content into my LLM agent?

Yes, the 'load_article_to_context' tool allows you to load the full content of an arXiv article directly into your LLM's context for summarization, question answering, or further analysis.

Is the arXiv MCP Server open source?

Yes, it is open source under the MIT license.

Integrate arXiv with FlowHunt

Supercharge your research flows by connecting arXiv to your AI agents using the arXiv MCP Server. Automate literature reviews, metadata extraction, and more.

Learn more