AI Agent
AI agent that formats sitemap.xml into llms.txt using a detailed prompt and example, given webpage content. Custom prompt provided in 'goal' field.
Transform any sitemap.xml into a well-structured llms.txt format using AI. This workflow fetches URLs from a sitemap, retrieves and processes their content, and leverages an AI agent to generate an optimized llms.txt file suitable for AI training or knowledge ingestion.

Flows
AI agent that formats sitemap.xml into llms.txt using a detailed prompt and example, given webpage content. Custom prompt provided in 'goal' field.
Below is a complete list of all components used in this flow to achieve its functionality. Components are the building blocks of every AI Flow. They allow you to create complex interactions and automate tasks by connecting various functionalities. Each component serves a specific purpose, such as handling user input, processing data, or integrating with external services.
The Chat Opened Trigger component detects when a chat session starts, enabling workflows to respond instantly as soon as a user opens the chat. It initiates flows with the initial chat message, making it essential for building responsive, interactive chatbots.
The Message Widget component displays custom messages within your workflow. Ideal for welcoming users, providing instructions, or showing any important information, it supports Markdown formatting and can be set to appear only once per session.
Discover the Chat Output component in FlowHunt—finalize chatbot responses with flexible, multi-part outputs. Essential for seamless flow completion and creating advanced, interactive AI chatbots.
The Chat Input component in FlowHunt initiates user interactions by capturing messages from the Playground. It serves as the starting point for flows, enabling the workflow to process both text and file-based inputs.
Unlock web content in your workflows with the URL Retriever component. Effortlessly extract and process the text and metadata from any list of URLs—including web articles, documents, and more. Supports advanced options like OCR for images, selective metadata extraction, and customizable caching, making it ideal for building knowledge-rich AI flows and automations.
The AI Agent component in FlowHunt empowers your workflows with autonomous decision-making and tool-using capabilities. It leverages large language models and connects to various tools to solve tasks, follow goals, and provide intelligent responses. Ideal for building advanced automations and interactive AI solutions.
FlowHunt's GoogleSearch component enhances chatbot accuracy using Retrieval-Augmented Generation (RAG) to access up-to-date knowledge from Google. Control results with options like language, country, and query prefixes for precise and relevant outputs.
Flow description
This workflow automates the process of converting a website’s sitemap.xml into a structured and AI-friendly llms.txt format. The flow leverages AI agents and retrieval tools to streamline preparing your site’s content for use in large language models (LLMs) and other AI applications. Below is a detailed breakdown of its steps and components.
Welcome Message on Chat Open
When a user opens the chat, a message widget displays a friendly prompt:
🗂️ Drop your sitemap.xml URL below!
I’ll convert it into a clean llms.txt format, perfect for feeding into AI models 🤖📄
This sets clear expectations and guides the user to submit the correct input.
sitemap.xml file into the chat interface.URL Retriever (Primary)
The workflow uses a URL Content Retriever node to:
sitemap.xml URL.Advanced Settings
Google Search Tool
The AI agent is equipped with a Google Search tool, allowing it to:
Secondary URL Retriever
An additional retriever node can be configured to fetch content from URLs found via Google Search, further enriching the AI agent’s knowledge base if required.
sitemap.xml content into a well-structured llms.txt according to a provided example.llms.txt files.llms.txt requirements for LLM ingestion.llms.txt content) is displayed to the user in chat, ready to be used for AI training or ingestion.| Step | Component | Purpose |
|---|---|---|
| 1 | Chat Opened Trigger | Shows welcome/instruction message |
| 2 | Message Widget | Guides user to input sitemap.xml URL |
| 3 | Chat Input | Receives user-submitted sitemap.xml URL |
| 4 | URL Retriever | Fetches and parses URLs/content from sitemap |
| 5 | Google Search Tool | (Optional) Finds additional context for pages |
| 6 | URL Retriever (Google) | (Optional) Fetches content from Google-found URLs |
| 7 | AI Agent | Converts all page data to formatted llms.txt |
| 8 | Chat Output | Presents formatted llms.txt to user |
Scalability:
Automates a time-consuming manual process, allowing you to convert any site’s sitemap into a usable format for LLMs without technical expertise.
Quality and Consistency:
Ensures that the output matches a strict format, improving the quality of your AI training data.
Extensibility:
Can be customized to include additional knowledge sources or apply more advanced extraction logic.
Efficiency:
Integrates caching and token limits to handle even large websites quickly and reliably.
AI-Driven Decisions:
The agent can prioritize important pages and structure the output intelligently—something that would be tedious or error-prone to do by hand.
This workflow makes it easy, fast, and reliable to convert website sitemaps into AI-optimized text files, saving you hours of manual work and ensuring your AI models get high-quality, structured input.

Transform your website's sitemap.xml into LLM-friendly documentation format automatically. This AI-powered converter extracts, processes, and structures your we...

Learn how LLMs.txt files help AI agents navigate your website efficiently, prioritize important content, and improve AI-driven visibility for your business.

The llms.txt file is a standardized Markdown file designed to optimize how Large Language Models (LLMs) access and process website content. Hosted at a website'...
Cookie Consent
We use cookies to enhance your browsing experience and analyze our traffic. See our privacy policy.