As of 31st January, OpenAI launched their next generation of reasoning models with O3 Mini Model. Overall, It seems OpenAI O3 Mini model is 15 times cheaper with approximately the same accuracy as O1 Model, OpenAI’s previous generation of reasoning Models. The new Model, OpenAI O3 Mini model is available now in FlowHunt less than a day passed from OpenAI’s launch. In this blog, we’ll deep dive into the model capabilities and compare it with Deepseek.
Evaluation Metrics for OpenAI O3 Mini vs DeepSeek
In this evaluation, I utilized a straightforward yet effective workflow involving chat input, chat output, and an AI generator to deliver a clear and concise explanation of the main line of the Ruy Lopez opening in chess. By comparing the performance of DeepSeek against O3 Mini, I was able to assess how each system handled the task of outlining key moves—such as 1. e4 e5, 2. Nf3 Nc6, 3. Bb5 a6, 4. Ba4 Nf6, and 5. O-O Be7—and their subsequent developments. This comparative analysis not only demonstrates the capabilities of these AI tools in processing and relaying complex chess strategies, but also highlights the value of a simple conversational flow in achieving readability and practical evaluation outcomes.
This is the flow:

DeepSeek Results
I have asked the model the following: “give me first 10 moves of ruy lopez”. The results is as follows. this is the thinking process of DeepSeek:
and this is the final result:
OpenAI O3 Mini Results
O3 wins. O3 Mini’s answer is more understandable and correct
Evaluating DeepSeek and OpenAI O3 Mini on Agentic Behavoiur
We have evaluated the reasoning models on a task needing 2-3 tools for AI Agents. Here is the task:
Given the company name, search and find the pricing page of product and suggest enhancements.
also take screenshot of pricing page.then send an email to me with the results
The task requires connecting to Google search to fetch the pricing page, crawling the pricing page and sending a detailed report to email.
The task was intentionally not described in details to check the reasoning capabilities of the mentioned models.

Using DeepSeek as AI Agent
Deepseek was not able to send email. Overall, our experience reveals that Deepseek is not as accurate in tool calling comparing to OpenAI GPT4o or OpenAI O3 Mini Models. This is the report generated from DeepSeek, analyzing pricing page of FlowHunt. Along the way this is the thought process of DeepSeek:
- Call to Google Tool:

2. Scrape URL:

The reasoning:
Subject: FlowHunt Pricing Page Review and Suggestions
Dear [Your Name],
I hope this message finds you well. Below is a summary of the FlowHunt pricing page, along with some suggestions for enhancements.
Pricing Overview:
FlowHunt offers four pricing tiers:
Starter Plan (€10/month):
10 Credits
5 Workflows per Workspace
1 Workspace
1 Team Member per Workspace
5 Chatbots per Workspace
2 Websites per Workspace
5 Documents per Workspace
50 FAQs per Workspace
Access to All Latest OpenAI LLMs, Including Custom Fine-Tuned Models
All Integrations
Email Support
Pro Plan (€50/month):
55 Credits (Includes 10% Discount on Credits)
20 Workflows per Workspace
5 Workspaces
10 Team Members per Workspace
20 Chatbots per Workspace
10 Websites per Workspace
50 Documents per Workspace
100 FAQs per Workspace
Access to All Latest OpenAI LLMs, Including Custom Fine-Tuned Models
Pro Support
API Key Access
Basic Analytics (Coming Soon)
All Integrations
Premium Plan (€500/month):
600 Credits (Includes 20% Discount on Credits)
50 Workflows per Workspace
10 Workspaces
100 Team Members per Workspace
50 Chatbots per Workspace
30 Websites per Workspace
100 Documents per Workspace
200 FAQs per Workspace
Access to All Latest OpenAI LLMs, Including Custom Fine-Tuned Models
Priority Support
API Key Access
Advanced Analytics (Coming Soon)
Initial Workflow
Consultation
Initial Workflow Training Support
All Integrations
Enterprise Plan (Custom Pricing starting from €1500/month):
Contracted Tokens (Bring Your Own Key – BYOK)
Unlimited Workflows
Unlimited Workspaces
Unlimited Users
Unlimited Chatbots
Unlimited Schedules
Unlimited Documents
Unlimited Questions and Answers
Unlimited Chat History
Access to All Latest OpenAI LLMs, Including Custom Fine-Tuned Models
Dedicated Priority Support
API Key Access
Advanced Analytics (Coming Soon)
Customizable Workflow Modules
Optional Training Support
Advanced Security and on-premises Deployment
Suggested Enhancements:
Feature Comparison Table:
Create a side-by-side comparison table highlighting the features of each plan. This would make it easier for users to quickly identify the differences between the plans.
Customer Testimonials/Case Studies:
Adding testimonials or case studies from existing customers can build trust and credibility.
Pricing Calculator:
Implement an interactive pricing calculator that allows users to estimate their costs based on their specific needs, especially for the Enterprise plan.
Free Trial Highlight:
Make the free trial option more prominent on the page to encourage users to try the service before committing to a paid plan.
Detailed Documentation/Setup Guides:
Provide links to detailed documentation or setup guides to help users understand how to make the most of the features included in each plan.
Using OpenAI O3 Mini as AI Agent
On the other hand, with this less amount of description OpenAI O3 Mini model was able to decode the exact process:
- Google Search

2. Scrape URL

3. Email Sending:

this is the email I received:
Hello,
Below are the results of the review of FlowHunt's pricing page and some suggested enhancements:
Pricing Page URL: https://www.flowhunt.io/pricing/
Screenshot: A screenshot of the pricing page was taken. (Please refer to the attached screenshot in your system.)
Summary of the Pricing Page:
The pricing page clearly displays four plans: Starter (€10/month), Pro (€50/month), Premium (€500/month), and Enterprise (from €1500/month). Each plan lists the credits, workflow and team limits, core features, and additional support options. The page includes both an overview and a detailed comparison using a table format, which helps potential customers compare the features across plans.
Suggested Enhancements:
Interactive Pricing Calculator:
Consider adding an interactive pricing calculator that allows users to adjust parameters (e.g., number of workflows, team members) to see a dynamic estimate of their monthly cost.
Visual Enhancements and UX Improvements:
Introduce more engaging visuals (e.g., icons or infographics) to further differentiate the features of each plan.
Add subtle animations when hovering over different plan details to improve user interactivity.
Currency Converter:
Since pricing is in Euros, include a currency converter option to cater to international users for a better localized experience.
Clearer Call-to-Action (CTA):
Enhance the CTA buttons by making them more prominent and possibly adding a secondary CTA (e.g., "Learn More" about plan benefits) for users who need more clarity before signing up.
Enhanced FAQ Section:
Expand the FAQ section with more detailed answers regarding the credits system, potential overages, and common customer scenarios.
Consider using collapsible sections to keep the page clean while providing additional information when needed.
Social Proof and Testimonials:
Incorporate customer testimonials or success stories near the pricing section to build credibility.
Mobile Optimization:
Ensure that the pricing table and all interactive elements render optimally on mobile devices.
Chat Support Integration:
Add an integrated live chat or chatbot support option on the pricing page to instantly address any queries related to pricing or plan details.
By implementing these enhancements, FlowHunt can provide a more engaging and user-friendly experience, potentially increasing conversion rates.
Best regards, [Your Name]
(P.S. This email includes the results and suggestions for FlowHunt’s pricing page as well as information about the captured screenshot.)
Web Page Title Generator Template
Generate perfect SEO titles effortlessly with FlowHunt's Web Page Title Generator. Just input a keyword and get top-performing titles in seconds!