bacground gradient shape

Reading Content from Uploaded Files

Learn how to enable your AgentX assistant to read and understand uploaded files using Text Extraction and Deep Image Understanding tools.

Your AgentX assistant can read and understand files you upload -  from PDFs and scanned pages to images and handwritten notes. This helps your agent extract important information or analyze visuals directly from the content you provide.

AgentX provides both basic and advanced skills that allow your agent to process uploaded files, analyze visuals, automate email communication, and improve the chat experience from the first interaction.

Adding skills to your Agent

Available Agent Skills

🧾 Text Extraction

Use this skill to capture any written text visible on an image or document - for example, text from signs, labels, invoices, or screenshots.
Examples: Upload a photo of a contract page, and your agent extracts all readable text for you to edit or search through.

  • Extract text from any image or PDF - instantly capture words from photos, scanned documents, or screenshots.

  • Turn pictures into editable text - quickly copy or edit written content from signs, labels, or handwritten notes.

  • Get clean text from images automatically - perfect for invoices, receipts, forms, or product packaging.


🧠 Deep Image Understanding

This feature allows your agent to analyze the full image, not just the text. It recognizes objects, handwriting, charts, and layouts to understand what’s shown in context.
Example: Send a whiteboard photo, and your agent can summarize the ideas, list the topics, or describe the diagram structure.

  • Understand what’s inside any image - detect objects, scenes, diagrams, and relationships between elements.

  • Analyze visuals beyond text - the AI interprets charts, handwritten notes, and complex document layouts.

  • Comprehend full image meaning - ideal for analyzing infographics, presentations, engineering drawings, or medical scans.


✉️ Send Emails

This skill allows your agent to send emails automatically on request.

What your agent can do:

  • Send emails to you using the default AgentX email

  • Send emails to any external address after connecting your Gmail account

How Email Sending Works

When enabling Send Emails, you can choose:

  • AgentX Email ([email protected])
    Emails are sent only to the agent creator (default).

  • Your Connected Gmail Account
    Allows the agent to send emails to any recipient.

To send emails externally, connect a Gmail or Google Workspace account once - then select it in Agent Skills.

Example command:

“Email today’s summary to [email protected].”


🗨️ Conversation Starters

Conversation Starters help guide users toward the most important or valuable questions from the very first interaction.

They appear as clickable prompts above the chat window.

You can use two types:

  • Dynamic Starters (AI-generated responses)
    Only the question is defined - the agent generates a fresh, contextual answer every time.
    Ideal for discovery, personalization, or up-to-date topics.

  • FAQ-Based Starters (Fixed responses)
    Reuse an existing FAQ to always return a predefined answer.
    Best for instructions, marketing messages, or compliance-sensitive content.

Examples:

  • “How can you help me automate my daily tasks?” → dynamic

  • “How do I integrate this agent with WhatsApp?” → FAQ-based


⚙️ How to Enable These Features

  1. Go to Agent Dashboard → Edit

  2. Open the General tab

  3. Scroll to Agent Skills

  4. Add one or more of the following:

  • Text Extraction

  • Deep Image Understanding

  • Send Emails

  • Conversation Starters

That’s it - your agent is now equipped to read files, analyze images, send emails, and guide conversations intelligently.

circle image

Start Your AI Automation Journey Today

Start Your AI Automation Journey Today

Sign up for Fusion AI and let AI handle your routine tasks - no credit card needed.

Sign up for Fusion AI and let AI handle your routine tasks - no credit card needed.