Document understading
Quick Start
What you can do:
- Analyze uploaded documents with AI
- Extract text, labels, and data from documents
- Get descriptions and summaries of visual content
- Compare documents and identify differences
- Answer questions about document content
Quick Start Steps:
1. Navigate to Chat: Go to the AI Chat interface
2. Upload an Image:
- Click the `+` icon
- Select "Upload from device" or "Diaflow Driveβ or you can simply drag and drop your files or copy and paste your images onto the chat frame.
Supported File Formats (Format support may vary by model):
Images: .jpg, .jpeg, .png, .webp
Documents: .docx, .txt, .pdf
3. Ask Your Question: Type a prompt asking about the image (e.g., "Describe this image" or "Extract the text from this receipt")
4. Send: Click Send and wait for AI analysis
5. Review Results: AI provides structured analysis, descriptions, or extracted data

Expected Result: AI analyzes the documents and provides detailed insights, descriptions, or extracted information based on your prompt.

Common Uses
1. Image Description and Analysis
Get detailed descriptions of images for:
- Accessibility (alt-text generation)
- Content moderation
- Visual content understanding
Example Prompts:
- "Describe this image in detail"
- "What objects are visible in this image?"
- "Analyze the composition and color scheme"

Output:

2. Data Extraction and Structuring
Extract structured data from files:
- Tables and charts
- Forms and surveys
- Product information
- Contact details
Example Prompts:
- "Extract the table data from this image and format as CSV"
- "Identify all products and their prices in this catalog file"
- "Convert this form into structured JSON data"

3. Content Analysis
Analyze visual content for:
- Brand compliance
- Design feedback
- Accessibility issues
- Quality assessment
Example Prompts:
- "Check this UI design for accessibility issues."
- "Analyze the color contrast in this image."
- "Provide design feedback on this mockup."

4. Object Detection and Classification
Identify and classify objects in images:
- Product identification
- Scene understanding
- Object counting
- Category classification
Example Prompts:
- "Count the number of people in this image"
- "Identify all products in this store shelf image"
- "What type of vehicle is shown in this image?"
Last updated
Was this helpful?