> For the complete documentation index, see [llms.txt](https://docs.diaflow.io/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.diaflow.io/workflow-builder/nodes/private-ai-llm-models/google-gemini.md).

# Google Gemini

### 1. The "Ready-to-Use" AI (Overview)

Zero setup, zero API keys. Just drag, drop, and start prompting using Diaflow's built-in AI credits.

Gemini’s superpower is multimodal work. It can understand text, images, documents, audio, and video in one node, which makes it ideal for workflows that need one AI brain to handle many content types.

### 2. How to Add the Node

Click on your canvas, search for **Google Gemini**, and click to add it to your workflow.

<figure><img src="/files/bAwLr1dN94rJyDzwtX6v" alt=""><figcaption></figcaption></figure>

### 3. What can you automate? (The "Wow" Use Cases)

#### 1. **The Massive Document Analyzer:**

Upload large contracts, proposals, reports, or policy files and let Gemini turn them into fast business summaries, key risks, and next-step actions.

#### 2. **The Multimodal Scanner:**

Pass image links for messy invoices, receipts, or scanned paperwork and let Gemini pull out totals, vendor names, and important details.

#### 3. **The Content/Data Factory:**

Send unstructured notes, forms, or uploaded content into Gemini and turn them into clean business data, internal summaries, or ready-to-use marketing copy.

### 4. Step-by-Step: Configuration & Smart Data Injection

#### 1. **Choose your Model:**

Select your preferred model from the dropdown.

Use **Flash** or **Flash-Lite** for fast, lower-cost tasks. Use **Pro** for deeper reasoning and more complex work.

#### 2. **Write the Prompt & Inject Data:**

The easiest way to add live data is to type `@` on your keyboard. Diaflow opens a visual dropdown so you can pick values from earlier steps, like a customer name or uploaded content.

Typing `{{` also works perfectly if you prefer the classic bracket syntax.

#### 3. **Attach Media (If applicable):**

If your workflow includes files, images, audio, or video, add their links in the file field so Gemini can review them as part of the same step.

<figure><img src="/files/C2I6zEG5OCxDY8B42qY7" alt=""><figcaption></figcaption></figure>

### 5. The Output: Seeing Your Results

This node returns the generated result so you can pass it straight into the next step. Use `{{node_name.data}}` in your next step, like Gmail or Google Sheets, to pass the generated text or media forward.

The node also returns the model used and the final resolved prompt, which helps you review what ran.

You will also see `input_token` and `output_token`. Since Diaflow handles the billing, these work like a meter so you can track how many credits your workflow is consuming.

<figure><img src="/files/seDcW4yCbWdvJNfS2Qy1" alt=""><figcaption></figcaption></figure>

### 6. Golden Rules & Guardrails (CRITICAL PRO-TIPS)

#### **File/Image Limits:**

Gemini supports common image, document, audio, and video formats. If a file type is not supported, the run fails immediately.

Supported file types include:

* images such as `jpg`, `jpeg`, `png`, `gif`, and `webp`
* documents such as `pdf`, `txt`, `csv`, `xlsx`, `xls`, and `docx`
* audio and video formats such as `mp3`, `wav`, `aac`, `mp4`, `mov`, and `webm`

#### **The System Prompt Trap:**

The separate system instruction field is currently not applied in this node. For guaranteed results with advanced models, put your Persona/System instructions directly into the main Prompt box.

#### **Auto-Streaming:**

The AI will think quietly and return the full complete answer at once.

#### More pro-tips

* If you choose an older Gemini model name, Diaflow may quietly run **Gemini 2.5 Flash** instead.
* This node does not keep conversation memory between turns.
* If the response is blocked for safety reasons, revise the input and try again.
* If the reply stops early, increase the allowed output length or simplify the request.

### Need help?

* Learn the basics in [How a node works](/getting-started/lets-start-with-the-basics/how-a-node-works.md)
* Browse available nodes in [Component List](/workflow-builder/component-list.md)
* Build the full flow in [Create a workflow](/workflow-builder/create-a-workflow.md)


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://docs.diaflow.io/workflow-builder/nodes/private-ai-llm-models/google-gemini.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
