What is the best way to add AI to a Flutter app in 2026?

There are three practical paths in 2026: Genkit Dart for multi-step pipelines with tool calls, Firebase AI Logic for zero-backend Gemini or Imagen calls directly from the Flutter client, and raw LLM SDKs (openai_dart, google_generative_ai, anthropic_sdk_dart) when you need GPT-4o, Claude, or provider portability. For most indie Flutter apps, start with Firebase AI Logic behind a repository interface, then extend with a raw SDK if you need other models.

Genkit Dart vs Firebase AI Logic: which should I use?

Firebase AI Logic is a single-turn client SDK that runs inside the Flutter app with zero backend, locked to Gemini and Imagen. Genkit Dart is a full AI framework for multi-step flows with tool calling, structured output, and evaluations, which runs on a Dart server. Use Firebase AI Logic when your AI feature is one prompt and you want to ship today. Use Genkit when your AI is a pipeline or needs observability.

Can I call OpenAI directly from a Flutter app in 2026?

No. Never embed your OpenAI API key in a Flutter app. Use a backend proxy (Cloud Functions, Render, Fly.io) with openai_dart on the server that holds the key and forwards authenticated requests. Firebase AI Logic is the only path where the client can safely call the model directly, and that is because Firebase App Check verifies the request originates from a legitimate build of your app.

Does Firebase AI Logic support Claude or GPT-4o?

No. Firebase AI Logic only supports Google models (Gemini and Imagen). If you need Claude, GPT-4o, Mistral, Grok, or a local model, use a raw SDK (openai_dart for OpenAI and OpenAI-compatible endpoints, anthropic_sdk_dart for Claude) with a backend proxy, or use Genkit Dart with a third-party plugin.

How do I switch between AI providers in Flutter without rewriting my UI?

Put every LLM call behind a repository interface in your Flutter app. Your BLoC or Riverpod code imports an AiRepository abstraction, and the concrete Firebase AI, OpenAI, or Claude implementation is bound via dependency injection. Switching providers or A/B testing two models becomes a one-line change in your DI container, not a UI rewrite. The Flutter Kit ships exactly this pattern out of the box.

How much does Flutter AI cost at indie scale?

Using flash-tier models (Gemini 2.5 Flash, GPT-4o mini, Claude Haiku 4), a typical Flutter chat feature at 10,000 daily active users and 5 turns per day costs roughly $225 to $675 per month. Using pro-tier models (Gemini 2.5 Pro, GPT-4o, Claude Sonnet 4) at the same volume pushes the bill into the $5,000 to $8,000 range. Route volume to flash tiers by default and reserve pro models for high-value turns.

Does The Flutter Kit include AI integration out of the box?

Yes. The Flutter Kit ships a provider-agnostic AI module with Firebase AI Logic and OpenAI implementations behind a shared AiRepository interface, streaming chat UI, Imagen and DALL-E image generation, GPT-4 Vision, server-side rate limiting on the Cloud Functions proxy, and RevenueCat subscription gating on expensive endpoints. You can toggle providers with a single config line.

Flutter AI in 2026: Genkit Dart vs Firebase AI vs Raw LLM SDKs

If you are shipping a Flutter app with AI in 2026, you have three real paths: Google's new Genkit Dart, the Firebase AI Logic SDKs (formerly Vertex AI in Firebase), or a raw LLM SDK like openai_dart, google_generative_ai, or anthropic_sdk_dart. I have shipped production Flutter apps on all three. This is the comparison I wish someone had handed me six months ago.

Short version, if you read nothing else: pick Genkit Dart if you are building a multi-step AI workflow with tool calls and want the same code running on a Dart server and inside Flutter. Pick Firebase AI Logic if you are already deep into Firebase and want zero backend code. Pick a raw SDK if you care about portability, provider-switching, or shipping Anthropic Claude on the Gemini side of the world. The longer version is below with a feature matrix, real cost math, and the provider-agnostic architecture I use on The Flutter Kit.

What actually changed for Flutter AI in 2026

Two things shifted in the first quarter of 2026 and they matter more than most of the hype. First, Google announced Genkit Dart in early 2026, the first full-featured AI framework that treats Dart as a first-class language. That means the same patterns (flows, tools, structured output, RAG) that JavaScript and Go developers have had since 2024 are now available inside Flutter. Second, Firebase rebranded Vertex AI in Firebase to Firebase AI Logic and opened it up to both Gemini and Imagen with a client-side SDK that no longer requires a server proxy.

Meanwhile, the raw-SDK path did not stand still. The community openai_dart package keeps pace with new models, Google ships an official google_generative_ai Dart package, and anthropic_sdk_dart fills in the Claude gap. The result is that Flutter finally has a real spectrum of choices, not just "call the OpenAI API yourself with http."

The three paths in one paragraph each

Before we go deep, here is the 30-second pitch for each so you know what I am comparing.

Genkit Dart. Google's open-source AI framework, now with first-class Dart support. You write flows, plug in providers (Gemini, Vertex AI, third parties), and get built-in observability, tool calls, and structured output. Runs on a Dart server and can be called directly from Flutter. Best when your AI logic is a pipeline, not a single prompt.
Firebase AI Logic. The rebranded Vertex-AI-in-Firebase client SDK. You add the firebase_ai package to your Flutter app, initialize it, and call Gemini or Imagen directly from the client. Firebase App Check prevents abuse. No backend required. Best when your AI calls are single-turn and you want to ship in a day.
Raw LLM SDKs. Packages like openai_dart, google_generative_ai, and anthropic_sdk_dart. You call them from a backend proxy (Cloud Functions, Render, Fly.io) that holds the API key. Most flexibility, most control, most plumbing. Best when you need provider portability or specific models (Claude, Grok, Mistral) that the first two paths do not cover.

Genkit Dart: the pipeline-first choice

Genkit Dart is the newest option and the one I get the most questions about. The short version is that Genkit gives you a structured way to write AI logic as flows. A flow is a typed function that accepts input, runs one or more LLM calls (possibly with tools), and returns structured output. You get built-in tracing, retries, and a visual inspector that runs locally.

The reason Dart matters here is that your Flutter engineers no longer have to context-switch to TypeScript or Go to write the AI backend. A Genkit flow you wrote for your Dart server can share model classes and validation with your Flutter app. That alone saves hours on a team of two, let alone one.

Where Genkit Dart wins, concretely:

Multi-step workflows. Summarize, classify, then generate a response in one flow, with observability at each step.
Tool calling. Define Dart functions that the model can invoke (look up the user, write to Firestore, fetch a URL). Genkit handles the plumbing.
Structured output. Ask for a typed Dart object back, not a JSON string you have to parse and hope for the best.
Evaluation. Genkit has first-class evals. You can regression-test your prompts against a dataset just like you would unit-test a function.

The trade-off: you still need a server. Genkit Dart flows are most often deployed as Cloud Functions or a small Dart server. You do not call Genkit directly from a Flutter client the way you can with Firebase AI Logic. If your AI logic is a single prompt with no tools, Genkit is overkill.

Firebase AI Logic: the zero-backend choice

Firebase AI Logic is what you reach for when you want an AI feature shipped this afternoon. Add firebase_ai to pubspec.yaml, initialize Firebase, enable the Firebase AI API in the Google Cloud console, and call Gemini directly from the Flutter client. That is it.

The model call, stripped to its essence, looks like this.

// lib/services/firebase_ai_service.dart
import 'package:firebase_ai/firebase_ai.dart';

final model = FirebaseAI.googleAI().generativeModel(
  model: 'gemini-2.0-flash',
);

Future<String> summarize(String input) async {
  final res = await model.generateContent([Content.text(input)]);
  return res.text ?? '';
}

That really is the whole thing for a basic use case. There is no key to store, no proxy to deploy, no CORS headache. What makes this safe to run client-side is Firebase App Check, which uses Play Integrity on Android and DeviceCheck/App Attest on iOS to verify that requests originate from a legitimate build of your app. You turn it on with a few lines in main.dart and most client-side abuse simply stops working.

Where Firebase AI Logic wins:

Zero backend. No Cloud Functions, no Render, no Fly.io. The model call lives in the app and Firebase handles authentication.
Streaming baked in. generateContentStream() gives you a typewriter effect in about three lines.
Imagen for images. Same SDK, one method call, and you are generating images with Imagen 3 or 4.
Tight Firestore integration. You already have auth, analytics, and storage in Firebase. Adding AI does not change the operational surface.

The trade-off: you are locked to Gemini and Imagen. If your product needs GPT-4o, Claude, or Mistral, Firebase AI Logic is the wrong tool. You also cannot do complex multi-step workflows the way you can in Genkit. It is a single-turn client SDK.

Raw SDKs: the portability choice

The raw-SDK path is what I would pick if portability matters to me. Three packages do most of the work in 2026, and each of them is maintained well enough to ship production apps on top of.

Package	Providers	Maintainer	Highlights
`openai_dart`	OpenAI, Azure OpenAI, any OpenAI-compatible endpoint	Community (davidmigloz)	Full coverage of chat, embeddings, images, vision, assistants, batches. Works with Together, Groq, Ollama.
`google_generative_ai`	Gemini via Google AI Studio	Google (official)	Simplest way to call Gemini from a Dart server without Firebase. Supports streaming and tools.
`anthropic_sdk_dart`	Anthropic Claude	Community	Claude 3.5 and 4.x support with streaming, tool use, vision. The only way to ship Claude in Dart today.

The pattern is always the same. You install the SDK on your backend (Dart Frog, Shelf, or Cloud Functions with a Node or Dart runtime), hold the API key in an environment variable or Secret Manager, expose an authenticated endpoint, and call that endpoint from Flutter with a normal HTTP request.

Where raw SDKs win:

Provider switching. Want to A/B test Gemini against Claude against GPT-4o? Swap the service implementation behind the same interface.
Model variety. Grok, Mistral, DeepSeek, Llama 3 on Groq, local models via Ollama: all reachable through an OpenAI-compatible SDK.
Cost control. You choose exactly which model runs which request, route cheap traffic to Flash or Haiku, expensive traffic to Sonnet or GPT-4o.
No platform lock-in. Your AI layer is portable across cloud providers. Leaving Firebase does not mean rewriting your AI code.

The trade-off: you own the plumbing. Authentication, rate limiting, error handling, cost tracking, streaming, abuse prevention. All of it is yours to build. This is the path I default to for anything that might need to ship multiple providers, but it is the most work.

Feature matrix: the honest comparison

Here is how the three paths stack up on the questions that actually decide what you should pick. No scoring system, just what works and what does not.

Capability	Genkit Dart	Firebase AI Logic	Raw SDKs
Call directly from Flutter client	No (needs server)	Yes	No (needs proxy)
Streaming responses	Yes	Yes	Yes (via SSE)
Tool calling	Yes, first-class	Yes	Yes, per SDK
Structured output (typed Dart)	Yes, strongest	Yes, via JSON schema	Manual parsing
Multiple models in one call	Yes, trivial	No	Yes, manual
Image generation	Yes (via Imagen / DALL-E plugins)	Yes (Imagen)	Yes (DALL-E, Imagen, SD)
Vision (image input)	Yes	Yes	Yes
RAG / embeddings	Yes, built-in	Partial	Yes, manual
Evals / prompt testing	Yes, built-in	No	No (BYO)
Tracing / observability	Yes, built-in	Basic (Cloud Logging)	DIY
Works with Claude	Yes (via plugin)	No	Yes (`anthropic_sdk_dart`)
Abuse protection	Your server + App Check	App Check (built-in)	Your server + App Check
Time to first working call	1 to 2 hours	15 minutes	2 to 4 hours
Lock-in risk	Low (framework-level)	High (Firebase only)	Lowest

Cost math at real scale

Pricing numbers below assume a typical chat turn of 300 input tokens and 300 output tokens. I am using published 2026 list prices. Your mileage will vary based on caching, batching, and actual model choice, but the relative ordering is stable.

Model (via path)	Input / 1M tokens	Output / 1M tokens	Cost per chat turn	Monthly cost at 10k DAU x 5 turns/day
Gemini 2.5 Flash (Firebase AI Logic)	$0.10	$0.40	$0.00015	~$225
Gemini 2.5 Pro (Firebase AI Logic or Genkit)	$1.25	$10.00	$0.00338	~$5,070
GPT-4o (raw `openai_dart`)	$2.50	$10.00	$0.00375	~$5,625
GPT-4o mini (raw `openai_dart`)	$0.15	$0.60	$0.000225	~$340
Claude Sonnet 4 (`anthropic_sdk_dart`)	$3.00	$15.00	$0.0054	~$8,100
Claude Haiku 4 (`anthropic_sdk_dart`)	$0.25	$1.25	$0.00045	~$675

Two practical takeaways. First, the flash tier of every provider is roughly the same price and is where most of your volume should go. GPT-4o mini, Gemini Flash, and Claude Haiku are all in the same order of magnitude and any of them can serve a well-scoped chat feature at 10k DAU for under $500 a month. Second, the pro tier gets expensive fast. Running Gemini 2.5 Pro or GPT-4o on every message is a five-figure monthly line item at indie scale, so route pro models to specific high-value turns (first message, complex queries) and fall back to flash tiers otherwise.

The architecture I actually ship

Regardless of which path you choose, there is one pattern that has saved me from every provider migration I have ever done: put every LLM call behind a repository interface. Your BLoC, Riverpod, or feature code never imports the AI package directly. It imports an AiRepository abstraction and the concrete implementation lives in a data/ folder. Switching from Firebase AI Logic to Claude becomes a swap of the repository binding, not a rewrite of your UI.

Here is the shape in Dart.

// lib/features/ai/domain/ai_repository.dart
abstract class AiRepository {
  Stream<String> streamReply(List<ChatMessage> messages);
  Future<GeneratedImage> generateImage(String prompt);
  Future<T> generateStructured<T>(String prompt, FromJson<T> decoder);
}

// lib/features/ai/data/firebase_ai_repository.dart
class FirebaseAiRepository implements AiRepository {
  FirebaseAiRepository(this._model);
  final GenerativeModel _model;

  @override
  Stream<String> streamReply(List<ChatMessage> messages) async* {
    final contents = messages.map((m) => Content.text(m.text)).toList();
    await for (final chunk in _model.generateContentStream(contents)) {
      final text = chunk.text;
      if (text != null && text.isNotEmpty) yield text;
    }
  }
  // ...
}

Your BLoC consumes AiRepository. Your DI container picks the concrete class. A second implementation (say, OpenAiRepository) can ship alongside the first for A/B testing, graceful fallback, or a slow migration. This is the exact pattern The Flutter Kit uses, and it is the single thing I would not compromise on in any production AI Flutter app.

The decision tree I actually use

When someone asks me which path to take, the conversation is usually over in three questions. Here they are, in the order I ask them.

Do you need a model that is not Gemini or Imagen? If yes (GPT-4o, Claude, Mistral, Grok, a local model), Firebase AI Logic is out. Go raw SDK, or Genkit with a plugin.
Is your AI logic one call, or a pipeline? One call with optional tools, stay with Firebase AI Logic or a raw SDK. A real pipeline (summarize, classify, generate), go Genkit.
How much backend do you want to own? None at all, Firebase AI Logic. Willing to run one Cloud Function or a tiny Dart server, everything else is on the table.

The most common answers in practice:

Indie consumer app, Gemini is fine, single-turn chat. Firebase AI Logic. Ship today, optimize later.
Indie app, needs Claude or GPT-4o, medium complexity. Raw SDK behind a repository, Cloud Functions proxy. This is what most The Flutter Kit users land on.
Team of two or more, multi-step agentic feature, observability matters. Genkit Dart on a small server. The eval and trace tooling alone is worth it.

Switching providers later: easier than you think

The most common objection I hear is "what if I pick wrong?" If you follow the repository pattern above, migrating between any two of these paths is a weekend of work, not a rewrite. You create a new repository implementation, flip a flag in your DI container, and ship a release. The UI never changes, because your chat widget only ever knew about an abstract AiRepository.

A realistic migration ladder for a growing product looks like this. Start with Firebase AI Logic because it is the fastest path to a working app. As soon as you need a second provider or a pipeline, add a raw SDK implementation behind the same repository. If your AI feature grows teeth (multi-step, tool calls, evaluations), port the server-side logic to Genkit Dart and keep the same client. You are never stuck.

What The Flutter Kit ships out of the box

Every pattern in this post is pre-wired in The Flutter Kit. The AI module ships with an AiRepository interface, a Firebase AI Logic implementation for zero-backend apps, an OpenAI implementation (with a Cloud Functions proxy) for when you need GPT-4o or Claude, streaming chat UI, DALL-E and Imagen image generation, GPT-4 Vision support, rate limiting, and RevenueCat subscription gating on expensive endpoints. You can switch providers with a single line in the config, or run two in parallel for A/B testing.

It is the provider-agnostic Flutter AI foundation I wish existed when I was building my first AI Flutter app in 2024. One-time $69 for unlimited commercial projects. See every integration on the features page, or skip ahead to checkout.

Final recommendation

There is no universally right answer, but there is a right default. If you are a solo indie developer starting a new AI Flutter app in 2026, begin with Firebase AI Logic behind a repository interface. You will ship in a day. When your product outgrows Gemini or single-turn prompts, extend the repository with a raw SDK or a Genkit flow. You get the speed of the easy path now and the flexibility of the hard path later, without choosing between them.

The worst outcome is picking any of these and letting the SDK leak into your UI code. Put every LLM call behind an interface, and the decision you make this week never has to be the decision you live with forever.