LLM modes

Which model runs my task?

Natoify picks a model per step based on the job. You can override it in Settings if you've added your own OpenAI or Anthropic key.

Gemini 2.5 Flash

Default

Use for: Quick chat, intake, classification, light planning.

Fastest and cheapest. Used by the New Task chat and short tool calls.

Gemini 2.5 Pro

Heavy lifting

Use for: Multi-step workflows, long-context research, multimodal analysis.

Best price-per-token for big context and image-text reasoning.

GPT-5

Precision

Use for: Tasks where nuance and tone matter — writing, sensitive emails, legal-flavoured drafts.

Strongest reasoning and writing in our gateway. More expensive — used selectively.

GPT-5 Mini

Balanced

Use for: High-volume drafting where you want GPT-quality at a fraction of the cost.

Keeps most of GPT-5's strengths at roughly a quarter of the price.

Gemini 3 Flash Image

Visual

Use for: Generating screenshots, charts, social images on the fly.

Fast image generation built into the same gateway — no separate vendor.

Anthropic Claude (Sonnet, Opus) and Perplexity are available via BYOK in Settings.