LLM modes
Which model runs my task?
Natoify picks a model per step based on the job. You can override it in Settings if you've added your own OpenAI or Anthropic key.
Gemini 2.5 Flash
Default
Use for: Quick chat, intake, classification, light planning.
Fastest and cheapest. Used by the New Task chat and short tool calls.
Gemini 2.5 Pro
Heavy lifting
Use for: Multi-step workflows, long-context research, multimodal analysis.
Best price-per-token for big context and image-text reasoning.
GPT-5
Precision
Use for: Tasks where nuance and tone matter — writing, sensitive emails, legal-flavoured drafts.
Strongest reasoning and writing in our gateway. More expensive — used selectively.
GPT-5 Mini
Balanced
Use for: High-volume drafting where you want GPT-quality at a fraction of the cost.
Keeps most of GPT-5's strengths at roughly a quarter of the price.
Gemini 3 Flash Image
Visual
Use for: Generating screenshots, charts, social images on the fly.
Fast image generation built into the same gateway — no separate vendor.
Anthropic Claude (Sonnet, Opus) and Perplexity are available via BYOK in Settings.