---
title: AI
description: Run AI models on Cloudflare's global network using Workers AI, AI Gateway, and other integrated AI products.
image: https://developers.cloudflare.com/dev-products-preview.png
---

> Documentation Index  
> Fetch the complete documentation index at: https://developers.cloudflare.com/ai/llms.txt  
> Use this file to discover all available pages before exploring further. 

[Skip to content](#%5Ftop) 

# AI

Run AI models on Cloudflare's global network.

 Available on all plans 

Cloudflare AI provides a unified platform for running AI models, whether hosted on Cloudflare infrastructure (Workers AI) or proxied through AI Gateway to external providers.

## Get started

###  Models 

Explore all AI models available through Cloudflare, including hosted models on Workers AI and external providers through AI Gateway.

[ Browse models ](https://developers.cloudflare.com/ai/models/) 

## Related products

**[Workers AI](https://developers.cloudflare.com/workers-ai/)** 

Run machine learning models, powered by serverless GPUs, on Cloudflare's global network.

**[AI Gateway](https://developers.cloudflare.com/ai-gateway/)** 

Observe and control your AI applications with caching, rate limiting, request retries, model fallback, and more.

**[Vectorize](https://developers.cloudflare.com/vectorize/)** 

Build full-stack AI applications with Vectorize, Cloudflare's vector database.

**[Agents](https://developers.cloudflare.com/agents/)** 

Build AI-powered agents to perform tasks, persist state, and interact with external services.

**[AI Search](https://developers.cloudflare.com/ai-search/)** 

Create fully managed RAG pipelines for your AI applications.

**[AI Crawl Control](https://developers.cloudflare.com/ai-crawl-control/)** 

Analyze and control third-party AI crawlers on your website.

**[Browser Rendering](https://developers.cloudflare.com/browser-run/)** 

Control and interact with headless browser instances for AI data extraction.

**[Cloudflare Agent](https://developers.cloudflare.com/cloudflare-agent/)** 

An AI-powered assistant that helps you navigate, configure, and manage Cloudflare.

**[Dynamic Workers](https://developers.cloudflare.com/dynamic-workers/)** 

Spin up isolated Workers on demand to execute code.

**[Sandbox SDK](https://developers.cloudflare.com/sandbox/)** 

Build secure, isolated code execution environments.

```json
{"@context":"https://schema.org","@type":"WebPage","@id":"https://developers.cloudflare.com/ai/#page","headline":"Overview · Cloudflare AI docs","description":"Run AI models on Cloudflare's global network using Workers AI, AI Gateway, and other integrated AI products.","url":"https://developers.cloudflare.com/ai/","inLanguage":"en","image":"https://developers.cloudflare.com/dev-products-preview.png","dateModified":"2026-05-18","publisher":{"@type":"Organization","name":"Cloudflare","url":"https://www.cloudflare.com/"},"isPartOf":{"@type":"WebSite","@id":"https://developers.cloudflare.com/#website","name":"Cloudflare Docs","url":"https://developers.cloudflare.com/"},"keywords":["AI"]}
{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"item":{"@id":"/directory/","name":"Directory"}},{"@type":"ListItem","position":2,"item":{"@id":"/ai/","name":"AI"}}]}
```

---

---
title: Models
image: https://developers.cloudflare.com/dev-products-preview.png
---

> Documentation Index  
> Fetch the complete documentation index at: https://developers.cloudflare.com/ai/llms.txt  
> Use this file to discover all available pages before exploring further. 

[Skip to content](#%5Ftop) 

# Models

Task TypesCapabilitiesProvidersAuthorsNewest first

We found 199 models

[📌![Moonshot AI logo](https://developers.cloudflare.com/_astro/moonshotai.D9EBG7kx.svg)kimi-k2.7-codeText Generation • Moonshot AIKimi K2.7 is a frontier-scale open-source 1T parameter model with a 262.1k context window, multi-turn tool calling, vision inputs, and structured outputs for agentic workloads.Cloudflare-hostedFunction callingReasoningVision](https://developers.cloudflare.com/ai/models/@cf/moonshotai/kimi-k2.7-code/)[📌![Zhipu AI logo](https://developers.cloudflare.com/_astro/zai.Dj2vcayE.svg)glm-4.7-flashText Generation • Zhipu AIGLM-4.7-Flash is a fast and efficient multilingual text generation model with a 131,072 token context window. Optimized for dialogue, instruction-following, and multi-turn tool calling across 100+ languages.Cloudflare-hostedFunction callingReasoning](https://developers.cloudflare.com/ai/models/@cf/zai-org/glm-4.7-flash/)[📌![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-oss-120bText Generation • OpenAIOpenAI's open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases – gpt-oss-120b is for production, general purpose, high reasoning use-cases.Cloudflare-hostedFunction callingReasoning](https://developers.cloudflare.com/ai/models/@cf/openai/gpt-oss-120b/)[📌![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)llama-4-scout-17b-16e-instructText Generation • MetaMeta's Llama 4 Scout is a 17 billion parameter model with 16 experts that is natively multimodal. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.Cloudflare-hostedBatchFunction callingVision](https://developers.cloudflare.com/ai/models/@cf/meta/llama-4-scout-17b-16e-instruct/)[![Alibaba logo](https://developers.cloudflare.com/_astro/alibaba.C3THgr9s.svg)hh1.1-r2vImage-to-Video • AlibabaAlibaba's HappyHorse 1.1 reference-to-video model. Takes 1-9 reference images (characters and scenes) and a prompt that choreographs them into a single video, keeping each subject's identity consistent. Supports 720P and 1080P output with durations from 3 to 15 seconds.Third-party](https://developers.cloudflare.com/ai/models/alibaba/hh1.1-r2v/)[![Alibaba logo](https://developers.cloudflare.com/_astro/alibaba.C3THgr9s.svg)hh1.1-i2vImage-to-Video • AlibabaAlibaba's HappyHorse 1.1 image-to-video model. Animates a reference image with an optional text prompt, with smoother motion, natural skin textures, and improved close-up quality over 1.0\. Supports 720P and 1080P output with durations from 3 to 15 seconds.Third-party](https://developers.cloudflare.com/ai/models/alibaba/hh1.1-i2v/)[![Alibaba logo](https://developers.cloudflare.com/_astro/alibaba.C3THgr9s.svg)hh1.1-t2vText-to-Video • AlibabaAlibaba's HappyHorse 1.1 text-to-video model. Generates videos from a text prompt with stronger dynamic expressiveness, better visual quality, and improved instruction following over 1.0\. Configurable resolution, aspect ratio, and duration (3-15s).Third-party](https://developers.cloudflare.com/ai/models/alibaba/hh1.1-t2v/)[![Pruna AI logo](https://developers.cloudflare.com/_astro/prunaai.BVOvqoaI.svg)p-videoText-to-Video • Pruna AIPruna's P-Video is a premium video generation model supporting text-to-video, image-to-video, and audio-conditioned generation up to 1080p at 24 or 48 fps, with configurable duration up to 20 seconds.Third-party](https://developers.cloudflare.com/ai/models/pruna/p-video/)[![Pruna AI logo](https://developers.cloudflare.com/_astro/prunaai.BVOvqoaI.svg)p-video-animateImage-to-Video • Pruna AIPruna's P-Video-Animate takes a source video and a subject reference image, then animates the referenced subject using the motion and audio from the source video.Third-party](https://developers.cloudflare.com/ai/models/pruna/p-video-animate/)[![Pruna AI logo](https://developers.cloudflare.com/_astro/prunaai.BVOvqoaI.svg)p-video-replaceImage-to-Video • Pruna AIPruna's P-Video-Replace takes a source video and one or more identity reference images, then places the referenced person or people into the video while preserving the source motion and audio.Third-party](https://developers.cloudflare.com/ai/models/pruna/p-video-replace/)[![Pruna AI logo](https://developers.cloudflare.com/_astro/prunaai.BVOvqoaI.svg)p-video-avatarImage-to-Video • Pruna AIPruna's P-Video-Avatar generates talking-head videos from a single portrait image driven by a text script or audio file, with multiple voices, languages, and output resolutions.Third-party](https://developers.cloudflare.com/ai/models/pruna/p-video-avatar/)[![Pruna AI logo](https://developers.cloudflare.com/_astro/prunaai.BVOvqoaI.svg)p-image-try-onImage-to-Image • Pruna AIPruna's P-Image Try-On virtually fits one or more garments onto a person's photo. Provide a photo of a person plus garment reference images and the model realistically dresses the person in the provided garments.Third-party](https://developers.cloudflare.com/ai/models/pruna/p-image-try-on/)[![Pruna AI logo](https://developers.cloudflare.com/_astro/prunaai.BVOvqoaI.svg)p-image-upscaleImage-to-Image • Pruna AIPruna's P-Image-Upscale increases image resolution using AI, targeting 1-128 megapixels with optional detail and realism enhancement for sharper, cleaner results.Third-party](https://developers.cloudflare.com/ai/models/pruna/p-image-upscale/)[![Pruna AI logo](https://developers.cloudflare.com/_astro/prunaai.BVOvqoaI.svg)p-image-editImage-to-Image • Pruna AIPruna's P-Image-Edit edits and composes 1-5 reference images with text instructions. It supports complex compositions, style transfers, and targeted edits with flexible output aspect ratios.Third-party](https://developers.cloudflare.com/ai/models/pruna/p-image-edit/)[![Pruna AI logo](https://developers.cloudflare.com/_astro/prunaai.BVOvqoaI.svg)p-imageText-to-Image • Pruna AIPruna's P-Image is an ultra-fast text-to-image model with automatic prompt enhancement and 2-stage refinement, combining exceptional speed with high-quality output and flexible aspect ratios.Third-party](https://developers.cloudflare.com/ai/models/pruna/p-image/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)gemini-3.5-flashText Generation • GoogleGemini 3.5 Flash is Google's fast multimodal model with frontier intelligence, superior search, and grounding capabilities.Third-party](https://developers.cloudflare.com/ai/models/google/gemini-3.5-flash/)[kkrea-2-largeText-to-Image • kreaMore than 2x the size of Medium, with softer post-training. Outputs are rawer, more textured, and more flexible — at its best, Large produces results Medium can't match. Strongest on photorealism, raw looks (motion blur, grain, low dynamic range), and expressive and artistic styles.Third-party](https://developers.cloudflare.com/ai/models/krea/krea-2-large/)[kkrea-2-mediumText-to-Image • kreaSmaller, faster, more cost-efficient. Extensive post-training makes outputs especially stable and consistent across generations. Strongest on illustration, anime, painting, and other expressive or artistic styles.Third-party](https://developers.cloudflare.com/ai/models/krea/krea-2-medium/)[kkrea-2-medium-turboText-to-Image • kreaThe fastest Krea 2 model, built for low-cost iteration on expressive illustrations, style-driven concepts, and rapid visual exploration. Keeps the Krea 2 style system and expressive visual range but uses a distilled sampling schedule so you can move through ideas much faster. Especially useful for expressive illustration, graphic styles, typography experiments, and quick campaign or concept directions.Third-party](https://developers.cloudflare.com/ai/models/krea/krea-2-medium-turbo/)[![RunwayML logo](https://developers.cloudflare.com/_astro/runway.Cq8Cjov4.svg)aleph-2Text-to-Video • RunwayMLRunwayML's video editing model. Edit one frame to update your whole video, make changes across multiple shots, and work with up to 30 seconds of video. Supports keyframe-guided editing for precise control over specific moments in the clip.Third-party](https://developers.cloudflare.com/ai/models/runwayml/aleph-2/)[![Zhipu AI logo](https://developers.cloudflare.com/_astro/zai.Dj2vcayE.svg)glm-5.2Text Generation • Zhipu AIZ.ai's flagship agentic coding modelCloudflare-hostedFunction callingReasoning](https://developers.cloudflare.com/ai/models/@cf/zai-org/glm-5.2/)[![Anthropic logo](https://developers.cloudflare.com/_astro/anthropic.DbRqBIjP.svg)claude-fable-5Text Generation • AnthropicClaude Fable 5 is Anthropic's most capable widely released model, built for the most demanding reasoning and long-horizon agentic work. Adaptive thinking is always on, and the model supports a 1M token context window with up to 128k output tokens per request.Third-party](https://developers.cloudflare.com/ai/models/anthropic/claude-fable-5/)[ddeepseek-v4-proText Generation • deepseekDeepSeek V4 Pro is a high-capability reasoning model from DeepSeek, served via Fireworks infrastructure for production-grade inference.Third-party](https://developers.cloudflare.com/ai/models/deepseek/deepseek-v4-pro/)[![xAI logo](https://developers.cloudflare.com/_astro/xai.2Y8IhZGx.svg)grok-voicewebsocket • xAIxAI's real-time voice conversation model with low-latency audio input and output streaming.Third-party](https://developers.cloudflare.com/ai/models/xai/grok-voice/)[![xAI logo](https://developers.cloudflare.com/_astro/xai.2Y8IhZGx.svg)grok-ttsText-to-Speech • xAIxAI's Grok text-to-speech model. Generates high-fidelity spoken audio in 5 expressive voices (eve, ara, rex, sal, leo) with 20+ supported languages. Supports inline speech tags for laughter, whispers, and pauses.Third-party](https://developers.cloudflare.com/ai/models/xai/grok-tts/)[![xAI logo](https://developers.cloudflare.com/_astro/xai.2Y8IhZGx.svg)grok-sttAutomatic Speech Recognition • xAIxAI's Grok speech-to-text model. Transcribes audio files into text across 25 languages with word-level timestamps, multichannel transcription, speaker diarization, and key-term biasing.Third-party](https://developers.cloudflare.com/ai/models/xai/grok-stt/)[![xAI logo](https://developers.cloudflare.com/_astro/xai.2Y8IhZGx.svg)grok-imagine-video-1.5-previewImage-to-Video • xAIxAI's next-generation video generation model. Generates, edits, and extends videos from text and image inputs. Supports multiple aspect ratios and resolutions with improved quality over the previous generation.Third-party](https://developers.cloudflare.com/ai/models/xai/grok-imagine-video-1.5-preview/)[![MiniMax logo](https://developers.cloudflare.com/_astro/minimax.DPZX-zZI.svg)m3Text Generation • MiniMaxMiniMax's M3 language model with frontier coding and agentic capabilities, a 1M token context window, and multilingual support.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/minimax/m3/)[![Alibaba logo](https://developers.cloudflare.com/_astro/alibaba.C3THgr9s.svg)wan-2.7-i2vImage-to-Video • AlibabaAlibaba's Wan 2.7 image-to-video model that generates videos from a reference image with optional text prompts. Supports 720P and 1080P output with durations from 2 to 15 seconds.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/alibaba/wan-2.7-i2v/)[![Anthropic logo](https://developers.cloudflare.com/_astro/anthropic.DbRqBIjP.svg)claude-opus-4.8Text Generation • AnthropicClaude Opus 4.8 is Anthropic's most capable generally available model, with a step-change improvement in agentic coding over Claude Opus 4.7\. It uses adaptive thinking to calibrate reasoning per task and supports a one million token context window at standard pricing.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/anthropic/claude-opus-4.8/)[![Black Forest Labs logo](https://developers.cloudflare.com/_astro/blackforestlabs.Ccs-Y4-D.svg)flux-2-pro-previewText-to-Image • Black Forest LabsFLUX.2 \[pro\] Preview is Black Forest Labs' recommended default for production image generation and editing — tracks the latest \[pro\] weights with strong multi-reference support.Third-party](https://developers.cloudflare.com/ai/models/black-forest-labs/flux-2-pro-preview/)[![Black Forest Labs logo](https://developers.cloudflare.com/_astro/blackforestlabs.Ccs-Y4-D.svg)flux-2-maxText-to-Image • Black Forest LabsFLUX.2 \[max\] is Black Forest Labs' highest-quality image model — top editing consistency, strongest prompt following, and grounding search for visualizations of real-time information.Third-party](https://developers.cloudflare.com/ai/models/black-forest-labs/flux-2-max/)[![Black Forest Labs logo](https://developers.cloudflare.com/_astro/blackforestlabs.Ccs-Y4-D.svg)flux-2-flexText-to-Image • Black Forest LabsFLUX.2 \[flex\] is Black Forest Labs' fine-grained control variant of FLUX.2 — exposes tunable inference steps, guidance, and prompt upsampling for typography-heavy and production workflows.Third-party](https://developers.cloudflare.com/ai/models/black-forest-labs/flux-2-flex/)[![xAI logo](https://developers.cloudflare.com/_astro/xai.2Y8IhZGx.svg)grok-imagine-videoText-to-Video • xAIxAI's video generation model. Generates, edits, and extends videos from text and image inputs with native synchronized audio including dialogue, sound effects, and music. Supports multiple creative modes (normal, fun, custom).Third-party](https://developers.cloudflare.com/ai/models/xai/grok-imagine-video/)[![xAI logo](https://developers.cloudflare.com/_astro/xai.2Y8IhZGx.svg)grok-imagine-image-qualityText-to-Image • xAIxAI's higher-fidelity text-to-image model optimized for sharper details, more accurate compositions, and stronger text rendering. Supports image editing via reference images and masks. Trades speed for quality compared to grok-imagine-image. Default output at 2k resolution.Third-party](https://developers.cloudflare.com/ai/models/xai/grok-imagine-image-quality/)[![xAI logo](https://developers.cloudflare.com/_astro/xai.2Y8IhZGx.svg)grok-4.3Text Generation • xAIxAI's Grok 4.3 model with a 1M-token context window and strong agentic tool calling with minimal hallucinations. Accepts text and image inputs, and supports function calling, structured outputs, and configurable reasoning effort (none, low, medium, high).Third-party](https://developers.cloudflare.com/ai/models/xai/grok-4.3/)[![xAI logo](https://developers.cloudflare.com/_astro/xai.2Y8IhZGx.svg)grok-imagine-imageText-to-Image • xAIxAI's Grok Imagine image model. Generates and edits images from text and reference-image inputs with configurable aspect ratio and resolution.Third-party](https://developers.cloudflare.com/ai/models/xai/grok-imagine-image/)[![xAI logo](https://developers.cloudflare.com/_astro/xai.2Y8IhZGx.svg)grok-4.20-multi-agent-0309Text Generation • xAIxAI's Grok 4.20 multi-agent model with a 2M-token context window. Multiple agents collaborate in parallel to perform deep research tasks, with function calling, structured outputs, and reasoning capabilities.Third-party](https://developers.cloudflare.com/ai/models/xai/grok-4.20-multi-agent-0309/)[![xAI logo](https://developers.cloudflare.com/_astro/xai.2Y8IhZGx.svg)grok-4.20-0309-non-reasoningText Generation • xAIxAI's Grok 4.20 non-reasoning model. Skips the thinking trace for fast, single-pass responses while keeping the same training as the reasoning variant.Third-party](https://developers.cloudflare.com/ai/models/xai/grok-4.20-0309-non-reasoning/)[![xAI logo](https://developers.cloudflare.com/_astro/xai.2Y8IhZGx.svg)grok-4.20-0309-reasoningText Generation • xAIxAI's Grok 4.20 reasoning model. Uses extended thinking to work through complex problems, returning a reasoning trace alongside the final answer.Third-party](https://developers.cloudflare.com/ai/models/xai/grok-4.20-0309-reasoning/)[![Vidu logo](https://developers.cloudflare.com/_astro/vidu._WEx0U8r.svg)q3-proText-to-Video • ViduVidu Q3 Pro is a high-quality video generation model supporting text-to-video, image-to-video, and start/end-frame-to-video workflows with audio and up to 16-second clips.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/vidu/q3-pro/)[![Vidu logo](https://developers.cloudflare.com/_astro/vidu._WEx0U8r.svg)q3-turboText-to-Video • ViduVidu Q3 Turbo is a faster version of Vidu Q3 optimized for lower latency video generation while maintaining audio support and up to 16-second clips.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/vidu/q3-turbo/)[![RunwayML logo](https://developers.cloudflare.com/_astro/runway.Cq8Cjov4.svg)gen-4.5Text-to-Video • RunwayMLRunwayML's video generation model supporting both text-to-video and image-to-video with customizable duration, aspect ratio, and content moderation controls.Third-party](https://developers.cloudflare.com/ai/models/runwayml/gen-4.5/)[![Recraft logo](https://developers.cloudflare.com/_astro/recraft.BhhnJczi.svg)recraftv4-vectorText-to-Image • RecraftGenerate production-ready SVG vector graphics from text prompts with clean geometry, structured layers, and editable paths.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/recraft/recraftv4-vector/)[![Recraft logo](https://developers.cloudflare.com/_astro/recraft.BhhnJczi.svg)recraftv4-proText-to-Image • RecraftRecraft V4 Pro generates high-resolution, art-directed images at 2048px+ with strong composition, text rendering, and design taste. Built for print and production work.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/recraft/recraftv4-pro/)[![Recraft logo](https://developers.cloudflare.com/_astro/recraft.BhhnJczi.svg)recraftv4-pro-vectorText-to-Image • RecraftGenerate detailed, production-ready SVG vector graphics from text prompts with fine geometry, scalable to any size for print and design work.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/recraft/recraftv4-pro-vector/)[![Recraft logo](https://developers.cloudflare.com/_astro/recraft.BhhnJczi.svg)recraftv4-1-utility-vectorText-to-Image • RecraftGenerate production-ready SVG vector graphics from text prompts with a general-purpose model suited for a wide range of design and illustration tasks.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/recraft/recraftv4-1-utility-vector/)[![Recraft logo](https://developers.cloudflare.com/_astro/recraft.BhhnJczi.svg)recraftv4-1-vectorText-to-Image • RecraftGenerate production-ready SVG vector graphics from text prompts with high aesthetic quality, clean geometry, structured layers, and editable paths.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/recraft/recraftv4-1-vector/)[![Recraft logo](https://developers.cloudflare.com/_astro/recraft.BhhnJczi.svg)recraftv4-1-utility-proText-to-Image • RecraftRecraft V4.1 Utility Pro is a general-purpose text-to-image model producing high-resolution 2048px+ output for a wide range of production and print use cases.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/recraft/recraftv4-1-utility-pro/)[![Recraft logo](https://developers.cloudflare.com/_astro/recraft.BhhnJczi.svg)recraftv4-1-utility-pro-vectorText-to-Image • RecraftGenerate detailed, high-resolution SVG vector graphics from text prompts with a general-purpose model, scalable to any size for print and large-scale design work.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/recraft/recraftv4-1-utility-pro-vector/)[![Recraft logo](https://developers.cloudflare.com/_astro/recraft.BhhnJczi.svg)recraftv4-1-pro-vectorText-to-Image • RecraftGenerate detailed, high-resolution SVG vector graphics from text prompts with high aesthetic quality, fine geometry, scalable to any size for print and design work.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/recraft/recraftv4-1-pro-vector/)[![Recraft logo](https://developers.cloudflare.com/_astro/recraft.BhhnJczi.svg)recraftv4-1-utilityText-to-Image • RecraftRecraft V4.1 Utility is a general-purpose text-to-image model balancing quality and flexibility for a wide range of everyday use cases at standard resolution.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/recraft/recraftv4-1-utility/)[![Recraft logo](https://developers.cloudflare.com/_astro/recraft.BhhnJczi.svg)recraftv4-1Text-to-Image • RecraftRecraft V4.1 generates art-directed images tuned for high aesthetics, with strong composition, accurate text rendering, and refined design taste. Fast and cost-efficient at standard resolution.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/recraft/recraftv4-1/)[![Recraft logo](https://developers.cloudflare.com/_astro/recraft.BhhnJczi.svg)recraftv4-1-proText-to-Image • RecraftRecraft V4.1 Pro generates high-resolution, art-directed images at 2048px+ tuned for high aesthetics, with strong composition, text rendering, and refined design taste. Built for print and production work.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/recraft/recraftv4-1-pro/)[![Recraft logo](https://developers.cloudflare.com/_astro/recraft.BhhnJczi.svg)recraftv4Text-to-Image • RecraftRecraft V4 generates art-directed images with strong composition, accurate text rendering, and design taste built in. Fast and cost-efficient at standard resolution.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/recraft/recraftv4/)[![PixVerse logo](https://developers.cloudflare.com/_astro/pixverse.DSyGEAYR.svg)v6Text-to-Video • PixVersePixverse v6 is the latest Pixverse video model with support for up to 15-second videos, customizable duration from 1 to 15 seconds, and audio generation.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/pixverse/v6/)[![Recraft logo](https://developers.cloudflare.com/_astro/recraft.BhhnJczi.svg)recraftv3Text-to-Image • RecraftRecraft V3 is the previous-generation text-to-image model from Recraft, well-suited to design-quality compositions, brand-aware imagery, and accurate text rendering.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/recraft/recraftv3/)[![PixVerse logo](https://developers.cloudflare.com/_astro/pixverse.DSyGEAYR.svg)v5.6Text-to-Video • PixVersePixverse v5.6 is a video generation model supporting text-to-video and image-to-video with audio generation, customizable aspect ratios, and up to 1080p output.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/pixverse/v5.6/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)tts-1-hdText-to-Speech • OpenAIOpenAI's high-definition text-to-speech model producing higher quality audio output.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/tts-1-hd/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)tts-1Text-to-Speech • OpenAIOpenAI's text-to-speech model optimized for real-time use with low latency.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/tts-1/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)o4-miniText Generation • OpenAIOpenAI's fast, lightweight reasoning model optimized for multi-step problem solving at lower cost.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/o4-mini/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)o3-miniText Generation • OpenAIo3-mini is the lightweight, low-cost reasoning variant of o3, well suited to quick analytical tasks at scale.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/o3-mini/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)o3Text Generation • OpenAIo3 is OpenAI’s general-purpose reasoning model, balancing strong analytical performance with reasonable latency and cost.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/o3/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-image-2Text-to-Image • OpenAIOpenAI's next-generation image model that creates and edits images from text prompts, with support for multiple quality levels, sizes, and output formats. Note: transparent backgrounds are not supported — use openai/gpt-image-1.5 for transparent PNGs.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-image-2/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-5.5-proText Generation • OpenAIGPT-5.5 pro uses OpenAI's Responses API with built-in tools, improved reasoning, and stateful context management.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-5.5-pro/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-image-1.5Text-to-Image • OpenAIOpenAI's image generation model that creates and edits images from text prompts, supporting multiple quality levels and output sizes.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-image-1.5/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-5.5Text Generation • OpenAIGPT-5.5 is OpenAI's flagship model with strong coding, reasoning, and multimodal capabilities.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-5.5/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-5.4-nanoText Generation • OpenAIGPT-5.4 nano is OpenAI's smallest and fastest model, optimized for edge and low-latency use cases.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-5.4-nano/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-5.4-proText Generation • OpenAIGPT-5.4 pro uses OpenAI's Responses API with built-in tools, improved reasoning, and stateful context management.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-5.4-pro/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-5.4Text Generation • OpenAIGPT-5.4 is OpenAI's flagship model with strong coding, reasoning, and multimodal capabilities.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-5.4/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-5.4-miniText Generation • OpenAIGPT-5.4 mini is a smaller, faster, and more cost-efficient version of GPT-5.4 for lightweight tasks.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-5.4-mini/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-5.1-chatText Generation • OpenAIGPT-5.1 Chat is the chat-tuned variant of GPT-5.1, optimised for back-and-forth conversation and instruction following.Third-party](https://developers.cloudflare.com/ai/models/openai/gpt-5.1-chat/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-5.1Text Generation • OpenAIGPT-5.1 is OpenAI’s incremental improvement over GPT-5, with stronger coding, reasoning, and writing.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-5.1/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-5-nanoText Generation • OpenAIGPT-5 Nano is OpenAI’s smallest GPT-5 variant, optimized for low latency and cheap, high-throughput tasks.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-5-nano/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-5-chatText Generation • OpenAIGPT-5 Chat is the chat-tuned variant of GPT-5, optimised for back-and-forth conversation and instruction following.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-5-chat/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-5-miniText Generation • OpenAIGPT-5 Mini is the lightweight, low-cost variant of GPT-5, well suited to high-volume coding and reasoning tasks.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-5-mini/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-5Text Generation • OpenAIOpenAI's model excelling at coding, writing, and reasoning.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-5/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-4o-transcribeAutomatic Speech Recognition • OpenAIA speech-to-text model that uses GPT-4o to transcribe audio with improved word error rate and better language recognition compared to original Whisper models.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-4o-transcribe/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-4o-miniText Generation • OpenAIGPT-4o Mini is the lightweight, low-cost variant of GPT-4o, well suited to high-volume tasks with multimodal inputs.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-4o-mini/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-4oText Generation • OpenAIGPT-4o is OpenAI’s multimodal flagship, accepting text and images and responding quickly across a wide range of tasks.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-4o/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-4.1-miniText Generation • OpenAIFast, affordable version of GPT-4.1 with a million-token context window.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-4.1-mini/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-4.1-nanoText Generation • OpenAIGPT-4.1 Nano is OpenAI’s smallest and cheapest GPT-4.1 variant, optimized for high-throughput, low-latency tasks.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-4.1-nano/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-4.1Text Generation • OpenAIOpenAI's flagship GPT model for complex tasks with a million-token context window.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/openai/gpt-4.1/)[![MiniMax logo](https://developers.cloudflare.com/_astro/minimax.DPZX-zZI.svg)speech-2.8-turboText-to-Speech • MiniMaxMiniMax Speech 2.8 Turbo turns text into natural, expressive speech with voice cloning, emotion control, and 40+ language support at faster speeds.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/minimax/speech-2.8-turbo/)[![MiniMax logo](https://developers.cloudflare.com/_astro/minimax.DPZX-zZI.svg)music-2.6Music Generation • MiniMaxMiniMax's music generation model that creates full-length songs with vocals from text prompts and lyrics, or instrumental tracks. Supports BPM/key control and auto-generated lyrics.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/minimax/music-2.6/)[![MiniMax logo](https://developers.cloudflare.com/_astro/minimax.DPZX-zZI.svg)speech-2.8-hdText-to-Speech • MiniMaxMiniMax Speech 2.8 HD focuses on studio-grade audio generation with emotion control, multilingual support (40+ languages), and voice cloning.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/minimax/speech-2.8-hd/)[![MiniMax logo](https://developers.cloudflare.com/_astro/minimax.DPZX-zZI.svg)m2.7Text Generation • MiniMaxMiniMax's M2.7 language model with multilingual capabilities.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/minimax/m2.7/)[![MiniMax logo](https://developers.cloudflare.com/_astro/minimax.DPZX-zZI.svg)hailuo-2.3-fastText-to-Video • MiniMaxA lower-latency version of Hailuo 2.3 that preserves core motion quality, visual consistency, and stylization while enabling faster iteration.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/minimax/hailuo-2.3-fast/)[![Inworld logo](https://developers.cloudflare.com/_astro/inworld.BDwMAXI2.svg)tts-2Text-to-Speech • InworldInworld's most powerful and expressive text-to-speech model. Builds on TTS 1.5 with rich expressive speech, real-time latency, natural language steering (e.g. \[whisper\], \[say excitedly\]), and stronger multilingual support across 15 production languages plus 90+ experimental languages.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/inworld/tts-2/)[![MiniMax logo](https://developers.cloudflare.com/_astro/minimax.DPZX-zZI.svg)hailuo-2.3Text-to-Video • MiniMaxA high-fidelity video generation model optimized for realistic human motion, cinematic VFX, expressive characters, and strong prompt and style adherence across text-to-video and image-to-video workflows.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/minimax/hailuo-2.3/)[![Inworld logo](https://developers.cloudflare.com/_astro/inworld.BDwMAXI2.svg)tts-1.5-maxText-to-Speech • InworldHighest-quality text-to-speech with under 200ms latency, emotion control, and 15-language support.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/inworld/tts-1.5-max/)[![Inworld logo](https://developers.cloudflare.com/_astro/inworld.BDwMAXI2.svg)tts-1.5-miniText-to-Speech • InworldUltra-fast, cost-efficient text-to-speech with approximately 120ms latency and 15-language support.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/inworld/tts-1.5-mini/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)veo-3.1-fastText-to-Video • GoogleA faster version of Veo 3.1 optimized for lower latency while maintaining high-quality video and audio output.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/google/veo-3.1-fast/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)veo-3-fastText-to-Video • GoogleA faster version of Veo 3 optimized for lower latency video generation with audio support.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/google/veo-3-fast/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)veo-3.1Text-to-Video • GoogleGoogle's latest video generation model with improved quality, motion, and audio generation.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/google/veo-3.1/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)nano-banana-proText-to-Image • GoogleGoogle's higher-quality image generation model with improved detail and prompt adherence.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/google/nano-banana-pro/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)veo-3Text-to-Video • GoogleGoogle's video generation model capable of producing high-quality videos with optional audio from text prompts.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/google/veo-3/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)nano-bananaText-to-Image • GoogleGoogle's fast image generation model producing high-quality images from text prompts.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/google/nano-banana/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)nano-banana-2Text-to-Image • GoogleGoogle's second-generation image generation model with improved quality and speed.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/google/nano-banana-2/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)gemini-3.1-proText Generation • GoogleGoogle's most intelligent Gemini model with improved reasoning, a medium thinking level, and a 1M token context window.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/google/gemini-3.1-pro/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)imagen-4Text-to-Image • GoogleGoogle's latest image generation model producing high-quality, photorealistic images from text prompts with support for multiple aspect ratios.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/google/imagen-4/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)gemini-3.1-flash-ttsText-to-Speech • GoogleThird-partyZero data retention](https://developers.cloudflare.com/ai/models/google/gemini-3.1-flash-tts/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)gemini-3-flashText Generation • GoogleGemini 3 Flash is Google's fast multimodal model with frontier intelligence, superior search, and grounding capabilities.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/google/gemini-3-flash/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)gemini-3.1-flash-liteText Generation • GoogleGoogle's lightest and most cost-efficient Gemini model for high-throughput tasks.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/google/gemini-3.1-flash-lite/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)gemini-2.5-flash-liteText Generation • GoogleGoogle's lightest and most cost-efficient Gemini 2.5 model for high-throughput tasks.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/google/gemini-2.5-flash-lite/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)gemini-2.5-proText Generation • GoogleGoogle's most capable Gemini 2.5 model with strong reasoning, thinking support, and a 1M token context window.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/google/gemini-2.5-pro/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)gemini-2.5-flashText Generation • GoogleGoogle's fast multimodal Gemini 2.5 model with strong reasoning and a 1M token context window.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/google/gemini-2.5-flash/)[![ByteDance logo](https://developers.cloudflare.com/_astro/bytedance.T1uiROQ6.svg)seedream-4.5Text-to-Image • ByteDanceSeedream 4.5 builds on 4.0 with multi-reference image support, batch generation, and sequential image generation.Third-party](https://developers.cloudflare.com/ai/models/bytedance/seedream-4.5/)[![ByteDance logo](https://developers.cloudflare.com/_astro/bytedance.T1uiROQ6.svg)seedream-5-liteText-to-Image • ByteDanceSeedream 5 Lite is a lighter, faster version of the Seedream 5 family with multi-reference and batch generation support.Third-party](https://developers.cloudflare.com/ai/models/bytedance/seedream-5-lite/)[![ByteDance logo](https://developers.cloudflare.com/_astro/bytedance.T1uiROQ6.svg)seedream-4.0Text-to-Image • ByteDanceSeedream 4.0 is ByteDance's image creation model that combines text-to-image generation and image editing into a single architecture, offering fast, high-resolution output up to 4K.Third-party](https://developers.cloudflare.com/ai/models/bytedance/seedream-4.0/)[![ByteDance logo](https://developers.cloudflare.com/_astro/bytedance.T1uiROQ6.svg)seedance-2.0-fastText-to-Video • ByteDanceFaster variant of ByteDance's Seedance 2.0 video model. Trades some quality for speed while sharing the same multimodal architecture. Supports text-to-video, image-to-video, native audio generation, multimodal references (images, videos, audio), video editing, and video extension.Third-party](https://developers.cloudflare.com/ai/models/bytedance/seedance-2.0-fast/)[![AssemblyAI logo](https://developers.cloudflare.com/_astro/assemblyai.DKrad3Z3.svg)universal-3-proAutomatic Speech Recognition • AssemblyAIAssemblyAI's Universal 3 Pro speech recognition model for high-accuracy transcription.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/assemblyai/universal-3-pro/)[![ByteDance logo](https://developers.cloudflare.com/_astro/bytedance.T1uiROQ6.svg)seedance-2.0Text-to-Video • ByteDanceByteDance's next-generation video model with a unified multimodal architecture. Generates high-quality video with synchronized audio from text, images, video clips, and audio inputs. Supports multimodal references (up to 9 images, 3 videos, 3 audio files), native audio generation, video editing, video extension, intelligent duration, and adaptive aspect ratio.Third-party](https://developers.cloudflare.com/ai/models/bytedance/seedance-2.0/)[![Anthropic logo](https://developers.cloudflare.com/_astro/anthropic.DbRqBIjP.svg)claude-sonnet-4.5Text Generation • AnthropicClaude Sonnet 4.5 is the best coding model to date, with significant improvements across the entire development lifecycle.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/anthropic/claude-sonnet-4.5/)[![Anthropic logo](https://developers.cloudflare.com/_astro/anthropic.DbRqBIjP.svg)claude-sonnet-4.6Text Generation • AnthropicClaude Sonnet 4.6 is Anthropic's latest balanced model offering strong coding, reasoning, and agentic capabilities with improved instruction following.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/anthropic/claude-sonnet-4.6/)[![Anthropic logo](https://developers.cloudflare.com/_astro/anthropic.DbRqBIjP.svg)claude-opus-4.6Text Generation • AnthropicClaude Opus 4.6 is Anthropic's flagship language model built for complex, multi-step work in coding, financial analysis, and legal reasoning. It uses extended thinking to work through complex problems carefully and features a one million token context window.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/anthropic/claude-opus-4.6/)[![Anthropic logo](https://developers.cloudflare.com/_astro/anthropic.DbRqBIjP.svg)claude-opus-4.7Text Generation • AnthropicClaude Opus 4.7 is Anthropic's most capable generally available model, with a step-change improvement in agentic coding over Claude Opus 4.6\. It uses adaptive thinking to calibrate reasoning per task and supports a one million token context window at standard pricing.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/anthropic/claude-opus-4.7/)[![Anthropic logo](https://developers.cloudflare.com/_astro/anthropic.DbRqBIjP.svg)claude-opus-4.5Text Generation • AnthropicClaude Opus 4.5 brings further reasoning, coding, and agentic improvements over Opus 4.1, with stronger tool use and tighter instruction following.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/anthropic/claude-opus-4.5/)[![Alibaba logo](https://developers.cloudflare.com/_astro/alibaba.C3THgr9s.svg)wan-2.6-imageText-to-Image • AlibabaAlibaba's Wan 2.6 text-to-image model generating images from text prompts with optional negative prompts and customizable dimensions.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/alibaba/wan-2.6-image/)[![Anthropic logo](https://developers.cloudflare.com/_astro/anthropic.DbRqBIjP.svg)claude-haiku-4.5Text Generation • AnthropicClaude Haiku 4.5 delivers similar levels of coding performance at one-third the cost and more than twice the speed of larger models.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/anthropic/claude-haiku-4.5/)[![Alibaba logo](https://developers.cloudflare.com/_astro/alibaba.C3THgr9s.svg)qwen3-maxText Generation • AlibabaAlibaba's Qwen 3 Max is a large language model with strong coding, reasoning, and multilingual capabilities, served via DashScope's OpenAI-compatible endpoint.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/alibaba/qwen3-max/)[![Alibaba logo](https://developers.cloudflare.com/_astro/alibaba.C3THgr9s.svg)qwen3.5-397b-a17bText Generation • AlibabaAlibaba's Qwen 3.5 is a 397B-parameter mixture-of-experts model with 17B active parameters, offering strong reasoning capabilities with efficient inference.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/alibaba/qwen3.5-397b-a17b/)[![Alibaba logo](https://developers.cloudflare.com/_astro/alibaba.C3THgr9s.svg)hh1-t2vText-to-Video • AlibabaAlibaba's HappyHorse 1.0 text-to-video model. Generates videos from a text prompt with configurable resolution, aspect ratio, and duration (3-15s).Third-partyZero data retention](https://developers.cloudflare.com/ai/models/alibaba/hh1-t2v/)[![Alibaba logo](https://developers.cloudflare.com/_astro/alibaba.C3THgr9s.svg)hh1-i2vImage-to-Video • AlibabaAlibaba's HappyHorse 1.0 image-to-video model. Animates a reference image with an optional text prompt. Supports 720P and 1080P output with durations from 3 to 15 seconds.Third-partyZero data retention](https://developers.cloudflare.com/ai/models/alibaba/hh1-i2v/)[![Moonshot AI logo](https://developers.cloudflare.com/_astro/moonshotai.D9EBG7kx.svg)kimi-k2.6Text Generation • Moonshot AIKimi K2.6 is a frontier-scale open-source 1T parameter model with a 262.1k context window, multi-turn tool calling, vision inputs, and structured outputs for agentic workloads.Cloudflare-hostedFunction callingReasoningVision](https://developers.cloudflare.com/ai/models/@cf/moonshotai/kimi-k2.6/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)gemma-4-26b-a4b-itText Generation • GoogleGemma 4 is Google's most intelligent family of open models, built from Gemini 3 research to maximize intelligence-per-parameter.Cloudflare-hostedFunction callingReasoningVision](https://developers.cloudflare.com/ai/models/@cf/google/gemma-4-26b-a4b-it/)[![NVIDIA logo](https://developers.cloudflare.com/_astro/nvidia.y1O6VlZA.svg)nemotron-3-120b-a12bText Generation • NVIDIANVIDIA Nemotron 3 Super is a hybrid MoE model with leading accuracy for multi-agent applications and specialized agentic AI systems.Cloudflare-hostedFunction callingReasoning](https://developers.cloudflare.com/ai/models/@cf/nvidia/nemotron-3-120b-a12b/)[![Moonshot AI logo](https://developers.cloudflare.com/_astro/moonshotai.D9EBG7kx.svg)kimi-k2.5Text Generation • Moonshot AIKimi K2.5 is a frontier-scale open-source model with a 256k context window, multi-turn tool calling, vision inputs, and structured outputs for agentic workloads.Cloudflare-hostedFunction callingDeprecatedReasoningVision](https://developers.cloudflare.com/ai/models/@cf/moonshotai/kimi-k2.5/)[![Black Forest Labs logo](https://developers.cloudflare.com/_astro/blackforestlabs.Ccs-Y4-D.svg)flux-2-klein-9bText-to-Image • Black Forest LabsFLUX.2 \[klein\] 9B is an ultra-fast, distilled image model with enhanced quality. It unifies image generation and editing in a single model, delivering state-of-the-art quality enabling interactive workflows, real-time previews, and latency-critical applications.Cloudflare-hostedPartner](https://developers.cloudflare.com/ai/models/@cf/black-forest-labs/flux-2-klein-9b/)[![Black Forest Labs logo](https://developers.cloudflare.com/_astro/blackforestlabs.Ccs-Y4-D.svg)flux-2-klein-4bText-to-Image • Black Forest LabsFLUX.2 \[klein\] is an ultra-fast, distilled image model. It unifies image generation and editing in a single model, delivering state-of-the-art quality enabling interactive workflows, real-time previews, and latency-critical applications.Cloudflare-hostedPartner](https://developers.cloudflare.com/ai/models/@cf/black-forest-labs/flux-2-klein-4b/)[![Black Forest Labs logo](https://developers.cloudflare.com/_astro/blackforestlabs.Ccs-Y4-D.svg)flux-2-devText-to-Image • Black Forest LabsFLUX.2 \[dev\] is an image model from Black Forest Labs where you can generate highly realistic and detailed images, with multi-reference support.Cloudflare-hostedPartner](https://developers.cloudflare.com/ai/models/@cf/black-forest-labs/flux-2-dev/)[![Deepgram logo](https://developers.cloudflare.com/_astro/deepgram.BYzW8KfF.svg)aura-2-esText-to-Speech • DeepgramAura-2 is a context-aware text-to-speech (TTS) model that applies natural pacing, expressiveness, and fillers based on the context of the provided text. The quality of your text input directly impacts the naturalness of the audio output.Cloudflare-hostedBatchPartnerReal-time](https://developers.cloudflare.com/ai/models/@cf/deepgram/aura-2-es/)[![Deepgram logo](https://developers.cloudflare.com/_astro/deepgram.BYzW8KfF.svg)aura-2-enText-to-Speech • DeepgramAura-2 is a context-aware text-to-speech (TTS) model that applies natural pacing, expressiveness, and fillers based on the context of the provided text. The quality of your text input directly impacts the naturalness of the audio output.Cloudflare-hostedBatchPartnerReal-time](https://developers.cloudflare.com/ai/models/@cf/deepgram/aura-2-en/)[![IBM logo](https://developers.cloudflare.com/_astro/ibm.CNSuznmO.svg)granite-4.0-h-microText Generation • IBMGranite 4.0 instruct models deliver strong performance across benchmarks, achieving industry-leading results in key agentic tasks like instruction following and function calling. These efficiencies make the models well-suited for a wide range of use cases like retrieval-augmented generation (RAG), multi-agent workflows, and edge deployments.Cloudflare-hostedFunction calling](https://developers.cloudflare.com/ai/models/@cf/ibm-granite/granite-4.0-h-micro/)[![Deepgram logo](https://developers.cloudflare.com/_astro/deepgram.BYzW8KfF.svg)fluxAutomatic Speech Recognition • DeepgramFlux is the first conversational speech recognition model built specifically for voice agents.Cloudflare-hostedPartnerReal-time](https://developers.cloudflare.com/ai/models/@cf/deepgram/flux/)[pplamo-embedding-1bText Embeddings • pfnetPLaMo-Embedding-1B is a Japanese text embedding model developed by Preferred Networks, Inc. It can convert Japanese text input into numerical vectors and can be used for a wide range of applications, including information retrieval, text classification, and clustering.Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/pfnet/plamo-embedding-1b/)[agemma-sea-lion-v4-27b-itText Generation • aisingaporeSEA-LION stands for Southeast Asian Languages In One Network, which is a collection of Large Language Models (LLMs) which have been pretrained and instruct-tuned for the Southeast Asia (SEA) region.Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/aisingapore/gemma-sea-lion-v4-27b-it/)[aindictrans2-en-indic-1BTranslation • ai4bharatIndicTrans2 is the first open-source transformer-based multilingual NMT model that supports high-quality translations across all the 22 scheduled Indic languagesCloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/ai4bharat/indictrans2-en-indic-1B/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)embeddinggemma-300mText Embeddings • GoogleEmbeddingGemma is a 300M parameter, state-of-the-art for its size, open embedding model from Google, built from Gemma 3 (with T5Gemma initialization) and the same research and technology used to create Gemini models. EmbeddingGemma produces vector representations of text, making it well-suited for search and retrieval tasks, including classification, clustering, and semantic similarity search. This model was trained with data in 100+ spoken languages.Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/google/embeddinggemma-300m/)[![Deepgram logo](https://developers.cloudflare.com/_astro/deepgram.BYzW8KfF.svg)aura-1Text-to-Speech • DeepgramAura is a context-aware text-to-speech (TTS) model that applies natural pacing, expressiveness, and fillers based on the context of the provided text. The quality of your text input directly impacts the naturalness of the audio output.Cloudflare-hostedBatchPartnerReal-time](https://developers.cloudflare.com/ai/models/@cf/deepgram/aura-1/)[![Leonardo logo](https://developers.cloudflare.com/_astro/leonardo.Ch-T5rST.svg)lucid-originText-to-Image • LeonardoLucid Origin from Leonardo.AI is their most adaptable and prompt-responsive model to date. Whether you're generating images with sharp graphic design, stunning full-HD renders, or highly specific creative direction, it adheres closely to your prompts, renders text with accuracy, and supports a wide array of visual styles and aesthetics – from stylized concept art to crisp product mockups. Cloudflare-hostedPartner](https://developers.cloudflare.com/ai/models/@cf/leonardo/lucid-origin/)[![Leonardo logo](https://developers.cloudflare.com/_astro/leonardo.Ch-T5rST.svg)phoenix-1.0Text-to-Image • LeonardoPhoenix 1.0 is a model by Leonardo.Ai that generates images with exceptional prompt adherence and coherent text.Cloudflare-hostedPartner](https://developers.cloudflare.com/ai/models/@cf/leonardo/phoenix-1.0/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)gpt-oss-20bText Generation • OpenAIOpenAI's open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases – gpt-oss-20b is for lower latency, and local or specialized use-cases.Cloudflare-hostedFunction callingReasoning](https://developers.cloudflare.com/ai/models/@cf/openai/gpt-oss-20b/)[![Pipecat logo](https://developers.cloudflare.com/_astro/pipecat.B-PNBdef.svg)smart-turn-v2Voice Activity Detection • PipecatAn open source, community-driven, native audio turn detection model in 2nd versionCloudflare-hostedBatchReal-time](https://developers.cloudflare.com/ai/models/@cf/pipecat-ai/smart-turn-v2/)[![Qwen logo](https://developers.cloudflare.com/_astro/qwen.CVqFFn5h.svg)qwen3-embedding-0.6bText Embeddings • QwenThe Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/qwen/qwen3-embedding-0.6b/)[![Deepgram logo](https://developers.cloudflare.com/_astro/deepgram.BYzW8KfF.svg)nova-3Automatic Speech Recognition • DeepgramTranscribe audio using Deepgram’s speech-to-text modelCloudflare-hostedBatchPartnerReal-time](https://developers.cloudflare.com/ai/models/@cf/deepgram/nova-3/)[![Qwen logo](https://developers.cloudflare.com/_astro/qwen.CVqFFn5h.svg)qwen3-30b-a3b-fp8Text Generation • QwenQwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support.Cloudflare-hostedBatchFunction callingReasoning](https://developers.cloudflare.com/ai/models/@cf/qwen/qwen3-30b-a3b-fp8/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)gemma-3-12b-itText Generation • GoogleGemma 3 models are well-suited for a variety of text generation and image understanding tasks, including question answering, summarization, and reasoning. Gemma 3 models are multimodal, handling text and image input and generating text output, with a large, 128K context window, multilingual support in over 140 languages, and is available in more sizes than previous versions.Cloudflare-hostedLoRADeprecated](https://developers.cloudflare.com/ai/models/@cf/google/gemma-3-12b-it/)[![MistralAI logo](https://developers.cloudflare.com/_astro/mistralai.Bn9UMUMu.svg)mistral-small-3.1-24b-instructText Generation • MistralAIBuilding upon Mistral Small 3 (2501), Mistral Small 3.1 (2503) adds state-of-the-art vision understanding and enhances long context capabilities up to 128k tokens without compromising text performance. With 24 billion parameters, this model achieves top-tier capabilities in both text and vision tasks.Cloudflare-hostedFunction calling](https://developers.cloudflare.com/ai/models/@cf/mistralai/mistral-small-3.1-24b-instruct/)[![Qwen logo](https://developers.cloudflare.com/_astro/qwen.CVqFFn5h.svg)qwq-32bText Generation • QwenQwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.Cloudflare-hostedLoRAReasoning](https://developers.cloudflare.com/ai/models/@cf/qwen/qwq-32b/)[![Qwen logo](https://developers.cloudflare.com/_astro/qwen.CVqFFn5h.svg)qwen2.5-coder-32b-instructText Generation • QwenQwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). As of now, Qwen2.5-Coder has covered six mainstream model sizes, 0.5, 1.5, 3, 7, 14, 32 billion parameters, to meet the needs of different developers. Qwen2.5-Coder brings the following improvements upon CodeQwen1.5:Cloudflare-hostedLoRA](https://developers.cloudflare.com/ai/models/@cf/qwen/qwen2.5-coder-32b-instruct/)[![BAAI logo](https://developers.cloudflare.com/_astro/baai.mOtdbKlV.svg)bge-reranker-baseText Classification • BAAIDifferent from embedding model, reranker uses question and document as input and directly output similarity instead of embedding. You can get a relevance score by inputting query and passage to the reranker. And the score can be mapped to a float value in \[0,1\] by sigmoid function. Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/baai/bge-reranker-base/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)llama-guard-3-8bText Generation • MetaLlama Guard 3 is a Llama-3.1-8B pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM inputs (prompt classification) and in LLM responses (response classification). It acts as an LLM – it generates text in its output that indicates whether a given prompt or response is safe or unsafe, and if unsafe, it also lists the content categories violated.Cloudflare-hostedLoRA](https://developers.cloudflare.com/ai/models/@cf/meta/llama-guard-3-8b/)[![DeepSeek logo](https://developers.cloudflare.com/_astro/deepseek.nPIT6fwR.svg)deepseek-r1-distill-qwen-32bText Generation • DeepSeekDeepSeek-R1-Distill-Qwen-32B is a model distilled from DeepSeek-R1 based on Qwen2.5\. It outperforms OpenAI-o1-mini across various benchmarks, achieving new state-of-the-art results for dense models.Cloudflare-hostedReasoning](https://developers.cloudflare.com/ai/models/@cf/deepseek-ai/deepseek-r1-distill-qwen-32b/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)llama-3.3-70b-instruct-fp8-fastText Generation • MetaLlama 3.3 70B quantized to fp8 precision, optimized to be faster.Cloudflare-hostedBatchFunction calling](https://developers.cloudflare.com/ai/models/@cf/meta/llama-3.3-70b-instruct-fp8-fast/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)llama-3.2-1b-instructText Generation • MetaThe Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks.Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/meta/llama-3.2-1b-instruct/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)llama-3.2-3b-instructText Generation • MetaThe Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks.Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/meta/llama-3.2-3b-instruct/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)llama-3.2-11b-vision-instructText Generation • Meta The Llama 3.2-Vision instruction-tuned models are optimized for visual recognition, image reasoning, captioning, and answering general questions about an image.Cloudflare-hostedLoRAVision](https://developers.cloudflare.com/ai/models/@cf/meta/llama-3.2-11b-vision-instruct/)[![Black Forest Labs logo](https://developers.cloudflare.com/_astro/blackforestlabs.Ccs-Y4-D.svg)flux-1-schnellText-to-Image • Black Forest LabsFLUX.1 \[schnell\] is a 12 billion parameter rectified flow transformer capable of generating images from text descriptions. Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/black-forest-labs/flux-1-schnell/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)llama-3.1-8b-instruct-awqText Generation • MetaQuantized (int4) generative text model with 8 billion parameters from Meta. Cloudflare-hostedDeprecated](https://developers.cloudflare.com/ai/models/@cf/meta/llama-3.1-8b-instruct-awq/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)llama-3.1-8b-instruct-fp8Text Generation • MetaLlama 3.1 8B quantized to FP8 precisionCloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/meta/llama-3.1-8b-instruct-fp8/)[![MyShell logo](https://developers.cloudflare.com/_astro/myshell.BpTDMxd2.svg)melottsText-to-Speech • MyShellMeloTTS is a high-quality multi-lingual text-to-speech library by MyShell.ai.Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/myshell-ai/melotts/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)llama-3.1-8b-instructText Generation • MetaThe Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction tuned generative models. The Llama 3.1 instruction tuned text only models are optimized for multilingual dialogue use cases and outperform many of the available open source and closed chat models on common industry benchmarks.Cloudflare-hostedDeprecated](https://developers.cloudflare.com/ai/models/@cf/meta/llama-3.1-8b-instruct/)[![BAAI logo](https://developers.cloudflare.com/_astro/baai.mOtdbKlV.svg)bge-m3Text Embeddings • BAAIMulti-Functionality, Multi-Linguality, and Multi-Granularity embeddings model.Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/baai/bge-m3/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)meta-llama-3-8b-instructText Generation • MetaGeneration over generation, Meta Llama 3 demonstrates state-of-the-art performance on a wide range of industry benchmarks and offers new capabilities, including improved reasoning. Cloudflare-hostedDeprecated](https://developers.cloudflare.com/ai/models/@hf/meta-llama/meta-llama-3-8b-instruct/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)whisper-large-v3-turboAutomatic Speech Recognition • OpenAIWhisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. Cloudflare-hostedBatch](https://developers.cloudflare.com/ai/models/@cf/openai/whisper-large-v3-turbo/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)llama-3-8b-instruct-awqText Generation • MetaQuantized (int4) generative text model with 8 billion parameters from Meta.Cloudflare-hostedDeprecated](https://developers.cloudflare.com/ai/models/@cf/meta/llama-3-8b-instruct-awq/)[lllava-1.5-7b-hfBetaImage-to-Text • llava-hfLLaVA is an open-source chatbot trained by fine-tuning LLaMA/Vicuna on GPT-generated multimodal instruction-following data. It is an auto-regressive language model, based on the transformer architecture.Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/llava-hf/llava-1.5-7b-hf/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)whisper-tiny-enBetaAutomatic Speech Recognition • OpenAIWhisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. Trained on 680k hours of labelled data, Whisper models demonstrate a strong ability to generalize to many datasets and domains without the need for fine-tuning. This is the English-only version of the Whisper Tiny model which was trained on the task of speech recognition.Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/openai/whisper-tiny-en/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)llama-3-8b-instructText Generation • MetaGeneration over generation, Meta Llama 3 demonstrates state-of-the-art performance on a wide range of industry benchmarks and offers new capabilities, including improved reasoning.Cloudflare-hostedDeprecated](https://developers.cloudflare.com/ai/models/@cf/meta/llama-3-8b-instruct/)[![MistralAI logo](https://developers.cloudflare.com/_astro/mistralai.Bn9UMUMu.svg)mistral-7b-instruct-v0.2BetaText Generation • MistralAIThe Mistral-7B-Instruct-v0.2 Large Language Model (LLM) is an instruct fine-tuned version of the Mistral-7B-v0.2\. Mistral-7B-v0.2 has the following changes compared to Mistral-7B-v0.1: 32k context window (vs 8k context in v0.1), rope-theta = 1e6, and no Sliding-Window Attention.Cloudflare-hostedLoRADeprecated](https://developers.cloudflare.com/ai/models/@hf/mistral/mistral-7b-instruct-v0.2/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)gemma-7b-it-loraBetaText Generation • Google This is a Gemma-7B base model that Cloudflare dedicates for inference with LoRA adapters. Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.Cloudflare-hostedLoRA](https://developers.cloudflare.com/ai/models/@cf/google/gemma-7b-it-lora/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)gemma-2b-it-loraBetaText Generation • GoogleThis is a Gemma-2B base model that Cloudflare dedicates for inference with LoRA adapters. Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.Cloudflare-hostedLoRA](https://developers.cloudflare.com/ai/models/@cf/google/gemma-2b-it-lora/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)llama-2-7b-chat-hf-loraBetaText Generation • MetaThis is a Llama2 base model that Cloudflare dedicated for inference with LoRA adapters. Llama 2 is a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 70 billion parameters. This is the repository for the 7B fine-tuned model, optimized for dialogue use cases and converted for the Hugging Face Transformers format. Cloudflare-hostedLoRA](https://developers.cloudflare.com/ai/models/@cf/meta-llama/llama-2-7b-chat-hf-lora/)[![Google logo](https://developers.cloudflare.com/_astro/google.DyXKPTPP.svg)gemma-7b-itBetaText Generation • GoogleGemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models. They are text-to-text, decoder-only large language models, available in English, with open weights, pre-trained variants, and instruction-tuned variants.Cloudflare-hostedLoRADeprecated](https://developers.cloudflare.com/ai/models/@hf/google/gemma-7b-it/)[nhermes-2-pro-mistral-7bBetaText Generation • nousresearchHermes 2 Pro on Mistral 7B is the new flagship 7B Hermes! Hermes 2 Pro is an upgraded, retrained version of Nous Hermes 2, consisting of an updated and cleaned version of the OpenHermes 2.5 Dataset, as well as a newly introduced Function Calling and JSON Mode dataset developed in-house.Cloudflare-hostedFunction callingDeprecated](https://developers.cloudflare.com/ai/models/@hf/nousresearch/hermes-2-pro-mistral-7b/)[![MistralAI logo](https://developers.cloudflare.com/_astro/mistralai.Bn9UMUMu.svg)mistral-7b-instruct-v0.2-loraBetaText Generation • MistralAIThe Mistral-7B-Instruct-v0.2 Large Language Model (LLM) is an instruct fine-tuned version of the Mistral-7B-v0.2.Cloudflare-hostedLoRA](https://developers.cloudflare.com/ai/models/@cf/mistral/mistral-7b-instruct-v0.2-lora/)[![Unum logo](https://developers.cloudflare.com/_astro/unum.Cjjoj0_o.svg)uform-gen2-qwen-500mBetaImage-to-Text • UnumUForm-Gen is a small generative vision-language model primarily designed for Image Captioning and Visual Question Answering. The model was pre-trained on the internal image captioning dataset and fine-tuned on public instructions datasets: SVIT, LVIS, VQAs datasets.Cloudflare-hostedDeprecated](https://developers.cloudflare.com/ai/models/@cf/unum/uform-gen2-qwen-500m/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)bart-large-cnnBetaSummarization • MetaBART is a transformer encoder-encoder (seq2seq) model with a bidirectional (BERT-like) encoder and an autoregressive (GPT-like) decoder. You can use this model for text summarization.Cloudflare-hostedDeprecated](https://developers.cloudflare.com/ai/models/@cf/facebook/bart-large-cnn/)[![Microsoft logo](https://developers.cloudflare.com/_astro/microsoft.LujcDJ--.svg)phi-2BetaText Generation • MicrosoftPhi-2 is a Transformer-based model with a next-word prediction objective, trained on 1.4T tokens from multiple passes on a mixture of Synthetic and Web datasets for NLP and coding.Cloudflare-hostedDeprecated](https://developers.cloudflare.com/ai/models/@cf/microsoft/phi-2/)[![Defog logo](https://developers.cloudflare.com/_astro/defog.BeLrxE1p.svg)sqlcoder-7b-2BetaText Generation • DefogThis model is intended to be used by non-technical users to understand data inside their SQL databases. Cloudflare-hostedDeprecated](https://developers.cloudflare.com/ai/models/@cf/defog/sqlcoder-7b-2/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)detr-resnet-50BetaObject Detection • MetaDEtection TRansformer (DETR) model trained end-to-end on COCO 2017 object detection (118k annotated images).Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/facebook/detr-resnet-50/)[![ByteDance logo](https://developers.cloudflare.com/_astro/bytedance.T1uiROQ6.svg)stable-diffusion-xl-lightningBetaText-to-Image • ByteDanceSDXL-Lightning is a lightning-fast text-to-image generation model. It can generate high-quality 1024px images in a few steps.Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/bytedance/stable-diffusion-xl-lightning/)[ldreamshaper-8-lcmText-to-Image • lykonStable Diffusion model that has been fine-tuned to be better at photorealism without sacrificing range.Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/lykon/dreamshaper-8-lcm/)[![RunwayML logo](https://developers.cloudflare.com/_astro/runway.Cq8Cjov4.svg)stable-diffusion-v1-5-img2imgBetaText-to-Image • RunwayMLStable Diffusion is a latent text-to-image diffusion model capable of generating photo-realistic images. Img2img generate a new image from an input image with Stable Diffusion. Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/runwayml/stable-diffusion-v1-5-img2img/)[![RunwayML logo](https://developers.cloudflare.com/_astro/runway.Cq8Cjov4.svg)stable-diffusion-v1-5-inpaintingBetaText-to-Image • RunwayMLStable Diffusion Inpainting is a latent text-to-image diffusion model capable of generating photo-realistic images given any text input, with the extra capability of inpainting the pictures by using a mask.Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/runwayml/stable-diffusion-v1-5-inpainting/)[![Stability.ai logo](https://developers.cloudflare.com/_astro/stabilityai.CmlmNdqR.svg)stable-diffusion-xl-base-1.0BetaText-to-Image • Stability.aiDiffusion-based text-to-image generative model by Stability AI. Generates and modify images based on text prompts.Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/stabilityai/stable-diffusion-xl-base-1.0/)[![BAAI logo](https://developers.cloudflare.com/_astro/baai.mOtdbKlV.svg)bge-large-en-v1.5Text Embeddings • BAAIBAAI general embedding (Large) model that transforms any given text into a 1024-dimensional vectorCloudflare-hostedBatch](https://developers.cloudflare.com/ai/models/@cf/baai/bge-large-en-v1.5/)[![BAAI logo](https://developers.cloudflare.com/_astro/baai.mOtdbKlV.svg)bge-small-en-v1.5Text Embeddings • BAAIBAAI general embedding (Small) model that transforms any given text into a 384-dimensional vectorCloudflare-hostedBatch](https://developers.cloudflare.com/ai/models/@cf/baai/bge-small-en-v1.5/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)llama-2-7b-chat-fp16Text Generation • MetaFull precision (fp16) generative text model with 7 billion parameters from MetaCloudflare-hostedDeprecated](https://developers.cloudflare.com/ai/models/@cf/meta/llama-2-7b-chat-fp16/)[![MistralAI logo](https://developers.cloudflare.com/_astro/mistralai.Bn9UMUMu.svg)mistral-7b-instruct-v0.1Text Generation • MistralAIInstruct fine-tuned version of the Mistral-7b generative text model with 7 billion parametersCloudflare-hostedLoRADeprecated](https://developers.cloudflare.com/ai/models/@cf/mistral/mistral-7b-instruct-v0.1/)[![BAAI logo](https://developers.cloudflare.com/_astro/baai.mOtdbKlV.svg)bge-base-en-v1.5Text Embeddings • BAAIBAAI general embedding (Base) model that transforms any given text into a 768-dimensional vectorCloudflare-hostedBatch](https://developers.cloudflare.com/ai/models/@cf/baai/bge-base-en-v1.5/)[![HuggingFace logo](https://developers.cloudflare.com/_astro/huggingface.ngjt5u2J.svg)distilbert-sst-2-int8Text Classification • HuggingFaceDistilled BERT model that was finetuned on SST-2 for sentiment classificationCloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/huggingface/distilbert-sst-2-int8/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)llama-2-7b-chat-int8Text Generation • MetaQuantized (int8) generative text model with 7 billion parameters from MetaCloudflare-hostedDeprecated](https://developers.cloudflare.com/ai/models/@cf/meta/llama-2-7b-chat-int8/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)m2m100-1.2bTranslation • MetaMultilingual encoder-decoder (seq-to-seq) model trained for Many-to-Many multilingual translationCloudflare-hostedBatch](https://developers.cloudflare.com/ai/models/@cf/meta/m2m100-1.2b/)[![Microsoft logo](https://developers.cloudflare.com/_astro/microsoft.LujcDJ--.svg)resnet-50Image Classification • Microsoft50 layers deep image classification CNN trained on more than 1M images from ImageNetCloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/microsoft/resnet-50/)[![OpenAI logo](https://developers.cloudflare.com/_astro/openai.BI8PEEzI.svg)whisperAutomatic Speech Recognition • OpenAIWhisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/openai/whisper/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)llama-3.1-70b-instructText Generation • MetaThe Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction tuned generative models. The Llama 3.1 instruction tuned text only models are optimized for multilingual dialogue use cases and outperform many of the available open source and closed chat models on common industry benchmarks.Cloudflare-hostedDeprecated](https://developers.cloudflare.com/ai/models/@cf/meta/llama-3.1-70b-instruct/)[![Meta logo](https://developers.cloudflare.com/_astro/meta.BR4nfp35.svg)llama-3.1-8b-instruct-fastText Generation • Meta\[Fast version\] The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction tuned generative models. The Llama 3.1 instruction tuned text only models are optimized for multilingual dialogue use cases and outperform many of the available open source and closed chat models on common industry benchmarks.Cloudflare-hosted](https://developers.cloudflare.com/ai/models/@cf/meta/llama-3.1-8b-instruct-fast/)

```json
{"@context":"https://schema.org","@type":"TechArticle","@id":"https://developers.cloudflare.com/ai/models/#page","headline":"Models · Cloudflare AI docs","url":"https://developers.cloudflare.com/ai/models/","inLanguage":"en","image":"https://developers.cloudflare.com/dev-products-preview.png","publisher":{"@type":"Organization","name":"Cloudflare","url":"https://www.cloudflare.com/"},"isPartOf":{"@type":"WebSite","@id":"https://developers.cloudflare.com/#website","name":"Cloudflare Docs","url":"https://developers.cloudflare.com/"}}
{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"item":{"@id":"/directory/","name":"Directory"}},{"@type":"ListItem","position":2,"item":{"@id":"/ai/","name":"AI"}},{"@type":"ListItem","position":3,"item":{"@id":"/ai/models/","name":"Models"}}]}
```

---

---
title: Related products
description: Explore Cloudflare products that complement AI, including Workers AI, AI Gateway, Vectorize, and more.
image: https://developers.cloudflare.com/dev-products-preview.png
---

> Documentation Index  
> Fetch the complete documentation index at: https://developers.cloudflare.com/ai/llms.txt  
> Use this file to discover all available pages before exploring further. 

[Skip to content](#%5Ftop) 

# Related products

**[Workers AI](https://developers.cloudflare.com/workers-ai/)** 

Run machine learning models on Cloudflare's GPU-powered infrastructure with serverless inference.

**[AI Gateway](https://developers.cloudflare.com/ai-gateway/)** 

Observe and control your AI applications with caching, rate limiting, and analytics.

**[Agents](https://developers.cloudflare.com/agents/)** 

Build AI-powered agents to perform tasks, persist state, and interact with external services.

**[AI Search](https://developers.cloudflare.com/ai-search/)** 

Create fully managed RAG pipelines for your AI applications.

**[Vectorize](https://developers.cloudflare.com/vectorize/)** 

Store, query, and manage high-dimensional vector databases for AI embeddings.

**[AI Crawl Control](https://developers.cloudflare.com/ai-crawl-control/)** 

Analyze and control third-party AI crawlers on your website.

**[Browser Rendering](https://developers.cloudflare.com/browser-run/)** 

Control and interact with headless browser instances for AI data extraction.

**[Cloudflare Agent](https://developers.cloudflare.com/cloudflare-agent/)** 

An AI-powered assistant that helps you navigate, configure, and manage Cloudflare.

**[Dynamic Workers](https://developers.cloudflare.com/dynamic-workers/)** 

Spin up isolated Workers on demand to execute code.

**[Sandbox SDK](https://developers.cloudflare.com/sandbox-sdk/)** 

Build secure, isolated code execution environments.

```json
{"@context":"https://schema.org","@type":"WebPage","@id":"https://developers.cloudflare.com/ai/related-products/#page","headline":"Related products · Cloudflare AI docs","description":"Explore Cloudflare products that complement AI, including Workers AI, AI Gateway, Vectorize, and more.","url":"https://developers.cloudflare.com/ai/related-products/","inLanguage":"en","image":"https://developers.cloudflare.com/dev-products-preview.png","dateModified":"2026-04-20","publisher":{"@type":"Organization","name":"Cloudflare","url":"https://www.cloudflare.com/"},"isPartOf":{"@type":"WebSite","@id":"https://developers.cloudflare.com/#website","name":"Cloudflare Docs","url":"https://developers.cloudflare.com/"}}
{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"item":{"@id":"/directory/","name":"Directory"}},{"@type":"ListItem","position":2,"item":{"@id":"/ai/","name":"AI"}},{"@type":"ListItem","position":3,"item":{"@id":"/ai/related-products/","name":"Related products"}}]}
```

---

---
title: Build Agents on Cloudflare
description: Create stateful AI agents with persistent memory, real-time WebSocket connections, and scheduled tasks using the Cloudflare Agents SDK.
image: https://developers.cloudflare.com/dev-products-preview.png
---

> Documentation Index  
> Fetch the complete documentation index at: https://developers.cloudflare.com/agents/llms.txt  
> Use this file to discover all available pages before exploring further. 

[Skip to content](#%5Ftop) 

# Build Agents on Cloudflare

Build and host Agents on Cloudflare, connect chat, voice, email, Slack, and webhooks to a durable agent runtime with Browser, Sandbox, AI Search, MCP, Payments, and other MCP tools.

When you host agents on Cloudflare, each agent session has a durable identity, local SQL storage, real-time connections, scheduled work, and recoverable execution.

Deploy once and Cloudflare runs your agents across its global network, scaling to tens of millions of instances. No infrastructure to manage, no sessions to reconstruct, no state to externalize.

[ Chat ](https://developers.cloudflare.com/agents/communication-channels/chat/)[ Email ](https://developers.cloudflare.com/agents/communication-channels/email/)[ Voice ](https://developers.cloudflare.com/agents/communication-channels/voice/)[ Slack ](https://developers.cloudflare.com/agents/communication-channels/slack/)[ Webhook ](https://developers.cloudflare.com/agents/communication-channels/webhooks/) 

Agent harness

Controls planning, tool use, and response flow.

[Project Think](https://developers.cloudflare.com/agents/harnesses/think/) [Build-your-own agent](https://developers.cloudflare.com/agents/runtime/agents-api/) 

Agents SDK runtime

Durable identity, state, connections, scheduling, and recovery. 

[Agent class](https://developers.cloudflare.com/agents/runtime/agents-api/) 

[State](https://developers.cloudflare.com/agents/runtime/lifecycle/state/)[Sessions](https://developers.cloudflare.com/agents/runtime/lifecycle/sessions/)[Routing](https://developers.cloudflare.com/agents/runtime/communication/routing/)[WebSockets](https://developers.cloudflare.com/agents/runtime/communication/websockets/)[Scheduling](https://developers.cloudflare.com/agents/runtime/execution/schedule-tasks/)[Fibers](https://developers.cloudflare.com/agents/runtime/execution/durable-execution/) 

[ Sandbox ](https://developers.cloudflare.com/agents/tools/sandbox/)[ MCP ](https://developers.cloudflare.com/agents/tools/mcp/)[ Browser ](https://developers.cloudflare.com/agents/tools/browser/)[ AI Search ](https://developers.cloudflare.com/agents/tools/ai-search/)[ Payments ](https://developers.cloudflare.com/agents/tools/payments/) 

[ Observability Logs · metrics · traces ](https://developers.cloudflare.com/agents/runtime/operations/observability/) 

Agents on Cloudflare are composed from four parts:

* **Communication channels** define how users and systems reach your agent, such as [chat](https://developers.cloudflare.com/agents/communication-channels/chat/), [voice](https://developers.cloudflare.com/agents/communication-channels/voice/), [email](https://developers.cloudflare.com/agents/communication-channels/email/), [Slack](https://developers.cloudflare.com/agents/communication-channels/slack/), [webhooks](https://developers.cloudflare.com/agents/communication-channels/webhooks/), and other event sources.
* **The agent harness** defines the loop: how the agent calls models, selects tools, handles tool results, streams responses, and decides whether to continue. Use [Project Think](https://developers.cloudflare.com/agents/harnesses/think/) for an opinionated harness, or build your own loop directly on the [Agents SDK runtime](https://developers.cloudflare.com/agents/runtime/agents-api/).
* **The Agents SDK runtime** provides durable infrastructure: the [Agent class](https://developers.cloudflare.com/agents/runtime/lifecycle/agent-class/), [state](https://developers.cloudflare.com/agents/runtime/lifecycle/state/), [sessions](https://developers.cloudflare.com/agents/runtime/lifecycle/sessions/), [routing](https://developers.cloudflare.com/agents/runtime/communication/routing/), [WebSockets](https://developers.cloudflare.com/agents/runtime/communication/websockets/), [scheduling](https://developers.cloudflare.com/agents/runtime/execution/schedule-tasks/), [fibers](https://developers.cloudflare.com/agents/runtime/execution/durable-execution/), and [observability](https://developers.cloudflare.com/agents/runtime/operations/observability/).
* **Tools** give the agent capabilities: [browser automation](https://developers.cloudflare.com/agents/tools/browser/), [sandboxed code execution](https://developers.cloudflare.com/agents/tools/sandbox/), [AI Search](https://developers.cloudflare.com/agents/tools/ai-search/), [MCP tools](https://developers.cloudflare.com/agents/tools/mcp/), and [payments](https://developers.cloudflare.com/agents/tools/payments/). [Code Mode](https://developers.cloudflare.com/agents/tools/codemode/) lets models discover and orchestrate multiple tools by writing code.

### Get started

Three commands to a running agent. No API keys required — the starter uses [Workers AI](https://developers.cloudflare.com/workers-ai/) by default.

Terminal window

```
npx create-cloudflare@latest --template cloudflare/agents-startercd agents-starter && npm installnpm run dev
```

The starter includes streaming AI chat, server-side and client-side tools, human-in-the-loop approval, and task scheduling — a foundation you can build on or tear apart. You can also swap in [OpenAI, Anthropic, Google Gemini, or any other provider](https://developers.cloudflare.com/agents/runtime/operations/using-ai-models/).

### Example agents

**[Chat agent](https://developers.cloudflare.com/agents/examples/chat-agent/)** 

Build a streaming AI chat agent with tools and human-in-the-loop approvals.

**[Slack agent](https://developers.cloudflare.com/agents/examples/slack-agent/)** 

Build an agent that responds to Slack messages, mentions, and commands.

**[Voice agent](https://developers.cloudflare.com/agents/examples/voice-agent/)** 

Build a real-time voice agent with speech-to-text and text-to-speech.

**[Browser agent](https://developers.cloudflare.com/agents/examples/browser-agent/)** 

Build an agent that can inspect pages, capture screenshots, and use browser tools.

**[Email agent](https://developers.cloudflare.com/agents/examples/email-agent/)** 

Build an agent that sends, receives, routes, and replies to email.

```json
{"@context":"https://schema.org","@type":"WebPage","@id":"https://developers.cloudflare.com/agents/#page","headline":"Agents · Cloudflare Agents docs","description":"Create stateful AI agents with persistent memory, real-time WebSocket connections, and scheduled tasks using the Cloudflare Agents SDK.","url":"https://developers.cloudflare.com/agents/","inLanguage":"en","image":"https://developers.cloudflare.com/dev-products-preview.png","dateModified":"2026-06-24","publisher":{"@type":"Organization","name":"Cloudflare","url":"https://www.cloudflare.com/"},"isPartOf":{"@type":"WebSite","@id":"https://developers.cloudflare.com/#website","name":"Cloudflare Docs","url":"https://developers.cloudflare.com/"},"keywords":["AI"]}
{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"item":{"@id":"/directory/","name":"Directory"}},{"@type":"ListItem","position":2,"item":{"@id":"/agents/","name":"Agents"}}]}
```

---

---
title: AI Crawl Control
description: Monitor and control how AI services access your website content.
image: https://developers.cloudflare.com/core-services-preview.png
---

> Documentation Index  
> Fetch the complete documentation index at: https://developers.cloudflare.com/ai-crawl-control/llms.txt  
> Use this file to discover all available pages before exploring further. 

[Skip to content](#%5Ftop) 

# AI Crawl Control

 Available on all plans 

Monitor and control how AI services access your website content.

AI companies use web content to train their models and power AI applications. AI Crawl Control (formerly AI Audit) gives you visibility into which AI services are accessing your content, and provides tools to manage access according to your preferences.

With AI Crawl Control, you can:

* **See which AI services access your content** \- Monitor the dashboard to see crawler activity and request patterns
* **Control access with granular policies** \- Set allow or block rules for individual crawlers
* **Monitor robots.txt compliance** \- Track which crawlers follow your directives and create enforcement rules
* **Explore monetization options** \- Set up pay per crawl pricing for content access [(private beta)](https://developers.cloudflare.com/ai-crawl-control/features/pay-per-crawl/what-is-pay-per-crawl/)
* **Deploy with zero configuration** \- Works automatically on all Cloudflare plans
[ Get started ](https://developers.cloudflare.com/ai-crawl-control/get-started/) 

---

## Features

###  Manage AI crawlers 

Control how AI crawlers interact with your domain.

[ Manage AI crawlers ](https://developers.cloudflare.com/ai-crawl-control/features/manage-ai-crawlers/) 

###  Analyze AI traffic 

Gain insight into how AI crawlers are interacting with your pages.

[ Analyze AI traffic ](https://developers.cloudflare.com/ai-crawl-control/features/analyze-ai-traffic/) 

###  Track robots.txt 

Track the health of `robots.txt` files and identify which crawlers are violating your directives.

[ Track robots.txt ](https://developers.cloudflare.com/ai-crawl-control/features/track-robots-txt/) 

###  Pay Per Crawl 

Allow AI crawlers to access content by paying per crawl.

[ Pay per crawl ](https://developers.cloudflare.com/ai-crawl-control/features/pay-per-crawl/what-is-pay-per-crawl/) 

---

## Use cases

Publishers and content creators 

Publishers and content creators can monitor which AI crawlers are accessing their articles and educational content. Set policies to allow beneficial crawlers while blocking others.

E-commerce and business sites 

E-commerce and business sites can identify AI crawler activity on product pages and business information. Control access to sensitive data like pricing and inventory.

Documentation sites 

Documentation sites can track how AI crawlers are accessing their technical documentation. Gain insight into how AI crawlers are engaging with your site.

---

## Related Products

**[Bots](https://developers.cloudflare.com/bots/)** 

Identify and mitigate automated traffic to protect your domain from bad bots.

**[Web Application Firewall](https://developers.cloudflare.com/waf/)** 

Get automatic protection from vulnerabilities and the flexibility to create custom rules.

**[Analytics](https://developers.cloudflare.com/analytics/)** 

View and analyze traffic on your domain.

```json
{"@context":"https://schema.org","@type":"WebPage","@id":"https://developers.cloudflare.com/ai-crawl-control/#page","headline":"Overview · Cloudflare AI Crawl Control docs","description":"Monitor and control how AI services access your website content.","url":"https://developers.cloudflare.com/ai-crawl-control/","inLanguage":"en","image":"https://developers.cloudflare.com/core-services-preview.png","dateModified":"2026-04-23","publisher":{"@type":"Organization","name":"Cloudflare","url":"https://www.cloudflare.com/"},"isPartOf":{"@type":"WebSite","@id":"https://developers.cloudflare.com/#website","name":"Cloudflare Docs","url":"https://developers.cloudflare.com/"},"keywords":["AI"]}
{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"item":{"@id":"/directory/","name":"Directory"}},{"@type":"ListItem","position":2,"item":{"@id":"/ai-crawl-control/","name":"AI Crawl Control"}}]}
```

---

---
title: Cloudflare AI Gateway
description: Observe and control your AI applications with analytics, caching, rate limiting, and model fallback through AI Gateway.
image: https://developers.cloudflare.com/dev-products-preview.png
---

> Documentation Index  
> Fetch the complete documentation index at: https://developers.cloudflare.com/ai-gateway/llms.txt  
> Use this file to discover all available pages before exploring further. 

[Skip to content](#%5Ftop) 

# Cloudflare AI Gateway

Observe and control your AI applications.

 Available on all plans 

Cloudflare's AI Gateway allows you to gain visibility and control over your AI apps. By connecting your apps to AI Gateway, you can gather insights on how people are using your application with analytics and logging and then control how your application scales with features such as caching, rate limiting, as well as request retries, model fallback, and more. Better yet - it only takes one line of code to get started.

Check out the [Get started guide](https://developers.cloudflare.com/ai-gateway/get-started/) to learn how to configure your applications with AI Gateway.

## Features

###  Models 

Explore all AI models available through AI Gateway, including OpenAI, Anthropic, Google, and more.

[ Browse models ](https://developers.cloudflare.com/ai/models/) 

###  Analytics 

View metrics such as the number of requests, tokens, and the cost it takes to run your application.

[ View Analytics ](https://developers.cloudflare.com/ai-gateway/observability/analytics/) 

###  Logging 

Gain insight on requests and errors.

[ View Logging ](https://developers.cloudflare.com/ai-gateway/observability/logging/) 

###  Caching 

Serve requests directly from Cloudflare's cache instead of the original model provider for faster requests and cost savings.

[ Use Caching ](https://developers.cloudflare.com/ai-gateway/features/caching/) 

###  Rate limiting 

Control how your application scales by limiting the number of requests your application receives.

[ Use Rate limiting ](https://developers.cloudflare.com/ai-gateway/features/rate-limiting/) 

###  Request retry and fallback 

Improve resilience by defining request retry and model fallbacks in case of an error.

[ Use Request retry and fallback ](https://developers.cloudflare.com/ai-gateway/features/dynamic-routing/) 

###  Your favorite providers 

Workers AI, Anthropic, Google Gemini, OpenAI, Replicate, and more work with AI Gateway.

[ Use Your favorite providers ](https://developers.cloudflare.com/ai-gateway/usage/providers/) 

---

## Related products

**[Workers AI](https://developers.cloudflare.com/workers-ai/)** 

Run machine learning models, powered by serverless GPUs, on Cloudflare’s global network.

**[Vectorize](https://developers.cloudflare.com/vectorize/)** 

Build full-stack AI applications with Vectorize, Cloudflare's vector database. Adding Vectorize enables you to perform tasks such as semantic search, recommendations, anomaly detection or can be used to provide context and memory to an LLM.

## More resources

[Developer Discord](https://discord.cloudflare.com) 

Connect with the Workers community on Discord to ask questions, show what you are building, and discuss the platform with other developers.

[Use cases](https://developers.cloudflare.com/use-cases/ai/) 

Learn how you can build and deploy ambitious AI applications to Cloudflare's global network.

[@CloudflareDev](https://x.com/cloudflaredev) 

Follow @CloudflareDev on Twitter to learn about product announcements, and what is new in Cloudflare Workers.

```json
{"@context":"https://schema.org","@type":"WebPage","@id":"https://developers.cloudflare.com/ai-gateway/#page","headline":"Overview · Cloudflare AI Gateway docs","description":"Observe and control your AI applications with analytics, caching, rate limiting, and model fallback through AI Gateway.","url":"https://developers.cloudflare.com/ai-gateway/","inLanguage":"en","image":"https://developers.cloudflare.com/dev-products-preview.png","dateModified":"2026-04-20","publisher":{"@type":"Organization","name":"Cloudflare","url":"https://www.cloudflare.com/"},"isPartOf":{"@type":"WebSite","@id":"https://developers.cloudflare.com/#website","name":"Cloudflare Docs","url":"https://developers.cloudflare.com/"},"keywords":["AI"]}
{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"item":{"@id":"/directory/","name":"Directory"}},{"@type":"ListItem","position":2,"item":{"@id":"/ai-gateway/","name":"AI Gateway"}}]}
```

---

---
title: Overview
description: Cloudflare AI Search is a managed search service. Index your content and query it with natural language from a Workers binding, REST API, or MCP server.
image: https://developers.cloudflare.com/dev-products-preview.png
---

> Documentation Index  
> Fetch the complete documentation index at: https://developers.cloudflare.com/ai-search/llms.txt  
> Use this file to discover all available pages before exploring further. 

[Skip to content](#%5Ftop) 

# Overview

The search primitive for your applications and agents.

 Available on all plans 

AI Search lets you add search to any application or agent without having to build an entire retrieval infrastructure. Create an instance, give it your data, and search it with natural language.

You can use AI Search for:

* Documentation and knowledge base search
* AI agent tool use and memory
* Per-tenant or per-agent file search

[ Get started ](https://developers.cloudflare.com/ai-search/get-started/)[ Watch AI Search demo ](https://www.youtube.com/watch?v=JUFdbkiDN2U)

---

## Features

###  Automated indexing 

Automatically and continuously index your data source, keeping your content fresh without manual reprocessing.

[ View indexing ](https://developers.cloudflare.com/ai-search/configuration/indexing/syncing/) 

###  Metadata filtering 

Define custom metadata fields and filter search results by category, version, language, or any attribute you define.

[ Add filters ](https://developers.cloudflare.com/ai-search/configuration/retrieval/filtering/) 

###  Hybrid search 

Combine semantic and keyword matching in the same query for more accurate results.

[ Configure hybrid search ](https://developers.cloudflare.com/ai-search/configuration/indexing/hybrid-search/) 

###  MCP and UI snippets 

Every instance includes a built-in MCP endpoint for AI agents and embeddable search components for your website.

[ Connect agents ](https://developers.cloudflare.com/ai-search/api/search/mcp/) 

---

## Related products

**[Workers AI](https://developers.cloudflare.com/workers-ai/)** 

Run machine learning models, powered by serverless GPUs, on Cloudflare's global network.

**[AI Gateway](https://developers.cloudflare.com/ai-gateway/)** 

Observe and control your AI applications with caching, rate limiting, request retries, model fallback, and more.

**[Vectorize](https://developers.cloudflare.com/vectorize/)** 

Build full-stack AI applications with Vectorize, Cloudflare's vector database.

**[Workers](https://developers.cloudflare.com/workers/)** 

Build serverless applications and deploy instantly across the globe for exceptional performance, reliability, and scale.

**[R2](https://developers.cloudflare.com/r2/)** 

Store large amounts of unstructured data without the costly egress bandwidth fees associated with typical cloud storage services.

---

## More resources

[Get started](https://developers.cloudflare.com/ai-search/get-started/) 

Create your first AI Search instance and run your first query.

[Developer Discord](https://discord.cloudflare.com) 

Connect with the Workers community on Discord to ask questions, share what you are building, and discuss the platform with other developers.

[@CloudflareDev](https://x.com/cloudflaredev) 

Follow @CloudflareDev on Twitter to learn about product announcements, and what is new in Cloudflare Workers.

```json
{"@context":"https://schema.org","@type":"WebPage","@id":"https://developers.cloudflare.com/ai-search/#page","headline":"Cloudflare AI Search · Cloudflare AI Search docs","description":"Cloudflare AI Search is a managed search service. Index your content and query it with natural language from a Workers binding, REST API, or MCP server.","url":"https://developers.cloudflare.com/ai-search/","inLanguage":"en","image":"https://developers.cloudflare.com/dev-products-preview.png","dateModified":"2026-06-19","publisher":{"@type":"Organization","name":"Cloudflare","url":"https://www.cloudflare.com/"},"isPartOf":{"@type":"WebSite","@id":"https://developers.cloudflare.com/#website","name":"Cloudflare Docs","url":"https://developers.cloudflare.com/"},"keywords":["AI"]}
{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"item":{"@id":"/directory/","name":"Directory"}},{"@type":"ListItem","position":2,"item":{"@id":"/ai-search/","name":"AI Search"}}]}
```

---

---
title: Browser Run
description: Control headless browsers with Cloudflare's Workers Browser Run API. Automate tasks, take screenshots, convert pages to PDFs, and test web apps.
image: https://developers.cloudflare.com/dev-products-preview.png
---

> Documentation Index  
> Fetch the complete documentation index at: https://developers.cloudflare.com/browser-run/llms.txt  
> Use this file to discover all available pages before exploring further. 

[Skip to content](#%5Ftop) 

# Browser Run

Run headless Chrome on [Cloudflare's global network](https://developers.cloudflare.com/workers/) for browser automation, web scraping, testing, and content generation.

 Available on Free and Paid plans 

Browser Run, formerly known as Browser Rendering, enables developers to programmatically control and interact with headless browser instances running on Cloudflare’s global network.

## Use cases

Programmatically load and fully render dynamic webpages or raw HTML and capture specific outputs such as:

* [Markdown](https://developers.cloudflare.com/browser-run/quick-actions/markdown-endpoint/)
* [Screenshots](https://developers.cloudflare.com/browser-run/quick-actions/screenshot-endpoint/)
* [PDFs](https://developers.cloudflare.com/browser-run/quick-actions/pdf-endpoint/)
* [Snapshots](https://developers.cloudflare.com/browser-run/quick-actions/snapshot/)
* [Links](https://developers.cloudflare.com/browser-run/quick-actions/links-endpoint/)
* [HTML elements](https://developers.cloudflare.com/browser-run/quick-actions/scrape-endpoint/)
* [Structured data](https://developers.cloudflare.com/browser-run/quick-actions/json-endpoint/)
* [Crawled web content](https://developers.cloudflare.com/browser-run/quick-actions/crawl-endpoint/)

## Integration methods

Browser Run offers two categories of integration methods:

* **[Quick Actions](https://developers.cloudflare.com/browser-run/quick-actions/)**: Simple, stateless browser tasks like screenshots, PDFs, and scraping. No code deployment needed.
* **Browser Sessions**: Direct browser control via [Puppeteer](https://developers.cloudflare.com/browser-run/puppeteer/), [Playwright](https://developers.cloudflare.com/browser-run/playwright/), [CDP](https://developers.cloudflare.com/browser-run/cdp/), or [Stagehand](https://developers.cloudflare.com/browser-run/stagehand/). Deploy within Cloudflare Workers or connect from any environment via CDP.

| Use case                                    | Recommended                                                                                                                                                                                                  | Why                                                              |
| ------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | ---------------------------------------------------------------- |
| Simple screenshot, PDF, or scrape           | [Quick Actions](https://developers.cloudflare.com/browser-run/quick-actions/)                                                                                                                                | No code deployment; single HTTP request                          |
| Browser automation                          | [Playwright](https://developers.cloudflare.com/browser-run/playwright/), [Puppeteer](https://developers.cloudflare.com/browser-run/puppeteer/), or [CDP](https://developers.cloudflare.com/browser-run/cdp/) | Full browser control with scripting                              |
| Porting existing scripts                    | [Puppeteer](https://developers.cloudflare.com/browser-run/puppeteer/), [Playwright](https://developers.cloudflare.com/browser-run/playwright/), or [CDP](https://developers.cloudflare.com/browser-run/cdp/) | Minimal code changes from standard libraries                     |
| AI-powered data extraction                  | [JSON endpoint](https://developers.cloudflare.com/browser-run/quick-actions/json-endpoint/)                                                                                                                  | Structured data via natural language prompts                     |
| Site-wide crawling                          | [Crawl endpoint](https://developers.cloudflare.com/browser-run/quick-actions/crawl-endpoint/)                                                                                                                | Multi-page content extraction with async results                 |
| AI agent browsing                           | [Playwright MCP](https://developers.cloudflare.com/browser-run/playwright/playwright-mcp/) or [CDP with MCP clients](https://developers.cloudflare.com/browser-run/cdp/mcp-clients/)                         | LLMs control browsers via MCP                                    |
| Resilient scraping                          | [Stagehand](https://developers.cloudflare.com/browser-run/stagehand/)                                                                                                                                        | AI finds elements by intent, not selectors                       |
| Direct browser control from any environment | [CDP](https://developers.cloudflare.com/browser-run/cdp/)                                                                                                                                                    | WebSocket access from local machines, CI/CD, or external servers |

## Key features

* **Scale to thousands of browsers**: Instant access to a global pool of browsers with low cold-start time, ideal for high-volume screenshot generation, data extraction, or automation at scale
* **Global by default**: Browser sessions run on Cloudflare's edge network, opening close to your users for better speed and availability worldwide
* **Easy to integrate**: [Quick Actions](https://developers.cloudflare.com/browser-run/quick-actions/) for common tasks, [Puppeteer](https://developers.cloudflare.com/browser-run/puppeteer/) and [Playwright](https://developers.cloudflare.com/browser-run/playwright/) for complex workflows, and [CDP](https://developers.cloudflare.com/browser-run/cdp/) for direct browser control from any environment
* **Session management**: [Reuse browser sessions](https://developers.cloudflare.com/browser-run/features/reuse-sessions/) across requests to improve performance and reduce cold-start overhead
* **Flexible pricing**: Pay only for browser time used with generous free tier ([view pricing](https://developers.cloudflare.com/browser-run/pricing/))

## Related products

**[Workers](https://developers.cloudflare.com/workers/)** 

Build serverless applications and deploy instantly across the globe for exceptional performance, reliability, and scale.

**[Durable Objects](https://developers.cloudflare.com/durable-objects/)** 

A globally distributed coordination API with strongly consistent storage. Using Durable Objects to [persist browser sessions](https://developers.cloudflare.com/browser-run/how-to/browser-run-with-do/) improves performance by eliminating the time that it takes to spin up a new browser session.

**[Agents](https://developers.cloudflare.com/agents/)** 

Build AI-powered agents that autonomously navigate websites and perform tasks using [Playwright MCP](https://developers.cloudflare.com/browser-run/playwright/playwright-mcp/) or [Stagehand](https://developers.cloudflare.com/browser-run/stagehand/).

## More resources

[Get started](https://developers.cloudflare.com/browser-run/get-started/) 

Choose an integration method and deploy your first project.

[Limits](https://developers.cloudflare.com/browser-run/limits/) 

Learn about Browser Run limits.

[Pricing](https://developers.cloudflare.com/browser-run/pricing/) 

Learn about Browser Run pricing.

[Playwright API](https://developers.cloudflare.com/browser-run/playwright/) 

Use Cloudflare's fork of Playwright for testing and automation.

[Developer Discord](https://discord.cloudflare.com) 

Connect with the Workers community on Discord to ask questions, show what you are building, and discuss the platform with other developers.

[@CloudflareDev](https://x.com/cloudflaredev) 

Follow @CloudflareDev on Twitter to learn about product announcements, and what is new in Cloudflare Workers.

```json
{"@context":"https://schema.org","@type":"WebPage","@id":"https://developers.cloudflare.com/browser-run/#page","headline":"Browser Run · Cloudflare Browser Run docs","description":"Control headless browsers with Cloudflare's Workers Browser Run API. Automate tasks, take screenshots, convert pages to PDFs, and test web apps.","url":"https://developers.cloudflare.com/browser-run/","inLanguage":"en","image":"https://developers.cloudflare.com/dev-products-preview.png","dateModified":"2026-04-21","publisher":{"@type":"Organization","name":"Cloudflare","url":"https://www.cloudflare.com/"},"isPartOf":{"@type":"WebSite","@id":"https://developers.cloudflare.com/#website","name":"Cloudflare Docs","url":"https://developers.cloudflare.com/"}}
{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"item":{"@id":"/directory/","name":"Directory"}},{"@type":"ListItem","position":2,"item":{"@id":"/browser-run/","name":"Browser Run"}}]}
```

---

---
title: Agent Lee
description: Ask questions, run diagnostics, and take actions across your Cloudflare account using an AI-powered dashboard assistant.
image: https://developers.cloudflare.com/dev-products-preview.png
---

> Documentation Index  
> Fetch the complete documentation index at: https://developers.cloudflare.com/agent-lee/llms.txt  
> Use this file to discover all available pages before exploring further. 

[Skip to content](#%5Ftop) 

# Agent Lee

An AI co-pilot built into the Cloudflare dashboard. Ask questions about your account, take actions, and run diagnostics, all in plain language.

Beta

Agent Lee is currently in beta and only available to accounts on a Free plan. Features and behaviors may change.

With Agent Lee, you can:

* Ask questions about your account configuration and get answers based on your actual data.
* Make changes to DNS records, zone settings, and security rules, with your approval required before anything executes.
* Run network diagnostics like DNS lookups and certificate checks.
* Generate inline charts and visualizations from your account analytics.

To get started, log in to the [Cloudflare dashboard ↗](https://dash.cloudflare.com) and select **Ask AI** in the upper-right corner of any dashboard page.

---

## Capabilities

### Account-aware answers

Agent Lee answers based on your actual account data, not just documentation. When you ask a question, it fetches your zone configuration, DNS records, and security settings before responding.

### Write operations

You can ask Agent Lee to create, update, or delete resources across your account using natural language. Every write operation requires your explicit approval before it executes, Agent Lee shows you exactly what it plans to do and waits for confirmation.

Example requests:

* "Add an A record for blog.example.com pointing to 192.0.2.10."
* "Enable Always Use HTTPS on my zone."
* "Set the SSL mode for example.com to Full (strict)."

### Network diagnostics

Run diagnostic commands to troubleshoot connectivity and configuration issues:

* **DNS lookups**: Query DNS records for any domain
* **Certificate checks**: Inspect TLS/SSL certificates
* **Domain information**: Look up WHOIS and RDAP registration data

### Generative UI

Agent Lee renders inline charts and data visualizations directly in the chat panel based on your account analytics. Example requests:

* "Show me a chart of my traffic over the last 7 days."
* "What does my error rate look like for the past 24 hours?"

---

## Data access and privacy

### What Agent Lee can access

* Zone settings, DNS records, firewall and WAF rules
* Workers scripts, routes, and bindings
* R2 bucket names, Cloudflare Tunnel configuration, cache rules
* Registrar domain data, account plan and usage metadata

Agent Lee fetches this data on demand when your question requires it.

### What Agent Lee cannot access

* Payment methods, billing history, or invoice details
* Account passwords, login credentials, or API tokens
* Raw log data or Logpush datasets
* Data from other Cloudflare accounts

### Conversation storage

Conversations are stored per user using [Durable Objects](https://developers.cloudflare.com/durable-objects/), isolated to your account. Conversation data is retained for one year in accordance with Cloudflare's data retention policy. Agent Lee does not currently reference previous conversation context when responding.

### Data usage

Agent Lee does not currently use your conversations, prompts, or account data to train AI models, nor do we share your data with other Cloudflare customers. Should these practices change in the future, we will provide advance notice to keep you informed. For Cloudflare's authoritative data handling commitments, refer to the [Cloudflare Privacy Policy ↗](https://www.cloudflare.com/privacypolicy/).

---

## Limitations

Agent Lee cannot:

* Write Workers scripts or generate application code
* Replace [Cloudflare Support ↗](https://support.cloudflare.com) for billing issues, account recovery, or outages
* Access payment methods, billing history, or API tokens
* Operate across multiple accounts: sessions are scoped to your authenticated account
* Remember previous conversations: each session starts fresh
* Query raw log data or Logpush datasets
* Execute write operations without your explicit approval

Agent Lee is entirely optional. If you do not open the Ask AI panel, none of your data is sent to or processed by it.

---

## Built on Cloudflare

Agent Lee is built on Cloudflare's own developer platform using the same primitives available to any Cloudflare developer.

| Component                                                                                                | Role                                                  |
| -------------------------------------------------------------------------------------------------------- | ----------------------------------------------------- |
| [Agents SDK](https://developers.cloudflare.com/agents/)                                                  | Agent lifecycle, state management, and scheduling     |
| [Durable Objects](https://developers.cloudflare.com/durable-objects/)                                    | Per-user conversation storage and write approval gate |
| [Workers AI](https://developers.cloudflare.com/workers-ai/)                                              | LLM inference                                         |
| [Cloudflare MCP server](https://developers.cloudflare.com/agents/model-context-protocol/apis/agent-api/) | Tool definitions for Cloudflare API operations        |

---

## Related resources

* [Agents SDK](https://developers.cloudflare.com/agents/)
* [Human in the Loop](https://developers.cloudflare.com/agents/concepts/agentic-patterns/human-in-the-loop/)
* [Workers AI](https://developers.cloudflare.com/workers-ai/)
* [Blog post: Introducing Agent Lee ↗](https://blog.cloudflare.com/introducing-agent-lee)

```json
{"@context":"https://schema.org","@type":"WebPage","@id":"https://developers.cloudflare.com/agent-lee/#page","headline":"Overview · Agent Lee docs","description":"Ask questions, run diagnostics, and take actions across your Cloudflare account using an AI-powered dashboard assistant.","url":"https://developers.cloudflare.com/agent-lee/","inLanguage":"en","image":"https://developers.cloudflare.com/dev-products-preview.png","dateModified":"2026-06-03","publisher":{"@type":"Organization","name":"Cloudflare","url":"https://www.cloudflare.com/"},"isPartOf":{"@type":"WebSite","@id":"https://developers.cloudflare.com/#website","name":"Cloudflare Docs","url":"https://developers.cloudflare.com/"},"keywords":["AI"]}
{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"item":{"@id":"/directory/","name":"Directory"}},{"@type":"ListItem","position":2,"item":{"@id":"/agent-lee/","name":"Agent Lee"}}]}
```

---

---
title: Dynamic Workers
description: Spin up isolated Workers on demand to execute code.
image: https://developers.cloudflare.com/dev-products-preview.png
---

> Documentation Index  
> Fetch the complete documentation index at: https://developers.cloudflare.com/dynamic-workers/llms.txt  
> Use this file to discover all available pages before exploring further. 

[Skip to content](#%5Ftop) 

# Dynamic Workers

Spin up Workers at runtime to execute code on-demand in a secure, sandboxed environment.

Dynamic Workers let you spin up an unlimited number of Workers to execute arbitrary code specified at runtime. Dynamic Workers can be used as a lightweight alternative to containers for securely sandboxing code you don't trust.

Dynamic Workers are the lowest-level primitive for spinning up a Worker, giving you full control over defining how the Worker is composed, which bindings it receives, whether it can reach the network, and more.

### Get started

Deploy the [Dynamic Workers Playground ↗](https://github.com/cloudflare/agents/tree/main/examples/dynamic-workers-playground) to create and run Workers dynamically from code you write or import from GitHub, with real-time logs and observability.

[![Deploy to Cloudflare](https://deploy.workers.cloudflare.com/button)](https://deploy.workers.cloudflare.com/?url=https://github.com/dinasaur404/dynamic-workers-playground)

## Use Dynamic Workers for

Use this pattern when code needs to run quickly in a secure, isolated environment.

* **AI Agent "Code Mode"**: LLMs are trained to write code. Instead of supplying an agent with tool calls to perform tasks, give it an API and let it write and execute code. Save up to 80% in inference tokens and cost by allowing the agent to programmatically process data instead of sending it all through the LLM.
* **AI-generated applications / "Vibe Code"**: Run generated code for prototypes, projects, and automations in a secure, isolated sandboxed environment.
* **Fast development and previews**: Load prototypes, previews, and playgrounds in milliseconds.
* **Custom automations**: Create custom tools on the fly that execute a task, call an integration, or automate a workflow.
* **Platforms**: Run applications uploaded by your users.

## Features

Because you compose the Worker that runs the code at runtime, you control how that Worker is configured and what it can access.

* **[Bindings](https://developers.cloudflare.com/dynamic-workers/usage/bindings/)**: Decide which bindings and structured data the dynamic Worker receives.
* **[Observability](https://developers.cloudflare.com/dynamic-workers/usage/observability/)**: Attach Tail Workers and capture logs for each run.
* **[Network access](https://developers.cloudflare.com/dynamic-workers/usage/egress-control/)**: Intercept or block Internet access for outbound requests.
* **[Limits](https://developers.cloudflare.com/dynamic-workers/usage/limits/)**: Enforce custom limits on the dynamic Worker's resource usage.
* **[Durable Object Facets](https://developers.cloudflare.com/dynamic-workers/usage/durable-object-facets/)**: Run dynamically-loaded code as a Durable Object with its own isolated SQLite storage.

```json
{"@context":"https://schema.org","@type":"WebPage","@id":"https://developers.cloudflare.com/dynamic-workers/#page","headline":"Dynamic Workers · Cloudflare Dynamic Workers docs","description":"Spin up isolated Workers on demand to execute code.","url":"https://developers.cloudflare.com/dynamic-workers/","inLanguage":"en","image":"https://developers.cloudflare.com/dev-products-preview.png","dateModified":"2026-04-21","publisher":{"@type":"Organization","name":"Cloudflare","url":"https://www.cloudflare.com/"},"isPartOf":{"@type":"WebSite","@id":"https://developers.cloudflare.com/#website","name":"Cloudflare Docs","url":"https://developers.cloudflare.com/"}}
{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"item":{"@id":"/directory/","name":"Directory"}},{"@type":"ListItem","position":2,"item":{"@id":"/dynamic-workers/","name":"Dynamic Workers"}}]}
```

---

---
title: Cloudflare Vectorize
description: Build full-stack AI applications with Vectorize, Cloudflare's vector database.
image: https://developers.cloudflare.com/dev-products-preview.png
---

> Documentation Index  
> Fetch the complete documentation index at: https://developers.cloudflare.com/vectorize/llms.txt  
> Use this file to discover all available pages before exploring further. 

[Skip to content](#%5Ftop) 

# Cloudflare Vectorize

Build full-stack AI applications with Vectorize, Cloudflare's powerful vector database.

Vectorize is a globally distributed vector database that enables you to build full-stack, AI-powered applications with [Cloudflare Workers](https://developers.cloudflare.com/workers/). Vectorize makes querying embeddings — representations of values or objects like text, images, audio that are designed to be consumed by machine learning models and semantic search algorithms — faster, easier and more affordable.

Vectorize is now Generally Available

To report bugs or give feedback, go to the [#vectorize Discord channel ↗](https://discord.cloudflare.com). If you are having issues with Wrangler, report issues in the [Wrangler GitHub repository ↗](https://github.com/cloudflare/workers-sdk/issues/new/choose).

For example, by storing the embeddings (vectors) generated by a machine learning model, including those built-in to [Workers AI](https://developers.cloudflare.com/workers-ai/) or by bringing your own from platforms like [OpenAI](#), you can build applications with powerful search, similarity, recommendation, classification and/or anomaly detection capabilities based on your own data.

The vectors returned can reference images stored in Cloudflare R2, documents in KV, and/or user profiles stored in D1 — enabling you to go from vector search result to concrete object all within the Workers platform, and without standing up additional infrastructure.

---

## Features

###  Vector database 

Learn how to create your first Vectorize database, upload vector embeddings, and query those embeddings from [Cloudflare Workers](https://developers.cloudflare.com/workers/).

[ Create your Vector database ](https://developers.cloudflare.com/vectorize/get-started/intro/) 

###  Vector embeddings using Workers AI 

Learn how to use Vectorize to generate vector embeddings using Workers AI.

[ Create vector embeddings using Workers AI ](https://developers.cloudflare.com/vectorize/get-started/embeddings/) 

###  Search using Vectorize and AI Search 

Learn how to automatically index your data and store it in Vectorize, then query it to generate context-aware responses using AI Search.

[ Build a RAG with Vectorize ](https://developers.cloudflare.com/ai-search/) 

---

## Related products

**[Workers AI](https://developers.cloudflare.com/workers-ai/)** 

Run machine learning models, powered by serverless GPUs, on Cloudflare’s global network.

**[R2 Storage](https://developers.cloudflare.com/r2/)** 

Store large amounts of unstructured data without the costly egress bandwidth fees associated with typical cloud storage services.

---

## More resources

[Limits](https://developers.cloudflare.com/vectorize/platform/limits/) 

Learn about Vectorize limits and how to work within them.

[Use cases](https://developers.cloudflare.com/use-cases/ai/) 

Learn how you can build and deploy ambitious AI applications to Cloudflare's global network.

[Storage options](https://developers.cloudflare.com/workers/platform/storage-options/) 

Learn more about the storage and database options you can build on with Workers.

[Developer Discord](https://discord.cloudflare.com) 

Connect with the Workers community on Discord to ask questions, join the `#vectorize` channel to show what you are building, and discuss the platform with other developers.

[@CloudflareDev](https://x.com/cloudflaredev) 

Follow @CloudflareDev on Twitter to learn about product announcements, and what is new in Cloudflare Developer Platform.

```json
{"@context":"https://schema.org","@type":"WebPage","@id":"https://developers.cloudflare.com/vectorize/#page","headline":"Overview · Cloudflare Vectorize docs","description":"Build full-stack AI applications with Vectorize, Cloudflare's vector database.","url":"https://developers.cloudflare.com/vectorize/","inLanguage":"en","image":"https://developers.cloudflare.com/dev-products-preview.png","dateModified":"2026-04-21","publisher":{"@type":"Organization","name":"Cloudflare","url":"https://www.cloudflare.com/"},"isPartOf":{"@type":"WebSite","@id":"https://developers.cloudflare.com/#website","name":"Cloudflare Docs","url":"https://developers.cloudflare.com/"},"keywords":["AI"]}
{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"item":{"@id":"/directory/","name":"Directory"}},{"@type":"ListItem","position":2,"item":{"@id":"/vectorize/","name":"Vectorize"}}]}
```

---

---
title: Cloudflare Workers AI
description: Run machine learning models, powered by serverless GPUs, on Cloudflare's global network.
image: https://developers.cloudflare.com/dev-products-preview.png
---

> Documentation Index  
> Fetch the complete documentation index at: https://developers.cloudflare.com/workers-ai/llms.txt  
> Use this file to discover all available pages before exploring further. 

[Skip to content](#%5Ftop) 

# Cloudflare Workers AI

Run machine learning models, powered by serverless GPUs, on Cloudflare's global network.

 Available on Free and Paid plans 

Workers AI allows you to run AI models in a serverless way, without having to worry about scaling, maintaining, or paying for unused infrastructure. You can invoke models running on GPUs on Cloudflare's network from your own code — from [Workers](https://developers.cloudflare.com/workers/), [Pages](https://developers.cloudflare.com/pages/), or anywhere via [the Cloudflare API](https://developers.cloudflare.com/api/resources/ai/methods/run/).

Workers AI gives you access to:

* **50+ [open-source models](https://developers.cloudflare.com/workers-ai/models/)**, available as a part of our model catalog
* Serverless, **pay-for-what-you-use** [pricing model](https://developers.cloudflare.com/workers-ai/platform/pricing/)
* All as part of a **fully-featured developer platform**, including [AI Gateway](https://developers.cloudflare.com/ai-gateway/), [Vectorize](https://developers.cloudflare.com/vectorize/), [Workers](https://developers.cloudflare.com/workers/) and more...

[ Get started ](https://developers.cloudflare.com/workers-ai/get-started)[ Watch a Workers AI demo ](https://youtu.be/cK%5FleoJsBWY?si=4u6BIy%5FuBOZf9Ve8)

Custom requirements

If you have custom requirements like private custom models or higher limits, complete the [Custom Requirements Form ↗](https://forms.gle/axnnpGDb6xrmR31T6). Cloudflare will contact you with next steps.

Workers AI is now Generally Available

To report bugs or give feedback, go to the [#workers-ai Discord channel ↗](https://discord.cloudflare.com). If you are having issues with Wrangler, report issues in the [Wrangler GitHub repository ↗](https://github.com/cloudflare/workers-sdk/issues/new/choose).

---

## Features

###  Models 

Workers AI comes with a curated set of popular open-source models that enable you to do tasks such as image classification, text generation, object detection and more.

[ Browse models ](https://developers.cloudflare.com/workers-ai/models/) 

---

## Related products

**[AI Gateway](https://developers.cloudflare.com/ai-gateway/)** 

Observe and control your AI applications with caching, rate limiting, request retries, model fallback, and more.

**[Vectorize](https://developers.cloudflare.com/vectorize/)** 

Build full-stack AI applications with Vectorize, Cloudflare’s vector database. Adding Vectorize enables you to perform tasks such as semantic search, recommendations, anomaly detection or can be used to provide context and memory to an LLM.

**[Workers](https://developers.cloudflare.com/workers/)** 

Build serverless applications and deploy instantly across the globe for exceptional performance, reliability, and scale.

**[Pages](https://developers.cloudflare.com/pages/)** 

Create full-stack applications that are instantly deployed to the Cloudflare global network.

**[R2](https://developers.cloudflare.com/r2/)** 

Store large amounts of unstructured data without the costly egress bandwidth fees associated with typical cloud storage services.

**[D1](https://developers.cloudflare.com/d1/)** 

Create new serverless SQL databases to query from your Workers and Pages projects.

**[Durable Objects](https://developers.cloudflare.com/durable-objects/)** 

A globally distributed coordination API with strongly consistent storage.

**[KV](https://developers.cloudflare.com/kv/)** 

Create a global, low-latency, key-value data storage.

---

## More resources

[Get started](https://developers.cloudflare.com/workers-ai/get-started/workers-wrangler/) 

Build and deploy your first Workers AI application.

[Plans](https://developers.cloudflare.com/workers-ai/platform/pricing/) 

Learn about Free and Paid plans.

[Limits](https://developers.cloudflare.com/workers-ai/platform/limits/) 

Learn about Workers AI limits.

[Use cases](https://developers.cloudflare.com/use-cases/ai/) 

Learn how you can build and deploy ambitious AI applications to Cloudflare's global network.

[Storage options](https://developers.cloudflare.com/workers/platform/storage-options/) 

Learn which storage option is best for your project.

[Developer Discord](https://discord.cloudflare.com) 

Connect with the Workers community on Discord to ask questions, share what you are building, and discuss the platform with other developers.

[@CloudflareDev](https://x.com/cloudflaredev) 

Follow @CloudflareDev on Twitter to learn about product announcements, and what is new in Cloudflare Workers.

```json
{"@context":"https://schema.org","@type":"WebPage","@id":"https://developers.cloudflare.com/workers-ai/#page","headline":"Overview · Cloudflare Workers AI docs","description":"Run machine learning models, powered by serverless GPUs, on Cloudflare's global network.","url":"https://developers.cloudflare.com/workers-ai/","inLanguage":"en","image":"https://developers.cloudflare.com/dev-products-preview.png","dateModified":"2026-04-21","publisher":{"@type":"Organization","name":"Cloudflare","url":"https://www.cloudflare.com/"},"isPartOf":{"@type":"WebSite","@id":"https://developers.cloudflare.com/#website","name":"Cloudflare Docs","url":"https://developers.cloudflare.com/"},"keywords":["AI"]}
{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"item":{"@id":"/directory/","name":"Directory"}},{"@type":"ListItem","position":2,"item":{"@id":"/workers-ai/","name":"Workers AI"}}]}
```
