Cloudflare Workers AI runs serverless inference at the edge with 90+ open-source models.
| Model |
|---|
gemma-7b-it@hf/google/gemma-7b-it |
gemma-7b-it-lora@cf/google/gemma-7b-it-lora |
mistral-7b-instruct-v0.1@cf/mistral/mistral-7b-instruct-v0.1 |
mistral-7b-instruct-v0.1-awq@hf/thebloke/mistral-7b-instruct-v0.1-awq |
mistral-7b-instruct-v0.2@hf/mistral/mistral-7b-instruct-v0.2 |
mistral-7b-instruct-v0.2-lora@cf/mistral/mistral-7b-instruct-v0.2-lora |
glm-4.7-flash@cf/zai-org/glm-4.7-flash |
llama-4-scout-17b-16e-instruct@cf/meta/llama-4-scout-17b-16e-instruct |
llama-3.3-70b-instruct-fp8-fast@cf/meta/llama-3.3-70b-instruct-fp8-fast |
llama-3.2-11b-vision-instruct@cf/meta/llama-3.2-11b-vision-instruct |