Skip to content

Demos and architectures

Workers AI can be used to build dynamic and performant services. The following demo applications and reference architectures showcase how to use Workers AI optimally within your architecture.

Demos

Explore the following demo applications for Workers AI.

  • Gamertown Customer Support Assistant: A RAG based AI Chat app that uses Vectorize to access video game data for employees of Gamertown.
  • Jobs At Conf: A job lisiting website to add jobs you find at in-person conferences. Built with Cloudflare Pages, R2, D1, Queues, and Workers AI.
  • LoRA News Summarizer: This application uses Cloudflare Workers AI, Streamlit, and Beautiful Soup to summarize input news article URLs in a variety of tones.
  • shrty.dev: A URL shortener that makes use of KV and Workers Analytics Engine. The admin interface uses Function Calling. Go Shorty!
  • Fanfic Generator: This application uses Cloudflare Workers AI, Streamlit, and AstraDB to generate personal scifi fanfiction.
  • Homie - Home Automation using Function Calling: A home automation tool that uses AI Function calling to change the color of lightbulbs in your home.
  • Hackathon Helper: A series of starters for Hackathons. Get building quicker! Python, Streamlit, Workers, and Pages starters for all your AI needs!
  • NBA Finals Polling and Predictor: This stateful polling application uses Cloudflare Workers AI, Cloudflare Pages, Cloudflare Durable Objects, and Hono to keep track of users' votes for different basketball teams and generates personal predictions for the series.
  • Multimodal AI Translator: This application uses Cloudflare Workers AI to perform multimodal translation of languages via audio and text in the browser.
  • Floor is Llava: This is an example repo to explore using the AI Vision model Llava hosted on Cloudflare Workers AI. This is a SvelteKit app hosted on Pages.
  • Workers AI Object Detector: Detect objects from a webcam in a Cloudflare Worker web app with detr-resnet-50 hosted on Cloudflare using Cloudflare Workers AI.
  • Comically Bad Art Generation: This app uses the wonderful Python UI Framework Streamlit and Cloudflare Workers AI.
  • Whatever-ify: Turn yourself into...whatever. Take a photo, get a description, generate a scene and character, then generate an image based on that calendar.
  • Phoney AI: This application uses Cloudflare Workers AI, Twilio, and AssemblyAI. Your phone is an input and output device.
  • Image Model Streamlit starters: Collection of Streamlit applications that are making use of Cloudflare Workers AI.
  • Vanilla JavaScript Chat Application using Cloudflare Workers AI: A web based chat interface built on Cloudflare Pages that allows for exploring Text Generation models on Cloudflare Workers AI. Design is built using tailwind.

Reference architectures

Explore the following reference architectures that use Workers AI:

  • Content-based asset creation : Combining text-generation models with text-to-image models can lead to powerful AI systems capable of generating visual content based on input prompts. This integration can be achieved through a collaborative framework where a text-generation model generates prompts for the text-to-image model based on input text.
  • Composable AI architecture : The architecture diagram illustrates how AI applications can be built end-to-end on Cloudflare, or single services can be integrated with external infrastructure and services.
  • Multi-vendor AI observability and control : By shifting features such as rate limiting, caching, and error handling to the proxy layer, organizations can apply unified configurations across services and inference service providers.
  • Retrieval Augmented Generation (RAG) : Retrieval-Augmented Generation (RAG) is an innovative approach in natural language processing that integrates retrieval mechanisms with generative models to enhance text generation.
  • Automatic captioning for video uploads : By integrating automatic speech recognition technology into video platforms, content creators, publishers, and distributors can reach a broader audience, including individuals with hearing impairments or those who prefer to consume content in different languages.
  • Ingesting BigQuery Data into Workers AI : You can connect a Cloudflare Worker to get data from Google BigQuery and pass it to Workers AI, to run AI Models, powered by serverless GPUs. This will allow you to enhance data with AI-generated responses, such as detecting the sentiment score of some text or generating tags for an article. This document describes a simple way to get started if you are looking to give Workers AI a try and see how the new and different AI models would perform with your data hosted in BigQuery.
  • Fullstack applications : Full-stack web applications leverage a combination of frontend and backend technologies, collectively forming a stack that powers the entire application. This technology stack encompasses various tools, frameworks, and languages, each serving a specific purpose within the development ecosystem.
  • Serverless image content management : In this reference architecture diagram, we reveal how to leverage various components of Cloudflare’s ecosystem to construct a scalable image management solution. This solution integrates moderation principles via Cloudflare’s Workers AI platform and performs image classification through inference at the edge. The storage of images is handled by Cloudflare’s R2 product, an S3 API-like object storage system, while metadata is stored in a key/value store to enable content augmentation.