Whisper Web
Convert spoken words into text
A collection of my favorite WebML demos, built with Transformers.js!
Convert spoken words into text
In-browser background removal
Experiment with and compare different tokenizers
Generate depth map from an image
A private and powerful AI that runs locally in your browser
Convert spoken words into text
Segment objects in your images
In-browser text-to-music w/ Transformers.js!
Real-time object detection w/ π€ Transformers.js
Convert text to speech effortlessly
In-browser WebGPU background removal
Generate text using Transformer models
Draw and get a caption for your doodle
Find images by entering text
Segment objects in images using points
Find images by typing a description
Generate code snippets based on your input
Search music using keywords
Classify text into categories without training
Find objects in images using Transformers.js
Upload an image to segment the face
In-browser speech recognition w/ word-level timestamps
Generate realistic images from textual descriptions
Classify images in real-time using your webcam
Estimate depth from your webcam video
Generate depth map from image
Convert spoken words to text
Transcribe voice to text