zhang
AI & ML interests
Recent Activity
Organizations
-
Running77
Browser only - Screen Capture & OCR
πOne-minute creation by AI Coding Autonomous Agent MOUSE-I
-
Running575575
First Agent Template
β‘Get current time in any timezone
-
Runtime error127127
OctoTools
πAn Agentic Framework with Tools for Complex Reasoning
-
Running137137
smolagents LLM leaderboard
πA leaderboard for LLMs powering smolagents
-
Running on Zero1.47k1.47k
Joy Caption Alpha Two
πGenerate captions for images in various styles
-
Running on Zero4040
Florence Llama
π¬Generate responses using images and text
-
trollek/ImagePromptHelper-danube3-500M
Text Generation β’ 0.5B β’ Updated β’ 33 β’ 3 -
trollek/ImagePromptHelper-danube3-500M-GGUF
0.5B β’ Updated β’ 375 β’ 2
-
laion/laion-audio-preview
Viewer β’ Updated β’ 4.15M β’ 1.38k β’ 11 -
Running on Zero2.6k2.6k
F5-TTS
π£F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
Running on L40S2.15k2.15k
FacePoke
πImport a portrait, click to move the head!
-
Running on L4631631
OpenAudio S1
πGenerate speech from text
-
allenai/olmOCR-7B-0225-preview
Image-to-Text β’ 8B β’ Updated β’ 181k β’ 701 -
Runtime error8181
Nanonets OCR
πDemo for Nanonets-OCR
-
Running on ZeroMCP347347
OCR
πolmocr / nanonets ocr / qwen2vl ocr / aya vision / rolmocr
-
Running on ZeroMCP128128
OCR2
π»nanonets ocr / smoldocling / monkey ocr / typhoon ocr
-
Running on Zero1.53k1.53k
Flux.1-dev Upscaler
πUpscale low-resolution images to high resolution
-
Running on Zero426426
InvSR
πImage Super-resolution via Diffusion Inversion
-
Running241241
FLUX Upsacle Image
π₯Upscale images with control and customization
-
Running on L4276276
Thera Arbitrary-Scale Super-Resolution
π₯Enhance image resolution with Thera
-
Djrango/Qwen2vl-Flux
Text-to-Image β’ Updated β’ 507 -
Running on Zero914914
OminiControl
πGenerate an edited image based on text and input image
-
Running on Zero393393
FLUXllama gpt-oss
πmcp_server & FLUX 4-bit Quantization + Enhanced
-
Running on L42.08k2.08k
MagicQuill
πͺΆGenerate edited images using scribble inputs
-
Running77
Browser only - Screen Capture & OCR
πOne-minute creation by AI Coding Autonomous Agent MOUSE-I
-
Running575575
First Agent Template
β‘Get current time in any timezone
-
Runtime error127127
OctoTools
πAn Agentic Framework with Tools for Complex Reasoning
-
Running137137
smolagents LLM leaderboard
πA leaderboard for LLMs powering smolagents
-
allenai/olmOCR-7B-0225-preview
Image-to-Text β’ 8B β’ Updated β’ 181k β’ 701 -
Runtime error8181
Nanonets OCR
πDemo for Nanonets-OCR
-
Running on ZeroMCP347347
OCR
πolmocr / nanonets ocr / qwen2vl ocr / aya vision / rolmocr
-
Running on ZeroMCP128128
OCR2
π»nanonets ocr / smoldocling / monkey ocr / typhoon ocr
-
Running on Zero1.47k1.47k
Joy Caption Alpha Two
πGenerate captions for images in various styles
-
Running on Zero4040
Florence Llama
π¬Generate responses using images and text
-
trollek/ImagePromptHelper-danube3-500M
Text Generation β’ 0.5B β’ Updated β’ 33 β’ 3 -
trollek/ImagePromptHelper-danube3-500M-GGUF
0.5B β’ Updated β’ 375 β’ 2
-
Running on Zero1.53k1.53k
Flux.1-dev Upscaler
πUpscale low-resolution images to high resolution
-
Running on Zero426426
InvSR
πImage Super-resolution via Diffusion Inversion
-
Running241241
FLUX Upsacle Image
π₯Upscale images with control and customization
-
Running on L4276276
Thera Arbitrary-Scale Super-Resolution
π₯Enhance image resolution with Thera
-
Djrango/Qwen2vl-Flux
Text-to-Image β’ Updated β’ 507 -
Running on Zero914914
OminiControl
πGenerate an edited image based on text and input image
-
Running on Zero393393
FLUXllama gpt-oss
πmcp_server & FLUX 4-bit Quantization + Enhanced
-
Running on L42.08k2.08k
MagicQuill
πͺΆGenerate edited images using scribble inputs
-
laion/laion-audio-preview
Viewer β’ Updated β’ 4.15M β’ 1.38k β’ 11 -
Running on Zero2.6k2.6k
F5-TTS
π£F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
Running on L40S2.15k2.15k
FacePoke
πImport a portrait, click to move the head!
-
Running on L4631631
OpenAudio S1
πGenerate speech from text