Extract and visualize layout from PDFs or images
Generate Gradio app code from user requests
MOSS-TTSD: Text to Spoken Dialogue Generation
nanonets ocr / smoldocling / monkey ocr / typhoon ocr