A data extraction tool to convert PDF to Markdown and JSON
Scrape a website and download its content as markdown
Convert PDFs to a Hugging Face dataset