Spaces:

Agents-MCP-Hackathon
/

ExplainAnything-AI

Sleeping

App Files Files Community

MonaHamid commited on Jun 10

Commit

0cafd22

verified ·

1 Parent(s): 6d08633

Update README.md

Browse files

Files changed (1) hide show

README.md +46 -31

README.md CHANGED Viewed

@@ -12,53 +12,68 @@ pinned: true
-📚 ExplainAnything.AI
-ExplainAnything.AI is an interactive multimodal science explainer built with Gradio. It allows users to:
-Ask science questions or upload diagrams/PDFs
-Get clear explanations using Google Gemini or Mistral
-Generate visual diagrams via FLUX (Stable Diffusion)
-Take auto-generated quizzes to test understanding
-Download personalized science reports in PDF or Markdown
-How It Works
-Ask a Question
-Type a science question (e.g., How do volcanoes erupt?)
-→ Choose level: Kid, Beginner, or Advanced
-Upload a File (Optional)
-You can upload a PDF, image, or diagram to get contextual explanations
-View Explanation + Diagram
-The app generates a clear explanation and a visual illustration using generative models
-Take the Quiz
-Answer auto-generated multiple choice questions to reinforce learning
-Download Report
-Export everything to a PDF or Markdown file for later review
-🔧 Technologies Used
-Gradio (UI)
-Google Gemini API (explanation from images + PDFs)
-Mistral via OpenRouter (text-based science explanations)
-Stable Diffusion FLUX (diagram generation)
-PDFPlumber + FPDF (report creation)
-Hugging Face Spaces (deployment)
- Environment Variables (set under Settings > Secrets)
-Make sure to add the following API keys as secrets in your Space:
-GEMINI_API_KEY
-OPENROUTER_API_KEY
-HF_TOKEN

+# ExplainAnything.AI
+**Track:** agent-demo-track
+**ExplainAnything.AI** is a multimodal agent that helps users understand any science topic — either by **asking a question** or **uploading an image or PDF**. It then builds an interactive explainer package with:
+- 🧠 Easy-to-understand explanation (via Mistral)
+- 🖼️ Auto-generated visual diagram (via Flux)
+- ❓ Quiz questions to test understanding (via Mistral)
+- 📄 Downloadable report summarizing everything
+---
+## 🚀 How It Works
+You have two options to get started:
+1. **Ask a question**, e.g. *"How do solar panels work?"*
+2. **Upload an image or PDF**, like a diagram or worksheet.
+The agent then:
+- Uses **Gemini Vision** (for image/PDF) or **your question** as input
+- Generates an explanation using **Mistral**
+- Creates a visual diagram with **Flux**
+- Generates quiz questions using **Mistral**
+- Compiles everything into a downloadable **learning report**
+---
+## 🚧 Build Status
+This Space is currently **building**, but all logic, tools, and functionality were submitted **before the deadline**.
+---
+## 🎥 Video Overview
+**Video Overview:** [Coming Soon – will be added post-deadline]
+*A short walkthrough of the app’s flow and learning experience will be uploaded here.*
+---
+## 🛠️ Tech Stack
+- **Mistral** – for science explanations + quiz generation
+- **Flux** – for diagram/image generation
+- **Gemini Vision** – for reading image and PDF content
+- **Gradio** – chat + upload interface
+- Manual orchestration (no MCP yet)
+---
+## 📘 Use Cases
+- 🧑‍🎓 Students learning STEM with visual + interactive help
+- 👩‍🏫 Teachers turning textbook pages into visual lessons
+- 🧠 Self-learners asking "why/how" questions and getting full reports
+---
+## 🧠 Future Plans
+- Integrate Hugging Face MCP for agent orchestration
+- Add TTS narration for accessibility
+- Generate downloadable PDF learning packs