|
--- |
|
title: ExplainAnything.AI |
|
emoji: π |
|
colorFrom: blue |
|
colorTo: purple |
|
sdk: gradio |
|
sdk_version: "4.20.0" |
|
app_file: app.py |
|
pinned: true |
|
--- |
|
|
|
|
|
|
|
|
|
# ExplainAnything.AI |
|
|
|
**Track:** agent-demo-track |
|
|
|
**ExplainAnything.AI** is a multimodal agent that helps users understand any science topic β either by **asking a question** or **uploading an image or PDF**. It then builds an interactive explainer package with: |
|
- π§ Easy-to-understand explanation (via Mistral) |
|
- πΌοΈ Auto-generated visual diagram (via Flux) |
|
- β Quiz questions to test understanding (via Mistral) |
|
- π Downloadable report summarizing everything |
|
|
|
--- |
|
|
|
## π How It Works |
|
|
|
You have two options to get started: |
|
1. **Ask a question**, e.g. *"How do solar panels work?"* |
|
2. **Upload an image or PDF**, like a diagram or worksheet. |
|
|
|
The agent then: |
|
- Uses **Gemini Vision** (for image/PDF) or **your question** as input |
|
- Generates an explanation using **Mistral** |
|
- Creates a visual diagram with **Flux** |
|
- Generates quiz questions using **Mistral** |
|
- Compiles everything into a downloadable **learning report** |
|
|
|
--- |
|
|
|
## π§ Build Status |
|
|
|
This Space is currently **building**, but all logic, tools, and functionality were submitted **before the deadline**. |
|
|
|
--- |
|
|
|
## π₯ Video Overview |
|
|
|
**Video Overview:** [Coming Soon β will be added post-deadline] |
|
|
|
*A short walkthrough of the appβs flow and learning experience will be uploaded here.* |
|
|
|
--- |
|
|
|
## π οΈ Tech Stack |
|
|
|
- **Mistral** β for science explanations + quiz generation |
|
- **Flux** β for diagram/image generation |
|
- **Gemini Vision** β for reading image and PDF content |
|
- **Gradio** β chat + upload interface |
|
- Manual orchestration (no MCP yet) |
|
|
|
--- |
|
|
|
## π Use Cases |
|
|
|
- π§βπ Students learning STEM with visual + interactive help |
|
- π©βπ« Teachers turning textbook pages into visual lessons |
|
- π§ Self-learners asking "why/how" questions and getting full reports |
|
|
|
--- |
|
|
|
## π§ Future Plans |
|
|
|
- Integrate Hugging Face MCP for agent orchestration |
|
- Add TTS narration for accessibility |
|
- Generate downloadable PDF learning packs |
|
|
|
|