File size: 2,084 Bytes
6d08633
 
 
 
 
 
 
 
 
 
 
 
 
 
0cafd22
5097a5c
0cafd22
5097a5c
0cafd22
 
 
 
 
5097a5c
0cafd22
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5097a5c
0cafd22
5097a5c
0cafd22
5097a5c
0cafd22
5097a5c
0cafd22
5097a5c
0cafd22
5097a5c
0cafd22
5097a5c
0cafd22
5097a5c
0cafd22
5097a5c
0cafd22
 
 
 
 
5097a5c
0cafd22
5097a5c
0cafd22
5097a5c
0cafd22
 
 
5097a5c
0cafd22
5097a5c
0cafd22
5097a5c
0cafd22
 
 
5097a5c
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
---
title: ExplainAnything.AI
emoji: πŸ“š
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: "4.20.0"
app_file: app.py
pinned: true
---




# ExplainAnything.AI

**Track:** agent-demo-track

**ExplainAnything.AI** is a multimodal agent that helps users understand any science topic β€” either by **asking a question** or **uploading an image or PDF**. It then builds an interactive explainer package with:
- 🧠 Easy-to-understand explanation (via Mistral)
- πŸ–ΌοΈ Auto-generated visual diagram (via Flux)
- ❓ Quiz questions to test understanding (via Mistral)
- πŸ“„ Downloadable report summarizing everything

---

## πŸš€ How It Works

You have two options to get started:
1. **Ask a question**, e.g. *"How do solar panels work?"*  
2. **Upload an image or PDF**, like a diagram or worksheet.

The agent then:
- Uses **Gemini Vision** (for image/PDF) or **your question** as input
- Generates an explanation using **Mistral**
- Creates a visual diagram with **Flux**
- Generates quiz questions using **Mistral**
- Compiles everything into a downloadable **learning report**

---

## 🚧 Build Status

This Space is currently **building**, but all logic, tools, and functionality were submitted **before the deadline**.

---

## πŸŽ₯ Video Overview

**Video Overview:** [Coming Soon – will be added post-deadline]

*A short walkthrough of the app’s flow and learning experience will be uploaded here.*

---

## πŸ› οΈ Tech Stack

- **Mistral** – for science explanations + quiz generation
- **Flux** – for diagram/image generation
- **Gemini Vision** – for reading image and PDF content
- **Gradio** – chat + upload interface
- Manual orchestration (no MCP yet)

---

## πŸ“˜ Use Cases

- πŸ§‘β€πŸŽ“ Students learning STEM with visual + interactive help
- πŸ‘©β€πŸ« Teachers turning textbook pages into visual lessons
- 🧠 Self-learners asking "why/how" questions and getting full reports

---

## 🧠 Future Plans

- Integrate Hugging Face MCP for agent orchestration
- Add TTS narration for accessibility
- Generate downloadable PDF learning packs