File size: 2,026 Bytes
25ec88f
 
 
 
 
 
 
 
 
 
 
72df28d
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
c1fe8cc
72df28d
 
 
 
 
 
 
 
 
 
 
 
 
 
c1fe8cc
 
72df28d
c1fe8cc
72df28d
 
 
 
 
 
 
 
 
c1fe8cc
7c61c59
72df28d
7c61c59
72df28d
7c61c59
 
 
 
72df28d
 
7c61c59
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
---
title: CodeMentor AI
emoji: 🧠
colorFrom: purple
colorTo: blue
sdk: streamlit
sdk_version: "1.30.0"
app_file: app/app.py
pinned: true
---

# CodeMentor AI – ChatGPT for Coding Interviews (Fine-Tuned Flan-T5)

CodeMentor AI is a fine-tuned language model specialized for solving **coding interview questions**, built on top of **TinyLlama-1.1B-Chat**, trained with 20K+ prompts, and deployed with a sleek **ChatGPT-style UI using Streamlit**.

---

##  Features

-  Fine-tuned LLM using HuggingFace Transformers
-  Trained on 20K+ high-quality coding problems (CodeAlpaca dataset)
-  Clean ChatGPT-style frontend built with Streamlit
-  Docker-ready for easy deployment
-  Optimized for local + cloud usage
-  Can run inference via terminal or web UI

---

##  Tech Stack

- `Flan-T5-small` (HuggingFace)
- `Transformers` + `Datasets`
- `Streamlit`
- `Docker` for packaging
- `Render` or `HuggingFace Spaces` for deployment

---

##  Training Details

| Config         | Value                   |
|----------------|-------------------------|
| Model          | `google/flan-t5-small`  |
| Epochs         | 6                       |
| Batch Size     | 1 (with gradient accumulation) |
| Learning Rate  | 5e-5                    |
| Max Length     | 512 tokens              |
| GPU            | GTX 1650 (4GB VRAM)     |
| Total Samples  | ~20,000 examples        |
| Training Time  | ~4 hours                |

---

##  Folder Structure

CodeMentor-AI/
β”‚
β”œβ”€β”€ data/ # Raw + Processed Datasets
β”œβ”€β”€ model/codementor-flan/ # Saved fine-tuned model
β”œβ”€β”€ train/ # Preprocessing + Training scripts
β”œβ”€β”€ app/app.py # Streamlit Chat UI
β”œβ”€β”€ requirements.txt # All dependencies
β”œβ”€β”€ Dockerfile # Docker config
β”œβ”€β”€ render.yaml # Optional Render deployment config


---

##  to Run Locally

```bash
git clone https://github.com/chetan10510/CodeMentor-AI.git
cd CodeMentor-AI
python -m venv .venv
.venv\Scripts\activate       # Windows
pip install -r requirements.txt
streamlit run app/app.py