Spaces:

scdong
/

mental_health_chatbot

Sleeping

App Files Files Community

scdong commited on Apr 11

Commit

bd30f7c

verified ·

1 Parent(s): b03ee16

Upload 10 files

Browse files

Files changed (10) hide show

Dockerfile +26 -0
Flan-T5_on_Kaggle_Dataset.ipynb +859 -0
LICENSE +21 -0
ML_on_Kaggle_Dataset.ipynb +0 -0
Mistral-7B_on_Kaggle_Dataset.ipynb +721 -0
README.md +103 -10
app.py +175 -0
app_slow_version.py +172 -0
chat_log_combined.csv +19 -0
requirements.txt +7 -0

Dockerfile ADDED Viewed

	@@ -0,0 +1,26 @@

+# Base image
+FROM python:3.10-slim
+# System dependencies
+RUN apt-get update && apt-get install -y \
+    build-essential \
+    cmake \
+    git \
+    curl \
+    ninja-build \
+    libopenblas-dev \
+    && apt-get clean
+# Set working directory
+WORKDIR /app
+# Copy requirements and install
+COPY requirements.txt .
+RUN pip install --upgrade pip && \
+    pip install --no-cache-dir --prefer-binary -r requirements.txt
+# Copy the rest of the app
+COPY . .
+# Run the app
+CMD ["streamlit", "run", "app_local.py", "--server.port=8501", "--server.address=0.0.0.0"]

Flan-T5_on_Kaggle_Dataset.ipynb ADDED Viewed

	@@ -0,0 +1,859 @@

+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "01991402-68d2-4cfb-9b3a-22f170ccf74b",
+   "metadata": {},
+   "source": [
+    "# Build a smart assistant to help mental health counselors respond thoughtfully to patients. "
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "f643f217-faf3-4b5a-9f14-63b898a4d7b8",
+   "metadata": {},
+   "source": [
+    "## The Mental Health Chatbot (Multi-Turn with LLM + Classifier) works in **two steps**:\n",
+    "\n",
+    "1. **Understanding the situation**: When you describe a patient's issue, the system uses a machine learning model to figure out what kind of response might be most helpful—like giving advice, validating feelings, asking a follow-up question, or sharing some mental health information.\n",
+    "\n",
+    "2. **Generating a helpful reply**: After the system decides what type of response is appropriate, it asks a language model (Flan-T5) to write a suggestion based on that need. For example, if the model thinks the user needs validation, it will ask the LLM to generate an empathetic and supportive response.\n",
+    "\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "0cae93e4-7e58-4497-8047-4a069bc7a6c6",
+   "metadata": {},
+   "source": [
+    "## This following python code does:\n",
+    "- Classifies user messages into response types (advice, validation, information, question)\n",
+    "- Uses a language model (Flan-T5) to generate counselor-like responses\n",
+    "- Maintains a limited conversation history\n",
+    "- Allows exporting conversation history to a JSON file"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "d29686fa-8db3-4447-9970-da864a96dc64",
+   "metadata": {},
+   "source": [
+    "### Load Required Libraries"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "8375290a-0034-4050-af72-c76183020bec",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "/Users/Pi/miniconda3/envs/myenv/lib/python3.10/site-packages/tqdm/auto.py:21: TqdmWarning: IProgress not found. Please update jupyter and ipywidgets. See https://ipywidgets.readthedocs.io/en/stable/user_install.html\n",
+      "  from .autonotebook import tqdm as notebook_tqdm\n"
+     ]
+    }
+   ],
+   "source": [
+    "import json\n",
+    "import pandas as pd\n",
+    "import numpy as np\n",
+    "import matplotlib.pyplot as plt\n",
+    "import seaborn as sns\n",
+    "from sklearn.model_selection import train_test_split\n",
+    "from sklearn.feature_extraction.text import TfidfVectorizer\n",
+    "from sklearn.preprocessing import LabelEncoder\n",
+    "from xgboost import XGBClassifier\n",
+    "from transformers import pipeline"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "c15554b4-d47f-4052-a18d-a51742065fc4",
+   "metadata": {},
+   "source": [
+    "### Load and Label Dataset "
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "dbb763bb-eaa1-45ce-ad03-3d6bb300c4e4",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "df = pd.read_csv(\"dataset/Kaggle_Mental_Health_Conversations_train.csv\")\n",
+    "df = df[['Context', 'Response']].dropna().copy()\n",
+    "\n",
+    "keywords_to_labels = {\n",
+    "    'advice': ['try', 'should', 'suggest', 'recommend'],\n",
+    "    'validation': ['understand', 'feel', 'valid', 'normal'],\n",
+    "    'information': ['cause', 'often', 'disorder', 'symptom'],\n",
+    "    'question': ['how', 'what', 'why', 'have you']\n",
+    "}\n",
+    "\n",
+    "def auto_label_response(response):\n",
+    "    response = response.lower()\n",
+    "    for label, keywords in keywords_to_labels.items():\n",
+    "        if any(word in response for word in keywords):\n",
+    "            return label\n",
+    "    return 'information'\n",
+    "\n",
+    "df['response_type'] = df['Response'].apply(auto_label_response)\n",
+    "\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "3cc76163-b688-49b1-a5ab-3f991e6a790b",
+   "metadata": {},
+   "source": [
+    "### Train on Combined Context + Response"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "id": "5aff7aff-7756-4c28-9424-9a9963580883",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "/Users/Pi/miniconda3/envs/myenv/lib/python3.10/site-packages/xgboost/training.py:183: UserWarning: [22:52:39] WARNING: /Users/runner/work/xgboost/xgboost/src/learner.cc:738: \n",
+      "Parameters: { \"use_label_encoder\" } are not used.\n",
+      "\n",
+      "  bst.update(dtrain, iteration=i, fobj=obj)\n"
+     ]
+    },
+    {
+     "data": {
+      "text/html": [
+       "<style>#sk-container-id-1 {\n",
+       "  /* Definition of color scheme common for light and dark mode */\n",
+       "  --sklearn-color-text: #000;\n",
+       "  --sklearn-color-text-muted: #666;\n",
+       "  --sklearn-color-line: gray;\n",
+       "  /* Definition of color scheme for unfitted estimators */\n",
+       "  --sklearn-color-unfitted-level-0: #fff5e6;\n",
+       "  --sklearn-color-unfitted-level-1: #f6e4d2;\n",
+       "  --sklearn-color-unfitted-level-2: #ffe0b3;\n",
+       "  --sklearn-color-unfitted-level-3: chocolate;\n",
+       "  /* Definition of color scheme for fitted estimators */\n",
+       "  --sklearn-color-fitted-level-0: #f0f8ff;\n",
+       "  --sklearn-color-fitted-level-1: #d4ebff;\n",
+       "  --sklearn-color-fitted-level-2: #b3dbfd;\n",
+       "  --sklearn-color-fitted-level-3: cornflowerblue;\n",
+       "\n",
+       "  /* Specific color for light theme */\n",
+       "  --sklearn-color-text-on-default-background: var(--sg-text-color, var(--theme-code-foreground, var(--jp-content-font-color1, black)));\n",
+       "  --sklearn-color-background: var(--sg-background-color, var(--theme-background, var(--jp-layout-color0, white)));\n",
+       "  --sklearn-color-border-box: var(--sg-text-color, var(--theme-code-foreground, var(--jp-content-font-color1, black)));\n",
+       "  --sklearn-color-icon: #696969;\n",
+       "\n",
+       "  @media (prefers-color-scheme: dark) {\n",
+       "    /* Redefinition of color scheme for dark theme */\n",
+       "    --sklearn-color-text-on-default-background: var(--sg-text-color, var(--theme-code-foreground, var(--jp-content-font-color1, white)));\n",
+       "    --sklearn-color-background: var(--sg-background-color, var(--theme-background, var(--jp-layout-color0, #111)));\n",
+       "    --sklearn-color-border-box: var(--sg-text-color, var(--theme-code-foreground, var(--jp-content-font-color1, white)));\n",
+       "    --sklearn-color-icon: #878787;\n",
+       "  }\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 {\n",
+       "  color: var(--sklearn-color-text);\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 pre {\n",
+       "  padding: 0;\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 input.sk-hidden--visually {\n",
+       "  border: 0;\n",
+       "  clip: rect(1px 1px 1px 1px);\n",
+       "  clip: rect(1px, 1px, 1px, 1px);\n",
+       "  height: 1px;\n",
+       "  margin: -1px;\n",
+       "  overflow: hidden;\n",
+       "  padding: 0;\n",
+       "  position: absolute;\n",
+       "  width: 1px;\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 div.sk-dashed-wrapped {\n",
+       "  border: 1px dashed var(--sklearn-color-line);\n",
+       "  margin: 0 0.4em 0.5em 0.4em;\n",
+       "  box-sizing: border-box;\n",
+       "  padding-bottom: 0.4em;\n",
+       "  background-color: var(--sklearn-color-background);\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 div.sk-container {\n",
+       "  /* jupyter's `normalize.less` sets `[hidden] { display: none; }`\n",
+       "     but bootstrap.min.css set `[hidden] { display: none !important; }`\n",
+       "     so we also need the `!important` here to be able to override the\n",
+       "     default hidden behavior on the sphinx rendered scikit-learn.org.\n",
+       "     See: https://github.com/scikit-learn/scikit-learn/issues/21755 */\n",
+       "  display: inline-block !important;\n",
+       "  position: relative;\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 div.sk-text-repr-fallback {\n",
+       "  display: none;\n",
+       "}\n",
+       "\n",
+       "div.sk-parallel-item,\n",
+       "div.sk-serial,\n",
+       "div.sk-item {\n",
+       "  /* draw centered vertical line to link estimators */\n",
+       "  background-image: linear-gradient(var(--sklearn-color-text-on-default-background), var(--sklearn-color-text-on-default-background));\n",
+       "  background-size: 2px 100%;\n",
+       "  background-repeat: no-repeat;\n",
+       "  background-position: center center;\n",
+       "}\n",
+       "\n",
+       "/* Parallel-specific style estimator block */\n",
+       "\n",
+       "#sk-container-id-1 div.sk-parallel-item::after {\n",
+       "  content: \"\";\n",
+       "  width: 100%;\n",
+       "  border-bottom: 2px solid var(--sklearn-color-text-on-default-background);\n",
+       "  flex-grow: 1;\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 div.sk-parallel {\n",
+       "  display: flex;\n",
+       "  align-items: stretch;\n",
+       "  justify-content: center;\n",
+       "  background-color: var(--sklearn-color-background);\n",
+       "  position: relative;\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 div.sk-parallel-item {\n",
+       "  display: flex;\n",
+       "  flex-direction: column;\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 div.sk-parallel-item:first-child::after {\n",
+       "  align-self: flex-end;\n",
+       "  width: 50%;\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 div.sk-parallel-item:last-child::after {\n",
+       "  align-self: flex-start;\n",
+       "  width: 50%;\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 div.sk-parallel-item:only-child::after {\n",
+       "  width: 0;\n",
+       "}\n",
+       "\n",
+       "/* Serial-specific style estimator block */\n",
+       "\n",
+       "#sk-container-id-1 div.sk-serial {\n",
+       "  display: flex;\n",
+       "  flex-direction: column;\n",
+       "  align-items: center;\n",
+       "  background-color: var(--sklearn-color-background);\n",
+       "  padding-right: 1em;\n",
+       "  padding-left: 1em;\n",
+       "}\n",
+       "\n",
+       "\n",
+       "/* Toggleable style: style used for estimator/Pipeline/ColumnTransformer box that is\n",
+       "clickable and can be expanded/collapsed.\n",
+       "- Pipeline and ColumnTransformer use this feature and define the default style\n",
+       "- Estimators will overwrite some part of the style using the `sk-estimator` class\n",
+       "*/\n",
+       "\n",
+       "/* Pipeline and ColumnTransformer style (default) */\n",
+       "\n",
+       "#sk-container-id-1 div.sk-toggleable {\n",
+       "  /* Default theme specific background. It is overwritten whether we have a\n",
+       "  specific estimator or a Pipeline/ColumnTransformer */\n",
+       "  background-color: var(--sklearn-color-background);\n",
+       "}\n",
+       "\n",
+       "/* Toggleable label */\n",
+       "#sk-container-id-1 label.sk-toggleable__label {\n",
+       "  cursor: pointer;\n",
+       "  display: flex;\n",
+       "  width: 100%;\n",
+       "  margin-bottom: 0;\n",
+       "  padding: 0.5em;\n",
+       "  box-sizing: border-box;\n",
+       "  text-align: center;\n",
+       "  align-items: start;\n",
+       "  justify-content: space-between;\n",
+       "  gap: 0.5em;\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 label.sk-toggleable__label .caption {\n",
+       "  font-size: 0.6rem;\n",
+       "  font-weight: lighter;\n",
+       "  color: var(--sklearn-color-text-muted);\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 label.sk-toggleable__label-arrow:before {\n",
+       "  /* Arrow on the left of the label */\n",
+       "  content: \"▸\";\n",
+       "  float: left;\n",
+       "  margin-right: 0.25em;\n",
+       "  color: var(--sklearn-color-icon);\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 label.sk-toggleable__label-arrow:hover:before {\n",
+       "  color: var(--sklearn-color-text);\n",
+       "}\n",
+       "\n",
+       "/* Toggleable content - dropdown */\n",
+       "\n",
+       "#sk-container-id-1 div.sk-toggleable__content {\n",
+       "  max-height: 0;\n",
+       "  max-width: 0;\n",
+       "  overflow: hidden;\n",
+       "  text-align: left;\n",
+       "  /* unfitted */\n",
+       "  background-color: var(--sklearn-color-unfitted-level-0);\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 div.sk-toggleable__content.fitted {\n",
+       "  /* fitted */\n",
+       "  background-color: var(--sklearn-color-fitted-level-0);\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 div.sk-toggleable__content pre {\n",
+       "  margin: 0.2em;\n",
+       "  border-radius: 0.25em;\n",
+       "  color: var(--sklearn-color-text);\n",
+       "  /* unfitted */\n",
+       "  background-color: var(--sklearn-color-unfitted-level-0);\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 div.sk-toggleable__content.fitted pre {\n",
+       "  /* unfitted */\n",
+       "  background-color: var(--sklearn-color-fitted-level-0);\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 input.sk-toggleable__control:checked~div.sk-toggleable__content {\n",
+       "  /* Expand drop-down */\n",
+       "  max-height: 200px;\n",
+       "  max-width: 100%;\n",
+       "  overflow: auto;\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 input.sk-toggleable__control:checked~label.sk-toggleable__label-arrow:before {\n",
+       "  content: \"▾\";\n",
+       "}\n",
+       "\n",
+       "/* Pipeline/ColumnTransformer-specific style */\n",
+       "\n",
+       "#sk-container-id-1 div.sk-label input.sk-toggleable__control:checked~label.sk-toggleable__label {\n",
+       "  color: var(--sklearn-color-text);\n",
+       "  background-color: var(--sklearn-color-unfitted-level-2);\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 div.sk-label.fitted input.sk-toggleable__control:checked~label.sk-toggleable__label {\n",
+       "  background-color: var(--sklearn-color-fitted-level-2);\n",
+       "}\n",
+       "\n",
+       "/* Estimator-specific style */\n",
+       "\n",
+       "/* Colorize estimator box */\n",
+       "#sk-container-id-1 div.sk-estimator input.sk-toggleable__control:checked~label.sk-toggleable__label {\n",
+       "  /* unfitted */\n",
+       "  background-color: var(--sklearn-color-unfitted-level-2);\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 div.sk-estimator.fitted input.sk-toggleable__control:checked~label.sk-toggleable__label {\n",
+       "  /* fitted */\n",
+       "  background-color: var(--sklearn-color-fitted-level-2);\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 div.sk-label label.sk-toggleable__label,\n",
+       "#sk-container-id-1 div.sk-label label {\n",
+       "  /* The background is the default theme color */\n",
+       "  color: var(--sklearn-color-text-on-default-background);\n",
+       "}\n",
+       "\n",
+       "/* On hover, darken the color of the background */\n",
+       "#sk-container-id-1 div.sk-label:hover label.sk-toggleable__label {\n",
+       "  color: var(--sklearn-color-text);\n",
+       "  background-color: var(--sklearn-color-unfitted-level-2);\n",
+       "}\n",
+       "\n",
+       "/* Label box, darken color on hover, fitted */\n",
+       "#sk-container-id-1 div.sk-label.fitted:hover label.sk-toggleable__label.fitted {\n",
+       "  color: var(--sklearn-color-text);\n",
+       "  background-color: var(--sklearn-color-fitted-level-2);\n",
+       "}\n",
+       "\n",
+       "/* Estimator label */\n",
+       "\n",
+       "#sk-container-id-1 div.sk-label label {\n",
+       "  font-family: monospace;\n",
+       "  font-weight: bold;\n",
+       "  display: inline-block;\n",
+       "  line-height: 1.2em;\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 div.sk-label-container {\n",
+       "  text-align: center;\n",
+       "}\n",
+       "\n",
+       "/* Estimator-specific */\n",
+       "#sk-container-id-1 div.sk-estimator {\n",
+       "  font-family: monospace;\n",
+       "  border: 1px dotted var(--sklearn-color-border-box);\n",
+       "  border-radius: 0.25em;\n",
+       "  box-sizing: border-box;\n",
+       "  margin-bottom: 0.5em;\n",
+       "  /* unfitted */\n",
+       "  background-color: var(--sklearn-color-unfitted-level-0);\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 div.sk-estimator.fitted {\n",
+       "  /* fitted */\n",
+       "  background-color: var(--sklearn-color-fitted-level-0);\n",
+       "}\n",
+       "\n",
+       "/* on hover */\n",
+       "#sk-container-id-1 div.sk-estimator:hover {\n",
+       "  /* unfitted */\n",
+       "  background-color: var(--sklearn-color-unfitted-level-2);\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 div.sk-estimator.fitted:hover {\n",
+       "  /* fitted */\n",
+       "  background-color: var(--sklearn-color-fitted-level-2);\n",
+       "}\n",
+       "\n",
+       "/* Specification for estimator info (e.g. \"i\" and \"?\") */\n",
+       "\n",
+       "/* Common style for \"i\" and \"?\" */\n",
+       "\n",
+       ".sk-estimator-doc-link,\n",
+       "a:link.sk-estimator-doc-link,\n",
+       "a:visited.sk-estimator-doc-link {\n",
+       "  float: right;\n",
+       "  font-size: smaller;\n",
+       "  line-height: 1em;\n",
+       "  font-family: monospace;\n",
+       "  background-color: var(--sklearn-color-background);\n",
+       "  border-radius: 1em;\n",
+       "  height: 1em;\n",
+       "  width: 1em;\n",
+       "  text-decoration: none !important;\n",
+       "  margin-left: 0.5em;\n",
+       "  text-align: center;\n",
+       "  /* unfitted */\n",
+       "  border: var(--sklearn-color-unfitted-level-1) 1pt solid;\n",
+       "  color: var(--sklearn-color-unfitted-level-1);\n",
+       "}\n",
+       "\n",
+       ".sk-estimator-doc-link.fitted,\n",
+       "a:link.sk-estimator-doc-link.fitted,\n",
+       "a:visited.sk-estimator-doc-link.fitted {\n",
+       "  /* fitted */\n",
+       "  border: var(--sklearn-color-fitted-level-1) 1pt solid;\n",
+       "  color: var(--sklearn-color-fitted-level-1);\n",
+       "}\n",
+       "\n",
+       "/* On hover */\n",
+       "div.sk-estimator:hover .sk-estimator-doc-link:hover,\n",
+       ".sk-estimator-doc-link:hover,\n",
+       "div.sk-label-container:hover .sk-estimator-doc-link:hover,\n",
+       ".sk-estimator-doc-link:hover {\n",
+       "  /* unfitted */\n",
+       "  background-color: var(--sklearn-color-unfitted-level-3);\n",
+       "  color: var(--sklearn-color-background);\n",
+       "  text-decoration: none;\n",
+       "}\n",
+       "\n",
+       "div.sk-estimator.fitted:hover .sk-estimator-doc-link.fitted:hover,\n",
+       ".sk-estimator-doc-link.fitted:hover,\n",
+       "div.sk-label-container:hover .sk-estimator-doc-link.fitted:hover,\n",
+       ".sk-estimator-doc-link.fitted:hover {\n",
+       "  /* fitted */\n",
+       "  background-color: var(--sklearn-color-fitted-level-3);\n",
+       "  color: var(--sklearn-color-background);\n",
+       "  text-decoration: none;\n",
+       "}\n",
+       "\n",
+       "/* Span, style for the box shown on hovering the info icon */\n",
+       ".sk-estimator-doc-link span {\n",
+       "  display: none;\n",
+       "  z-index: 9999;\n",
+       "  position: relative;\n",
+       "  font-weight: normal;\n",
+       "  right: .2ex;\n",
+       "  padding: .5ex;\n",
+       "  margin: .5ex;\n",
+       "  width: min-content;\n",
+       "  min-width: 20ex;\n",
+       "  max-width: 50ex;\n",
+       "  color: var(--sklearn-color-text);\n",
+       "  box-shadow: 2pt 2pt 4pt #999;\n",
+       "  /* unfitted */\n",
+       "  background: var(--sklearn-color-unfitted-level-0);\n",
+       "  border: .5pt solid var(--sklearn-color-unfitted-level-3);\n",
+       "}\n",
+       "\n",
+       ".sk-estimator-doc-link.fitted span {\n",
+       "  /* fitted */\n",
+       "  background: var(--sklearn-color-fitted-level-0);\n",
+       "  border: var(--sklearn-color-fitted-level-3);\n",
+       "}\n",
+       "\n",
+       ".sk-estimator-doc-link:hover span {\n",
+       "  display: block;\n",
+       "}\n",
+       "\n",
+       "/* \"?\"-specific style due to the `<a>` HTML tag */\n",
+       "\n",
+       "#sk-container-id-1 a.estimator_doc_link {\n",
+       "  float: right;\n",
+       "  font-size: 1rem;\n",
+       "  line-height: 1em;\n",
+       "  font-family: monospace;\n",
+       "  background-color: var(--sklearn-color-background);\n",
+       "  border-radius: 1rem;\n",
+       "  height: 1rem;\n",
+       "  width: 1rem;\n",
+       "  text-decoration: none;\n",
+       "  /* unfitted */\n",
+       "  color: var(--sklearn-color-unfitted-level-1);\n",
+       "  border: var(--sklearn-color-unfitted-level-1) 1pt solid;\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 a.estimator_doc_link.fitted {\n",
+       "  /* fitted */\n",
+       "  border: var(--sklearn-color-fitted-level-1) 1pt solid;\n",
+       "  color: var(--sklearn-color-fitted-level-1);\n",
+       "}\n",
+       "\n",
+       "/* On hover */\n",
+       "#sk-container-id-1 a.estimator_doc_link:hover {\n",
+       "  /* unfitted */\n",
+       "  background-color: var(--sklearn-color-unfitted-level-3);\n",
+       "  color: var(--sklearn-color-background);\n",
+       "  text-decoration: none;\n",
+       "}\n",
+       "\n",
+       "#sk-container-id-1 a.estimator_doc_link.fitted:hover {\n",
+       "  /* fitted */\n",
+       "  background-color: var(--sklearn-color-fitted-level-3);\n",
+       "}\n",
+       "</style><div id=\"sk-container-id-1\" class=\"sk-top-container\"><div class=\"sk-text-repr-fallback\"><pre>XGBClassifier(base_score=None, booster=None, callbacks=None,\n",
+       "              colsample_bylevel=None, colsample_bynode=None,\n",
+       "              colsample_bytree=None, device=None, early_stopping_rounds=None,\n",
+       "              enable_categorical=False, eval_metric=&#x27;mlogloss&#x27;,\n",
+       "              feature_types=None, feature_weights=None, gamma=None,\n",
+       "              grow_policy=None, importance_type=None,\n",
+       "              interaction_constraints=None, learning_rate=0.1, max_bin=None,\n",
+       "              max_cat_threshold=None, max_cat_to_onehot=None,\n",
+       "              max_delta_step=None, max_depth=6, max_leaves=None,\n",
+       "              min_child_weight=None, missing=nan, monotone_constraints=None,\n",
+       "              multi_strategy=None, n_estimators=100, n_jobs=None, num_class=4, ...)</pre><b>In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook. <br />On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.</b></div><div class=\"sk-container\" hidden><div class=\"sk-item\"><div class=\"sk-estimator fitted sk-toggleable\"><input class=\"sk-toggleable__control sk-hidden--visually\" id=\"sk-estimator-id-1\" type=\"checkbox\" checked><label for=\"sk-estimator-id-1\" class=\"sk-toggleable__label fitted sk-toggleable__label-arrow\"><div><div>XGBClassifier</div></div><div><a class=\"sk-estimator-doc-link fitted\" rel=\"noreferrer\" target=\"_blank\" href=\"https://xgboost.readthedocs.io/en/release_3.0.0/python/python_api.html#xgboost.XGBClassifier\">?<span>Documentation for XGBClassifier</span></a><span class=\"sk-estimator-doc-link fitted\">i<span>Fitted</span></span></div></label><div class=\"sk-toggleable__content fitted\"><pre>XGBClassifier(base_score=None, booster=None, callbacks=None,\n",
+       "              colsample_bylevel=None, colsample_bynode=None,\n",
+       "              colsample_bytree=None, device=None, early_stopping_rounds=None,\n",
+       "              enable_categorical=False, eval_metric=&#x27;mlogloss&#x27;,\n",
+       "              feature_types=None, feature_weights=None, gamma=None,\n",
+       "              grow_policy=None, importance_type=None,\n",
+       "              interaction_constraints=None, learning_rate=0.1, max_bin=None,\n",
+       "              max_cat_threshold=None, max_cat_to_onehot=None,\n",
+       "              max_delta_step=None, max_depth=6, max_leaves=None,\n",
+       "              min_child_weight=None, missing=nan, monotone_constraints=None,\n",
+       "              multi_strategy=None, n_estimators=100, n_jobs=None, num_class=4, ...)</pre></div> </div></div></div></div>"
+      ],
+      "text/plain": [
+       "XGBClassifier(base_score=None, booster=None, callbacks=None,\n",
+       "              colsample_bylevel=None, colsample_bynode=None,\n",
+       "              colsample_bytree=None, device=None, early_stopping_rounds=None,\n",
+       "              enable_categorical=False, eval_metric='mlogloss',\n",
+       "              feature_types=None, feature_weights=None, gamma=None,\n",
+       "              grow_policy=None, importance_type=None,\n",
+       "              interaction_constraints=None, learning_rate=0.1, max_bin=None,\n",
+       "              max_cat_threshold=None, max_cat_to_onehot=None,\n",
+       "              max_delta_step=None, max_depth=6, max_leaves=None,\n",
+       "              min_child_weight=None, missing=nan, monotone_constraints=None,\n",
+       "              multi_strategy=None, n_estimators=100, n_jobs=None, num_class=4, ...)"
+      ]
+     },
+     "execution_count": 3,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "df['combined_text'] = df['Context'] + \" \" + df['Response']\n",
+    "\n",
+    "le = LabelEncoder()\n",
+    "y = le.fit_transform(df['response_type'])\n",
+    "\n",
+    "vectorizer = TfidfVectorizer(max_features=2000, ngram_range=(1, 2))\n",
+    "X = vectorizer.fit_transform(df['combined_text'])\n",
+    "\n",
+    "X_train, X_test, y_train, y_test = train_test_split(\n",
+    "    X, y, test_size=0.2, stratify=y, random_state=42\n",
+    ")\n",
+    "\n",
+    "xgb_model = XGBClassifier(\n",
+    "    objective='multi:softmax',\n",
+    "    num_class=len(le.classes_),\n",
+    "    eval_metric='mlogloss',\n",
+    "    use_label_encoder=False,\n",
+    "    max_depth=6,\n",
+    "    learning_rate=0.1,\n",
+    "    n_estimators=100\n",
+    ")\n",
+    "xgb_model.fit(X_train, y_train)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "c7b4e142-8479-4da0-8f6a-70e488f90349",
+   "metadata": {},
+   "source": [
+    "### Load LLM (Flan-T5)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "id": "ea35abb0-f2df-4331-b41a-965a9e42b4c5",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Loading Flan-T5 model... (this may take a few seconds)\n"
+     ]
+    },
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "Device set to use mps:0\n"
+     ]
+    }
+   ],
+   "source": [
+    "print(\"Loading Flan-T5 model... (this may take a few seconds)\")\n",
+    "llm = pipeline(\"text2text-generation\", model=\"google/flan-t5-base\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "c1ccf043-7431-4767-9ac8-73c59fe46ccf",
+   "metadata": {},
+   "source": [
+    "### Prediction + Prompt Functions"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "982161f8-824c-45c2-b968-e6e7ad5e0874",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "def predict_response_type(user_input):\n",
+    "    combined = user_input + \" placeholder_response\"\n",
+    "    vec = vectorizer.transform([combined])\n",
+    "    prediction = xgb_model.predict(vec)[0]\n",
+    "    predicted_class = le.inverse_transform([prediction])[0]\n",
+    "    confidence = np.max(xgb_model.predict_proba(vec))\n",
+    "    return predicted_class, confidence\n",
+    "\n",
+    "def prompt_templates(user_input, response_type):\n",
+    "    templates = {\n",
+    "        \"advice\": f\"A student said: \\\"{user_input}\\\". What practical advice should a mental health counselor offer?\",\n",
+    "        \"validation\": f\"A student said: \\\"{user_input}\\\". Respond with an emotionally supportive message that shows empathy and validates their feelings.\",\n",
+    "        \"information\": f\"A student said: \\\"{user_input}\\\". Explain what might be happening emotionally from a counselor's perspective.\",\n",
+    "        \"question\": f\"A student said: \\\"{user_input}\\\". What are 1-2 thoughtful follow-up questions a counselor might ask?\"\n",
+    "    }\n",
+    "    return templates.get(response_type, templates[\"information\"])\n",
+    "\n",
+    "def generate_llm_response(user_input, response_type):\n",
+    "    prompt = prompt_templates(user_input, response_type)\n",
+    "    result = llm(prompt, max_length=150, do_sample=True, temperature=0.7, top_p=0.9)\n",
+    "    return result[0][\"generated_text\"].strip()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "8599f696-9a22-4f03-9021-596b46febf26",
+   "metadata": {},
+   "source": [
+    "### Conversation Memory + Exporting"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "id": "f95ba731-e303-42b2-a46d-143f5aaeb914",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "MAX_MEMORY_TURNS = 6\n",
+    "history = []\n",
+    "\n",
+    "def trim_memory(history, max_turns=MAX_MEMORY_TURNS):\n",
+    "    return history[-max_turns:]\n",
+    "\n",
+    "def save_conversation(history, filename=\"chat_history.json\"):\n",
+    "    with open(filename, \"w\") as f:\n",
+    "        json.dump(history, f, indent=2)\n",
+    "    print(f\"\\nConversation saved to {filename}\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "74e2aba2-20fd-48b7-b770-8206aa9fb396",
+   "metadata": {},
+   "source": [
+    "### Intro + Chat"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "id": "f3e4b5a8-f2b0-4ed4-b6d3-23541f615c0a",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "--- Multi-Turn Mental Health Chatbot ---\n",
+      "This assistant simulates a counselor's conversation using AI.\n",
+      "- Type something your patient/student might say\n",
+      "- Type 'save' to export the conversation\n",
+      "- Type 'exit' to quit\n",
+      "\n",
+      "Example:\n",
+      "User: I feel like I’ll mess up my big presentation tomorrow.\n",
+      "Counselor: It’s completely normal to feel nervous before a big event...\n",
+      "\n"
+     ]
+    }
+   ],
+   "source": [
+    "def show_intro():\n",
+    "    print(\"\\n--- Multi-Turn Mental Health Chatbot ---\")\n",
+    "    print(\"This assistant simulates a counselor's conversation using AI.\")\n",
+    "    print(\"- Type something your patient/student might say\")\n",
+    "    print(\"- Type 'save' to export the conversation\")\n",
+    "    print(\"- Type 'exit' to quit\\n\")\n",
+    "\n",
+    "    print(\"Example:\")\n",
+    "    print(\"User: I feel like I’ll mess up my big presentation tomorrow.\")\n",
+    "    print(\"Counselor: It’s completely normal to feel nervous before a big event...\\n\")\n",
+    "\n",
+    "show_intro()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "1b818787-dab2-4d0c-a384-55cfc6072b9d",
+   "metadata": {},
+   "source": [
+    "### Chat Loop"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "id": "748b8c59-fbc0-4ca3-814f-5f299128fddd",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdin",
+     "output_type": "stream",
+     "text": [
+      "User:  i'm nervous\n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "(Predicted: information, Confidence: 85.5%)\n",
+      "Counselor: The student might be feeling anxious or uncertain about the situation.\n"
+     ]
+    },
+    {
+     "name": "stdin",
+     "output_type": "stream",
+     "text": [
+      "User:  exit\n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Goodbye\n"
+     ]
+    }
+   ],
+   "source": [
+    "while True:\n",
+    "    user_input = input(\"User: \").strip()\n",
+    "\n",
+    "    if user_input.lower() == \"exit\":\n",
+    "        print(\"Goodbye\")\n",
+    "        break\n",
+    "    elif user_input.lower() == \"save\":\n",
+    "        save_conversation(history)\n",
+    "        continue\n",
+    "\n",
+    "    predicted_type, confidence = predict_response_type(user_input)\n",
+    "    print(f\"(Predicted: {predicted_type}, Confidence: {confidence:.1%})\")\n",
+    "\n",
+    "    llm_reply = generate_llm_response(user_input, predicted_type)\n",
+    "\n",
+    "    history.append({\"role\": \"user\", \"content\": user_input})\n",
+    "    history.append({\"role\": \"assistant\", \"content\": llm_reply})\n",
+    "    history = trim_memory(history)\n",
+    "\n",
+    "    print(\"Counselor:\", llm_reply)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "72251dc9-6090-4448-8fe9-04ad82079520",
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python (myenv)",
+   "language": "python",
+   "name": "myenv"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.16"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}

LICENSE ADDED Viewed

	@@ -0,0 +1,21 @@

+MIT License
+Copyright (c) 2025 Stella Dong
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

ML_on_Kaggle_Dataset.ipynb ADDED Viewed

The diff for this file is too large to render. See raw diff

Mistral-7B_on_Kaggle_Dataset.ipynb ADDED Viewed

	@@ -0,0 +1,721 @@

+{
+ "cells": [
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "ae79196c-1b84-4cde-8051-6cb113b80373",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "/Users/Pi/miniconda3/envs/myenv/lib/python3.10/site-packages/xgboost/training.py:183: UserWarning: [00:11:22] WARNING: /Users/runner/work/xgboost/xgboost/src/learner.cc:738: \n",
+      "Parameters: { \"use_label_encoder\" } are not used.\n",
+      "\n",
+      "  bst.update(dtrain, iteration=i, fobj=obj)\n",
+      "llama_model_load_from_file_impl: using device Metal (Apple M2 Pro) - 21845 MiB free\n",
+      "llama_model_loader: loaded meta data with 20 key-value pairs and 291 tensors from /Users/Pi/models/mistral/mistral-7b-instruct-v0.1.Q4_K_M.gguf (version GGUF V2)\n",
+      "llama_model_loader: Dumping metadata keys/values. Note: KV overrides do not apply in this output.\n",
+      "llama_model_loader: - kv   0:                       general.architecture str              = llama\n",
+      "llama_model_loader: - kv   1:                               general.name str              = mistralai_mistral-7b-instruct-v0.1\n",
+      "llama_model_loader: - kv   2:                       llama.context_length u32              = 32768\n",
+      "llama_model_loader: - kv   3:                     llama.embedding_length u32              = 4096\n",
+      "llama_model_loader: - kv   4:                          llama.block_count u32              = 32\n",
+      "llama_model_loader: - kv   5:                  llama.feed_forward_length u32              = 14336\n",
+      "llama_model_loader: - kv   6:                 llama.rope.dimension_count u32              = 128\n",
+      "llama_model_loader: - kv   7:                 llama.attention.head_count u32              = 32\n",
+      "llama_model_loader: - kv   8:              llama.attention.head_count_kv u32              = 8\n",
+      "llama_model_loader: - kv   9:     llama.attention.layer_norm_rms_epsilon f32              = 0.000010\n",
+      "llama_model_loader: - kv  10:                       llama.rope.freq_base f32              = 10000.000000\n",
+      "llama_model_loader: - kv  11:                          general.file_type u32              = 15\n",
+      "llama_model_loader: - kv  12:                       tokenizer.ggml.model str              = llama\n",
+      "llama_model_loader: - kv  13:                      tokenizer.ggml.tokens arr[str,32000]   = [\"<unk>\", \"<s>\", \"</s>\", \"<0x00>\", \"<...\n",
+      "llama_model_loader: - kv  14:                      tokenizer.ggml.scores arr[f32,32000]   = [0.000000, 0.000000, 0.000000, 0.0000...\n",
+      "llama_model_loader: - kv  15:                  tokenizer.ggml.token_type arr[i32,32000]   = [2, 3, 3, 6, 6, 6, 6, 6, 6, 6, 6, 6, ...\n",
+      "llama_model_loader: - kv  16:                tokenizer.ggml.bos_token_id u32              = 1\n",
+      "llama_model_loader: - kv  17:                tokenizer.ggml.eos_token_id u32              = 2\n",
+      "llama_model_loader: - kv  18:            tokenizer.ggml.unknown_token_id u32              = 0\n",
+      "llama_model_loader: - kv  19:               general.quantization_version u32              = 2\n",
+      "llama_model_loader: - type  f32:   65 tensors\n",
+      "llama_model_loader: - type q4_K:  193 tensors\n",
+      "llama_model_loader: - type q6_K:   33 tensors\n",
+      "print_info: file format = GGUF V2\n",
+      "print_info: file type   = Q4_K - Medium\n",
+      "print_info: file size   = 4.07 GiB (4.83 BPW) \n",
+      "init_tokenizer: initializing tokenizer for type 1\n",
+      "load: control token:      2 '</s>' is not marked as EOG\n",
+      "load: control token:      1 '<s>' is not marked as EOG\n",
+      "load: special_eos_id is not in special_eog_ids - the tokenizer config may be incorrect\n",
+      "load: special tokens cache size = 3\n",
+      "load: token to piece cache size = 0.1637 MB\n",
+      "print_info: arch             = llama\n",
+      "print_info: vocab_only       = 0\n",
+      "print_info: n_ctx_train      = 32768\n",
+      "print_info: n_embd           = 4096\n",
+      "print_info: n_layer          = 32\n",
+      "print_info: n_head           = 32\n",
+      "print_info: n_head_kv        = 8\n",
+      "print_info: n_rot            = 128\n",
+      "print_info: n_swa            = 0\n",
+      "print_info: n_embd_head_k    = 128\n",
+      "print_info: n_embd_head_v    = 128\n",
+      "print_info: n_gqa            = 4\n",
+      "print_info: n_embd_k_gqa     = 1024\n",
+      "print_info: n_embd_v_gqa     = 1024\n",
+      "print_info: f_norm_eps       = 0.0e+00\n",
+      "print_info: f_norm_rms_eps   = 1.0e-05\n",
+      "print_info: f_clamp_kqv      = 0.0e+00\n",
+      "print_info: f_max_alibi_bias = 0.0e+00\n",
+      "print_info: f_logit_scale    = 0.0e+00\n",
+      "print_info: f_attn_scale     = 0.0e+00\n",
+      "print_info: n_ff             = 14336\n",
+      "print_info: n_expert         = 0\n",
+      "print_info: n_expert_used    = 0\n",
+      "print_info: causal attn      = 1\n",
+      "print_info: pooling type     = 0\n",
+      "print_info: rope type        = 0\n",
+      "print_info: rope scaling     = linear\n",
+      "print_info: freq_base_train  = 10000.0\n",
+      "print_info: freq_scale_train = 1\n",
+      "print_info: n_ctx_orig_yarn  = 32768\n",
+      "print_info: rope_finetuned   = unknown\n",
+      "print_info: ssm_d_conv       = 0\n",
+      "print_info: ssm_d_inner      = 0\n",
+      "print_info: ssm_d_state      = 0\n",
+      "print_info: ssm_dt_rank      = 0\n",
+      "print_info: ssm_dt_b_c_rms   = 0\n",
+      "print_info: model type       = 7B\n",
+      "print_info: model params     = 7.24 B\n",
+      "print_info: general.name     = mistralai_mistral-7b-instruct-v0.1\n",
+      "print_info: vocab type       = SPM\n",
+      "print_info: n_vocab          = 32000\n",
+      "print_info: n_merges         = 0\n",
+      "print_info: BOS token        = 1 '<s>'\n",
+      "print_info: EOS token        = 2 '</s>'\n",
+      "print_info: UNK token        = 0 '<unk>'\n",
+      "print_info: LF token         = 13 '<0x0A>'\n",
+      "print_info: EOG token        = 2 '</s>'\n",
+      "print_info: max token length = 48\n",
+      "load_tensors: loading model tensors, this can take a while... (mmap = true)\n",
+      "load_tensors: layer   0 assigned to device CPU\n",
+      "load_tensors: layer   1 assigned to device CPU\n",
+      "load_tensors: layer   2 assigned to device CPU\n",
+      "load_tensors: layer   3 assigned to device CPU\n",
+      "load_tensors: layer   4 assigned to device CPU\n",
+      "load_tensors: layer   5 assigned to device CPU\n",
+      "load_tensors: layer   6 assigned to device CPU\n",
+      "load_tensors: layer   7 assigned to device CPU\n",
+      "load_tensors: layer   8 assigned to device CPU\n",
+      "load_tensors: layer   9 assigned to device CPU\n",
+      "load_tensors: layer  10 assigned to device CPU\n",
+      "load_tensors: layer  11 assigned to device CPU\n",
+      "load_tensors: layer  12 assigned to device CPU\n",
+      "load_tensors: layer  13 assigned to device CPU\n",
+      "load_tensors: layer  14 assigned to device CPU\n",
+      "load_tensors: layer  15 assigned to device CPU\n",
+      "load_tensors: layer  16 assigned to device CPU\n",
+      "load_tensors: layer  17 assigned to device CPU\n",
+      "load_tensors: layer  18 assigned to device CPU\n",
+      "load_tensors: layer  19 assigned to device CPU\n",
+      "load_tensors: layer  20 assigned to device CPU\n",
+      "load_tensors: layer  21 assigned to device CPU\n",
+      "load_tensors: layer  22 assigned to device CPU\n",
+      "load_tensors: layer  23 assigned to device CPU\n",
+      "load_tensors: layer  24 assigned to device CPU\n",
+      "load_tensors: layer  25 assigned to device CPU\n",
+      "load_tensors: layer  26 assigned to device CPU\n",
+      "load_tensors: layer  27 assigned to device CPU\n",
+      "load_tensors: layer  28 assigned to device CPU\n",
+      "load_tensors: layer  29 assigned to device CPU\n",
+      "load_tensors: layer  30 assigned to device CPU\n",
+      "load_tensors: layer  31 assigned to device CPU\n",
+      "load_tensors: layer  32 assigned to device CPU\n",
+      "load_tensors: tensor 'token_embd.weight' (q4_K) (and 290 others) cannot be used with preferred buffer type CPU_AARCH64, using CPU instead\n",
+      "load_tensors: offloading 0 repeating layers to GPU\n",
+      "load_tensors: offloaded 0/33 layers to GPU\n",
+      "load_tensors:   CPU_Mapped model buffer size =  4165.37 MiB\n",
+      ".................................................................................................\n",
+      "llama_init_from_model: n_seq_max     = 1\n",
+      "llama_init_from_model: n_ctx         = 2048\n",
+      "llama_init_from_model: n_ctx_per_seq = 2048\n",
+      "llama_init_from_model: n_batch       = 512\n",
+      "llama_init_from_model: n_ubatch      = 512\n",
+      "llama_init_from_model: flash_attn    = 0\n",
+      "llama_init_from_model: freq_base     = 10000.0\n",
+      "llama_init_from_model: freq_scale    = 1\n",
+      "llama_init_from_model: n_ctx_per_seq (2048) < n_ctx_train (32768) -- the full capacity of the model will not be utilized\n",
+      "ggml_metal_init: allocating\n",
+      "ggml_metal_init: found device: Apple M2 Pro\n",
+      "ggml_metal_init: picking default device: Apple M2 Pro\n",
+      "ggml_metal_load_library: using embedded metal library\n",
+      "ggml_metal_init: GPU name:   Apple M2 Pro\n",
+      "ggml_metal_init: GPU family: MTLGPUFamilyApple8  (1008)\n",
+      "ggml_metal_init: GPU family: MTLGPUFamilyCommon3 (3003)\n",
+      "ggml_metal_init: GPU family: MTLGPUFamilyMetal3  (5001)\n",
+      "ggml_metal_init: simdgroup reduction   = true\n",
+      "ggml_metal_init: simdgroup matrix mul. = true\n",
+      "ggml_metal_init: has residency sets    = true\n",
+      "ggml_metal_init: has bfloat            = true\n",
+      "ggml_metal_init: use bfloat            = false\n",
+      "ggml_metal_init: hasUnifiedMemory      = true\n",
+      "ggml_metal_init: recommendedMaxWorkingSetSize  = 22906.50 MB\n",
+      "ggml_metal_init: loaded kernel_add                                    0x11f38a990 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_add_row                                0x12ef6f310 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_sub                                    0x11f405df0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_sub_row                                0x11f38b080 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul                                    0x11bc91a30 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_row                                0x11f1aa2f0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_div                                    0x12ee21f20 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_div_row                                0x12ef82170 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_repeat_f32                             0x11bc91cc0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_repeat_f16                             0x13b813e80 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_repeat_i32                             0x11f38b2e0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_repeat_i16                             0x11f1aaa50 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_scale                                  0x12ef826e0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_scale_4                                0x11bc92230 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_clamp                                  0x11f406d10 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_tanh                                   0x11bc92580 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_relu                                   0x11f423300 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_sigmoid                                0x13a3d6930 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_gelu                                   0x11bc92a60 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_gelu_4                                 0x11f1aacb0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_gelu_quick                             0x11bc93220 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_gelu_quick_4                           0x11bc937a0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_silu                                   0x11bc93ce0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_silu_4                                 0x13a3d7120 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_elu                                    0x13a3d7600 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_soft_max_f16                           0x11bc93fa0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_soft_max_f16_4                         0x12ef82940 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_soft_max_f32                           0x11bc94400 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_soft_max_f32_4                         0x13a3d7860 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_diag_mask_inf                          0x12ef82ba0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_diag_mask_inf_8                        0x13a3d7ac0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_f32                           0x13a3d7d20 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_f16                           0x11bc94660 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: skipping kernel_get_rows_bf16                     (not supported)\n",
+      "ggml_metal_init: loaded kernel_get_rows_q4_0                          0x13a3d7f80 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_q4_1                          0x11bc948c0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_q5_0                          0x11bc94b20 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_q5_1                          0x11f423890 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_q8_0                          0x11bc94d80 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_q2_K                          0x11f1ab1d0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_q3_K                          0x11bc94fe0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_q4_K                          0x11bc95240 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_q5_K                          0x11bc954a0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_q6_K                          0x11bc95700 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_iq2_xxs                       0x11f1ab430 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_iq2_xs                        0x12ef82e00 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_iq3_xxs                       0x13a3d81e0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_iq3_s                         0x11bc95960 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_iq2_s                         0x12ee225b0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_iq1_s                         0x11bc95bc0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_iq1_m                         0x13a3d6e10 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_iq4_nl                        0x11f423c10 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_iq4_xs                        0x13a3d8440 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_get_rows_i32                           0x11bc95e30 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_rms_norm                               0x13a3d86a0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_group_norm                             0x11f1ab690 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_norm                                   0x11bc96090 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_ssm_conv_f32                           0x12ef83060 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_ssm_scan_f32                           0x11bc962f0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_f32_f32                         0x12ef833e0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: skipping kernel_mul_mv_bf16_f32                   (not supported)\n",
+      "ggml_metal_init: skipping kernel_mul_mv_bf16_f32_1row              (not supported)\n",
+      "ggml_metal_init: skipping kernel_mul_mv_bf16_f32_l4                (not supported)\n",
+      "ggml_metal_init: skipping kernel_mul_mv_bf16_bf16                  (not supported)\n",
+      "ggml_metal_init: loaded kernel_mul_mv_f16_f32                         0x13a3d8900 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_f16_f32_1row                    0x11f1aba20 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_f16_f32_l4                      0x11bc96580 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_f16_f16                         0x11bc967e0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_q4_0_f32                        0x12ef83760 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_q4_1_f32                        0x12ef83ae0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_q5_0_f32                        0x11bc96a40 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_q5_1_f32                        0x13a3d8b60 | th_max =  576 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_q8_0_f32                        0x13a3d8dc0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_f16_f32_r1_2                0x11bc96ca0 | th_max =  896 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_f16_f32_r1_3                0x11f38b7d0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_f16_f32_r1_4                0x11f424100 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_f16_f32_r1_5                0x13a3d9020 | th_max =  768 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q4_0_f32_r1_2               0x13a3d9280 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q4_0_f32_r1_3               0x11bc96f00 | th_max =  704 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q4_0_f32_r1_4               0x13a3d94e0 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q4_0_f32_r1_5               0x11bc97160 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q4_1_f32_r1_2               0x12ef83e60 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q4_1_f32_r1_3               0x11f1abf10 | th_max =  704 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q4_1_f32_r1_4               0x11bc973c0 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q4_1_f32_r1_5               0x13a3d9740 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q5_0_f32_r1_2               0x13a3d99a0 | th_max =  704 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q5_0_f32_r1_3               0x11bc97620 | th_max =  704 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q5_0_f32_r1_4               0x13a3d9c00 | th_max =  576 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q5_0_f32_r1_5               0x13a3d9e60 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q5_1_f32_r1_2               0x11bc97880 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q5_1_f32_r1_3               0x13a3da0c0 | th_max =  704 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q5_1_f32_r1_4               0x11bc97ae0 | th_max =  576 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q5_1_f32_r1_5               0x11f424360 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q8_0_f32_r1_2               0x11f1ac170 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q8_0_f32_r1_3               0x11f38ba30 | th_max =  704 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q8_0_f32_r1_4               0x13a3da320 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q8_0_f32_r1_5               0x11f1ac670 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q4_K_f32_r1_2               0x13a3da580 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q4_K_f32_r1_3               0x11f1ac8d0 | th_max =  704 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q4_K_f32_r1_4               0x12ef841e0 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q4_K_f32_r1_5               0x11bc97d40 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q5_K_f32_r1_2               0x13a3da7e0 | th_max =  704 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q5_K_f32_r1_3               0x13a3daa40 | th_max =  704 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q5_K_f32_r1_4               0x11bc97fa0 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q5_K_f32_r1_5               0x11bc98200 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q6_K_f32_r1_2               0x11f38bf30 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q6_K_f32_r1_3               0x13a3daca0 | th_max =  704 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q6_K_f32_r1_4               0x11f424880 | th_max =  704 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_q6_K_f32_r1_5               0x12ef84740 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_iq4_nl_f32_r1_2             0x13b814370 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_iq4_nl_f32_r1_3             0x11f424ae0 | th_max =  704 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_iq4_nl_f32_r1_4             0x11f425270 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_ext_iq4_nl_f32_r1_5             0x12ef84ac0 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_q2_K_f32                        0x11bc98460 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_q3_K_f32                        0x13b8145d0 | th_max =  576 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_q4_K_f32                        0x12ef84e40 | th_max =  576 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_q5_K_f32                        0x11f4254d0 | th_max =  576 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_q6_K_f32                        0x11f425730 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_iq2_xxs_f32                     0x12ef85250 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_iq2_xs_f32                      0x11f425ce0 | th_max =  704 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_iq3_xxs_f32                     0x11f425f40 | th_max =  768 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_iq3_s_f32                       0x11f4262c0 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_iq2_s_f32                       0x11f426640 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_iq1_s_f32                       0x11f426c30 | th_max =  448 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_iq1_m_f32                       0x12ef85720 | th_max =  576 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_iq4_nl_f32                      0x11f4271e0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_iq4_xs_f32                      0x11f427440 | th_max =  896 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_f32_f32                      0x13a3daf00 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_f16_f32                      0x13b814830 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: skipping kernel_mul_mv_id_bf16_f32                (not supported)\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_q4_0_f32                     0x11f4277c0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_q4_1_f32                     0x13a3db160 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_q5_0_f32                     0x11f427b40 | th_max =  576 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_q5_1_f32                     0x12ef85980 | th_max =  576 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_q8_0_f32                     0x13b814a90 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_q2_K_f32                     0x13b814cf0 | th_max =  576 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_q3_K_f32                     0x12ef85d00 | th_max =  576 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_q4_K_f32                     0x12ef86080 | th_max =  576 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_q5_K_f32                     0x11bc987f0 | th_max =  576 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_q6_K_f32                     0x13b815070 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_iq2_xxs_f32                  0x11bc98a50 | th_max =  768 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_iq2_xs_f32                   0x11f1acdd0 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_iq3_xxs_f32                  0x11bc98cb0 | th_max =  704 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_iq3_s_f32                    0x13a3db3c0 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_iq2_s_f32                    0x13a3db620 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_iq1_s_f32                    0x11f4280f0 | th_max =  448 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_iq1_m_f32                    0x11bc98f10 | th_max =  576 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_iq4_nl_f32                   0x13b8153f0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mv_id_iq4_xs_f32                   0x11f4285c0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_f32_f32                         0x12ee22810 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_f16_f32                         0x11f428820 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: skipping kernel_mul_mm_bf16_f32                   (not supported)\n",
+      "ggml_metal_init: loaded kernel_mul_mm_q4_0_f32                        0x11f428ba0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_q4_1_f32                        0x12ef86400 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_q5_0_f32                        0x11f428f20 | th_max =  768 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_q5_1_f32                        0x12ef86780 | th_max =  768 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_q8_0_f32                        0x11f4292a0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_q2_K_f32                        0x12ef86d30 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_q3_K_f32                        0x13a3db880 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_q4_K_f32                        0x11f429620 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_q5_K_f32                        0x12ee22b90 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_q6_K_f32                        0x11bc99180 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_iq2_xxs_f32                     0x11f38c190 | th_max =  768 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_iq2_xs_f32                      0x11f4299a0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_iq3_xxs_f32                     0x12ee22df0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_iq3_s_f32                       0x11bc993e0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_iq2_s_f32                       0x11bc99640 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_iq1_s_f32                       0x12ee233a0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_iq1_m_f32                       0x11f429d20 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_iq4_nl_f32                      0x12ef86f90 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_iq4_xs_f32                      0x12ee23870 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_f32_f32                      0x13b815950 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_f16_f32                      0x12ef87320 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: skipping kernel_mul_mm_id_bf16_f32                (not supported)\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_q4_0_f32                     0x11f42a0a0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_q4_1_f32                     0x11bc998a0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_q5_0_f32                     0x11f42a420 | th_max =  768 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_q5_1_f32                     0x12ef878d0 | th_max =  768 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_q8_0_f32                     0x11f42a7a0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_q2_K_f32                     0x13b815bb0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_q3_K_f32                     0x11f42ab20 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_q4_K_f32                     0x12ee23ad0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_q5_K_f32                     0x11f42afc0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_q6_K_f32                     0x12ef87c50 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_iq2_xxs_f32                  0x12ee23d30 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_iq2_xs_f32                   0x12ef88120 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_iq3_xxs_f32                  0x11f42b490 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_iq3_s_f32                    0x13a3dbae0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_iq2_s_f32                    0x12ef88380 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_iq1_s_f32                    0x11f42b6f0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_iq1_m_f32                    0x11bc99b10 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_iq4_nl_f32                   0x11f42ba80 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_mul_mm_id_iq4_xs_f32                   0x12ef885e0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_rope_norm_f32                          0x12ef88840 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_rope_norm_f16                          0x12ef88d60 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_rope_neox_f32                          0x13a3dbd40 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_rope_neox_f16                          0x12ee240b0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_im2col_f16                             0x12ef88fc0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_im2col_f32                             0x13b815f30 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_im2col_ext_f16                         0x11f42be00 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_im2col_ext_f32                         0x11f42c180 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_conv_transpose_1d_f32_f32              0x11f42c500 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_conv_transpose_1d_f16_f32              0x11f42ca60 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_upscale_f32                            0x11bc99d70 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_pad_f32                                0x11f42ccc0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_pad_reflect_1d_f32                     0x12ee24490 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_timestep_embedding_f32                 0x11bc9a000 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_arange_f32                             0x11bc9a260 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_argsort_f32_i32_asc                    0x11f42d040 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_argsort_f32_i32_desc                   0x11f42d3c0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_leaky_relu_f32                         0x11f42ddc0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_f16_h64                 0x11f42e2b0 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_f16_h80                 0x12ef89620 | th_max =  640 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_f16_h96                 0x11f42e510 | th_max =  576 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_f16_h112                0x12ef89c80 | th_max =  576 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_f16_h128                0x11f42e770 | th_max =  512 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_f16_h256                0x12ee24980 | th_max =  512 | th_width =   32\n",
+      "ggml_metal_init: skipping kernel_flash_attn_ext_bf16_h64           (not supported)\n",
+      "ggml_metal_init: skipping kernel_flash_attn_ext_bf16_h80           (not supported)\n",
+      "ggml_metal_init: skipping kernel_flash_attn_ext_bf16_h96           (not supported)\n",
+      "ggml_metal_init: skipping kernel_flash_attn_ext_bf16_h112          (not supported)\n",
+      "ggml_metal_init: skipping kernel_flash_attn_ext_bf16_h128          (not supported)\n",
+      "ggml_metal_init: skipping kernel_flash_attn_ext_bf16_h256          (not supported)\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q4_0_h64                0x12ee24be0 | th_max =  704 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q4_0_h80                0x12ef8a000 | th_max =  896 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q4_0_h96                0x12ef8a260 | th_max =  896 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q4_0_h112               0x12ef8a5e0 | th_max =  896 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q4_0_h128               0x11f42ecd0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q4_0_h256               0x13b8162b0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q4_1_h64                0x11f42ef30 | th_max =  768 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q4_1_h80                0x12ef8ab40 | th_max =  896 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q4_1_h96                0x12ef8ada0 | th_max =  896 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q4_1_h112               0x12ef8b250 | th_max =  896 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q4_1_h128               0x11f42f6c0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q4_1_h256               0x11bc9a4c0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q5_0_h64                0x11bc9a990 | th_max =  576 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q5_0_h80                0x11bc9abf0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q5_0_h96                0x13a3dbfa0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q5_0_h112               0x13a3dc200 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q5_0_h128               0x11f38c3f0 | th_max =  768 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q5_0_h256               0x11bc9ae50 | th_max =  768 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q5_1_h64                0x12ef8b8e0 | th_max =  576 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q5_1_h80                0x12ef8bb40 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q5_1_h96                0x13a3dc460 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q5_1_h112               0x11bc9b0b0 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q5_1_h128               0x13a3dc6c0 | th_max =  768 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q5_1_h256               0x11bc9b310 | th_max =  768 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q8_0_h64                0x13a3dc920 | th_max =  704 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q8_0_h80                0x11bc9b570 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q8_0_h96                0x11f38c770 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q8_0_h112               0x13a3dcb80 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q8_0_h128               0x13a3dcde0 | th_max =  896 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_q8_0_h256               0x11bc9b7d0 | th_max =  896 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_vec_f16_h128            0x12ef8c040 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: skipping kernel_flash_attn_ext_vec_bf16_h128      (not supported)\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_vec_q4_0_h128           0x13a3dd040 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_vec_q4_1_h128           0x13a3dd2a0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_vec_q5_0_h128           0x13a3dd500 | th_max =  768 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_vec_q5_1_h128           0x11bc9bc90 | th_max =  768 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_vec_q8_0_h128           0x11bc9bef0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_vec_f16_h256            0x11bc9c150 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: skipping kernel_flash_attn_ext_vec_bf16_h256      (not supported)\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_vec_q4_0_h256           0x11f1ad030 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_vec_q4_1_h256           0x11bc9c3b0 | th_max =  896 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_vec_q5_0_h256           0x11f38caf0 | th_max =  704 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_vec_q5_1_h256           0x11bc9c610 | th_max =  704 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_flash_attn_ext_vec_q8_0_h256           0x11bc9c870 | th_max =  832 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_set_f32                                0x11bc9cc90 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_set_i32                                0x12ef8c2a0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cpy_f32_f32                            0x13a3dd760 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cpy_f32_f16                            0x11f1ad530 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: skipping kernel_cpy_f32_bf16                      (not supported)\n",
+      "ggml_metal_init: loaded kernel_cpy_f16_f32                            0x13a3dd9c0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cpy_f16_f16                            0x13a3ddc20 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: skipping kernel_cpy_bf16_f32                      (not supported)\n",
+      "ggml_metal_init: skipping kernel_cpy_bf16_bf16                     (not supported)\n",
+      "ggml_metal_init: loaded kernel_cpy_f32_q8_0                           0x11bc9cef0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cpy_f32_q4_0                           0x11bc9d150 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cpy_f32_q4_1                           0x11bc9d3b0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cpy_f32_q5_0                           0x13a3dde80 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cpy_f32_q5_1                           0x11f1ad790 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cpy_f32_iq4_nl                         0x12ef8c920 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cpy_q4_0_f32                           0x12ee250e0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cpy_q4_0_f16                           0x11bc9d610 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cpy_q4_1_f32                           0x11bc9d870 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cpy_q4_1_f16                           0x11bc9dad0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cpy_q5_0_f32                           0x11bc9dd30 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cpy_q5_0_f16                           0x11bc9df90 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cpy_q5_1_f32                           0x13a3de0e0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cpy_q5_1_f16                           0x11bc9e1f0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cpy_q8_0_f32                           0x13a3de340 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cpy_q8_0_f16                           0x11bc9e450 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_concat                                 0x11f1ad9f0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_sqr                                    0x11f1adf00 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_sqrt                                   0x11f38d440 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_sin                                    0x11f1ae820 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_cos                                    0x13a3de820 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_sum_rows                               0x12ef8ce10 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_argmax                                 0x11f38d6a0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_pool_2d_avg_f32                        0x11f1aeba0 | th_max = 1024 | th_width =   32\n",
+      "ggml_metal_init: loaded kernel_pool_2d_max_f32                        0x13a3dea80 | th_max = 1024 | th_width =   32\n",
+      "llama_kv_cache_init: kv_size = 2048, offload = 1, type_k = 'f16', type_v = 'f16', n_layer = 32, can_shift = 1\n",
+      "llama_kv_cache_init: layer 0: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 1: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 2: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 3: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 4: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 5: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 6: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 7: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 8: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 9: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 10: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 11: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 12: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 13: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 14: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 15: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 16: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 17: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 18: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 19: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 20: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 21: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 22: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 23: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 24: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 25: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 26: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 27: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 28: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 29: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 30: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init: layer 31: n_embd_k_gqa = 1024, n_embd_v_gqa = 1024\n",
+      "llama_kv_cache_init:        CPU KV buffer size =   256.00 MiB\n",
+      "llama_init_from_model: KV self size  =  256.00 MiB, K (f16):  128.00 MiB, V (f16):  128.00 MiB\n",
+      "llama_init_from_model:        CPU  output buffer size =     0.12 MiB\n",
+      "llama_init_from_model:        CPU compute buffer size =   164.01 MiB\n",
+      "llama_init_from_model: graph nodes  = 1030\n",
+      "llama_init_from_model: graph splits = 514 (with bs=512), 1 (with bs=1)\n",
+      "Metal : EMBED_LIBRARY = 1 | CPU : NEON = 1 | ARM_FMA = 1 | FP16_VA = 1 | MATMUL_INT8 = 1 | DOTPROD = 1 | ACCELERATE = 1 | AARCH64_REPACK = 1 | \n",
+      "Model metadata: {'general.quantization_version': '2', 'tokenizer.ggml.unknown_token_id': '0', 'tokenizer.ggml.eos_token_id': '2', 'tokenizer.ggml.bos_token_id': '1', 'tokenizer.ggml.model': 'llama', 'llama.attention.head_count_kv': '8', 'llama.context_length': '32768', 'llama.attention.head_count': '32', 'llama.rope.freq_base': '10000.000000', 'llama.rope.dimension_count': '128', 'general.file_type': '15', 'llama.feed_forward_length': '14336', 'llama.embedding_length': '4096', 'llama.block_count': '32', 'general.architecture': 'llama', 'llama.attention.layer_norm_rms_epsilon': '0.000010', 'general.name': 'mistralai_mistral-7b-instruct-v0.1'}\n",
+      "Using fallback chat format: llama-2\n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "--- Multi-Turn Mental Health Chatbot ---\n",
+      "This assistant simulates a counselor's conversation using AI.\n",
+      "- Type something your patient/student might say\n",
+      "- Type 'save' to export the conversation\n",
+      "- Type 'exit' to quit\n",
+      "\n",
+      "Example:\n",
+      "User: I feel like I’ll mess up my big presentation tomorrow.\n",
+      "Counselor: It’s completely normal to feel nervous before a big event...\n",
+      "\n"
+     ]
+    },
+    {
+     "name": "stdin",
+     "output_type": "stream",
+     "text": [
+      "User:  i'm nervous\n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "(Predicted: information, Confidence: 85.5%)\n"
+     ]
+    },
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "llama_perf_context_print:        load time =     766.62 ms\n",
+      "llama_perf_context_print: prompt eval time =     766.43 ms /    23 tokens (   33.32 ms per token,    30.01 tokens per second)\n",
+      "llama_perf_context_print:        eval time =   17184.04 ms /   149 runs   (  115.33 ms per token,     8.67 tokens per second)\n",
+      "llama_perf_context_print:       total time =   17971.68 ms /   172 tokens\n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Counselor: From a mental health perspective, feeling nervous or anxious is a common and normal human experience. It's a natural response to stress or a challenging situation. However, if the feelings of nervousness persist or interfere with daily life, it could be a sign of an anxiety disorder.\n",
+      "\n",
+      "Anxiety disorders are a group of mental health conditions in which a person feels excessive and persistent worry or fear. These feelings can interfere with daily life, such as work, school, relationships, and personal activities. The symptoms of anxiety disorders vary from person to person, but typically include physical symptoms like sweating, trembling, and difficulty sleeping, as well as psychological symptoms like feelings of worry, fear, and avoidance\n"
+     ]
+    }
+   ],
+   "source": [
+    "# Multi-Turn Mental Health Chatbot using XGBoost + Mistral-7B\n",
+    "\n",
+    "import os\n",
+    "import pandas as pd\n",
+    "from sklearn.feature_extraction.text import TfidfVectorizer\n",
+    "from sklearn.preprocessing import LabelEncoder\n",
+    "from sklearn.model_selection import train_test_split\n",
+    "from xgboost import XGBClassifier\n",
+    "from llama_cpp import Llama\n",
+    "import json\n",
+    "\n",
+    "# --- Load and prepare the dataset ---\n",
+    "df = pd.read_csv(\"dataset/Kaggle_Mental_Health_Conversations_train.csv\")\n",
+    "df = df[['Context', 'Response']].dropna().copy()\n",
+    "\n",
+    "# Auto-labeling\n",
+    "keywords_to_labels = {\n",
+    "    'advice': ['try', 'should', 'suggest', 'recommend'],\n",
+    "    'validation': ['understand', 'feel', 'valid', 'normal'],\n",
+    "    'information': ['cause', 'often', 'disorder', 'symptom'],\n",
+    "    'question': ['how', 'what', 'why', 'have you']\n",
+    "}\n",
+    "\n",
+    "def auto_label_response(response):\n",
+    "    response = response.lower()\n",
+    "    for label, keywords in keywords_to_labels.items():\n",
+    "        if any(word in response for word in keywords):\n",
+    "            return label\n",
+    "    return 'information'\n",
+    "\n",
+    "df['response_type'] = df['Response'].apply(auto_label_response)\n",
+    "\n",
+    "# Combine Context and Response for richer features\n",
+    "df['combined_text'] = df['Context'] + \" \" + df['Response']\n",
+    "\n",
+    "# Encode labels\n",
+    "le = LabelEncoder()\n",
+    "y = le.fit_transform(df['response_type'])\n",
+    "\n",
+    "# TF-IDF Vectorization\n",
+    "vectorizer = TfidfVectorizer(max_features=2000, ngram_range=(1, 2))\n",
+    "X = vectorizer.fit_transform(df['combined_text'])\n",
+    "\n",
+    "# Train/Test Split\n",
+    "X_train, X_test, y_train, y_test = train_test_split(\n",
+    "    X, y, test_size=0.2, stratify=y, random_state=42\n",
+    ")\n",
+    "\n",
+    "# Train XGBoost Classifier\n",
+    "xgb_model = XGBClassifier(\n",
+    "    objective='multi:softmax',\n",
+    "    num_class=len(le.classes_),\n",
+    "    eval_metric='mlogloss',\n",
+    "    use_label_encoder=False,\n",
+    "    max_depth=6,\n",
+    "    learning_rate=0.1,\n",
+    "    n_estimators=100\n",
+    ")\n",
+    "xgb_model.fit(X_train, y_train)\n",
+    "\n",
+    "# Load Mistral model\n",
+    "MODEL_PATH = os.path.expanduser(\"/Users/Pi/models/mistral/mistral-7b-instruct-v0.1.Q4_K_M.gguf\")\n",
+    "llm = Llama(model_path=MODEL_PATH, n_ctx=2048, n_threads=os.cpu_count())\n",
+    "\n",
+    "# --- Chatbot Functions ---\n",
+    "def predict_response_type(user_input):\n",
+    "    vec = vectorizer.transform([user_input])\n",
+    "    pred = xgb_model.predict(vec)\n",
+    "    proba = xgb_model.predict_proba(vec).max()\n",
+    "    label = le.inverse_transform(pred)[0]\n",
+    "    return label, proba\n",
+    "\n",
+    "def build_prompt(user_input, response_type):\n",
+    "    prompts = {\n",
+    "        \"advice\": f\"A student said: \\\"{user_input}\\\". What advice should a counselor give to help?\",\n",
+    "        \"validation\": f\"A student said: \\\"{user_input}\\\". How can a counselor validate the student's emotions?\",\n",
+    "        \"information\": f\"A student said: \\\"{user_input}\\\". Explain what might be happening from a mental health perspective.\",\n",
+    "        \"question\": f\"A student said: \\\"{user_input}\\\". What follow-up questions should a counselor ask?\"\n",
+    "    }\n",
+    "    return prompts.get(response_type, prompts[\"information\"])\n",
+    "\n",
+    "def generate_llm_response(user_input, response_type):\n",
+    "    prompt = build_prompt(user_input, response_type)\n",
+    "    result = llm(prompt, max_tokens=150, temperature=0.7)\n",
+    "    return result['choices'][0]['text'].strip()\n",
+    "\n",
+    "def trim_memory(history, max_turns=6):\n",
+    "    return history[-max_turns * 2:]\n",
+    "\n",
+    "def save_conversation(history):\n",
+    "    with open(\"chat_history.json\", \"w\") as f:\n",
+    "        json.dump(history, f, indent=2)\n",
+    "    print(\"✅ Conversation saved to chat_history.json\\n\")\n",
+    "\n",
+    "def show_intro():\n",
+    "    print(\"\\n--- Multi-Turn Mental Health Chatbot ---\")\n",
+    "    print(\"This assistant simulates a counselor's conversation using AI.\")\n",
+    "    print(\"- Type something your patient/student might say\")\n",
+    "    print(\"- Type 'save' to export the conversation\")\n",
+    "    print(\"- Type 'exit' to quit\\n\")\n",
+    "\n",
+    "    print(\"Example:\")\n",
+    "    print(\"User: I feel like I’ll mess up my big presentation tomorrow.\")\n",
+    "    print(\"Counselor: It’s completely normal to feel nervous before a big event...\\n\")\n",
+    "\n",
+    "# --- Run Chatbot ---\n",
+    "history = []\n",
+    "show_intro()\n",
+    "\n",
+    "while True:\n",
+    "    user_input = input(\"User: \").strip()\n",
+    "\n",
+    "    if user_input.lower() == \"exit\":\n",
+    "        print(\"Goodbye\")\n",
+    "        break\n",
+    "    elif user_input.lower() == \"save\":\n",
+    "        save_conversation(history)\n",
+    "        continue\n",
+    "\n",
+    "    predicted_type, confidence = predict_response_type(user_input)\n",
+    "    print(f\"(Prfedicted: {predicted_type}, Confidence: {confidence:.1%})\")\n",
+    "\n",
+    "    llm_reply = generate_llm_response(user_input, predicted_type)\n",
+    "\n",
+    "    history.append({\"role\": \"user\", \"content\": user_input})\n",
+    "    history.append({\"role\": \"assistant\", \"content\": llm_reply})\n",
+    "    history = trim_memory(history)\n",
+    "\n",
+    "    print(\"Counselor:\", llm_reply)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "8e2f9122-f3dd-4d09-bfb1-c0438df2e8f1",
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python (myenv)",
+   "language": "python",
+   "name": "myenv"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.16"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}

README.md CHANGED Viewed

@@ -1,13 +1,106 @@
 ---
-title: Mental Health Chatbot
-emoji: ⚡
-colorFrom: indigo
-colorTo: green
-sdk: streamlit
-sdk_version: 1.44.1
-app_file: app.py
-pinned: false
-license: mit
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

+# 🧠 Mental Health Counselor Assistant
+A privacy-friendly, intelligent assistant designed to help mental health professionals explore **response suggestions** based on patient input. Built with `XGBoost` for intent classification and `Mistral-7B` for natural language generation, this app helps support clinical conversations with empathy and structure — while keeping you in full control.
+---
+## ✅ Problem Statement
+Mental health counselors are tasked with responding to a wide range of complex emotions and situations. In high-pressure or uncertain moments, this tool helps:
+- Predict the **intent** behind a patient's message (advice-seeking, validation, informational, or inquisitive)
+- Generate a supportive, **AI-assisted counselor response**
+- Log conversations for review and learning
+- Preserve privacy with local inference — built for **HIPAA-aware workflows**
+---
+## ✨ Features
+- 🔍 **Intent Prediction** — XGBoost classifier trained on annotated mental health dialogue
+- 💬 **Response Generation** — LLM-backed replies using quantized [Mistral-7B](https://mistral.ai/news/announcing-mistral-7b/)
+- 🧠 **Multi-turn Memory** — Maintains up to 6 rounds of conversation
+- ✅ **Export to JSON + CSV** — With timestamps for traceability
+- 🧰 **HIPAA-conscious Design** — No third-party API calls required; all runs locally
+---
+## 🖼️ App Preview
+![counselor assistant demo](demo/demo.gif)
+---
+## 🚀 How to Run
+### 🔧 Local (with Mistral-7B)
+> 🧩 You must download a quantized `.gguf` model file from Hugging Face or TheBloke, e.g.:
+>
+> `/Users/yourname/models/mistral/mistral-7b-instruct-v0.1.Q4_K_M.gguf`
+```bash
+git clone https://github.com/yourname/mental-health-chatbot.git
+cd mental-health-chatbot
+pip install -r requirements.txt
+streamlit run app_local.py
+```
 ---
+### 🐳 Docker (Recommended)
+```bash
+docker build -t counselor-assistant .
+docker run -p 8501:8501 counselor-assistant
+```
+---
+## 📁 Project Structure
+```
+mental-health-chatbot/
+│
+├── app_local.py              # Streamlit app (uses llama-cpp + XGBoost)
+├── requirements.txt
+├── Dockerfile
+├── demo/                     # Optional: screenshots or .gif
+├── dataset/
+│   └── Kaggle_Mental_Health_Conversations_train.csv
+├── models/
+│   └── mistral-7b-instruct-v0.1.Q4_K_M.gguf
+├── README.md
+└── LICENSE
+```
+---
+## 📌 Design Decisions
+- 🧠 Used `TfidfVectorizer` + `XGBoost` as a **lightweight, interpretable** classifier
+- 🔒 LLM inference handled locally with `llama-cpp-python` — no external data leaks
+- 📦 Docker support to simulate realistic deployment and improve portability
+- 🎯 Optimized for educational and prototyping use in **clinician settings**
 ---
+## 📍 Future Improvements
+- Add long-form audio transcription support
+- Integrate feedback loop for model tuning
+- Auto-summary of sessions
+- Hugging Face Space version (hosted with opt-in privacy tradeoffs)
+---
+## 👩‍⚕️ A Note to Reviewers
+This project was developed for an interview to showcase how **AI and empathy** can work hand-in-hand to support mental health professionals. Thank you for reviewing, and I hope it sparks a great conversation about the intersection of tech and care. 💙
+---
+## 📄 License
+MIT — free to use and modify, but please be thoughtful in healthcare applications.

app.py ADDED Viewed

	@@ -0,0 +1,175 @@

+import streamlit as st
+import os
+import pandas as pd
+import json
+import time
+import csv
+from datetime import datetime
+from sklearn.feature_extraction.text import TfidfVectorizer
+from sklearn.preprocessing import LabelEncoder
+from sklearn.model_selection import train_test_split
+from xgboost import XGBClassifier
+from llama_cpp import Llama
+st.set_page_config(page_title="Counselor Assistant", layout="centered")
+st.markdown("""
+    <style>
+        .main { background-color: #f9f9f9; padding: 1rem 2rem; border-radius: 12px; }
+        h1 { color: #2c3e50; text-align: center; font-size: 2.4rem; }
+        .user { color: #1f77b4; font-weight: bold; }
+        .assistant { color: #2ca02c; font-weight: bold; }
+    </style>
+""", unsafe_allow_html=True)
+st.title("🧠 Mental Health Counselor Assistant")
+st.markdown("""
+Hi there, counselor 👋
+This tool is here to offer **supportive, AI-generated suggestions** when you’re not quite sure how to respond to a patient.
+### How it helps:
+- 🧩 Predicts the type of support your patient might need (advice, validation, information, & question.)
+- 💬 Generates a supportive counselor response
+- 📁 Lets you save and track conversations for reflection
+It's a sidekick, not a substitute for your clinical judgment 💚
+""")
+df = pd.read_csv("dataset/Kaggle_Mental_Health_Conversations_train.csv")
+df = df[['Context', 'Response']].dropna().copy()
+keywords_to_labels = {
+    'advice': ['try', 'should', 'suggest', 'recommend'],
+    'validation': ['understand', 'feel', 'valid', 'normal'],
+    'information': ['cause', 'often', 'disorder', 'symptom'],
+    'question': ['how', 'what', 'why', 'have you']
+}
+def auto_label_response(response):
+    response = response.lower()
+    for label, keywords in keywords_to_labels.items():
+        if any(word in response for word in keywords):
+            return label
+    return 'information'
+df['response_type'] = df['Response'].apply(auto_label_response)
+df['combined_text'] = df['Context'] + " " + df['Response']
+le = LabelEncoder()
+y = le.fit_transform(df['response_type'])
+vectorizer = TfidfVectorizer(max_features=2000, ngram_range=(1, 2))
+X = vectorizer.fit_transform(df['combined_text'])
+X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, stratify=y, random_state=42)
+xgb_model = XGBClassifier(
+    objective='multi:softmax',
+    num_class=len(le.classes_),
+    eval_metric='mlogloss',
+    use_label_encoder=False,
+    max_depth=6,
+    learning_rate=0.1,
+    n_estimators=100
+)
+xgb_model.fit(X_train, y_train)
+MODEL_PATH = os.path.expanduser("/Users/Pi/models/mistral/mistral-7b-instruct-v0.1.Q4_K_M.gguf")
+@st.cache_resource(show_spinner=True)
+def load_llm():
+    return Llama(model_path=MODEL_PATH, n_ctx=2048, n_threads=os.cpu_count())
+llm = load_llm()
+def predict_response_type(user_input):
+    vec = vectorizer.transform([user_input])
+    pred = xgb_model.predict(vec)
+    proba = xgb_model.predict_proba(vec).max()
+    label = le.inverse_transform(pred)[0]
+    return label, proba
+def build_prompt(user_input, response_type):
+    prompts = {
+        "advice": f"A patient said: \"{user_input}\". What advice should a mental health counselor give to support them?",
+        "validation": f"A patient said: \"{user_input}\". How can a counselor validate and empathize with their emotions?",
+        "information": f"A patient said: \"{user_input}\". Explain what might be happening from a mental health perspective.",
+        "question": f"A patient said: \"{user_input}\". What thoughtful follow-up questions should a counselor ask?"
+    }
+    return prompts.get(response_type, prompts["information"])
+def generate_llm_response(user_input, response_type):
+    prompt = build_prompt(user_input, response_type)
+    start = time.time()
+    with st.spinner("Thinking through a helpful response for your patient..."):
+        result = llm(prompt, max_tokens=300, temperature=0.7)
+    end = time.time()
+    st.info(f"Response generated in {end - start:.1f} seconds")
+    return result['choices'][0]['text'].strip()
+def trim_memory(history, max_turns=6):
+    return history[-max_turns * 2:]
+def save_conversation(history):
+    now = datetime.now().strftime("%Y-%m-%d %H:%M:%S")
+    with open("chat_log_combined.csv", "w", newline='') as f:
+        writer = csv.writer(f)
+        writer.writerow(["Timestamp", "Role", "Content", "Intent", "Confidence"])
+        for entry in history:
+            writer.writerow([
+                now,
+                entry.get("role", ""),
+                entry.get("content", ""),
+                entry.get("label", ""),
+                round(float(entry.get("confidence", 0)), 2)
+            ])
+    st.success("Saved to chat_log_combined.csv")
+if "history" not in st.session_state:
+    st.session_state.history = []
+if "user_input" not in st.session_state:
+    st.session_state.user_input = ""
+MAX_WORDS = 1000
+word_count = len(st.session_state.user_input.split())
+st.markdown(f"**📝 Input Length:** {word_count} / {MAX_WORDS} words")
+st.session_state.user_input = st.text_area(
+    "💬 What did your patient say?",
+    value=st.session_state.user_input,
+    placeholder="e.g. I just feel like I'm never going to get better.",
+    height=100
+)
+col1, col2, col3 = st.columns([2, 1, 1])
+with col1:
+    send = st.button("💡 Suggest Response")
+with col2:
+    save = st.button("📁 Save This")
+with col3:
+    reset = st.button("🔁 Reset")
+if send and st.session_state.user_input:
+    user_input = st.session_state.user_input
+    predicted_type, confidence = predict_response_type(user_input)
+    reply = generate_llm_response(user_input, predicted_type)
+    st.session_state.history.append({"role": "user", "content": user_input})
+    st.session_state.history.append({"role": "assistant", "content": reply, "label": predicted_type, "confidence": confidence})
+    st.session_state.history = trim_memory(st.session_state.history)
+if save:
+    save_conversation(st.session_state.history)
+if reset:
+    st.session_state.history = []
+    st.session_state.user_input = ""
+    st.success("Conversation has been cleared.")
+st.markdown("---")
+for turn in st.session_state.history:
+    if turn["role"] == "user":
+        st.markdown(f"🧍‍♀️ **Patient:** {turn['content']}")
+    else:
+        st.markdown(f"👩‍⚕️👨‍⚕️ **Suggested Counselor Response:** {turn['content']}")
+        st.caption(f"_Intent: {turn['label']} (Confidence: {turn['confidence']:.0%})_")
+    st.markdown("---")

app_slow_version.py ADDED Viewed

	@@ -0,0 +1,172 @@

+# Streamlit App: Counselor Assistant using XGBoost + Flan-T5 (Cloud Version)
+import streamlit as st
+import os
+import pandas as pd
+import json
+import time
+import csv
+from sklearn.feature_extraction.text import TfidfVectorizer
+from sklearn.preprocessing import LabelEncoder
+from sklearn.model_selection import train_test_split
+from xgboost import XGBClassifier
+from transformers import pipeline
+st.set_page_config(page_title="Counselor Assistant", layout="centered")
+st.markdown("""
+    <style>
+        .main { background-color: #f4f4f9; padding: 1rem 2rem; border-radius: 12px; }
+        h1 { color: #2c3e50; text-align: center; font-size: 2.4rem; }
+        .user { color: #1f77b4; font-weight: bold; }
+        .assistant { color: #2ca02c; font-weight: bold; }
+    </style>
+""", unsafe_allow_html=True)
+st.title("Mental Health Counselor Assistant")
+st.markdown("""
+Welcome, counselor 👩‍⚕️👨‍⚕️
+This assistant is designed to provide you with **supportive, evidence-based suggestions** when you're unsure how to best respond to a patient’s concerns.
+Just enter what your patient shared with you, and this tool will:
+- Predict the type of support that fits best (e.g., advice, validation, information, and question)
+- Generate a suggested counselor reply
+- Let you save the conversation for your records
+This is not a diagnostic tool — it’s here to support **your clinical intuition**.
+""")
+# Load and prepare the dataset
+df = pd.read_csv("dataset/Kaggle_Mental_Health_Conversations_train.csv")
+df = df[['Context', 'Response']].dropna().copy()
+keywords_to_labels = {
+    'advice': ['try', 'should', 'suggest', 'recommend'],
+    'validation': ['understand', 'feel', 'valid', 'normal'],
+    'information': ['cause', 'often', 'disorder', 'symptom'],
+    'question': ['how', 'what', 'why', 'have you']
+}
+def auto_label_response(response):
+    response = response.lower()
+    for label, keywords in keywords_to_labels.items():
+        if any(word in response for word in keywords):
+            return label
+    return 'information'
+df['response_type'] = df['Response'].apply(auto_label_response)
+df['combined_text'] = df['Context'] + " " + df['Response']
+le = LabelEncoder()
+y = le.fit_transform(df['response_type'])
+vectorizer = TfidfVectorizer(max_features=2000, ngram_range=(1, 2))
+X = vectorizer.fit_transform(df['combined_text'])
+X_train, X_test, y_train, y_test = train_test_split(
+    X, y, test_size=0.2, stratify=y, random_state=42
+)
+xgb_model = XGBClassifier(
+    objective='multi:softmax',
+    num_class=len(le.classes_),
+    eval_metric='mlogloss',
+    use_label_encoder=False,
+    max_depth=6,
+    learning_rate=0.1,
+    n_estimators=100
+)
+xgb_model.fit(X_train, y_train)
+# Replace Mistral-7B with Flan-T5 hosted model
+@st.cache_resource(show_spinner="Loading Flan-T5 model...")
+def load_llm():
+    return pipeline("text2text-generation", model="google/flan-t5-base")
+llm = load_llm()
+def predict_response_type(user_input):
+    vec = vectorizer.transform([user_input])
+    pred = xgb_model.predict(vec)
+    proba = xgb_model.predict_proba(vec).max()
+    label = le.inverse_transform(pred)[0]
+    return label, proba
+def build_prompt(user_input, response_type):
+    prompts = {
+        "advice": f"A patient said: \"{user_input}\". What advice should a mental health counselor give to support them?",
+        "validation": f"A patient said: \"{user_input}\". How can a counselor validate and empathize with their emotions?",
+        "information": f"A patient said: \"{user_input}\". Explain what might be happening from a mental health perspective.",
+        "question": f"A patient said: \"{user_input}\". What thoughtful follow-up questions should a counselor ask?"
+    }
+    return prompts.get(response_type, prompts["information"])
+def generate_llm_response(user_input, response_type):
+    prompt = build_prompt(user_input, response_type)
+    start = time.time()
+    with st.spinner("Thinking through a helpful response for your patient..."):
+        result = llm(prompt, max_length=150, do_sample=True, temperature=0.7)
+    end = time.time()
+    st.info(f"Response generated in {end - start:.1f} seconds")
+    return result[0]["generated_text"].strip()
+def trim_memory(history, max_turns=6):
+    return history[-max_turns * 2:]
+def save_conversation(history):
+    with open("chat_history.json", "w") as f:
+        json.dump(history, f, indent=2)
+    with open("chat_log.csv", "w", newline='') as f:
+        writer = csv.writer(f)
+        writer.writerow(["Role", "Content"])
+        for entry in history:
+            writer.writerow([entry.get("role", ""), entry.get("content", "")])
+    st.success("Saved to chat_history.json and chat_log.csv")
+# Streamlit UI
+if "history" not in st.session_state:
+    st.session_state.history = []
+with st.expander("💡 Sample inputs you can try"):
+    st.markdown("""
+    - My patient is constantly feeling overwhelmed at work.
+    - A student says they panic every time they have to speak in class.
+    - Someone told me they think they’ll never feel okay again.
+    """)
+user_input = st.text_area("💬 What did your patient say?", placeholder="e.g. I just feel like I'm never going to get better.", height=100)
+col1, col2, col3 = st.columns([2, 1, 1])
+with col1:
+    send = st.button("Suggest Response")
+with col2:
+    save = st.button("📁 Save This")
+with col3:
+    reset = st.button("🔁 Reset")
+if send and user_input:
+    predicted_type, confidence = predict_response_type(user_input)
+    reply = generate_llm_response(user_input, predicted_type)
+    st.session_state.history.append({"role": "user", "content": user_input})
+    st.session_state.history.append({"role": "assistant", "content": reply, "label": predicted_type, "confidence": confidence})
+    st.session_state.history = trim_memory(st.session_state.history)
+if save:
+    save_conversation(st.session_state.history)
+if reset:
+    st.session_state.history = []
+    st.success("Conversation has been cleared.")
+st.markdown("---")
+for turn in st.session_state.history:
+    if turn["role"] == "user":
+        st.markdown(f"🧍‍♀️ **Patient:** {turn['content']}")
+    else:
+        st.markdown(f"👩‍⚕️👨‍⚕️ **Suggested Counselor Response:** {turn['content']}")
+        st.caption(f"_Intent: {turn['label']} (Confidence: {turn['confidence']:.0%})_")
+    st.markdown("---")

chat_log_combined.csv ADDED Viewed

	@@ -0,0 +1,19 @@

+Timestamp,Role,Content,Intent,Confidence
+2025-04-11 03:11:14,user,i  am nervous about my incoming interview ,,0.0
+2025-04-11 03:11:14,assistant,"From a mental health perspective, it is understandable that the patient may be feeling nervous about their upcoming interview. Anxiety is a common and natural response to stressful situations, and it can manifest in various physical and emotional symptoms.
+The patient's nervousness may be related to their fear of failure, uncertainty about the interview process, or concerns about their ability to perform well. These feelings of anxiety may be affecting their self-confidence, sleep, appetite, and overall well-being.
+It is important for the patient to acknowledge and validate their feelings, and to take steps to manage their anxiety. Some effective coping strategies for managing anxiety include deep breathing exercises, progressive muscle relaxation, cognitive-behavioral therapy, and mindfulness practices. It may also be helpful for the patient to practice positive self-talk and visualization techniques to boost their confidence and self-esteem.
+Overall, it is essential for the patient to prioritize their mental health and well-being during this time, and to seek support from friends, family, or a mental health professional if needed.",information,0.84
+2025-04-11 03:11:14,user,i cannot sleep ,,0.0
+2025-04-11 03:11:14,assistant,"There can be a variety of reasons why someone might have trouble sleeping from a mental health perspective. Some common causes include:
+1. Stress and anxiety: High levels of stress and anxiety can interfere with the ability to fall and stay asleep. This can be caused by a number of factors, such as work or family problems, financial difficulties, or health concerns.
+2. Depression: Depression can cause changes in sleep patterns, including difficulty falling and staying asleep, or feeling tired during the day.
+3. Bipolar disorder: Bipolar disorder is a mental health condition characterized by extreme mood swings, including periods of depression and mania. Sleep disturbances are common in people with bipolar disorder.
+4. Post-traumatic stress disorder (PTSD): PTSD is a mental health condition that can develop after experiencing or witnessing a traumatic event, such as a natural disaster, a serious accident, or a sexual assault. People with PTSD may have difficulty sleeping, as they may be haunted by memories of the event.
+5. Substance abuse: The use of drugs and alcohol can interfere with sleep, making it difficult to fall and stay asleep.
+It is important to note that these are just a few examples, and that there may be other underlying factors that are contributing to the difficulty sleeping. If the patient is experiencing persistent difficulty sleeping, it may be helpful to seek professional help",information,0.85

requirements.txt ADDED Viewed

	@@ -0,0 +1,7 @@

+streamlit
+pandas
+scikit-learn
+xgboost
+seaborn
+matplotlib
+llama-cpp-python --prefer-binary