license: apache-2.0
language:
- es
- en
tags:
- lora
- openllama
- agriculture
- climate
- potato
- few-shot
- classification
- low-resource
- causal-lm
datasets:
- custom
base_model: openlm-research/open_llama_3b
model-index:
- name: lora-lateblight-v5
results: []
lora-lateblight-v5 🌱 — OpenLLaMA-3B Fine-Tuned for Potato Late Blight Risk Prediction in Huancavelica, Peru
Author: Jorge Luis Alonso (@jalonso24)
Model Base: openlm-research/open_llama_3b
Adapter Format: LoRA (PEFT)
Location: Huancavelica, Andes, Peru
Languages: Spanish, English
Task: Multi-class classification (Bajo
, Moderado
, Alto
) of late blight disease risk using natural language prompts
🔍 What is this?
lora-lateblight-v5
is a lightweight, prompt-based classifier that predicts potato late blight risk levels using natural language inputs. It was fine-tuned with LoRA adapters on a curated dataset of expert-labelled and synthetic examples from Huancavelica, Peru.
It supports bilingual prompts, runs on low-cost hardware, and helps agronomists and rural farmers make better decisions using just weather and crop descriptions in plain language.
🧪 How was it trained?
- Fine-Tuning Method: LoRA (Low-Rank Adaptation)
r=32
,alpha=16
,dropout=0.1
- Training Framework: PEFT with
transformers
, Colab T4 GPU - Epochs: 5, using mixed precision with float32 stabilization
- Data Sources:
- Historical climate + crop risk data (SENAMHI, CIP)
- Synthetic "Alto" risk cases for balancing
- Distilled "Moderado" cases from GPT-4 + Gemini consensus
- Prompt Style:
- Few-shot (3 labeled examples:
Bajo
,Moderado
,Alto
) - Natural, bilingual inputs (e.g., Spanish + English)
- Example format:
La variedad es INIA-302 Amarilis, sembrada en noviembre. La precipitación fue 18.4 mm, la temperatura máxima 17.2°C, la mínima 6.1°C y la humedad promedio 84.12%. ¿Cuál es el riesgo de tizón tardío? Riesgo:
- Few-shot (3 labeled examples:
📊 Evaluation Summary
Metric | Score |
---|---|
Accuracy | 73% |
Recall (Alto) | 100% |
F1-score (Alto) | 0.78 |
F1-score (Mod.) | 0.25 |
- ✅ Model reliably detects all high-risk (Alto) cases
- ⚠️ Struggles with moderate-risk (Moderado) due to timing subtleties
- ⚖️ Prioritizes early warning over precision — suitable for advisory tools
📁 Files Included
adapter_model.safetensors
— LoRA adapter weightstokenizer/
— Tokenizer from OpenLLaMA 3BREADME.md
— This model card
🧠 Intended Use
- Forecast potato late blight risk in high-altitude Andean regions
- Run lightweight inference on devices like Beelink Mini PCs
- Support local extension workers, agronomists, or AI-powered alert tools
- Educate farmers using mobile-friendly prompts in Spanish or English
🚧 Known Limitations
- Overpredicts
Alto
in ambiguous cases — intended as a safety trade-off - Struggles with unclear or borderline
Moderado
cases - Not designed for automated spraying systems or IoT-triggered actions
- Needs expert validation before field-level decisions
🫷 Try It (Coming Soon)
We’re working on a Gradio demo where you’ll be able to enter:
- Crop variety
- Planting month
- Weekly humidity, temperature, and rainfall
👉 and receive a natural-language Riesgo prediction.
📚 Citation
@misc{alonso2025lateblight,
author = {Jorge Luis Alonso},
title = {Predicting Potato Late Blight in Huancavelica, Peru, Using OpenLLaMA 3B + LoRA},
year = {2025}, {\url{https://www.linkedin.com/pulse/improving-potato-late-blight-forecasting-huancavelica-alonso-5h1af}}
note = {Fine-tuning based on expert-labelled and synthetic data with few-shot prompt classification},
howpublished = {\url{https://huggingface.co/jalonso24/lora-lateblight-v5}}
}
Based on foundational studies by Gastelo et al. (2025), Giraldo et al. (2010), and Saffer et al. (2024).