Spaces:

GoConqurer
/

textlens-ocr

Running

GoConqurer commited on Jun 22

Commit

67e2508

1 Parent(s): 796f321

🔧 Fix Gradio API name conflicts and upgrade version

✅ Fixes:
- Add unique api_name to avoid duplicate function warnings
- Upgrade Gradio from 4.0.0 to 4.44.0+ for latest features
- Separate API endpoints for upload vs click events

🚀 Deployment:
- Eliminates warning: 'api_name extract_text_from_image already exists'
- Uses latest Gradio version with bug fixes and improvements
- Maintains both auto-upload and manual extract functionality

Files changed (3) hide show

README.md +363 -176
requirements.txt +1 -1
ui/interface.py +4 -2

README.md CHANGED Viewed

@@ -12,200 +12,215 @@ license: mit
 # 🔍 TextLens - AI-Powered OCR
-A modern Vision-Language Model (VLM) based OCR application that extracts text from images using Microsoft Florence-2 model with intelligent fallback systems.
-## ✨ Features
-- **🤖 Advanced VLM OCR**: Uses Microsoft Florence-2 for state-of-the-art text extraction
-- **🔄 Smart Fallback System**: Automatically falls back to EasyOCR if Florence-2 fails
-- **🧪 Demo Mode**: Test mode for demonstration when other methods are unavailable
-- **🎨 Modern UI**: Clean, responsive Gradio interface with excellent UX
-- **📱 Multiple Input Methods**: Upload, webcam, clipboard support
-- **⚡ Real-time Processing**: Automatic text extraction on image upload
-- **📋 Copy Functionality**: Easy text copying from results
-- **🚀 GPU Acceleration**: Supports CUDA, MPS, and CPU inference
-- **🛡️ Error Handling**: Robust error handling and user-friendly messages
 ## 🏗️ Architecture
 ```
 textlens-ocr/
-├── app.py                 # Main Gradio application
-├── requirements.txt       # Python dependencies
-├── README.md             # Project documentation
-├── models/               # OCR processing modules
-│   ├── __init__.py
-│   └── ocr_processor.py  # Advanced OCR class with fallbacks
-├── utils/                # Utility functions
-│   ├── __init__.py
-│   └── image_utils.py    # Image preprocessing utilities
-└── ui/                   # User interface components
-    ├── __init__.py
-    ├── interface.py      # Gradio interface
-    ├── handlers.py       # Event handlers
-    └── styles.py         # CSS styling
 ```
 ## 🚀 Quick Start
-### Local Development
-1. **Clone the repository**
-   ```bash
-   git clone https://github.com/KumarAmrit30/textlens-ocr.git
-   cd textlens-ocr
-   ```
-2. **Set up Python environment**
    ```bash
-   python3 -m venv textlens_env
-   source textlens_env/bin/activate  # On Windows: textlens_env\Scripts\activate
    ```
-3. **Install dependencies**
    ```bash
    pip install -r requirements.txt
    ```
-4. **Run the application**
    ```bash
    python app.py
    ```
-5. **Open your browser**
-   Navigate to `http://localhost:7860`
-### Quick Test
-Run the test suite to verify everything works:
 ```bash
-python test_ocr.py
 ```
-## 🔧 Technical Details
-### OCR Processing Pipeline
-1. **Primary**: Microsoft Florence-2 VLM
-   - State-of-the-art vision-language model
-   - Supports both basic OCR and region-based extraction
-   - GPU accelerated inference
-2. **Fallback**: EasyOCR
-   - Traditional OCR with good accuracy
-   - Works when Florence-2 fails to load
-   - Multi-language support
-3. **Demo Mode**: Test Mode
-   - Demonstration functionality
-   - Shows interface working correctly
-   - Used when other methods are unavailable
-### Model Loading Strategy
-The application uses an intelligent loading strategy:
-```python
-try:
-    # Try Florence-2 with specific revision
-    model = AutoModelForCausalLM.from_pretrained(
-        "microsoft/Florence-2-base",
-        revision='refs/pr/6',
-        trust_remote_code=True
-    )
-except:
-    # Fall back to default Florence-2
-    model = AutoModelForCausalLM.from_pretrained(
-        "microsoft/Florence-2-base",
-        trust_remote_code=True
-    )
 ```
-### Device Detection
-Automatically detects and uses the best available device:
-- **CUDA**: NVIDIA GPUs with CUDA support
-- **MPS**: Apple Silicon Macs (M1/M2/M3)
-- **CPU**: Fallback for all systems
-## 📊 Performance
-| Model            | Size   | Speed  | Accuracy  | Use Case              |
-| ---------------- | ------ | ------ | --------- | --------------------- |
-| Florence-2-base  | 230M   | Fast   | High      | General OCR           |
-| Florence-2-large | 770M   | Medium | Very High | High accuracy needs   |
-| EasyOCR          | ~100MB | Medium | Good      | Fallback/Multilingual |
-## 🔍 Supported Image Formats
-- **JPEG** (.jpg, .jpeg)
-- **PNG** (.png)
-- **WebP** (.webp)
-- **BMP** (.bmp)
-- **TIFF** (.tiff, .tif)
-- **GIF** (.gif)
-## 🎯 Use Cases
-- **📄 Document Digitization**: Convert physical documents to text
-- **🏪 Receipt Processing**: Extract data from receipts and invoices
-- **📱 Screenshot Text Extraction**: Get text from app screenshots
-- **🚗 License Plate Reading**: Extract text from vehicle plates
-- **📚 Book/Article Scanning**: Digitize printed materials
-- **🌐 Multilingual Text**: Process text in various languages
-## 🛠️ Configuration
-### Model Selection
-Change the model in `models/ocr_processor.py`:
-```python
-# For faster inference
-ocr = OCRProcessor(model_name="microsoft/Florence-2-base")
-# For higher accuracy
-ocr = OCRProcessor(model_name="microsoft/Florence-2-large")
-```
-### UI Customization
-Modify the Gradio interface in `app.py`:
-- Update colors and styling in the CSS section
-- Change layout in the `create_interface()` function
-- Add new features or components
-## 🧪 Testing
-The project includes comprehensive tests:
-```bash
-# Run all tests
-python test_ocr.py
-# Test specific functionality
-python -c "from models.ocr_processor import OCRProcessor; ocr = OCRProcessor(); print(ocr.get_model_info())"
 ```
-## 🚀 Deployment
-### HuggingFace Spaces
-1. Fork this repository
-2. Create a new Space on HuggingFace
-3. Connect your repository
-4. The app will automatically deploy
-### Docker Deployment
 ```dockerfile
 FROM python:3.9-slim
@@ -220,92 +235,264 @@ EXPOSE 7860
 CMD ["python", "app.py"]
 ```
-### Local Server
 ```bash
-# Production server
-pip install gunicorn
-gunicorn -w 4 -b 0.0.0.0:7860 app:create_interface().app
 ```
-## 🔐 Environment Variables
-| Variable               | Description           | Default                |
-| ---------------------- | --------------------- | ---------------------- |
-| `GRADIO_SERVER_PORT`   | Server port           | 7860                   |
-| `TRANSFORMERS_CACHE`   | Model cache directory | `~/.cache/huggingface` |
-| `CUDA_VISIBLE_DEVICES` | GPU device selection  | All available          |
-## 🤝 Contributing
-1. Fork the repository
-2. Create a feature branch
-3. Make your changes
-4. Add tests for new functionality
-5. Submit a pull request
-## 📝 API Reference
 ### OCRProcessor Class
 ```python
 from models.ocr_processor import OCRProcessor
-# Initialize
-ocr = OCRProcessor(model_name="microsoft/Florence-2-base")
-# Extract text
 text = ocr.extract_text(image)
-# Extract with regions
 result = ocr.extract_text_with_regions(image)
-# Get model info
 info = ocr.get_model_info()
 ```
-## 🐛 Troubleshooting
 ### Common Issues
-1. **Model Loading Errors**
    ```bash
-   # Install missing dependencies
-   pip install einops timm
    ```
-2. **CUDA Out of Memory**
-   ```python
-   # Use CPU instead
-   ocr = OCRProcessor()
-   ocr.device = "cpu"
    ```
-3. **SSL Certificate Errors**
    ```bash
-   # Update certificates (macOS)
-   /Applications/Python\ 3.x/Install\ Certificates.command
    ```
 ## 📄 License
-This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
-## 🙏 Acknowledgments
-- **Microsoft** for the Florence-2 model
-- **HuggingFace** for the transformers library
-- **Gradio** for the web interface framework
-- **EasyOCR** for fallback OCR capabilities
-## 📞 Support
-- Create an issue for bug reports
-- Start a discussion for feature requests
-- Check existing issues before posting
 ---
 **Made with ❤️ for the AI community**

 # 🔍 TextLens - AI-Powered OCR
+[![Deploy to HuggingFace](https://img.shields.io/badge/🤗-Deploy%20to%20Spaces-blue)](https://huggingface.co/spaces/GoConqurer/textlens-ocr)
+[![GitHub](https://img.shields.io/badge/GitHub-Repository-green)](https://github.com/KumarAmrit30/textlens-ocr)
+[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE)
+[![Python 3.9+](https://img.shields.io/badge/python-3.9+-blue.svg)](https://www.python.org/downloads/)
+A state-of-the-art Vision-Language Model (VLM) based OCR application that extracts text from images using Microsoft Florence-2 with intelligent fallback systems and enterprise-grade zero downtime deployment.
+## 🚀 Live Demo
+**🔗 Try it now:** [https://huggingface.co/spaces/GoConqurer/textlens-ocr](https://huggingface.co/spaces/GoConqurer/textlens-ocr)
+![TextLens Demo](https://img.shields.io/badge/Demo-Live-brightgreen)
+## ✨ Key Features
+### 🤖 Advanced AI-Powered OCR
+- **Microsoft Florence-2 VLM**: State-of-the-art vision-language model for text extraction
+- **Intelligent Fallback System**: Automatic fallback to EasyOCR if primary model fails
+- **Multi-Model Support**: Florence-2-base and Florence-2-large variants
+- **Real-time Processing**: Instant text extraction on image upload
+### 🎨 Modern User Experience
+- **Clean UI**: Professional Gradio interface with intuitive design
+- **Multiple Input Methods**: Upload files, use webcam, or paste from clipboard
+- **Copy-to-Clipboard**: One-click text copying functionality
+- **Responsive Design**: Works seamlessly on desktop and mobile devices
+- **Dark/Light Theme**: Automatic theme adaptation
+### ⚡ Performance & Reliability
+- **GPU Acceleration**: Supports CUDA, MPS (Apple Silicon), and CPU inference
+- **Smart Device Detection**: Automatically uses best available hardware
+- **Error Resilience**: Robust error handling with graceful degradation
+- **Memory Optimization**: Efficient model loading and cleanup
+### 🛡️ Enterprise Features
+- **Zero Downtime Deployment**: Blue-green deployment with health checks
+- **Health Monitoring**: Built-in `/health` and `/ready` endpoints
+- **Graceful Shutdown**: Signal handling for clean application restarts
+- **Production Ready**: Scalable architecture with automated deployment
 ## 🏗️ Architecture
 ```
 textlens-ocr/
+├── 📱 Frontend (Gradio UI)
+│   ├── ui/interface.py      # Main interface components
+│   ├── ui/handlers.py       # Event handlers & logic
+│   └── ui/styles.py         # CSS styling & themes
+├── 🧠 AI Models
+│   └── models/ocr_processor.py  # OCR engine with fallbacks
+├── 🔧 Utilities
+│   └── utils/image_utils.py     # Image preprocessing
+├── 🚀 Deployment
+│   ├── .github/workflows/       # CI/CD pipelines
+│   ├── scripts/deploy.py        # Manual deployment tools
+│   └── deployment.config.yml    # Deployment configuration
+├── 📚 Documentation
+│   ├── README.md               # Main documentation
+│   └── DEPLOYMENT.md           # Deployment guide
+└── ⚙️ Configuration
+    ├── app.py                  # Main application entry
+    └── requirements.txt        # Dependencies
 ```
 ## 🚀 Quick Start
+### 🌐 Online (Recommended)
+**Instant access** - No installation required:
+👉 [**Launch TextLens**](https://huggingface.co/spaces/GoConqurer/textlens-ocr)
+### 💻 Local Development
+1. **Clone Repository**
    ```bash
+   git clone https://github.com/KumarAmrit30/textlens-ocr.git
+   cd textlens-ocr
    ```
+2. **Setup Environment**
    ```bash
+   python -m venv textlens_env
+   source textlens_env/bin/activate  # Windows: textlens_env\Scripts\activate
    pip install -r requirements.txt
    ```
+3. **Launch Application**
    ```bash
    python app.py
    ```
+   🌐 Open: `http://localhost:7860`
+### 🧪 Quick Test
 ```bash
+# Verify installation
+python -c "from models.ocr_processor import OCRProcessor; print('✅ TextLens ready!')"
 ```
+## 📊 Model Performance
+| Model                | Size  | Speed     | Accuracy     | Best For               |
+| -------------------- | ----- | --------- | ------------ | ---------------------- |
+| **Florence-2-base**  | 270M  | ⚡ Fast   | 📈 High      | General OCR, Real-time |
+| **Florence-2-large** | 770M  | 🐌 Medium | 📊 Very High | High accuracy needs    |
+| **EasyOCR**          | ~100M | 🚀 Medium | 📋 Good      | Fallback, Multilingual |
+## 🎯 Supported Use Cases
+| Category            | Examples                        | Performance |
+| ------------------- | ------------------------------- | ----------- |
+| 📄 **Documents**    | PDFs, Scanned papers, Forms     | ⭐⭐⭐⭐⭐  |
+| 🧾 **Receipts**     | Shopping receipts, Invoices     | ⭐⭐⭐⭐    |
+| 📱 **Screenshots**  | App interfaces, Error messages  | ⭐⭐⭐⭐⭐  |
+| 🚗 **Vehicle**      | License plates, VIN numbers     | ⭐⭐⭐⭐    |
+| 📚 **Books**        | Printed text, Handwritten notes | ⭐⭐⭐⭐    |
+| 🌐 **Multilingual** | Multiple languages              | ⭐⭐⭐      |
+## 🔧 Configuration
+### 🎛️ Model Selection
+```python
+from models.ocr_processor import OCRProcessor
+# Fast inference (recommended)
+ocr = OCRProcessor(model_name="microsoft/Florence-2-base")
+# Maximum accuracy
+ocr = OCRProcessor(model_name="microsoft/Florence-2-large")
 ```
+### 🎨 UI Customization
+Modify `ui/styles.py` to customize appearance:
+```python
+# Change color scheme
+PRIMARY_COLOR = "#1f77b4"
+SECONDARY_COLOR = "#ff7f0e"
+# Update layout
+INTERFACE_WIDTH = "100%"
+```
+### ⚙️ Environment Variables
+| Variable               | Description          | Default                |
+| ---------------------- | -------------------- | ---------------------- |
+| `SPACE_ID`             | HuggingFace Space ID | Auto-detected          |
+| `DEPLOYMENT_STAGE`     | deployment stage     | `production`           |
+| `TRANSFORMERS_CACHE`   | Model cache path     | `~/.cache/huggingface` |
+| `CUDA_VISIBLE_DEVICES` | GPU selection        | All available          |
+## 🚀 Deployment
+### 🤗 HuggingFace Spaces (Recommended)
+**Automatic Deployment:**
+1. Fork this repository
+2. Push to `main`/`master` branch
+3. GitHub Actions automatically deploys to HuggingFace Spaces
+4. Access your deployed app at: `https://huggingface.co/spaces/USERNAME/textlens-ocr`
+**Manual Deployment:**
+1. Go to [GitHub Actions](https://github.com/KumarAmrit30/textlens-ocr/actions)
+2. Select "Deploy to HuggingFace Spaces"
+3. Click "Run workflow"
+4. Choose deployment type:
+   - **Direct**: Quick deployment to production
+   - **Blue-Green**: Zero downtime with staging validation
+### 🔄 Zero Downtime Deployment
+Our enterprise-grade deployment system ensures **zero downtime** for users:
+**Features:**
+- 🔵 **Blue-Green Deployment**: Test in staging before production
+- 🏥 **Health Monitoring**: Automatic health checks with retry logic
+- 🔄 **Graceful Shutdown**: Clean application restarts
+- 📊 **Real-time Monitoring**: Deployment status tracking
+**Health Endpoints:**
+- `GET /health` - Application health status
+- `GET /ready` - Application readiness check
+**Deployment Flow:**
+```mermaid
+graph LR
+    A[Code Push] --> B[Validate]
+    B --> C[Deploy Staging]
+    C --> D[Health Check]
+    D --> E[Deploy Production]
+    E --> F[Verify]
+    F --> G[Complete ✅]
 ```
+### 🐳 Docker Deployment
 ```dockerfile
 FROM python:3.9-slim
 CMD ["python", "app.py"]
 ```
+Build and run:
 ```bash
+docker build -t textlens-ocr .
+docker run -p 7860:7860 textlens-ocr
 ```
+### ☁️ Cloud Platforms
+| Platform               | Status        | Guide                                                               |
+| ---------------------- | ------------- | ------------------------------------------------------------------- |
+| **HuggingFace Spaces** | ✅ Ready      | [Deploy Now](https://huggingface.co/spaces/GoConqurer/textlens-ocr) |
+| **Google Colab**       | ✅ Compatible | Open in Colab                                                       |
+| **AWS/GCP/Azure**      | 🔧 Docker     | Use Docker deployment                                               |
+| **Heroku**             | ⚠️ Limited    | GPU not available                                                   |
+## 🧪 Testing & Development
+### 🔍 Running Tests
+```bash
+# Basic functionality test
+python -c "
+from models.ocr_processor import OCRProcessor
+ocr = OCRProcessor()
+print(f'✅ Model loaded: {ocr.get_model_info()}')
+"
+# Test with sample image
+python -c "
+from PIL import Image
+from models.ocr_processor import OCRProcessor
+import requests
+# Download test image
+img_url = 'https://via.placeholder.com/300x100/000000/FFFFFF?text=Hello+World'
+image = Image.open(requests.get(img_url, stream=True).raw)
+# Test OCR
+ocr = OCRProcessor()
+result = ocr.extract_text(image)
+print(f'✅ OCR Result: {result}')
+"
+```
+### 🛠️ Development Tools
+```bash
+# Install development dependencies
+pip install -r requirements.txt
+# Format code
+black . --line-length 88
+# Type checking
+mypy models/ utils/ ui/
+# Lint code
+flake8 --max-line-length 88
+```
+## 📚 API Reference
 ### OCRProcessor Class
 ```python
 from models.ocr_processor import OCRProcessor
+# Initialize processor
+ocr = OCRProcessor(
+    model_name="microsoft/Florence-2-base",  # Model selection
+    device=None,                             # Auto-detect device
+    torch_dtype=None                         # Auto-select dtype
+)
+# Extract text from image
 text = ocr.extract_text(image)
+# Returns: str
+# Extract text with bounding boxes
 result = ocr.extract_text_with_regions(image)
+# Returns: dict with text and regions
+# Get model information
 info = ocr.get_model_info()
+# Returns: dict with model details
+# Cleanup resources
+ocr.cleanup()
+```
+### Health Check API
+```bash
+# Check application health
+curl https://huggingface.co/spaces/GoConqurer/textlens-ocr/health
+# Response:
+{
+  "status": "healthy",
+  "timestamp": 1640995200,
+  "version": "1.0.0",
+  "environment": "production"
+}
+# Check readiness
+curl https://huggingface.co/spaces/GoConqurer/textlens-ocr/ready
+# Response:
+{
+  "status": "ready",
+  "timestamp": 1640995200
+}
 ```
+## 🚨 Troubleshooting
 ### Common Issues
+| Issue                   | Symptoms                 | Solution                                |
+| ----------------------- | ------------------------ | --------------------------------------- |
+| **Model Loading Error** | ImportError, CUDA errors | Check GPU drivers, install CUDA toolkit |
+| **Memory Error**        | Out of memory            | Reduce batch size, use CPU inference    |
+| **SSL Certificate**     | SSL errors on macOS      | Run certificate update command          |
+| **Permission Error**    | File access denied       | Check file permissions, run as admin    |
+### Debug Commands
+```bash
+# Check CUDA availability
+python -c "import torch; print(f'CUDA: {torch.cuda.is_available()}')"
+# Check transformers version
+python -c "import transformers; print(f'Transformers: {transformers.__version__}')"
+# Test health endpoint locally
+curl http://localhost:7860/health
+# View application logs
+tail -f textlens.log
+```
+### Getting Help
+1. 📋 **Check existing issues**: [GitHub Issues](https://github.com/KumarAmrit30/textlens-ocr/issues)
+2. 🆕 **Create new issue**: Provide error details and environment info
+3. 💬 **Join discussion**: [GitHub Discussions](https://github.com/KumarAmrit30/textlens-ocr/discussions)
+4. 📧 **Contact**: Create an issue for direct support
+## 🤝 Contributing
+We welcome contributions! Here's how to get started:
+### 🔧 Development Setup
+1. **Fork & Clone**
    ```bash
+   git clone https://github.com/YOUR_USERNAME/textlens-ocr.git
+   cd textlens-ocr
    ```
+2. **Create Branch**
+   ```bash
+   git checkout -b feature/your-feature-name
    ```
+3. **Make Changes**
+   - Add new features or fix bugs
+   - Update tests and documentation
+   - Follow code style guidelines
+4. **Test Changes**
    ```bash
+   python -m pytest tests/
+   python -c "from models.ocr_processor import OCRProcessor; OCRProcessor()"
    ```
+5. **Submit PR**
+   ```bash
+   git add .
+   git commit -m "feat: add your feature description"
+   git push origin feature/your-feature-name
+   ```
+### 📝 Contribution Guidelines
+- **Code Style**: Follow PEP 8, use Black formatter
+- **Documentation**: Update README and docstrings
+- **Tests**: Add tests for new functionality
+- **Commits**: Use conventional commit messages
+- **Issues**: Link PRs to relevant issues
 ## 📄 License
+This project is licensed under the **MIT License** - see the [LICENSE](LICENSE) file for details.
+### 🙏 Third-Party Licenses
+- **Microsoft Florence-2**: [MIT License](https://github.com/microsoft/Florence)
+- **HuggingFace Transformers**: [Apache License 2.0](https://github.com/huggingface/transformers)
+- **Gradio**: [Apache License 2.0](https://github.com/gradio-app/gradio)
+- **EasyOCR**: [Apache License 2.0](https://github.com/JaidedAI/EasyOCR)
+## 🌟 Acknowledgments
+Special thanks to:
+- **Microsoft Research** for the incredible Florence-2 vision-language model
+- **HuggingFace** for the transformers library and Spaces platform
+- **Gradio Team** for the amazing web interface framework
+- **JaidedAI** for EasyOCR fallback capabilities
+- **Open Source Community** for continuous support and contributions
+## 📈 Project Status
+| Component         | Status        | Version |
+| ----------------- | ------------- | ------- |
+| **Core OCR**      | ✅ Stable     | v1.0.0  |
+| **Web UI**        | ✅ Stable     | v1.0.0  |
+| **Deployment**    | ✅ Production | v1.0.0  |
+| **API**           | ✅ Stable     | v1.0.0  |
+| **Documentation** | ✅ Complete   | v1.0.0  |
+### 🎯 Roadmap
+- [ ] **Multi-language UI** support
+- [ ] **Batch processing** for multiple images
+- [ ] **API rate limiting** and authentication
+- [ ] **Custom model** fine-tuning support
+- [ ] **Mobile app** development
+- [ ] **Cloud storage** integration
+## 📞 Support & Community
+### 🔗 Links
+- **🏠 Homepage**: [GitHub Repository](https://github.com/KumarAmrit30/textlens-ocr)
+- **🚀 Live Demo**: [HuggingFace Spaces](https://huggingface.co/spaces/GoConqurer/textlens-ocr)
+- **📋 Issues**: [Report Bugs](https://github.com/KumarAmrit30/textlens-ocr/issues)
+- **💬 Discussions**: [GitHub Discussions](https://github.com/KumarAmrit30/textlens-ocr/discussions)
+- **📖 Documentation**: [Deployment Guide](DEPLOYMENT.md)
+### 📊 Stats
+![GitHub stars](https://img.shields.io/github/stars/KumarAmrit30/textlens-ocr?style=social)
+![GitHub forks](https://img.shields.io/github/forks/KumarAmrit30/textlens-ocr?style=social)
+![GitHub watchers](https://img.shields.io/github/watchers/KumarAmrit30/textlens-ocr?style=social)
 ---
+<div align="center">
 **Made with ❤️ for the AI community**
+[⭐ Star this repo](https://github.com/KumarAmrit30/textlens-ocr) • [🔗 Try the demo](https://huggingface.co/spaces/GoConqurer/textlens-ocr) • [📖 Read docs](DEPLOYMENT.md)
+</div>

requirements.txt CHANGED Viewed

@@ -6,7 +6,7 @@ sentencepiece>=0.1.97
 protobuf>=3.20.0
 # UI and web interface
-gradio>=4.0.0
 # Image processing
 pillow>=9.0.0

 protobuf>=3.20.0
 # UI and web interface
+gradio>=4.44.0
 # Image processing
 pillow>=9.0.0

ui/interface.py CHANGED Viewed

@@ -103,13 +103,15 @@ def create_interface():
         image_input.upload(
             fn=extract_text_from_image,
             inputs=image_input,
-            outputs=text_output
         )
         extract_btn.click(
             fn=extract_text_from_image,
             inputs=image_input,
-            outputs=text_output
         )
         refresh_status_btn.click(

         image_input.upload(
             fn=extract_text_from_image,
             inputs=image_input,
+            outputs=text_output,
+            api_name="extract_on_upload"
         )
         extract_btn.click(
             fn=extract_text_from_image,
             inputs=image_input,
+            outputs=text_output,
+            api_name="extract_on_click"
         )
         refresh_status_btn.click(