Hi, Iโm Sabarinathan, an AI Engineer and developer passionate about building robust and responsible multimodal models, including OCR and document understanding for English, Japanese, and Tamil.
My work spans speech recognition, image understanding, and transformer-based architectures.
Interests:
Vision-Language Models (VLM) and lightweight computer vision models
Natural Language Processing (NLP)
Automatic Speech Recognition (ASR)