Arabic Small Nougat
#1
by
johnlockejrr
- opened
Sorry to disturb, can you kindly share the method you used to train this beautiful model (maybe python script/notebook)? I'm trying to train an Arabic model for some medieval manuscripts, I have groundtruth as ALTO (I can convert it to image/csv or text easily), how should the original dataset look like? I see your dataset you used but is already in pickle format so I don't know how the raw data looked like. Thank you!
Hello @johnlockejrr ,
I am working on a larger variant of this model and with its release i will open source my datasets, training code and paper explaining everything.
Happy that the model is beneficial for you ^^
Wow! Thank you so much @MohamedRashad ! Can't wait!
Any updates? 😇
أي أخبار جديدة يا أخي؟