Detailed Architecture information.
#71
by
Sharath07
- opened
Is it just the weights which are open sourced or do we have any detailed information on how it is trained and what kind of vision encoders and language decoders are involved. Like exact architecture