Detailed Architecture information.

#71
by Sharath07 - opened

Is it just the weights which are open sourced or do we have any detailed information on how it is trained and what kind of vision encoders and language decoders are involved. Like exact architecture

I wish they published a paper or something. Would be definitely educational and interesting

Sign up or log in to comment