Nordic embedding training data Collection This is a collection of synthetic datasets for embedding model training in Danish, Swedish and Norwegian (bokmål). • 15 items • Updated 3 days ago • 2
NB-Whisper Collection Models based on Whisper from OpenAI, and trained on data from Språkbanken and the digital collection at the National Library of Norway. • 7 items • Updated Nov 30, 2024 • 11