David Dale
cointegrated
AI & ML interests
Machine translation, Chatbots, applied NLU, controllable text generation (in particular, text style transfer), miniature models.
Recent Activity
new activity
1 day ago
cointegrated/SONAR_200_text_encoder:can you please do the same for decoder
new activity
1 day ago
slone/finugorbib:[bot] Conversion to Parquet
liked
a dataset
1 day ago
udmurtNLP/udmurt-russian-parallel-corpora
Organizations
cointegrated's activity
can you please do the same for decoder
1
#2 opened about 1 month ago
by
damerajee
[bot] Conversion to Parquet
#1 opened 2 days ago
by
parquet-converter
Added Dargwa dev set to flores_plus
2
#3 opened about 2 months ago
by
Murtazali
Fix encoding at chv devtest
3
#9 opened 2 days ago
by
alexantonov
[DRAFT] Fix orthography in the Russian dev set
2
#4 opened about 2 months ago
by
cointegrated
Add data integrity tests
1
#7 opened about 1 month ago
by
cointegrated
Two sentences in the dev set (one Lombard and one Tamasheq-Tifinagh) seem to be missing
#6 opened about 1 month ago
by
cointegrated
Adding `safetensors` variant of this model
#1 opened 3 months ago
by
SFconvertbot
[bot] Conversion to Parquet
#1 opened 3 months ago
by
parquet-converter
Optimize the preprocessing and generation
#11 opened 4 months ago
by
cointegrated
More Details about the model
1
#1 opened 9 months ago
by
sanjay73
Training script?
1
#2 opened 9 months ago
by
vdmbrsv
Some issues with models loading
1
#1 opened 9 months ago
by
matveymih
Adding ONNX file of this model
#2 opened 10 months ago
by
optowo
"max_seq_length": 512 in sentence_bert_config.json
1
#3 opened 10 months ago
by
sergeyzh
Please use materials from the Internet Archive, but not this way.
3
#2 opened 11 months ago
by
markgraham
Fix task tags
#2 opened over 2 years ago
by
albertvillanova
[messed up] Adding language tags and a small description
1
#1 opened over 1 year ago
by
cointegrated
Adding language tags and a small description
#2 opened over 1 year ago
by
cointegrated