Qwen models with custom class for bidirectional attention
Joao Coelho
jmvcoelho
·
AI & ML interests
None yet
Recent Activity
authored
a paper
2 months ago
Dwell in the Beginning: How Language Models Embed Long Documents for
Dense Retrieval
authored
a paper
2 months ago
DeepResearchGym: A Free, Transparent, and Reproducible Evaluation
Sandbox for Deep Research
upvoted
a
paper
2 months ago
DeepResearchGym: A Free, Transparent, and Reproducible Evaluation
Sandbox for Deep Research