Safetensors
qwen2

Improve language tag

#1
by lbourdois - opened
Files changed (1) hide show
  1. README.md +40 -26
README.md CHANGED
@@ -1,27 +1,41 @@
1
- ---
2
- license: mit
3
- base_model:
4
- - Qwen/Qwen2.5-3B-Instruct
5
- ---
6
- # DeepRetrieval
7
- ## Overview
8
-
9
- DeepRetrieval is a novel approach that uses reinforcement learning (RL) to train Large Language Models (LLMs) for query generation without requiring supervised data. Instead of relying on expensive human-annotated or distilled reference queries, DeepRetrieval enables LLMs to learn through direct trial and error, using retrieval metrics as rewards.
10
- ## Key Features
11
-
12
- - **No Supervision Required**: Eliminates the need for expensive human-annotated or distilled reference queries
13
- - **RL-Based Framework**: Uses reinforcement learning to optimize query generation directly for retrieval performance
14
- - **State-of-the-Art Performance**: Achieves remarkable results across diverse retrieval tasks
15
-
16
- Please view our [GitHub page](https://github.com/pat-jj/DeepRetrieval) for instructions.
17
-
18
- [DeepRetrieval Paper](arxiv.org/abs/2503.00223)
19
- ```
20
- @article{jiang2025deepretrievalhackingrealsearch,
21
- title={DeepRetrieval: Hacking Real Search Engines and Retrievers with Large Language Models via Reinforcement Learning},
22
- author={Pengcheng Jiang and Jiacheng Lin and Lang Cao and Runchu Tian and SeongKu Kang and Zifeng Wang and Jimeng Sun and Jiawei Han},
23
- year={2025},
24
- journal = {arXiv preprint arXiv: 2503.00223},
25
- url={https://arxiv.org/abs/2503.00223}
26
- }
 
 
 
 
 
 
 
 
 
 
 
 
 
 
27
  ```
 
1
+ ---
2
+ license: mit
3
+ base_model:
4
+ - Qwen/Qwen2.5-3B-Instruct
5
+ language:
6
+ - zho
7
+ - eng
8
+ - fra
9
+ - spa
10
+ - por
11
+ - deu
12
+ - ita
13
+ - rus
14
+ - jpn
15
+ - kor
16
+ - vie
17
+ - tha
18
+ - ara
19
+ ---
20
+ # DeepRetrieval
21
+ ## Overview
22
+
23
+ DeepRetrieval is a novel approach that uses reinforcement learning (RL) to train Large Language Models (LLMs) for query generation without requiring supervised data. Instead of relying on expensive human-annotated or distilled reference queries, DeepRetrieval enables LLMs to learn through direct trial and error, using retrieval metrics as rewards.
24
+ ## Key Features
25
+
26
+ - **No Supervision Required**: Eliminates the need for expensive human-annotated or distilled reference queries
27
+ - **RL-Based Framework**: Uses reinforcement learning to optimize query generation directly for retrieval performance
28
+ - **State-of-the-Art Performance**: Achieves remarkable results across diverse retrieval tasks
29
+
30
+ Please view our [GitHub page](https://github.com/pat-jj/DeepRetrieval) for instructions.
31
+
32
+ [DeepRetrieval Paper](arxiv.org/abs/2503.00223)
33
+ ```
34
+ @article{jiang2025deepretrievalhackingrealsearch,
35
+ title={DeepRetrieval: Hacking Real Search Engines and Retrievers with Large Language Models via Reinforcement Learning},
36
+ author={Pengcheng Jiang and Jiacheng Lin and Lang Cao and Runchu Tian and SeongKu Kang and Zifeng Wang and Jimeng Sun and Jiawei Han},
37
+ year={2025},
38
+ journal = {arXiv preprint arXiv: 2503.00223},
39
+ url={https://arxiv.org/abs/2503.00223}
40
+ }
41
  ```