Commit
·
e1b6475
1
Parent(s):
1960aa4
Create README.md
Browse filesSigned-off-by: jinjieyuan <[email protected]>
README.md
ADDED
@@ -0,0 +1,57 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language: en
|
3 |
+
license: apache-2.0
|
4 |
+
datasets:
|
5 |
+
- nyu-mll/glue
|
6 |
+
---
|
7 |
+
|
8 |
+
# EFTNAS Model Card: eftnas-s1-bert-base
|
9 |
+
|
10 |
+
The super-networks fine-tuned on BERT-base with [GLUE benchmark](https://gluebenchmark.com/) using EFTNAS.
|
11 |
+
|
12 |
+
## Model Details
|
13 |
+
|
14 |
+
### Information
|
15 |
+
|
16 |
+
- **Model name:** eftnas-s1-bert-base-[TASK]
|
17 |
+
- **Base model:** [bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased)
|
18 |
+
- **Subnetwork version:** Super-network
|
19 |
+
- **NNCF Configurations:** [eftnas_configs](https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning/tree/main/EFTNAS/eftnas_configs)
|
20 |
+
|
21 |
+
### Training and Evaluation
|
22 |
+
|
23 |
+
[GLUE benchmark](https://gluebenchmark.com/)
|
24 |
+
|
25 |
+
## Results
|
26 |
+
|
27 |
+
Results of the optimal sub-network discoverd from the super-network:
|
28 |
+
|
29 |
+
| Model | GFLOPs | GLUE Avg. | MNLI-m | QNLI | QQP | SST-2 | CoLA | MRPC | RTE |
|
30 |
+
|-------------------------------|-----------|---------------|----------|------|----------|----------|----------|----------|------|
|
31 |
+
| **Development Set:** |
|
32 |
+
| **EFTNAS-S1** | 5.7 | **82.9** | **84.6** | 90.8 | **91.2** | **93.5** | **60.6** | **90.8** | 69.0 |
|
33 |
+
| **Test Set:** |
|
34 |
+
| **EFTNAS-S1** | 5.7 | 77.7 | 83.7 | 89.9 | **71.8** | **93.4** | **52.6** | 87.6 | 65.0 |
|
35 |
+
|
36 |
+
|
37 |
+
## Model Sources
|
38 |
+
|
39 |
+
- **Repository:** [https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning/tree/main/EFTNAS](https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning/tree/main/EFTNAS)
|
40 |
+
- **Paper:** [Searching for Efficient Language Models in First-Order Weight-Reordered Super-Networks]()
|
41 |
+
|
42 |
+
## Citation
|
43 |
+
|
44 |
+
```bibtex
|
45 |
+
@inproceedings{
|
46 |
+
eftnas2024,
|
47 |
+
title={Searching for Efficient Language Models in First-Order Weight-Reordered Super-Networks},
|
48 |
+
author={J. Pablo Munoz and Yi Zheng and Nilesh Jain},
|
49 |
+
booktitle={The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation},
|
50 |
+
year={2024},
|
51 |
+
url={}
|
52 |
+
}
|
53 |
+
```
|
54 |
+
|
55 |
+
## License
|
56 |
+
|
57 |
+
Apache-2.0
|