IntelLabs
/

eftnas-s1-bert-base

Model card Files Files and versions

jinjieyuan commited on Mar 19, 2024

Commit

e1b6475

·

1 Parent(s): 1960aa4

Create README.md

Signed-off-by: jinjieyuan <[email protected]>

Files changed (1) hide show

README.md +57 -0

README.md ADDED Viewed

	@@ -0,0 +1,57 @@

+---
+language: en
+license: apache-2.0
+datasets:
+- nyu-mll/glue
+---
+# EFTNAS Model Card: eftnas-s1-bert-base
+The super-networks fine-tuned on BERT-base with [GLUE benchmark](https://gluebenchmark.com/) using EFTNAS.
+## Model Details
+### Information
+- **Model name:** eftnas-s1-bert-base-[TASK]
+- **Base model:** [bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased)
+- **Subnetwork version:** Super-network
+- **NNCF Configurations:** [eftnas_configs](https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning/tree/main/EFTNAS/eftnas_configs)
+### Training and Evaluation
+[GLUE benchmark](https://gluebenchmark.com/)
+## Results
+Results of the optimal sub-network discoverd from the super-network:
+| Model                         | GFLOPs    | GLUE Avg.     | MNLI-m   | QNLI | QQP      | SST-2    | CoLA     | MRPC     | RTE  |
+|-------------------------------|-----------|---------------|----------|------|----------|----------|----------|----------|------|
+| **Development Set:**           |
+| **EFTNAS-S1**          | 5.7       | **82.9**      | **84.6** | 90.8 | **91.2** | **93.5** | **60.6** | **90.8** | 69.0 |
+| **Test Set:**                  |
+| **EFTNAS-S1**          | 5.7       | 77.7          | 83.7     | 89.9 | **71.8** | **93.4** | **52.6** | 87.6     | 65.0 |
+## Model Sources
+- **Repository:** [https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning/tree/main/EFTNAS](https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning/tree/main/EFTNAS)
+- **Paper:** [Searching for Efficient Language Models in First-Order Weight-Reordered Super-Networks]()
+## Citation
+```bibtex
+@inproceedings{
+  eftnas2024,
+  title={Searching for Efficient Language Models in First-Order Weight-Reordered Super-Networks},
+  author={J. Pablo Munoz and Yi Zheng and Nilesh Jain},
+  booktitle={The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation},
+  year={2024},
+  url={}
+}
+```
+## License
+Apache-2.0