Update README.md
Browse files
README.md
CHANGED
|
@@ -79,7 +79,7 @@ Also cite the ByT5 paper:
|
|
| 79 |
|
| 80 |
## Model Details
|
| 81 |
|
| 82 |
-
This is the model card for the
|
| 83 |
|
| 84 |
- **Developed by:** Julie Kallini, Shikhar Murty, Christopher D. Manning, Christopher Potts, R贸bert Csord谩s
|
| 85 |
- **Model type:** MrT5
|
|
|
|
| 79 |
|
| 80 |
## Model Details
|
| 81 |
|
| 82 |
+
This is the model card for the 1.23B-parameter **MrT5 Large** (`mrt5-large`), a more efficient variant of ByT5 Large (`google/byt5-large`). This model is trained to reduce sequence lengths by ~50% on average.
|
| 83 |
|
| 84 |
- **Developed by:** Julie Kallini, Shikhar Murty, Christopher D. Manning, Christopher Potts, R贸bert Csord谩s
|
| 85 |
- **Model type:** MrT5
|