t5-small-finetuned-cnndm_3epoch

This model is a fine-tuned version of t5-small on the cnn_dailymail dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum	Gen Len
1.9113	0.14	5000	1.7162	24.4374	11.6932	20.1741	23.0427	18.9997
1.8772	0.28	10000	1.7008	24.3715	11.6699	20.1387	22.9772	18.9997
1.8609	0.42	15000	1.6911	24.4174	11.6986	20.1756	23.0205	18.9997
1.8564	0.56	20000	1.6871	24.4374	11.6801	20.1663	23.0366	18.9995
1.8495	0.7	25000	1.6796	24.4019	11.6901	20.177	23.034	18.999
1.8448	0.84	30000	1.6787	24.4813	11.7227	20.1985	23.0847	18.999
1.8427	0.98	35000	1.6762	24.4905	11.7591	20.2548	23.1006	18.9993
1.8341	1.11	40000	1.6747	24.4743	11.7124	20.1782	23.0726	18.9996
1.822	1.25	45000	1.6753	24.4797	11.7292	20.2319	23.0816	18.9993
1.8262	1.39	50000	1.6713	24.4865	11.7079	20.2214	23.0919	18.9986
1.8281	1.53	55000	1.6702	24.5095	11.7364	20.2534	23.1264	18.9991
1.8228	1.67	60000	1.6678	24.5153	11.7595	20.2544	23.1138	18.9993
1.824	1.81	65000	1.6662	24.5324	11.7804	20.2671	23.1498	18.9997
1.8265	1.95	70000	1.6648	24.5795	11.7917	20.2935	23.1855	18.9992
1.8179	2.09	75000	1.6658	24.5426	11.804	20.2861	23.1586	18.9996
1.8147	2.23	80000	1.6646	24.5429	11.7914	20.2889	23.1542	18.9993
1.8026	2.37	85000	1.6632	24.5451	11.8045	20.2781	23.1555	18.9996
1.8141	2.51	90000	1.6643	24.5078	11.7781	20.2631	23.121	18.9996
1.8124	2.65	95000	1.6628	24.5728	11.7958	20.2875	23.178	18.9996
1.8098	2.79	100000	1.6635	24.5534	11.7998	20.2979	23.169	18.9996
1.8153	2.93	105000	1.6622	24.5435	11.7919	20.2929	23.1661	18.9996