Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2806.9
TFLOPS
65
148
183
Qian Liu
SivilTaram
Follow
Pent's profile picture
NickyNicky's profile picture
KatyaKnw's profile picture
84 followers
ยท
74 following
http://siviltaram.github.io/
sivil_taram
SivilTaram
AI & ML interests
Cooking cool things
Recent Activity
authored
a paper
about 6 hours ago
ZeCO: Zero Communication Overhead Sequence Parallelism for Linear Attention
upvoted
a
paper
1 day ago
ZeCO: Zero Communication Overhead Sequence Parallelism for Linear Attention
commented
on
a paper
1 day ago
ZeCO: Zero Communication Overhead Sequence Parallelism for Linear Attention
View all activity
Organizations
SivilTaram
's models
19
Sort:ย Recently updated
SivilTaram/tongyao_models_0504
Updated
May 5
SivilTaram/tongyao_models_v2
Updated
Mar 22
SivilTaram/tongyao_models
Updated
Mar 22
SivilTaram/mingzhe_models_llama3_1_8b_full_0119_dpo_ds3_2e-6
Updated
Jan 24
SivilTaram/mingzhe_models_llama3_1_8b_full_0119_sft_ds3_2e-6
Updated
Jan 24
SivilTaram/zephyr-7b-gemma-dpo-freeze-mlp
Updated
Apr 14, 2024
SivilTaram/tapex-t5-large-finetuned-wtq
Text Generation
โข
Updated
Jun 30, 2022
โข
18
SivilTaram/tapex-t5-xl-finetuned-wtq
Text Generation
โข
Updated
Jun 30, 2022
โข
12
SivilTaram/tapex-t5-small-lm-adapt
Text Generation
โข
Updated
Jun 30, 2022
โข
22
SivilTaram/tapex-t5-large-lm-adapt
Text Generation
โข
Updated
Jun 30, 2022
โข
32
SivilTaram/tapex-t5-xl-lm-adapt
Text Generation
โข
Updated
Jun 30, 2022
โข
21
SivilTaram/tapex-t5-base-lm-adapt
Updated
Jun 30, 2022
SivilTaram/poet-sql-finetuned-hotpotqa
Updated
Jun 30, 2022
SivilTaram/poet-sql-roberta
Updated
Jun 30, 2022
โข
17
SivilTaram/poet-sql-digit-finetuned-drop
Updated
Jun 29, 2022
SivilTaram/poet-math-digit
Updated
Jun 29, 2022
SivilTaram/poet-math-digit-finetuned-drop
Updated
Jun 29, 2022
SivilTaram/poet-sql-digit
Feature Extraction
โข
Updated
May 27, 2022
โข
18
SivilTaram/poet-sql
Feature Extraction
โข
Updated
May 27, 2022
โข
37