KCSG Knowledge Computing and Service Group, IIE, CAS

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Suu authored a paper 23 days ago

Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval

Suu authored a paper 23 days ago

CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-Experts

Suu authored a paper 23 days ago

Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts

View all activity

Suu

authored 6 papers 23 days ago

Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval

Paper • 2408.10613 • Published Aug 20, 2024

CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-Experts

Paper • 2410.16077 • Published Oct 21, 2024 • 1

Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts

Paper • 2502.12928 • Published Feb 18

UniAttn: Reducing Inference Costs via Softmax Unification for Post-Training LLMs

Paper • 2502.00439 • Published Feb 1 • 1

LightRetriever: A LLM-based Hybrid Retrieval Architecture with 1000x Faster Query Inference

Paper • 2505.12260 • Published May 18

Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization

Paper • 2508.07629 • Published 26 days ago • 39

Suu

updated a collection 25 days ago

Libra

Collection

This collection hosts the transformers and original repos of the Libra-Guard releases. • 8 items • Updated 25 days ago • 1

ffgcc

updated 4 datasets about 1 month ago

ffgcc

updated 5 models about 1 month ago

caskcsg/Llama-3-8B-LongMagpie-p-mix-512K-Instruct

8B • Updated Aug 2 • 5

caskcsg/Llama-3-8B-LongMagpie-512K-Instruct

8B • Updated Aug 2 • 4

caskcsg/Llama-3-8B-NExtLong-512K-Base

8B • Updated Aug 2 • 3

caskcsg/Llama-3-8B-NExtLong-512K-Instruct

8B • Updated Aug 2 • 3 • 4

caskcsg/Llama-3-8B-NExtLong-128K-Base

8B • Updated Aug 2 • 5

ffgcc

updated 4 datasets about 1 month ago

caskcsg/NExtLong-Instruct-dataset-Magpie-Llama-3.3-Pro-1M-v0.1

Preview • Updated Aug 2 • 174

caskcsg/NExtLong-512K-dataset-subset

Updated Aug 2 • 122

caskcsg/NExtLong-64K-dataset

Viewer • Updated Aug 2 • 24.8k • 342

caskcsg/NExtLong-128K-dataset

Viewer • Updated Aug 2 • 14.5k • 193

AI & ML interests

Recent Activity

Team members 5

caskcsg's activity