wing lian PRO

winglian

·

AI & ML interests

None yet

Recent Activity

liked a dataset 9 days ago

common-pile/peS2o_filtered

liked a dataset 9 days ago

common-pile/pubmed_filtered

liked a dataset 9 days ago

common-pile/arxiv_papers_filtered

View all activity

Organizations

New activity in kernels-community/scattermoe 8 months ago

Update build/torch-universal/scattermoe/layers.py

#1 opened 8 months ago by

New activity in mlabonne/LFM2-1.2B-Pirate about 1 year ago

Update config with YAML

#1 opened about 1 year ago by

New activity in allenai/Llama-3.1-Tulu-3-8B-DPO about 1 year ago

Batch Size

#4 opened about 1 year ago by

New activity in moonshotai/Moonlight-16B-A3B-Instruct over 1 year ago

remove import/call to code no longer in latest transformers

#3 opened over 1 year ago by

New activity in nvidia/Hymba-1.5B-Base over 1 year ago

fix int/str for conv_dim indexing

#5 opened over 1 year ago by

New activity in axolotl-ai-co/romulus-mistral-nemo-12b-simpo almost 2 years ago

Update README.md

#2 opened almost 2 years ago by

New activity in deepseek-ai/DeepSeek-Prover-V1.5-Base almost 2 years ago

Match the config class name to what the modeling code expects

#4 opened almost 2 years ago by

New activity in meta-llama/Meta-Llama-3-8B over 2 years ago

Rename original/tokenizer.model to tokenizer.model

#6 opened over 2 years ago by

commented a paper over 2 years ago

Octopus v2: On-device language model for super agent

Paper • 2404.01744 • Published Apr 2, 2024 • 59 •

New activity in ai21labs/Jamba-v0.1 over 2 years ago

finetuning issues

#9 opened over 2 years ago by

New activity in open-llm-leaderboard/open_llm_leaderboard over 2 years ago

latest commit breaks ability to submit mistral finetunes

#410 opened over 2 years ago by

New activity in Open-Orca/Mistral-7B-OpenOrca over 2 years ago

Can you share the training configuration of Axolotl?

#24 opened over 2 years ago by

New activity in open-llm-leaderboard/open_llm_leaderboard over 2 years ago

please remove openaccess-ai-collective/grendel

#387 opened over 2 years ago by

Unable to submit public models for evaluation

#379 opened over 2 years ago by

New activity in microsoft/phi-1_5 almost 3 years ago

Attention mask not working during training

#34 opened almost 3 years ago by

New activity in Open-Orca/OpenOrcaxOpenChat-Preview2-13B almost 3 years ago

Unreliable Benchmarks. Definitely worse than LLaMA2-13b

#5 opened almost 3 years ago by

New activity in winglian/t5-large-flan-cot about 3 years ago

Adding `safetensors` variant of this model

#1 opened about 3 years ago by

New activity in openaccess-ai-collective/openllama-7b-4k about 3 years ago

What does the 4k stand for?

#1 opened about 3 years ago by

New activity in openaccess-ai-collective/hippogriff-30b-chat about 3 years ago

Will you consider releasing a public dataset?

#1 opened about 3 years ago by

Set use_cache to True, otherwise inference performance is poor

#2 opened about 3 years ago by