25 12 2

Princeton NLP group

princeton-nlp

BhaskarSteve's profile picture

Keron's profile picture

davzoku's profile picture

https://princeton-nlp.github.io

princeton_nlp
princeton-nlp

AI & ML interests

None yet

Recent Activity

new activity 8 days ago

HuggingFaceTB/FineMath-Llama-3B:Hyperparameters

updated a collection 2 months ago

RLMT Experiments

updated a collection 2 months ago

RLMT Experiments

View all activity

Organizations

princeton-nlp 's collections 6

RLMT Experiments

The *RLMT* collection. Coming soon!

princeton-nlp/warm-start__sft__think__Llama-3.1-8B-Instruct

8B • Updated Sep 22 • 7
princeton-nlp/warm-start__sft__nothink__Qwen2.5-7B-Instruct

8B • Updated Sep 22 • 57
princeton-nlp/warm-start__sft__think__Llama-3.1-8B

8B • Updated Sep 22 • 8
princeton-nlp/warm-start__sft__think__Qwen2.5-7B

8B • Updated Sep 22 • 11

SWE-bench

SWE-bench is a benchmark for evaluating Language Models and AI Systems on their ability resolve real world GitHub Issues.

princeton-nlp/SWE-bench

Viewer • Updated Mar 3 • 21.5k • 22k • 128
princeton-nlp/SWE-bench_Lite

Viewer • Updated Mar 3 • 323 • 32.6k • 50
princeton-nlp/SWE-bench_Multimodal

Viewer • Updated Jan 13 • 612 • 1.16k • 21
princeton-nlp/SWE-bench_Verified

Viewer • Updated Feb 18 • 500 • 588k • 233

Sheared Llama

princeton-nlp/Sheared-LLaMA-1.3B

Text Generation • Updated Jan 23, 2024 • 4.33k • 98
princeton-nlp/Sheared-LLaMA-2.7B

Text Generation • Updated Jan 23, 2024 • 2.49k • 61
princeton-nlp/Sheared-LLaMA-1.3B-ShareGPT

Text Generation • Updated Dec 4, 2023 • 977 • 10
princeton-nlp/Sheared-LLaMA-2.7B-ShareGPT

Text Generation • Updated Dec 4, 2023 • 1.16k • 8

SimPO

This collections contains a list of SimPO and baseline models.

princeton-nlp/gemma-2-9b-it-SimPO

Text Generation • 9B • Updated Aug 2, 2024 • 1.53k • • 170
princeton-nlp/gemma-2-9b-it-DPO

Text Generation • 9B • Updated Jul 18, 2024 • 38 • • 9
princeton-nlp/Llama-3-Base-8B-SFT-IPO

Text Generation • 8B • Updated Jun 17, 2024 • 24 • • 1
princeton-nlp/Llama-3-Base-8B-SFT-DPO

Text Generation • 8B • Updated Jun 17, 2024 • 144 •

ProLong

ProLong is a family of long-context models that are continued trained and supervised fine-tuned from Llama-3-8B, with a maximum context window of 512K

princeton-nlp/Llama-3-8B-ProLong-64k-Base

Text Generation • 8B • Updated Oct 31, 2024 • 8.36k • • 5
princeton-nlp/Llama-3-8B-ProLong-64k-Instruct

Text Generation • 8B • Updated Oct 31, 2024 • 8.47k • • 13
princeton-nlp/Llama-3-8B-ProLong-512k-Base

8B • Updated Oct 31, 2024 • 7.84k • 9
princeton-nlp/Llama-3-8B-ProLong-512k-Instruct

8B • Updated Oct 31, 2024 • 8k • 24

SimCSE

princeton-nlp/unsup-simcse-bert-base-uncased

Feature Extraction • Updated Nov 11, 2022 • 26.4k • • 5
princeton-nlp/unsup-simcse-bert-large-uncased

Feature Extraction • Updated Nov 15, 2022 • 48 • 1
princeton-nlp/unsup-simcse-roberta-base

Feature Extraction • Updated Jun 16, 2021 • 3.54k • • 9
princeton-nlp/unsup-simcse-roberta-large

Feature Extraction • Updated Jun 16, 2021 • 610 • 3

RLMT Experiments

The *RLMT* collection. Coming soon!

princeton-nlp/warm-start__sft__think__Llama-3.1-8B-Instruct

8B • Updated Sep 22 • 7
princeton-nlp/warm-start__sft__nothink__Qwen2.5-7B-Instruct

8B • Updated Sep 22 • 57
princeton-nlp/warm-start__sft__think__Llama-3.1-8B

8B • Updated Sep 22 • 8
princeton-nlp/warm-start__sft__think__Qwen2.5-7B

8B • Updated Sep 22 • 11

SimPO

This collections contains a list of SimPO and baseline models.

princeton-nlp/gemma-2-9b-it-SimPO

Text Generation • 9B • Updated Aug 2, 2024 • 1.53k • • 170
princeton-nlp/gemma-2-9b-it-DPO

Text Generation • 9B • Updated Jul 18, 2024 • 38 • • 9
princeton-nlp/Llama-3-Base-8B-SFT-IPO

Text Generation • 8B • Updated Jun 17, 2024 • 24 • • 1
princeton-nlp/Llama-3-Base-8B-SFT-DPO

Text Generation • 8B • Updated Jun 17, 2024 • 144 •

SWE-bench

SWE-bench is a benchmark for evaluating Language Models and AI Systems on their ability resolve real world GitHub Issues.

princeton-nlp/SWE-bench

Viewer • Updated Mar 3 • 21.5k • 22k • 128
princeton-nlp/SWE-bench_Lite

Viewer • Updated Mar 3 • 323 • 32.6k • 50
princeton-nlp/SWE-bench_Multimodal

Viewer • Updated Jan 13 • 612 • 1.16k • 21
princeton-nlp/SWE-bench_Verified

Viewer • Updated Feb 18 • 500 • 588k • 233

ProLong

ProLong is a family of long-context models that are continued trained and supervised fine-tuned from Llama-3-8B, with a maximum context window of 512K

princeton-nlp/Llama-3-8B-ProLong-64k-Base

Text Generation • 8B • Updated Oct 31, 2024 • 8.36k • • 5
princeton-nlp/Llama-3-8B-ProLong-64k-Instruct

Text Generation • 8B • Updated Oct 31, 2024 • 8.47k • • 13
princeton-nlp/Llama-3-8B-ProLong-512k-Base

8B • Updated Oct 31, 2024 • 7.84k • 9
princeton-nlp/Llama-3-8B-ProLong-512k-Instruct

8B • Updated Oct 31, 2024 • 8k • 24

Sheared Llama

princeton-nlp/Sheared-LLaMA-1.3B

Text Generation • Updated Jan 23, 2024 • 4.33k • 98
princeton-nlp/Sheared-LLaMA-2.7B

Text Generation • Updated Jan 23, 2024 • 2.49k • 61
princeton-nlp/Sheared-LLaMA-1.3B-ShareGPT

Text Generation • Updated Dec 4, 2023 • 977 • 10
princeton-nlp/Sheared-LLaMA-2.7B-ShareGPT

Text Generation • Updated Dec 4, 2023 • 1.16k • 8

SimCSE

princeton-nlp/unsup-simcse-bert-base-uncased

Feature Extraction • Updated Nov 11, 2022 • 26.4k • • 5
princeton-nlp/unsup-simcse-bert-large-uncased

Feature Extraction • Updated Nov 15, 2022 • 48 • 1
princeton-nlp/unsup-simcse-roberta-base

Feature Extraction • Updated Jun 16, 2021 • 3.54k • • 9
princeton-nlp/unsup-simcse-roberta-large

Feature Extraction • Updated Jun 16, 2021 • 610 • 3