vikarti-anatra
's Collections
Interesting ones
updated
LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B
Paper
•
2310.20624
•
Published
•
13
Unleashing the Power of Pre-trained Language Models for Offline
Reinforcement Learning
Paper
•
2310.20587
•
Published
•
18
BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B
Paper
•
2311.00117
•
Published
VideoFusion: Decomposed Diffusion Models for High-Quality Video
Generation
Paper
•
2303.08320
•
Published
•
3
Vikhrmodels/Vikhr-7B-instruct_0.4
Text Generation
•
Updated
•
1.61k
•
33
IlyaGusev/saiga_llama3_8b
Text Generation
•
Updated
•
35k
•
123
cognitivecomputations/wizard_vicuna_70k_unfiltered
Viewer
•
Updated
•
34.6k
•
106
•
164
failspy/llama-3-70B-Instruct-abliterated
Text Generation
•
Updated
•
2.47k
•
104
Zoyd/Sao10K_L3-8B-Stheno-v3.1-8_0bpw_exl2
Text Generation
•
Updated
•
11
•
3
Zoyd/Sao10K_L3-8B-Stheno-v3.1-6_5bpw_exl2
Text Generation
•
Updated
•
5
•
1
sophosympatheia/Aurora-Nights-70B-v1.0
Text Generation
•
Updated
•
440
•
22
PygmalionAI/mythalion-13b
Text Generation
•
Updated
•
1.14k
•
158
Nitral-AI/Poppy_Porpoise-1.0-L3-8B
Text Generation
•
Updated
•
53
•
24
NeverSleep/Noromaid-v0.4-Mixtral-Instruct-8x7b-Zloss
Text Generation
•
Updated
•
32
•
37
microsoft/Phi-3-medium-128k-instruct
Text Generation
•
Updated
•
18.5k
•
•
380
Azazelle/L3-RP_io
Text Generation
•
Updated
•
6
•
3
Lewdiculous/Poppy_Porpoise-1.0-L3-8B-GGUF-IQ-Imatrix
Updated
•
136
•
15
ACECODER: Acing Coder RL via Automated Test-Case Synthesis
Paper
•
2502.01718
•
Published
•
28
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training
Tokens
Paper
•
2504.07096
•
Published
•
70