Upload 20B_tokenizer.json
#26 opened 11 months ago
by
tonymhl
I did some little experiments with cfc 'liquid neural network' and wondered if it could replace the RNN. What do you think?
#22 opened over 1 year ago
by
diegottt
How many GPU and CPU memory I need to finetune a 7B raven model
1
#21 opened over 1 year ago
by
fubincom
Loading the model into WebUI
1
#20 opened over 1 year ago
by
Respair
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6467e44e696e7355f5d710b6/wBRRrgins-OVSNLMXDRHE.jpeg)
Self-reflection
2
#17 opened over 1 year ago
by
Raspbfox
Amazing results with Raven 3B!! It speaks other languages, it knows the date.. How does this work?
5
#15 opened almost 2 years ago
by
phi0112358
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1661340992329-noauth.png)
Training an INT4 version of the 7B model
7
#14 opened almost 2 years ago
by
Raspbfox
RWKV used as model for intent classification followed by performing tasks on own (Similar to Auto-GPT).
4
#13 opened almost 2 years ago
by
Parag09
New Dataset
10
#11 opened almost 2 years ago
by
Raspbfox
Comparison with llama based models?
3
#7 opened almost 2 years ago
by
milsunone
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62fe0aa72ab4812db7019081/NK09YfcoB1g0xoqCiX00O.jpeg)
Seeking Improvements and Configuration Advice for Longer Responses and Larger Tokens
3
#5 opened almost 2 years ago
by
dondraper
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642b2b8bbb77f84566321800/vBdIs222m14F2HGYWkd9T.jpeg)
ChnJpn more Japanese Ratio Plz?
5
#3 opened almost 2 years ago
by
terorin