adamo1139
/

Danube3-4b-4chan-HESOYAM-2510-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Danube3-4b-4chan-HESOYAM-2510-GGUF / README.md

adamo1139's picture

Update README.md

8c29cd5 verified 4 months ago

|

1.43 kB

	---
	license: apache-2.0
	datasets:
	- adamo1139/4chan_archive_ShareGPT_only5
	- adamo1139/HESOYAM_v0.4
	- adamo1139/uninstruct-v1-experimental-chatml
	language:
	- en
	base_model:
	- h2oai/h2o-danube3-4b-base
	pipeline_tag: text-generation
	---
	# GGUF Quants

	Quants for [adamo1139/danube3-4b-4chan-hesoyam-2510](https://huggingface.co/adamo1139/danube3-4b-4chan-hesoyam-2510) are available in this repo

	# Model Details

	I finetuned Danube3 4B Base on [adamo1139/uninstruct-v1-experimental-chatml](https://huggingface.co/datasets/adamo1139/uninstruct-v1-experimental-chatml) dataset with the goal being making AI assistant slop less likely.

	Then I did finetuning on [adamo1139/4chan_archive_ShareGPT_only5](https://huggingface.co/datasets/adamo1139/4chan_archive_ShareGPT_only5) which is a filtered collection of 4chan threads from various boards for 1 epoch to introduce 4chan-specific slang.

	Then I did finetuning on [adamo1139/HESOYAM_v0.4](https://huggingface.co/datasets/adamo1139/HESOYAM_v0.4) for 3 epochs to improve 1-on-1 chat capabilities.

	This is a resulting model.

	# Prompt format

	Use ChatML prompt format.

	System message should be in the format as below:

	```
	A chat on 4chan board /3/
	A chat on 4chan board /g/
	A chat on 4chan board /x/
	A chat on 4chan board /pol/
	```

	# Evaluation

	I am still vibe-checking the model but initial results are good. I might have put in a bit too much reddit style from HESOYAM, not sure.