Sefika
/

CRE_llama_fewrel_1_memory_10_4

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions

CRE_llama_fewrel_1_memory_10_4 / README.md

Sefika's picture

Update README.md

2a65864 verified about 1 month ago

|

history blame contribute delete

1.4 kB

	---
	library_name: transformers
	tags:
	- trl
	- sft
	license: mit
	---

	# Model Card for Model ID

	<!-- Provide a quick summary of what the model is/does. -->



	## Model Details

	### Model Description

	<!-- Provide a longer summary of what this model is. -->

	This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.

	- Developed by: Sefika

	- Language(s) (NLP): EN
	- License: MIT
	- Finetuned from model [optional]: https://huggingface.co/meta-llama/Llama-2-7b-chat-hf

	### Model Sources [optional]

	<!-- Provide the basic links for the model. -->

	- Repository: [More Information Needed]
	- Paper [optional]: [More Information Needed]
	- Demo [optional]: [More Information Needed]


	### Direct Use

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline

	tokenizer = "meta-llama/Llama-2-7b-chat-hf"
	model_id = "CRE_llama_fewrel_1_memory_10_4"
	tokenizer = AutoTokenizer.from_pretrained(model_id, use_auth_token=True)
	model = AutoModelForCausalLM.from_pretrained(
	model_id,
	device_map="auto",
	load_in_4bit=True, # Requires bitsandbytes
	torch_dtype="auto"
	)

	```

	#### Testing Data

	FewRel



	BibTeX:
	The paper "Large Language Models for Continual Relation Extraction" is submitted to Springer Machine Learning journal




	## Model Card Contact
	sefika efeoglu