Update README.md
Browse files
README.md
CHANGED
@@ -4,10 +4,13 @@ tags:
|
|
4 |
license: gpl-3.0
|
5 |
---
|
6 |
## A conversational agent with many personalities (PersonaGPT)
|
7 |
-
PersonaGPT is an open-domain conversational agent
|
8 |
-
|
9 |
-
|
|
|
10 |
|
|
|
|
|
11 |
|
12 |
## Full Repo
|
13 |
|
|
|
4 |
license: gpl-3.0
|
5 |
---
|
6 |
## A conversational agent with many personalities (PersonaGPT)
|
7 |
+
PersonaGPT is an open-domain conversational agent designed to do 2 tasks:
|
8 |
+
|
9 |
+
1. decoding _personalized_ responses based on input personality facts (the "persona" profile of the bot).
|
10 |
+
2. incorporating _turn-level goals_ into its responses through "action codes" (e.g., "talk about work", "ask about favorite music").
|
11 |
|
12 |
+
It builds on the [DialoGPT-medium](https://huggingface.co/microsoft/DialoGPT-medium) pretrained model based on the [GPT-2](https://github.com/openai/gpt-2) architecture.
|
13 |
+
This model is trained on the [Persona-Chat](https://arxiv.org/pdf/1801.07243) dataset, with added special tokens to better distinguish between conversational history and personality traits for dyadic conversations. Furthermore, some active learning was used to train the model to do _controlled_ decoding using turn-level goals.
|
14 |
|
15 |
## Full Repo
|
16 |
|