Update README.md
Browse files
README.md
CHANGED
@@ -43,7 +43,7 @@ The model outputs two vectors:
|
|
43 |
1. **Policy**: A probability distribution over `NUM_POSSIBLE_MOVES=4672` representing the probability of making each move, obtained using `softmax` activation.
|
44 |
2. **Value**: A single scalar value indicating win/loss probability from current player’s perspective, ranging from -1 (loss) to 1 (win), obtained using `tanh` activation.
|
45 |
|
46 |
-
|
47 |
|
48 |
### Model Architecture
|
49 |
|
@@ -128,6 +128,12 @@ This model was evaluated against a simple random move opponent using the `evalua
|
|
128 |
|
129 |
These scores indicate that the model, in its current state, is not a strong chess player. It draws a majority of games against a random opponent, but also loses a significant number. Further training and architecture improvements are needed to enhance its performance.
|
130 |
|
|
|
|
|
|
|
|
|
|
|
|
|
131 |
## How to Use
|
132 |
|
133 |
### Training
|
|
|
43 |
1. **Policy**: A probability distribution over `NUM_POSSIBLE_MOVES=4672` representing the probability of making each move, obtained using `softmax` activation.
|
44 |
2. **Value**: A single scalar value indicating win/loss probability from current player’s perspective, ranging from -1 (loss) to 1 (win), obtained using `tanh` activation.
|
45 |
|
46 |
+
|
47 |
|
48 |
### Model Architecture
|
49 |
|
|
|
128 |
|
129 |
These scores indicate that the model, in its current state, is not a strong chess player. It draws a majority of games against a random opponent, but also loses a significant number. Further training and architecture improvements are needed to enhance its performance.
|
130 |
|
131 |
+
|
132 |
+
## Demo Game Video
|
133 |
+
|
134 |
+
You can see a demo game here: [StockZero Demo Gameplay Video](https://huggingface.co/nirajandhakal/StockZero/blob/main/v2-gameplay-svg-high-quality.mp4)
|
135 |
+
|
136 |
+
|
137 |
## How to Use
|
138 |
|
139 |
### Training
|