Update README.md
Browse files
README.md
CHANGED
@@ -16,6 +16,30 @@ language:
|
|
16 |
|
17 |
# Current status:
|
18 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
19 |
<details>
|
20 |
<summary><b>June 8th, 2025, Is this project dead? LLAMA-4 was released bruh!</b></summary>
|
21 |
A LOT of stuff was changed over the past year, a lot of new datasets created, lessons learned, and so on and so forth. No, this project is <b>not dead</b>, and with the catastrophic release of LLAMA-4 (many, including myself stated it would be DOA, which proven to be correct, many researchers left meta after LLAMA-4 release, etc etc...). So... It seems that LLAMA-3 would stay relevant for quite some time. This is happening, but it's not the highest priority right now.
|
|
|
16 |
|
17 |
# Current status:
|
18 |
|
19 |
+
<details>
|
20 |
+
<summary><b>June 15th, 2025, well well well... It looks like the meme "I work well under pressure" does indeed applies, massive progress.</b></summary>
|
21 |
+
Yesterday, the 14th of June 2025, was quite the day in terms of geopolitics, I try to keep this stuff out of AI & tech, but I will say this... despite literally dozens if not hunders of ballistic missiles heading my way, I've made a very significant progress that is very much relevant for this whole project, and for all future projects.
|
22 |
+
|
23 |
+
It doesn't get any more pressure than the above, and at the moment of sirens and what not, I had an "Aha!" moment, and something clicked. Then (after it was "safe" to surface) I tested my idea, and it indeed worked. This is big. What does all of this cryptic mumbling means for the project? What was discovered?
|
24 |
+
|
25 |
+
Nothing sexy. No new "revolutionary RL technique" (GRPO SPPO DPO or any of that), it's simply data processing stuff. But... **IT IS SEXY**. Why?
|
26 |
+
|
27 |
+
Because it worked. And it means that I've gained access to a very substantial and possibly unique sources of data.
|
28 |
+
|
29 |
+
What does all the cryptic mumbling even mean?
|
30 |
+
|
31 |
+
It means that a new, very interestig sources of data that would **GREATLY** help with making all future models more balanced in term of ideology are now available, and that both **LLAMA_UNALINGED** and all future models would now enjoy a significant upgrade. **Very** significant.
|
32 |
+
|
33 |
+
Why I don't tell what is that data I keep yapping about?
|
34 |
+
|
35 |
+
Because this is a **grey zone**. By Israeli laws, **it is allowed**, but it's very much in the grey zone in terms of data and copyright law used for AI training. I want to both be as **transparent** as I can, while **protecting** the project.
|
36 |
+
|
37 |
+
Fun fact, the last time ballistic missiles were flying my way, I've made I [Impish_LLAMA_3B](https://huggingface.co/SicariusSicariiStuff/Impish_LLAMA_3B), which was the first 'production ready' Roleplay model at 3B size (and was indeed used at scale by several known AI platform). I work well under pressure.
|
38 |
+
|
39 |
+
I need a vacation though. Too bad that the skies are currently closed.
|
40 |
+
|
41 |
+
</details>
|
42 |
+
|
43 |
<details>
|
44 |
<summary><b>June 8th, 2025, Is this project dead? LLAMA-4 was released bruh!</b></summary>
|
45 |
A LOT of stuff was changed over the past year, a lot of new datasets created, lessons learned, and so on and so forth. No, this project is <b>not dead</b>, and with the catastrophic release of LLAMA-4 (many, including myself stated it would be DOA, which proven to be correct, many researchers left meta after LLAMA-4 release, etc etc...). So... It seems that LLAMA-3 would stay relevant for quite some time. This is happening, but it's not the highest priority right now.
|