Hieu Lam PRO

lamhieu

AI & ML interests

.-.

Articles

Organizations

lamhieu's activity

replied to m-ric's post 14 days ago
view reply

Sounds interesting but I think there will be a big breakthrough, a new "architecture/methodology/factor/rethinking" for developing large models. That's what I think, I don't know what it is yet, haha.

posted an update 26 days ago
view post
Post
1574
๐ŸŽฏ Ghost 8B Beta 1608: Empowering Your AI Assistant
๐Ÿ“ฆ Unlock the Power of Ghost 8B Beta 1608: Build Your Personal AI Companion
Ghost 8B Beta 1608 empowers you to create a safe and multilingual AI assistant tailored to your needs, directly on your personal computer. ๐Ÿง‘โ€๐Ÿ’ป Leverage AI's capabilities within your own space! ๐Ÿš€ Ghost 8B Beta 1608 is ready to become your AI companion.
~
๐Ÿ“ฆ ๊ฐœ์ธ์šฉ AI ๋ณด์กฐ ๋„๊ตฌ๋กœ Ghost 8B Beta 1608๋ฅผ ํ™œ์šฉํ•˜์„ธ์š”!
Ghost 8B Beta 1608, AI์˜ ํž˜์„ ํ™œ์šฉํ•˜์—ฌ ์•ˆ์ „ํ•˜๊ณ  ๊ฐœ์ธํ™”๋œ ์–ธ์–ด ์ง€์›์„ ์ œ๊ณตํ•˜๋Š” AI ๋ณด์กฐ ๋„๊ตฌ๋ฅผ ์ง์ ‘ ๊ตฌ์ถ•ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ๐Ÿง‘โ€๐Ÿ’ป ๊ฐœ์ธ ์ปดํ“จํ„ฐ์—์„œ AI์˜ ํ˜œํƒ์„ ๋ˆ„๋ฆฌ์„ธ์š”! ๐Ÿš€ Ghost 8B Beta 1608๋Š” ๋‹น์‹ ์˜ AI ํŒŒํŠธ๋„ˆ๊ฐ€ ๋  ์ค€๋น„๊ฐ€ ๋˜์–ด ์žˆ์Šต๋‹ˆ๋‹ค.
lamhieu/ghost-8b-beta-8k
ghost-x/ghost-8b-beta-668ead6179f93be717db4542
posted an update about 1 month ago
view post
Post
3065
๐Ÿš€ Weโ€™re excited to launch Ghost 8B Beta (1608), a top-performing language model with unmatched multilingual support and cost efficiency.

Key Highlights:
- Superior Performance: Outperforms Llama 3.1 8B Instruct, GPT-3.5 Turbo, Claude 3 Opus, GPT-4, and more in winrate scores.
- Expanded Language Support: Now supports 16 languages, including English, Vietnamese, Spanish, Chinese, and more.
- Enhanced Capabilities: Improved math, reasoning, and instruction-following for better task handling.

With two context options (8k and 128k), Ghost 8B Beta is perfect for complex, multilingual applications, balancing power and cost-effectiveness.

๐Ÿ”— Learn More: https://ghost-x.org/docs/models/ghost-8b-beta
ghost-x/ghost-8b-beta-668ead6179f93be717db4542
replied to their post about 2 months ago
replied to their post about 2 months ago
view reply

@Dihelson @llama-anon @AIWizard76 @danielus
๐ŸŽ‰ Ghost 8B Beta Released: Game-Changing Language Model

Ghost 8B Beta is a groundbreaking language model developed with a clear vision: to deliver exceptional multilingual support, superior knowledge capabilities, and all while remaining cost-effective. This model comes in two context length variations, 8k and 128k, ensuring flexibility for various tasks. Moreover, it boasts built-in multilingual functionality, making it a powerful tool for global communication and understanding.

replied to their post about 2 months ago
view reply

๐ŸŽ‰ Ghost 8B Beta Released: Game-Changing Language Model

Ghost 8B Beta is a groundbreaking language model developed with a clear vision: to deliver exceptional multilingual support, superior knowledge capabilities, and all while remaining cost-effective. This model comes in two context length variations, 8k and 128k, ensuring flexibility for various tasks. Moreover, it boasts built-in multilingual functionality, making it a powerful tool for global communication and understanding.

posted an update about 2 months ago
view post
Post
2093
๐ŸŽ‰ Ghost 8B Beta Released: Game-Changing Language Model
--
Ghost 8B Beta is a groundbreaking language model developed with a clear vision: to deliver exceptional multilingual support, superior knowledge capabilities, and all while remaining cost-effective. This model comes in two context length variations, 8k and 128k, ensuring flexibility for various tasks. Moreover, it boasts built-in multilingual functionality, making it a powerful tool for global communication and understanding.
--
* See detailed article: https://huggingface.co/blog/lamhieu/ghost-8b-beta-released-game-changing-language-mode
* Model card: ghost-x/ghost-8b-beta
* Official website: https://ghost-x.org/docs/models/ghost-8b-beta
posted an update 2 months ago
view post
Post
2109
๐Ÿคฏ Ghost 8B Beta emerges as a clear leader, surpassing even proprietary models like xAI Grok 1, OpenAI GPT 3.5, and Mistral Mixtral 8x7B. This dominance extends to its parity with Mistral Medium, further solidifying its position as a top-tier language model. Furthermore, Ghost 8B Beta stands out as one of only three models employing the zero-shot method for evaluation, alongside Claude 2 and Claude 3, showcasing its unique capabilities and potential for groundbreaking applications.
---
๐Ÿ’ฌ Chat with the model here:
- Playground with Ghost 8B Beta (ฮฒ, 8k): lamhieu/ghost-8b-beta-8k
- Playground with Ghost 8B Beta (ฮฒ, 128k): lamhieu/ghost-8b-beta-128k
- Official website: https://ghost-x.org/docs/models/ghost-8b-beta/
  • 2 replies
ยท
replied to their post 2 months ago
view reply

Thank you for your dedication, it sounds great. Here I would like to share some additional information and perspectives so that everyone can better understand the issues we address:

  • With language models, when applying in practice we only need it to be understood at 80% or a good overview and combining with RAG will bring better accuracy. So, here we will need a good level of truth telling model and the ability to understand and work with RAG at a very good level to be most effective.
  • In Italian, I'm very happy when it speaks well, it proves that my training method and source code for it were correct because it's actually live with the d0x5 version. This is all because Italian was only added later (at the same time as German), responding to the fact that sometimes it can only be described as a translation mays.
  • With the ability to reason, I hope you don't misunderstand. It still works well, just when compared to some current superior models like GPT 4o or Claude 3, there will be some songs where it will "lose". It still outperforms a lot of other much larger models. For example, the question "Andrew is free from 11 am to 3 pm, Joanne is free from noon to 2 pm and then 3:30 pm to 5 pm. Hannah is available at noon for half an hour, and then 4 pm to 6 pm. What are some options for start times for a 30 minute meeting for Andrew, Hannah, and Joanne?" taken from OpenAI GPT4 home page.

One note: in reasoning tests, models often set the temperature to 0, with Ghost 8B Beta we always set it to 0.1 as the lowest. The reason is simple because if at this level the model still reasons well, then at level 0.4 (the default level of the current chat) it will still often achieve the same results, and we want to aim for practical efficiency. rather than scores. Let's try to lower the temperature with some reasoning questions to experiment.

After all, you guys are great, thank you so much everyone.

An example of reasoning about time:
Screenshot 2024-07-16 at 11.29.12.png
Screenshot 2024-07-16 at 11.29.25.png

An example of a long context with extensive summary capabilities: Paper: Point out the highlights and identify the ideal people to apply it..

replied to their post 2 months ago
view reply

@Dihelson It's probably because you told the model to do it again. Try telling the model to change each word. Of course, it could still be because the model misunderstood.

replied to their post 2 months ago
view reply

Try the following conversation: (1) ask to write an article -> (2) ask to translate the article into the languages โ€‹โ€‹you want.

replied to their post 2 months ago
view reply

@AIWizard76 It hasn't gone through any real eval tests to be able to compare, but if we're just talking about ghost 8b beta, it has good translation capabilities for supported languages. It works well for translating long texts and also translating into multiple languages โ€‹โ€‹simultaneously.

replied to their post 2 months ago
view reply

It's simple, currently the base version will not try to lengthen the text and be more "obedient". Maybe tomorrow or the next day I'll put it up for everyone to try.
Note, the current version is running everything from version "disl-0x5", the new version will improve a lot but it may not be ready right now.

replied to their post 2 months ago
view reply

thank you for your comments and encouragement ๐Ÿค—
another question, how do you feel when conversing in Italian?

replied to their post 2 months ago
replied to their post 2 months ago
view reply

@danielus I noticed the explanation model because this is what the chat version (ft from ghost 8b beta, base) does for the chat task (base will not try to explain and will respect the system more strictly). The goal of answering with more information is to help users avoid having to learn more or get side answers from just one question. Of course, this can sometimes be a hassle, we'll try to balance it out.

replied to their post 2 months ago
replied to their post 2 months ago
view reply

@Dihelson It supports Portuguese language, try it and let me know what you think. ๐Ÿ‡ต๐Ÿ‡น

replied to their post 2 months ago
replied to their post 2 months ago
view reply

A note here, the model is capable of working well with 9 major languages โ€‹โ€‹along with function tools for the languages. It has a size that can be called a boy compared to other multilingual models (which may be lacking or inferior in things like function tools and performance).

replied to their post 2 months ago
view reply

@ZeroWw To be honest, our initial training focused more on math ability than on (abstract) reasoning. It still has just less training data, rest assured as this is a test of training recipes, expanding the training capability domains and languages โ€‹โ€‹is just a matter of time and resources..

replied to their post 2 months ago
view reply

@Dihelson I understand, we value a model that has good reasoning capabilities and that is also our goal. In the early versions, it was focused on the immediate goals of good multilingualism, safety, functional tools support and good general performance. And it has achieved its goals. In the next stage, I will also conquer the things you said and a few other languages. Enjoy ๐Ÿค—

replied to their post 2 months ago
view reply

@Dihelson @ZeroWw Thank you for pointing out the problems, in terms of reasoning we have noticed the problem. It will be improved in an upgraded version in the near future, the current version is disl-0x5 (d0x5).

replied to their post 2 months ago
view reply

I think it works well for this question, right? Have you adjusted the temperature higher?

IMG_2252.png

posted an update 2 months ago
view post
Post
4275
๐ŸŽ‰ The Ghost 8B Beta model outperforms prominent models such as Llama 3 8B Instruct, GPT 3.5 Turbo in the lc_winrate score. In addition, it also outperforms Claude 3 Opus, Claude 3 Sonnet, GPT-4, and Mistral Large when comparing the winrate score of AlpacaEval 2.0.

Ghost 8B Beta is a large language model developed with goals that include excellent multilingual support, superior knowledge capabilities, and cost-effectiveness. The model comes in two context length versions, 8k and 128k, along with multilingual function tools support by default.
The languages supported are ๐Ÿ‡บ๐Ÿ‡ธ English, ๐Ÿ‡ซ๐Ÿ‡ท French, ๐Ÿ‡ฎ๐Ÿ‡น Italian, ๐Ÿ‡ช๐Ÿ‡ธ Spanish, ๐Ÿ‡ต๐Ÿ‡น Portuguese, ๐Ÿ‡ฉ๐Ÿ‡ช German, ๐Ÿ‡ป๐Ÿ‡ณ Vietnamese, ๐Ÿ‡ฐ๐Ÿ‡ท Korean and ๐Ÿ‡จ๐Ÿ‡ณ Chinese.

Explore the Potential:
To learn more about this groundbreaking language model, visit the official website or explore the online demo platforms:
- Ghost 8B Beta (ฮฒ, 8k) on Spaces: lamhieu/ghost-8b-beta-8k.
- Ghost 8B Beta (ฮฒ, 128k) on Spaces: lamhieu/ghost-8b-beta-128k
- Official website: https://ghost-x.org/docs/models/ghost-8b-beta
ยท
posted an update 2 months ago
view post
Post
1508
Ghost 8B Beta is a large language model developed with goals that include excellent multilingual support, superior knowledge capabilities, and cost-effectiveness. The model comes in two context length versions, 8k and 128k, along with multilingual function tools support by default.
* The languages supported are ๐Ÿ‡บ๐Ÿ‡ธ English, ๐Ÿ‡ซ๐Ÿ‡ท French, ๐Ÿ‡ฎ๐Ÿ‡น Italian, ๐Ÿ‡ช๐Ÿ‡ธ Spanish, ๐Ÿ‡ต๐Ÿ‡น Portuguese, ๐Ÿ‡ฉ๐Ÿ‡ช German, ๐Ÿ‡ป๐Ÿ‡ณ Vietnamese, ๐Ÿ‡ฐ๐Ÿ‡ท Korean and ๐Ÿ‡จ๐Ÿ‡ณ Chinese.
* ๐Ÿ‘จโ€๐Ÿ’ป Try on Spaces: lamhieu/ghost-8b-beta-8k
* ๐Ÿ“‹ Official website: https://ghost-x.org/docs/models/ghost-8b-beta
  • 1 reply
ยท
posted an update 3 months ago
view post
Post
2880
Wow, this is amazing! ๐Ÿคฏ
Samba is a powerful hybrid model with an unlimited context length, combining Mamba, MLP, Sliding Window Attention, and MLP stacking. Samba largest version, Samba-3.8B, trained on 3.2 trillion tokens, excels in benchmarks like MMLU, GSM8K, and HumanEval, and shines in long-context tasks with minimal tuning.
---
Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"
Github: https://github.com/microsoft/Samba
replied to their post 4 months ago
view reply

Let's try a math problem, what do you think about this answer?

image.png

posted an update 4 months ago
view post
Post
1344
Haloooo, continue experimenting with a checkpoint version of Ghost Beta (small version) during training in stage 1 (trained progress: 41%).

Supported languages: ๐Ÿ‡บ๐Ÿ‡ธ English, ๐Ÿ‡ช๐Ÿ‡ธ Spanish, ๐Ÿ‡ต๐Ÿ‡น Portuguese, ๐Ÿ‡ซ๐Ÿ‡ท French, ๐Ÿ‡ฎ๐Ÿ‡น Italian, ๐Ÿ‡ฉ๐Ÿ‡ช German, ๐Ÿ‡ป๐Ÿ‡ณ Vietnamese, ๐Ÿ‡ฐ๐Ÿ‡ท Korean, ๐Ÿ‡จ๐Ÿ‡ณ Chinese, and !?

Note that this is not a conclusion, this is just a sharing of the state of the model. If you find it interesting, please follow the project at:
* https://x.com/ghostx_ai
* https://ghost-x.org/
* https://huggingface.co/ghost-x

Ghost X is currently very open to invitations to cooperate, share and support.
๐Ÿคฏ๐Ÿ‘‡
  • 1 reply
ยท
posted an update 4 months ago
view post
Post
857
With the previous survey, Ghost Beta (small version) will support 9+ languages โ€‹โ€‹fluently. It is revealed that the model will be designed for 3 stages of training, showing a checkpoint to try at stage 1 (trained progress: 29%).

Supported languages: ๐Ÿ‡บ๐Ÿ‡ธ English, ๐Ÿ‡ช๐Ÿ‡ธ Spanish, ๐Ÿ‡ต๐Ÿ‡น Portuguese, ๐Ÿ‡ซ๐Ÿ‡ท French, ๐Ÿ‡ฎ๐Ÿ‡น Italian, ๐Ÿ‡ฉ๐Ÿ‡ช German, ๐Ÿ‡ป๐Ÿ‡ณ Vietnamese, ๐Ÿ‡ฐ๐Ÿ‡ท Korean, ๐Ÿ‡จ๐Ÿ‡ณ Chinese, and !?

Note that this is not a conclusion, this is just a sharing of the state of the model. If you find it interesting, please follow the project at:
* https://x.com/ghostx_ai
* https://ghost-x.org/
* https://huggingface.co/ghost-x

๐Ÿคฏ๐Ÿ‘‡
posted an update 4 months ago
view post
Post
1398
๐ŸŽ‰ Happy to announce about the collection called "Blackhole". It is a black hole of high quality data in many fields, multilingual to train LLMs with SFT and DPO methods.
๐Ÿ“ฆ There are now over 30++ high-quality datasets available so you can start creating interesting models. It will be updated in the future, glad if it helps someone.

lamhieu/blackhole-66473b7feec034b4fb70818a