Gene Ruebsamen

yahma

AI & ML interests

Foundational Networks, Transformer models, Diffusion Models, Convolutional Neural Networks, Reinforcement learning

Recent Activity

Organizations

Analytics Club at ETH ZΓΌrich's profile picture

yahma's activity

New activity in huggingchat/chat-ui 4 months ago

[NEW] Assistants

175
#357 opened 11 months ago by
victor
reacted to KingNish's post with πŸ‘ 7 months ago
view post
Post
3724
New Updates OpenGPT 4o
1. Live Chat (also known as video chat) (very powerful and fast, it can even identify famous places and persons)
2. Powerful Image Generation

Test and give feedback of New features:
KingNish/OpenGPT-4o

Future Updates
1. PDF Chat
2. Human like speech (Using Parler tts expresso)
3. Multilingual support for voice chat

Suggest more features that should be added. πŸ€—

Edit: Live Chat is now very powerful (than prev)
Β·
upvoted an article 8 months ago
view article
Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

β€’ 170
reacted to DmitryRyumin's post with ❀️ 10 months ago
view post
Post
🌟✨ Exciting Announcement: NVIDIA AI Foundation Models ✨🌟

πŸš€ Interact effortlessly with the latest SOTA AI model APIs, all optimized on the powerful NVIDIA accelerated computing stack-right from your browser! πŸ’»βš‘

πŸ”— Web Page: https://catalog.ngc.nvidia.com/ai-foundation-models

🌟🎯 Favorites:

πŸ”Ή Code Generation:
1️⃣ Code Llama 70B πŸ“πŸ”₯: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/codellama-70b
Model πŸ€–: codellama/CodeLlama-70b-hf

πŸ”Ή Text and Code Generation:
1️⃣ Gemma 7B πŸ’¬πŸ’»: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/gemma-7b
Model πŸ€–: google/gemma-7b
2️⃣ Yi-34B πŸ“šπŸ’‘: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/yi-34b
Model πŸ€–: 01-ai/Yi-34B

πŸ”Ή Text Generation:
1️⃣ Mamba-Chat πŸ’¬πŸ: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/mamba-chat
Model πŸ€–: havenhq/mamba-chat
2️⃣ Llama 2 70B πŸ“πŸ¦™: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/llama2-70b
Model πŸ€–: meta-llama/Llama-2-70b

πŸ”Ή Text-To-Text Translation:
1️⃣ SeamlessM4T V2 πŸŒπŸ”„: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/seamless-m4t2-t2tt
Model πŸ€–: facebook/seamless-m4t-v2-large

πŸ”Ή Image Generation:
1️⃣ Stable Diffusion XL πŸŽ¨πŸ”: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/sdxl

πŸ”Ή Image Conversation:
1️⃣ NeVA-22B πŸ—¨οΈπŸ“Έ: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/neva-22b

πŸ”Ή Image Classification and Object Detection:
1️⃣ CLIP πŸ–ΌοΈπŸ”: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/clip

πŸ”Ή Voice Conversion:
1️⃣ Maxine Voice Font πŸ—£οΈπŸŽΆ: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/voice-font

πŸ”Ή Multimodal LLM (MLLM):
1️⃣ Kosmos-2 πŸŒπŸ‘οΈ: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/kosmos-2
  • 2 replies
Β·
reacted to xianbao's post with ❀️ 11 months ago
view post
Post
There appears to be a huge misunderstanding regarding the licensing requirements for open sourced Chinese speaking speaking LLMs on
@huggingface


I initially shared this misconception too, but after conducting some research, I came up with the list below.

Veryimpressive!

liked a Space 11 months ago
reacted to dhuynh95's post with ❀️ 11 months ago
view post
Post
βœ…New paper to ensure valid LLM output with SOTA LLMs like GPT4 by mixing it with OSS LLMs

Paper: arxiv.org/abs/2401.09967

Great paper showing how strong proprietary AI like #GPT4 can be paired with #OSS LLM to ensure LLM output validity, e.g. valid JSON.

Many devs complain that #LLMs cannot be reliably used in production if the output is not valid, for instance, if one wants to use LLMs to generate SQL queries or JSON, it is crucial that the output is valid.

Frameworks have arisen to constrain the outputs of the LLM to follow some constraints, like outlines (https://github.com/outlines-dev/outlines), but they assume access to logits.

This makes them incompatible with proprietary LLMs like GPT4 that don’t share logits, so one can only use open-source LLMs that are much less performant.

This paper shows how can use powerful proprietary LLMs like GPT4 to create a first unconstrained sketch and refine it using an OSS model like Llama 2 where logits are accessible, to rewrite the sketch following some specific constraints.

They show that GPT4 Precision can be increased by 14% (43% before, 57% after), by boosting it with constrained output on information extraction on Wiki-NRE!
New activity in NousResearch/Nous-Hermes-2-Yi-34B 12 months ago

Any chance of a 200k version?

1
#7 opened 12 months ago by
yahma
New activity in HuggingFaceM4/idefics_playground 12 months ago

Demo ERROR

#53 opened 12 months ago by
yahma
New activity in huggingchat/chat-ui about 1 year ago

openchat_3.5

3
#325 opened about 1 year ago by
hungryai
New activity in open-llm-leaderboard/open_llm_leaderboard about 1 year ago