Britny Farahdel PRO

britny

AI & ML interests

None yet

Recent Activity

reacted to merve's post with 🚀 5 days ago

smolagents can see 🔥 we just shipped vision support to smolagents 🤗 agentic computers FTW you can now: 💻 let the agent get images dynamically (e.g. agentic web browser) 📑 pass images at the init of the agent (e.g. chatting with documents, filling forms automatically etc) with few LoC change! 🤯 you can use transformers models locally (like Qwen2VL) OR plug-in your favorite multimodal inference provider (gpt-4o, antrophic & co) 🤠 read our blog http://hf.co/blog/smolagents-can-see

updated a collection 7 days ago

Image generation

View all activity

Organizations

britny's activity

reacted to merve's post with 🚀 5 days ago

Post

2080

smolagents can see 🔥
we just shipped vision support to smolagents 🤗 agentic computers FTW

you can now:
💻 let the agent get images dynamically (e.g. agentic web browser)
📑 pass images at the init of the agent (e.g. chatting with documents, filling forms automatically etc)
with few LoC change! 🤯
you can use transformers models locally (like Qwen2VL) OR plug-in your favorite multimodal inference provider (gpt-4o, antrophic & co) 🤠

read our blog http://hf.co/blog/smolagents-can-see

updated 2 collections 7 days ago

3D

Collection

7 items • Updated 7 days ago

Image generation

Collection

14 items • Updated 7 days ago

updated a collection 13 days ago

Avatars

Collection

15 items • Updated 9 days ago

reacted to merve's post with ❤️ 19 days ago

Post

3611

What a beginning to this year in open ML 🤠
Let's unwrap! merve/jan-10-releases-677fe34177759de0edfc9714

Multimodal 🖼️
> ByteDance released SA2VA: a family of vision LMs that can take image, video, text and visual prompts
> moondream2 is out with new capabilities like outputting structured data and gaze detection!
> Dataset: Alibaba DAMO lab released multimodal textbook — 22k hours worth of samples from instruction videos 🤯
> Dataset: SciCap captioning on scientific documents benchmark dataset is released along with the challenge!

LLMs 💬
> Microsoft released Phi-4, sota open-source 14B language model 🔥
> Dolphin is back with Dolphin 3.0 Llama 3.1 8B 🐬🐬
> Prime-RL released Eurus-2-7B-PRIME a new language model trained using PRIME alignment
> SmallThinker-3B is a new small reasoning LM based on Owen2.5-3B-Instruct 💭
> Dataset: QWQ-LONGCOT-500K is the dataset used to train SmallThinker, generated using QwQ-32B-preview 📕
> Dataset: @cfahlgren1 released React Code Instructions: a dataset of code instruction-code pairs 📕
> Dataset: Qwen team is on the roll, they just released CodeElo, a dataset of code preferences 👩🏻‍💻

Embeddings 🔖
> @MoritzLaurer released zero-shot version of ModernBERT large 👏
> KaLM is a new family of performant multilingual embedding models with MIT license built using Qwen2-0.5B

Image/Video Generation ⏯️
> NVIDIA released Cosmos, a new family of diffusion/autoregressive World Foundation Models generating worlds from images, videos and texts 🔥
> Adobe released TransPixar: a new text-to-video model that can generate assets with transparent backgrounds (a first!)
> Dataset: fal released cosmos-openvid-1m Cosmos-tokenized OpenVid-1M with samples from OpenVid-1M

Others
> Prior Labs released TabPFNv2, the best tabular transformer is out for classification and regression
> Metagene-1 is a new RNA language model that can be used for pathogen detection, zero-shot embedding and genome understanding

updated a collection 19 days ago

Avatars

Collection

15 items • Updated 9 days ago

updated a collection 21 days ago

Tools

Collection

12 items • Updated 21 days ago

updated a collection 24 days ago

Avatars

Collection

15 items • Updated 9 days ago

reacted to merve's post with 🚀 30 days ago

Post

4835

supercharge your LLM apps with smolagents 🔥

however cool your LLM is, without being agentic it can only go so far

enter smolagents: a new agent library by Hugging Face to make the LLM write code, do analysis and automate boring stuff!

Here's our blog for you to get started https://huggingface.co/blog/smolagents

reacted to ginipick's post with 🔥 about 1 month ago

Post

5230

🎬 Revolutionize Your Video Creation
Dokdo Multimodal AI Transform a single image into a stunning video with perfect audio harmony! 🚀

Superior Technology 💫
Advanced Flow Matching: Smoother video transitions surpassing Kling and Sora
Intelligent Sound System: Automatically generates perfect audio by analyzing video mood
Multimodal Framework: Advanced AI integrating image, text, and audio analysis
Outstanding Performance 🎯
Ultra-High Resolution: 4K video quality with bfloat16 acceleration
Real-Time Optimization: 3x faster processing with PyTorch GPU acceleration
Smart Sound Matching: Real-time audio effects based on scene transitions and motion
Exceptional Features ✨
Custom Audio Creation: Natural soundtrack matching video tempo and rhythm
Intelligent Watermarking: Adaptive watermark adjusting to video characteristics
Multilingual Support: Precise translation engine powered by Helsinki-NLP
Versatile Applications 🌟
Social Media Marketing: Create engaging shorts for Instagram and YouTube
Product Promotion: Dynamic promotional videos highlighting product features
Educational Content: Interactive learning materials with enhanced engagement
Portfolio Enhancement: Professional-grade videos showcasing your work
Experience the video revolution with Dokdo Multimodal, where anyone can create professional-quality content from a single image. Elevate your content with perfectly synchronized video and audio that captivates your audience! 🎨

Start creating stunning videos that stand out from the crowd - whether you're a marketer, educator, content creator, or business owner. Join the future of AI-powered video creation today!

ginipick/Dokdo-multimodal

#VideoInnovation #AITechnology #PremiumContent #MarketingSolution

🔊 Please turn on your sound for the best viewing experience!

1 reply

reacted to hexgrad's post with 🔥 about 1 month ago

Post

4029

Merry Christmas! 🎄 Open sourced a small TTS model at hexgrad/Kokoro-82M

2 replies

reacted to prithivMLmods's post with 🤗 about 1 month ago

Post

6432

Sketchify 😉🎨

+ strangerzonehf/Flux-Sketch-Smudge-LoRA
+ strangerzonehf/Flux-Sketch-Sized-LoRA
+ strangerzonehf/Sketch-Paint

- strangerzonehf/sketch-fav-675ba869c7ceaec7e652ee1c

reacted to InferenceIllusionist's post with 🔥 about 1 month ago

Post

1979

MilkDropLM-32b-v0.3: Unlocking Next-Gen Visuals ✨

Stoked to release the latest iteration of our MilkDropLM project! This new release is based on the powerful Qwen2.5-Coder-32B-Instruct model using the same great dataset that powered our 7b model.

What's new?

- Genome Unlocked: Deeper understanding of preset relationships for more accurate and creative generations.

- Preset Revival: Breathe new life into old presets with our upgraded model!

- Loop-B-Gone: Say goodbye to pesky loops and hello to smooth generation.

- Natural Chats: Engage in more natural sounding conversations with our LLM than ever before.

Released under Apache 2.0, because sharing is caring!

Try it out: InferenceIllusionist/MilkDropLM-32b-v0.3

Shoutout to @superwatermelon for his invaluable insights and collab, and to all those courageous members in the community that have tested and provided feedback before!

updated a collection about 1 month ago

Avatars

Collection

15 items • Updated 9 days ago

liked a Space about 1 month ago

Running

487

📈

Scaling test-time compute

updated 2 collections about 1 month ago

Image Editing

Collection

21 items • Updated 17 days ago

Flux.1

Collection

5 items • Updated Dec 16, 2024

updated a collection about 2 months ago

Image Editing

Collection

21 items • Updated 17 days ago

reacted to cutechicken's post with 🚀 about 2 months ago

Post

3502

🎮 Introduction to the World's First 3D Tank Game Created Solely with Generative AI 🚀
The advancement of AI technology is revolutionizing game development paradigms. I embarked on a challenge to create a 3D tank game using "only AI assistance," pushing the boundaries of what's possible in AI-driven game development. 🤖
Following the success of my first 2D tank game ( cutechicken/tankwar) 🎯, I ventured into the more challenging realm of 3D FPS game development. Remarkably, using Hugging Face's AI tool ( VIDraft/mouse1), the basic game framework was generated in just one minute ⚡. The 3D modeling ( ginipick/SORA-3D) and sound effects ( fantaxy/Sound-AI-SFX) were also easily created with AI assistance.
The resulting game ( cutechicken/TankWar3D) represents arguably the world's first 3D FPS game created primarily with generative AI. 90% was accomplished through AI capabilities, with the remaining 10% comprising my post-processing work. 🎉
Key Technical Features: 🛠️

Complete 3D rendering system using Three.js 🖥️
Real-time physics-based collision detection and handling 💥
Dynamic shadow and lighting system ☀️
Real-time radar and enemy tracking system 🎯
Advanced particle effects system (explosions, smoke, fire) 💫
Dynamic sound system (engine, firing, explosion sounds) 🔊
AI-driven enemy strategy system (pursuit, evasion, combat) 🤖
Terrain-based tank tilt adjustment 🌍
Real-time crosshair targeting system 🎯
Dynamic UI system (health bars, ammo, score) 📊

Technical Implementation: ⚙️

Physics Engine: 🎳
Custom collision detection system
Dynamic obstacle handling
Real-time terrain interaction

AI Systems: 🧠
State-based AI behavior patterns
Dynamic pathfinding
Tactical decision-making system

Graphics: 🎨
PBR-based rendering
Dynamic particle system
Real-time shadow mapping

reacted to prithivMLmods's post with 🔥 about 2 months ago

Post

2695

strangerzonehf/Flux-Sketch-Flat-LoRA