Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
7.4
TFLOPS
4
39
19
Simon Pagezy
pagezyhf
Follow
reach-vb's profile picture
BrigitteTousi's profile picture
yuyingzhi's profile picture
20 followers
Β·
24 following
pagezyhf
AI & ML interests
Healthcare ML
Recent Activity
new
activity
about 18 hours ago
Qwen/Qwen2-VL-7B-Instruct:
Anyone able to deploy an inference endpoint on sagemaker?
reacted
to
merve
's
post
with π
about 20 hours ago
Oof, what a week! π₯΅ So many things have happened, let's recap! https://huggingface.co/collections/merve/jan-24-releases-6793d610774073328eac67a9 Multimodal π¬ - We have released SmolVLM -- tiniest VLMs that come in 256M and 500M, with it's retrieval models ColSmol for multimodal RAG π - UI-TARS are new models by ByteDance to unlock agentic GUI control π€― in 2B, 7B and 72B - Alibaba DAMO lab released VideoLlama3, new video LMs that come in 2B and 7B - MiniMaxAI released Minimax-VL-01, where decoder is based on MiniMax-Text-01 456B MoE model with long context - Dataset: Yale released a new benchmark called MMVU - Dataset: CAIS released Humanity's Last Exam (HLE) a new challenging MM benchmark LLMs π - DeepSeek-R1 & DeepSeek-R1-Zero: gigantic 660B reasoning models by DeepSeek, and six distilled dense models, on par with o1 with MIT license! π€― - Qwen2.5-Math-PRM: new math models by Qwen in 7B and 72B - NVIDIA released AceMath and AceInstruct, new family of models and their datasets (SFT and reward ones too!) Audio π£οΈ - Llasa is a new speech synthesis model based on Llama that comes in 1B,3B, and 8B - TangoFlux is a new audio generation model trained from scratch and aligned with CRPO Image/Video/3D Generation β―οΈ - Flex.1-alpha is a new 8B pre-trained diffusion model by ostris similar to Flux - tencent released Hunyuan3D-2, new 3D asset generation from images
upvoted
an
article
5 days ago
Mastering Long Contexts in LLMs with KVPress
View all activity
Articles
Hugging Face models in Amazon Bedrock
Dec 9, 2024
β’
11
Introducing HUGS - Scale your AI with Open Models
Oct 23, 2024
β’
36
Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI
Aug 19, 2024
β’
18
Google Cloud TPUs made available to Hugging Face users
Jul 9, 2024
β’
19
Introducing Spaces Dev Mode for a seamless developer experience
May 21, 2024
β’
14
Organizations
pagezyhf
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
Qwen/Qwen2-VL-7B-Instruct
about 18 hours ago
Anyone able to deploy an inference endpoint on sagemaker?
6
#71 opened 17 days ago by
TeoGX
New activity in
Datou1111/shou_xin
about 1 month ago
Add generated example
#9 opened about 1 month ago by
pagezyhf
New activity in
huggingface/HuggingDiscussions
3 months ago
[FEEDBACK] Follow
4
#14 opened over 1 year ago by
victor
New activity in
aws-neuron/optimum-neuron-cache
9 months ago
[Cache Request] meta-llama/Meta-Llama-3-8B
1
#71 opened 9 months ago by
sandkoan