Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2970404.1
TFLOPS
321
366
610
Yatharth Sharma
YaTharThShaRma999
Follow
takarajordan's profile picture
spherec's profile picture
dark-pen's profile picture
20 followers
ยท
28 following
AI & ML interests
None yet
Recent Activity
upvoted
an
article
3 days ago
Introducing Cosmos Predict-2: A Foundation For Your Own World Model
reacted
to
merve
's
post
with ๐ค
6 days ago
Releases of the past week are here https://huggingface.co/collections/merve/releases-june-13-6852c3c1eaf1e0c24c958860 Here's our picks ๐ค So many interesting models released past week in open AI! ๐ค ๐ผ๏ธ Computer Vision/VLMs > https://huggingface.co/nanonets/Nanonets-OCR-s is the new state-of-the-art OCR model that can handle checkboxes, watermarks, tables (OS) > Meta released https://huggingface.co/collections/facebook/v-jepa-2-6841bad8413014e185b497a6, new sota video embeddings with two new classification models (OS) > https://huggingface.co/ByteDance-Seed/SeedVR2-3B is a new 3B video restoration model (OS) Audio > Stepfun released https://huggingface.co/stepfun-ai/Step-Audio-AQAA, new large (137B ๐คฏ) audio language model that takes in audio and generates audio (OS) ๐ค Robotics > nvidia released https://huggingface.co/nvidia/GR00T-N1.5-3B, new open foundation vision language action model 3D > https://huggingface.co/tencent/Hunyuan3D-2.1 is the new version of Hunyuan by Tencent that can generate 3D assets from text and image prompts
reacted
to
merve
's
post
with ๐
6 days ago
Releases of the past week are here https://huggingface.co/collections/merve/releases-june-13-6852c3c1eaf1e0c24c958860 Here's our picks ๐ค So many interesting models released past week in open AI! ๐ค ๐ผ๏ธ Computer Vision/VLMs > https://huggingface.co/nanonets/Nanonets-OCR-s is the new state-of-the-art OCR model that can handle checkboxes, watermarks, tables (OS) > Meta released https://huggingface.co/collections/facebook/v-jepa-2-6841bad8413014e185b497a6, new sota video embeddings with two new classification models (OS) > https://huggingface.co/ByteDance-Seed/SeedVR2-3B is a new 3B video restoration model (OS) Audio > Stepfun released https://huggingface.co/stepfun-ai/Step-Audio-AQAA, new large (137B ๐คฏ) audio language model that takes in audio and generates audio (OS) ๐ค Robotics > nvidia released https://huggingface.co/nvidia/GR00T-N1.5-3B, new open foundation vision language action model 3D > https://huggingface.co/tencent/Hunyuan3D-2.1 is the new version of Hunyuan by Tencent that can generate 3D assets from text and image prompts
View all activity
Organizations
None yet
YaTharThShaRma999
's datasets
4
Sort:ย Recently updated
YaTharThShaRma999/calibration_audio
Viewer
โข
Updated
May 11
โข
128
โข
28
YaTharThShaRma999/Physics_dataset
Viewer
โข
Updated
Sep 24, 2023
โข
1k
โข
45
โข
3
YaTharThShaRma999/autotrain-data-flant5finetune
Preview
โข
Updated
Aug 10, 2023
โข
23
YaTharThShaRma999/ImageCaptioningDataset
Updated
May 13, 2023
โข
16