IndustryAI's picture

IndustryAI

AI4Industry

·

AI & ML interests

None yet

Recent Activity

liked a dataset 8 days ago

DefectSpectrum/Defect_Spectrum

new activity about 1 month ago

AI4Industry/MolDet:Best Molecule Detection Model !

updated a dataset about 1 month ago

AI4Industry/MolParser-7M

View all activity

Organizations

None yet

upvoted a paper 3 months ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 114

upvoted a collection 3 months ago

SigLIP 2

OpenCLIP and timm SigLIP 2 models • 45 items • Updated 4 days ago • 23

upvoted an article 5 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

By

and 6 others •

Feb 20

• 291

upvoted an article 6 months ago

Article

Open-source DeepResearch – Freeing our search agents

By

and 4 others •

Feb 4

• 1.28k

upvoted an article 7 months ago

Article

Timm ❤️ Transformers: Use any timm model with transformers

By

and 4 others •

Jan 16

• 51

upvoted a paper 8 months ago

MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild

Paper • 2411.11098 • Published Nov 17, 2024 • 1

upvoted a collection 10 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated 15 days ago • 223

upvoted a collection 11 months ago

MobileNetV4 pretrained weights

Weights for MobileNet-V4 pretrained in timm • 17 items • Updated 4 days ago • 19

upvoted a paper 12 months ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 163

upvoted an article about 1 year ago

Article

MobileNet Baselines

By

•

Jul 26, 2024

• 25

upvoted 2 collections about 1 year ago

🍃 MINT-1T

Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24, 2024 • 62

Searching for Better ViT Baselines

Exploring ViT hparams and model shapes for the GPU poor (between tiny and base). • 28 items • Updated 4 days ago • 18

upvoted an article over 1 year ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

By

and 2 others •

Apr 15, 2024

• 185

upvoted a paper over 1 year ago

Uni-SMART: Universal Science Multimodal Analysis and Research Transformer

Paper • 2403.10301 • Published Mar 15, 2024 • 55