Mishig Davaadorj's picture

Mishig Davaadorj

mishig

·

AI & ML interests

NP-completeness, grammars, universality

Recent Activity

upvoted a paper about 5 hours ago

Approximating Language Model Training Data from Weights

updated a Space 2 days ago

huggingface/inference-playground

upvoted a paper 7 days ago

Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning

View all activity

Organizations

mishig's activity

upvoted a paper about 5 hours ago

Approximating Language Model Training Data from Weights

Paper • 2506.15553 • Published 2 days ago • 1

updated a Space 2 days ago

Inference Playground

Toggle dark/light theme on Hugging Face Playground

upvoted a paper 7 days ago

Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning

Paper • 2410.21845 • Published Oct 29, 2024 • 16

upvoted a paper 8 days ago

Chain-of-Thought Reasoning is a Policy Improvement Operator

Paper • 2309.08589 • Published Sep 15, 2023 • 2

upvoted an article 9 days ago

Article

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

By

and 4 others •

9 days ago

• 61

upvoted a changelog 9 days ago

Changelog

Connect Your MCP Client to the Hugging Face Hub

14 days ago

• 89

liked a Space 10 days ago

LeRobot Arena

A web-based robotics control

upvoted an article 10 days ago

Article

The Common Pile v0.1

By

and 2 others •

14 days ago

• 39

commented on Tensors 10 days ago

yes, underrated af !
Great article indeed 👏

upvoted an article 10 days ago

Article

Tensors

By

•

14 days ago

• 6

liked a model 11 days ago

BAAI/RoboBrain2.0-7B

Robotics • Updated 10 days ago • 1.82k • 74

upvoted a paper 13 days ago

Layer by Layer: Uncovering Hidden Representations in Language Models

Paper • 2502.02013 • Published Feb 4 • 2

upvoted 3 papers 14 days ago

Autonomous Improvement of Instruction Following Skills via Foundation Models

Paper • 2407.20635 • Published Jul 30, 2024 • 1

RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning

Paper • 2412.09858 • Published Dec 13, 2024 • 2

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

Paper • 2401.16013 • Published Jan 29, 2024 • 26

upvoted a paper 16 days ago

Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents

Paper • 2505.22954 • Published 23 days ago • 11

upvoted a paper 17 days ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published 18 days ago • 97

New activity in deepseek-ai/DeepSeek-R1-0528 23 days ago

Summer or Winter?

#1 opened 23 days ago by

upvoted an article 25 days ago

Article

Interactive Tools for machine learning, deep learning, and math

By

•

25 days ago

• 44

reacted to Kseniase's post with 🚀 26 days ago

Post

4541

12 Types of JEPA

JEPA, or Joint Embedding Predictive Architecture, is an approach to building AI models introduced by Yann LeCun. It differs from transformers by predicting the representation of a missing or future part of the input, rather than the next token or pixel. This encourages conceptual understanding, not just low-level pattern matching. So JEPA allows teaching AI to reason abstractly.

Here are 12 types of JEPA you should know about:

1. I-JEPA -> Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture (2301.08243)
A non-generative, self-supervised learning framework designed for processing images. It works by masking parts of the images and then trying to predict those masked parts

2. MC-JEPA -> MC-JEPA: A Joint-Embedding Predictive Architecture for Self-Supervised Learning of Motion and Content Features (2307.12698)
Simultaneously interprets video data - dynamic elements (motion) and static details (content) - using a shared encoder

3. V-JEPA -> Revisiting Feature Prediction for Learning Visual Representations from Video (2404.08471)
Presents vision models trained by predicting future video features, without pretrained image encoders, text, negative sampling, or reconstruction

4. UI-JEPA -> UI-JEPA: Towards Active Perception of User Intent through Onscreen User Activity (2409.04081)
Masks unlabeled UI sequences to learn abstract embeddings, then adds a fine-tuned LLM decoder for intent prediction.

5. Audio-based JEPA (A-JEPA) -> A-JEPA: Joint-Embedding Predictive Architecture Can Listen (2311.15830)
Masks spectrogram patches with a curriculum, encodes them, and predicts hidden representations.

6. S-JEPA -> S-JEPA: towards seamless cross-dataset transfer through dynamic spatial attention (2403.11772)
Signal-JEPA is used in EEG analysis. It adds a spatial block-masking scheme and three lightweight downstream classifiers

7. TI-JEPA -> TI-JEPA: An Innovative Energy-based Joint Embedding Strategy for Text-Image Multimodal Systems (2503.06380)
Text-Image JEPA uses self-supervised, energy-based pre-training to map text and images into a shared embedding space, improving cross-modal transfer to downstream tasks

Find more types below 👇

Also, explore the basics of JEPA in our article: https://www.turingpost.com/p/jepa

If you liked it, subscribe to the Turing Post: https://www.turingpost.com/subscribe

1 reply

·