arxiv:2601.09708
Min-Hung Chen
cmhungsteve
AI & ML interests
Multimodal AI, Transfer Learning, Unsupervised Learning, Video Understanding, Vision Transformer, Computer Vision, Deep Learning
Recent Activity
authored
a paper
about 20 hours ago
3AM: Segment Anything with Geometric Consistency in Videos
authored
a paper
about 20 hours ago
Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning