TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies Paper • 2412.10345 • Published 12 days ago • 2
OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation Paper • 2412.09585 • Published 13 days ago • 10
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion Paper • 2412.04424 • Published 20 days ago • 55
Trajectory Attention for Fine-grained Video Motion Control Paper • 2411.19324 • Published 27 days ago • 12
Are Vision Language Models Texture or Shape Biased and Can We Steer Them? Paper • 2403.09193 • Published Mar 14
On the Interplay of Convolutional Padding and Adversarial Robustness Paper • 2308.06612 • Published Aug 12, 2023
An Extended Study of Human-like Behavior under Adversarial Training Paper • 2303.12669 • Published Mar 22, 2023
The Power of Linear Combinations: Learning with Random Convolutions Paper • 2301.11360 • Published Jan 26, 2023
Does Medical Imaging learn different Convolution Filters? Paper • 2210.13799 • Published Oct 25, 2022
Adversarial Robustness through the Lens of Convolutional Filters Paper • 2204.02481 • Published Apr 5, 2022
CNN Filter DB: An Empirical Investigation of Trained Convolutional Filters Paper • 2203.15331 • Published Mar 29, 2022
PBADet: A One-Stage Anchor-Free Approach for Part-Body Association Paper • 2402.07814 • Published Feb 12 • 1
Learning Continuous Mesh Representation with Spherical Implicit Surface Paper • 2301.04695 • Published Jan 11, 2023
DDGS-CT: Direction-Disentangled Gaussian Splatting for Realistic Volume Rendering Paper • 2406.02518 • Published Jun 4
6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering Paper • 2410.04974 • Published Oct 7