Vision Transformer Adapters for Generalizable Multitask Learning Paper • 2308.12372 • Published Aug 23, 2023
NEMTO: Neural Environment Matting for Novel View and Relighting Synthesis of Transparent Objects Paper • 2303.11963 • Published Mar 21, 2023 • 2
TempSAL -- Uncovering Temporal Information for Deep Saliency Prediction Paper • 2301.02315 • Published Jan 5, 2023 • 1
VGRP-Bench: Visual Grid Reasoning Puzzle Benchmark for Large Vision-Language Models Paper • 2503.23064 • Published 25 days ago
FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing Paper • 2503.19191 • Published 29 days ago • 1
VolRecon: Volume Rendering of Signed Ray Distance Functions for Generalizable Multi-View Reconstruction Paper • 2212.08067 • Published Dec 15, 2022