Alex Martin's picture

2 21 3

Alex Martin

alexmartin1722

·

alexmartin1722

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure

upvoted a paper 4 months ago

Certified Mitigation of Worst-Case LLM Copyright Infringement

upvoted a paper 4 months ago

SmolVLM: Redefining small and efficient multimodal models

View all activity

Organizations

upvoted a paper about 1 month ago

The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure

Paper • 2506.22724 • Published Jun 28 • 10

upvoted 4 papers 4 months ago

Certified Mitigation of Worst-Case LLM Copyright Infringement

Paper • 2504.16046 • Published Apr 22 • 13

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 197

Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting

Paper • 2504.05541 • Published Apr 7 • 16

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

Paper • 2504.07951 • Published Apr 10 • 29

upvoted a collection 4 months ago

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated Jul 1 • 74

upvoted a collection 5 months ago

MultiVENT and MAGMAR Resources

Resources associated with the MultiVENT datasets, MAGMAR workshop, and other video retrieval and multimodal retrieval augmented generation • 5 items • Updated Apr 4 • 1

upvoted 2 papers 5 months ago

WikiVideo: Article Generation from Multiple Videos

Paper • 2504.00939 • Published Apr 1 • 38

Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning

Paper • 2503.04973 • Published Mar 6 • 25

upvoted a collection 6 months ago

LLaVA-OneVision

a model good at arbitrary types of visual input • 15 items • Updated Oct 5, 2024 • 25

upvoted 2 papers 6 months ago

Rank1: Test-Time Compute for Reranking in Information Retrieval

Paper • 2502.18418 • Published Feb 25 • 28

Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering

Paper • 2502.13962 • Published Feb 19 • 29

upvoted 2 papers 7 months ago

Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion

Paper • 2501.09019 • Published Jan 15 • 12

Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding

Paper • 2501.07888 • Published Jan 14 • 16

upvoted 2 papers 8 months ago

Compressed Chain of Thought: Efficient Reasoning Through Dense Representations

Paper • 2412.13171 • Published Dec 17, 2024 • 36

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 98

upvoted a paper 10 months ago

Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements

Paper • 2410.08968 • Published Oct 11, 2024 • 14

upvoted a collection 11 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Apr 30 • 308

upvoted a paper 11 months ago

Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models

Paper • 2409.11136 • Published Sep 17, 2024 • 25

upvoted a paper about 1 year ago

Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling

Paper • 2408.03695 • Published Aug 7, 2024 • 13