Victor Jotham Ashioya's picture

Victor Jotham Ashioya

ashioyajotham

·

https://ashioyajotham.github.io/

AI & ML interests

Hallucination in LLMs, AI Safety: alignment, red-teaming

Recent Activity

upvoted a paper 11 days ago

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

liked a Space 3 months ago

HuggingFaceFW/blogpost-fineweb-v1

updated a Space 3 months ago

ashioyajotham/falcon_7b_coder

View all activity

Organizations

None yet

ashioyajotham's activity

upvoted a paper 11 days ago

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Paper • 2309.12307 • Published Sep 21, 2023 • 89

upvoted 2 papers 3 months ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 120

Towards General-Purpose Model-Free Reinforcement Learning

Paper • 2501.16142 • Published Jan 27 • 30

upvoted a paper 8 months ago

Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published Aug 22, 2024 • 92

upvoted 2 papers 11 months ago

Automatic Data Curation for Self-Supervised Learning: A Clustering-Based Approach

Paper • 2405.15613 • Published May 24, 2024 • 18

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27, 2024 • 90

upvoted a paper 12 months ago

Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory

Paper • 2405.08707 • Published May 14, 2024 • 33

upvoted 13 papers about 1 year ago

LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25, 2024 • 69

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14, 2024 • 128

Algorithmic progress in language models

Paper • 2403.05812 • Published Mar 9, 2024 • 21

Stealing Part of a Production Language Model

Paper • 2403.06634 • Published Mar 11, 2024 • 92

Common 7B Language Models Already Possess Strong Math Capabilities

Paper • 2403.04706 • Published Mar 7, 2024 • 21

How Far Are We from Intelligent Visual Deductive Reasoning?

Paper • 2403.04732 • Published Mar 7, 2024 • 24

SaulLM-7B: A pioneering Large Language Model for Law

Paper • 2403.03883 • Published Mar 6, 2024 • 83

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

Paper • 2402.17177 • Published Feb 27, 2024 • 89

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16, 2024 • 80

Reformatted Alignment

Paper • 2402.12219 • Published Feb 19, 2024 • 18

RLVF: Learning from Verbal Feedback without Overgeneralization

Paper • 2402.10893 • Published Feb 16, 2024 • 12

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 109

Scaling Laws for Fine-Grained Mixture of Experts

Paper • 2402.07871 • Published Feb 12, 2024 • 14