Shyam Sudhakaran's picture

Shyam Sudhakaran

shyamsn97

·

AI & ML interests

Reinforcement Learning, Open-Ended Algorithms, Neural Cellular Automata

Recent Activity

liked a model 1 day ago

peiyi9979/math-shepherd-mistral-7b-prm

upvoted a collection 6 days ago

3D Modelization

View all activity

Organizations

shyamsn97's activity

upvoted a collection 6 days ago

3D Modelization

41 items • Updated 7 days ago • 4

upvoted a paper 3 months ago

Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published Sep 12 • 43

upvoted a collection 4 months ago

WebInstruct 🌐 Embeddings 🧱 Models

A collection of SoTA embeddings model fine-tuned on WebInstruct dataset to learn to pair instructions with its responses • 3 items • Updated Sep 4 • 11

upvoted an article 4 months ago

Article

Selective fine-tuning of Language Models with Spectrum

By

•

Sep 3

• 30

upvoted a collection 4 months ago

💻 Local SmolLMs

SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos • 14 items • Updated 3 days ago • 46

upvoted 2 collections 7 months ago

Mixture-of-preference-reward-modeling

The mixture of preference datasets used for reward modeling. • 2 items • Updated Apr 29 • 2

Standard-format-preference-dataset

We collect the open-source datasets and process them into the standard format. • 14 items • Updated May 8 • 23

upvoted a paper 8 months ago

Data-Efficient Multimodal Fusion on a Single GPU

Paper • 2312.10144 • Published Dec 15, 2023 • 6

upvoted 2 collections 9 months ago

Fine-Tuned

41 items • Updated Nov 23 • 7

Merges

Experimental LLM merging • 1292 items • Updated Nov 23 • 7

upvoted a paper 12 months ago

Transformers are Multi-State RNNs

Paper • 2401.06104 • Published Jan 11 • 36

upvoted a collection 12 months ago

Preference Datasets for DPO

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated 15 days ago • 34

upvoted a paper about 1 year ago

An Emulator for Fine-Tuning Large Language Models using Small Language Models

Paper • 2310.12962 • Published Oct 19, 2023 • 14

upvoted a collection about 1 year ago

🚂 SD-XL Training Suite

All the steps to train your own SD-XL custom model • 7 items • Updated Oct 3 • 21

upvoted a paper over 1 year ago

HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models

Paper • 2307.06949 • Published Jul 13, 2023 • 50