Jonathan Li's picture

7 3 5

Jonathan Li

jlli

woffett

AI & ML interests

None yet

Recent Activity

updated a dataset 20 days ago

jlli/Hungarian_CCPDF_SynQA

View all activity

Organizations

jlli's activity

updated a dataset 20 days ago

jlli/Hungarian_CCPDF_SynQA

Viewer • Updated 20 days ago • 19.3k • 52

updated a dataset 3 months ago

jlli/SynthDog_hu2

Viewer • Updated Oct 4 • 40k • 38

New activity in meta-llama/Llama-3.2-11B-Vision-Instruct 3 months ago

Vocab size vs. LM head size mismatch

#46 opened 3 months ago by

Text model weights are different from 3.1 8B Instruct

#32 opened 3 months ago by

upvoted a collection 3 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 19 days ago • 548

updated 4 datasets 3 months ago

jlli/JDocQA-binary

Viewer • Updated Sep 24 • 1.38k • 49

jlli/JDocQA-nonbinary

Viewer • Updated Sep 24 • 7.54k • 94

jlli/HungarianDocQA-OCR

Viewer • Updated Sep 24 • 54 • 39 • 1

jlli/SynthDog_hu

Viewer • Updated Sep 24 • 20.5k • 41

reacted to zolicsaki's post with 🚀 7 months ago

Post

2795

We posted new SOTA SambaLingo 70B parameter models for Arabic, Thai and Hungarian!

Check out the models here sambanovasystems/sambalingo-65e25770f2037c85ad35ca77

and our paper
https://arxiv.org/abs/2404.05829

reacted to zolicsaki's post with 🚀 7 months ago

Post

890

SambaNova just released a revolutionary paper about how the SN40L AI chip can host many LLMs on a single node and run inference so efficiently that it enables running a "composition of experts." These experts can be interconnected via a router, resulting in remarkable accuracy. This method allows you to take open source expert models from HuggingFace and continuously build and integrate them into a composition of experts.

I am also super excited about the possibilities that SN40Ls unlock for LLM agent workflows and pipelined calls. With the release of GPT4o, it seems that monolithic LLMs are starting to reach a plateau, and I believe that the next wave of AI will be driven by pipelined LLM calls and agent workflows. Most pipelined LLM workflows are bottlenecked by prohibitively expensive compute and high latency, but the SN40L provides a one stop shop solution for this. We need to get the word out to the community that this hardware exists, because it will open up a realm of possibilities that developers working with Nvidia hardware did not know exist.

SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts (2405.07518)

upvoted 2 papers 7 months ago

Efficiently Adapting Pretrained Language Models To New Languages

Paper • 2311.05741 • Published Nov 9, 2023 • 11

SambaLingo: Teaching Large Language Models New Languages

Paper • 2404.05829 • Published Apr 8 • 12

liked a Space 10 months ago

Samba CoE V0.1

updated a collection 10 months ago

BLOOMChat

Chat-aligned multilingual 176B models, trained by SambaNova on RDU • 2 items • Updated Sep 30 • 7