Added VLLM Offline Serve working code.

#107

by hrithiksagar-tih - opened 5 days ago

base: refs/heads/main

←

from: refs/pr/107

Discussion Files changed

+76

-0

hrithiksagar-tih

5 days ago

So, in this commit, I have attached the solution to the OSS 20b model inference code via vLLM. The original code in the cookbook: https://cookbook.openai.com/articles/gpt-oss/run-vllm, was not working; with a few modifications, it worked.

Added VLLM Offline Serve working code.b97f50dc

RichardDeetlefs

3 days ago

What GPU are you using? Ampere, Ada Lovelace?

hrithiksagar-tih

3 days ago

I used H100s

dkundel-openai

OpenAI org 2 days ago

Thank you @hrithiksagar-tih ! Can you actually instead PR into github.com/openai/gpt-oss and I'll copy back into both model cards? Thanks

dkundel-openai changed pull request status to closed 2 days ago

hrithiksagar-tih

1 day ago

Yes I will do it
Thanks!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment