Robert Dahlke PRO

rbrt

AI & ML interests

None yet

Recent Activity

Organizations

TNG Technology Consulting GmbH's profile picture

rbrt's activity

New activity in tngtech/DeepSeek-R1T-Chimera 7 days ago

Paid version?

1
#2 opened 19 days ago by
Blazgo
view reply

We published the experts that we switched off in the paper (see below). The method to switch them off works at inference time, so no need to upload new weights:

Screenshot From 2025-05-02 15-25-41.png

updated a Space about 1 month ago
upvoted 2 articles about 1 month ago
view article
Article

Finetuning olmOCR to be a faithful OCR-Engine

By tngtech and 1 other •
• 18
view article
Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

By tngtech •
• 13
upvoted an article about 2 months ago
view article
Article

Efficient Request Queueing – Optimizing LLM Performance

By tngtech •
• 12
view reply

That's a bit scary. Because listening to it, we assumed it read the paper.

So it seems there are more hallucinations than we initially thought. Funnily, quite a few of them it guessed correctly.