view article Article Finetuning olmOCR to be a faithful OCR-Engine By tngtech and 1 other • Apr 22 • 18
view article Article Finetuning olmOCR to be a faithful OCR-Engine By tngtech and 1 other • Apr 22 • 18
view article Article Prefill and Decode for Concurrent Requests - Optimizing LLM Performance By tngtech • Apr 16 • 17
view article Article Efficient Request Queueing – Optimizing LLM Performance By tngtech • Apr 2 • 12
view article Article Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time By rbrt and 4 others • Feb 18 • 33
view article Article Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time By rbrt and 4 others • Feb 18 • 33