https://github.com/nicolay-r/bulk-chain/releases/tag/0.25.3
The latest release brings huge updates on:
β Reforged mechanism of models inference that work in steraming mode.
- Callbacks support for streaming mode (earlier only in demo)
- Deployment of various clients (shell, tksheet; see attachment)
β Support for batching (earlier in API mode only)
β Optional caching of inferred data in SQlite (always enabled earlier)
- This now makes possible to faster launch small (but mighty) LLMs
π Project: https://github.com/nicolay-r/bulk-chain
π Proviers: https://github.com/nicolay-r/nlp-thirdgate