Nicolay Rusnachenko

nicolay-r

AI & ML interests

Information Retrieval・Medical Multimodal NLP (🖼+📝) Research Fellow @BU_Research・software developer http://arekit.io・PhD in NLP

Recent Activity

liked a model 1 day ago

Xiaolihai/flan-t5-large_MeDistill

liked a model 2 days ago

RefalMachine/RuadaptQwen3-32B-Instruct

updated a model 7 days ago

nicolay-r/qwen25-05b-multiclinsum-distil

View all activity

Organizations

None yet

Posts 70

Post

309

📢 Following the bulk-chain v1.1.0 release and its minimalistic integration and real-world deployment on async inference with your LLM provider. Below is an example of tk-sheet for spreadsheet-style LLM reasoning

🔑 We use the concept of asyncio loops. You can set event loop of the utilized framework. At present it converts async generators to sync.
https://stackoverflow.com/questions/71580727/translating-async-generator-into-sync-one/79617639#79617639

🔗 The integration demo:
https://github.com/nicolay-r/bulk-chain-tksheet-client

🌌 The provider:
https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/replicate_104.py

Post

2400

🚀 For those who interested in minimalistic integration of LLMs inferece with predefined reasoning shema, excited to share the latest bulk chain 1.1.0. It represents a no-string solution for deploying your LLM for efficient inference over data iterators.
✨ Key Features:
- Full async inference support + Including streaming mode for real-time output
- simplified inference API
🔗 Check out the repo: https://github.com/nicolay-r/bulk-chain

💡 Special thanks to @RicardoLee for his work on effective async LLaMA-3 deployment that helped shape this release:
https://github.com/RicardoLeeV587/Llama3-FastInference

View all Posts