1 8 176

mlor

machine-learnoooooooor

AI & ML interests

None yet

Recent Activity

liked a dataset 1 day ago

allenai/tulu_v3.9_wildchat_100k_english-r1-format-filtered-filtered

liked a model 2 days ago

Intelligent-Internet/II-Medical-32B-Preview

liked a model 5 days ago

baidu/ERNIE-4.5-VL-424B-A47B-Base-PT

View all activity

Organizations

None yet

liked a dataset 1 day ago

allenai/tulu_v3.9_wildchat_100k_english-r1-format-filtered-filtered

Viewer • Updated 2 days ago • 22.5k • 1

liked a model 2 days ago

Intelligent-Internet/II-Medical-32B-Preview

33B • Updated 2 days ago • 23 • 11

liked 5 models 5 days ago

liked a dataset 5 days ago

miriad/miriad-4.4M

Viewer • Updated 25 days ago • 4.49M • 828 • 15

reacted to Kseniase's post with 🤗 7 days ago

Post

3379

10 Open-source Deep Research assistants

Deep Research agents are quickly becoming our daily co-workers — built for complex investigations, not just chat. With modular architecture, advanced tool use and real web access, they go far beyond typical AI. While big-name agents get the spotlight, we want to highlight some powerful recent open-source alternatives:

1. DeerFlow -> https://github.com/bytedance/deer-flow
A modular multi-agent system combining LMs and tools for automated research and code analysis. It links a coordinator, planner, team of specialized agent, and reporter, and converts reports to speech via Text-to-Speech (TTS)

2. Alita -> https://github.com/CharlesQ9/Alita
Uses a single problem-solving module for scalable reasoning through simplicity. It self-evolves by generating and reusing Model Context Protocols (MCPs) from open-source tools to build external capabilities for diverse tasks

3. WebThinker -> https://github.com/RUC-NLPIR/WebThinker
Lets reasoning models autonomously search the web and navigate pages. Deep Web Explorer allows interaction with links and follow-up searches. Through a Think-Search-and-Draft process models generate and refine reports in real time. RL training with preference pairs improves the workflow

4. SimpleDeepSearcher -> https://github.com/RUCAIBox/SimpleDeepSearcher
A lightweight framework showing that supervised fine-tuning is a real alternative to complex RL, using simulated web interactions and multi-criteria curation to generate high-quality training data

5. AgenticSeek -> https://github.com/Fosowl/agenticSeek
A private, on-device assistant that picks the best agent expert for browsing, coding, or planning—no cloud needed. Includes voice input via speech-to-text

6. Suna -> https://github.com/kortix-ai/suna
Offers web browsing, file and doc handling, CLI execution, site deployment, and API/service integration—all in one assistant

Subscribe to the Turing Post:https://www.turingpost.com/subscribe
Read further ⬇️

2 replies

liked a model 7 days ago

Qwen/Qwen3-Embedding-0.6B

Feature Extraction • 0.6B • Updated 16 days ago • 2.48M • • 322

liked a Space 7 days ago

5.97k

MTEB Leaderboard

🥇

Embedding Leaderboard

reacted to eaddario's post with 🚀 7 days ago

Post

3707

Layer-wise and Pruned versions of Qwen/Qwen3-30B-A3B

* Tesor-wise: eaddario/Qwen3-30B-A3B-GGUF
* Pruned: eaddario/Qwen3-30B-A3B-pruned-GGUF

Even though the Perplexity scores of the pruned version are 3 times higher, the ARC, HellaSwag, MMLU, Truthful QA and WinoGrande scores are holding remarkably well, considering two layers were removed (5 and 39). This seems to support Xin Men et al conclusions in
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect (2403.03853)

Results summary in the model's card and test results in the ./scores directory. Questions/feedback is always welcomed.