Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
24.2
TFLOPS
4
8
205
James Neville
Khawn2u
Follow
shtefcs's profile picture
ingenioso's profile picture
Mi6paulino's profile picture
4 followers
·
11 following
khawn2u
Khawn2u
khawn2u.bsky.social
AI & ML interests
None yet
Recent Activity
reacted
to
codelion
's
post
with 👍
8 days ago
I recently worked on a LoRA that improves tool use in LLM. Thought the approach might interest folks here. The issue I have had when trying to use some of the local LLMs with coding agents is this: Me: "Find all API endpoints with authentication in this codebase" LLM: "You should look for @app.route decorators and check if they have auth middleware..." But I often want it to search the files and show me but the LLM doesn't trigger a tool use call. To fine-tune it for tool use I combined two data sources: 1. Magpie scenarios - 5000+ diverse tasks (bug hunting, refactoring, security audits) 2. Real execution - Ran these on actual repos (FastAPI, Django, React) to get authentic tool responses This ensures the model learns both breadth (many scenarios) and depth (real tool behavior). Tools We Taught: - `read_file` - Actually read file contents - `search_files` - Regex/pattern search across codebases - `find_definition` - Locate classes/functions - `analyze_imports` - Dependency tracking - `list_directory` - Explore structure - `run_tests` - Execute test suites Improvements: - Tool calling accuracy: 12% → 80% - Correct parameters: 8% → 87% - Multi-step tasks: 3% → 78% - End-to-end completion: 5% → 80% - Tools per task: 0.2 → 3.8 The LoRA really improves on intential tool call as an example consider the query: "Find ValueError in payment module" The response proceeds as follows: 1. Calls `search_files` with pattern "ValueError" 2. Gets 4 matches across 3 files 3. Calls `read_file` on each match 4. Analyzes context 5. Reports: "Found 3 ValueError instances: payment/processor.py:47 for invalid amount, payment/validator.py:23 for unsupported currency..." Resources: - Colab notebook https://colab.research.google.com/github/codelion/ellora/blob/main/Ellora_Recipe_3_Enhanced_Tool_Calling_and_Code_Understanding.ipynb - Model - https://huggingface.co/codelion/Llama-3.2-1B-Instruct-tool-calling-lora - GitHub - https://github.com/codelion/ellora
liked
a dataset
11 days ago
shahxeebhassan/human_vs_ai_sentences
liked
a model
11 days ago
microsoft/VibeVoice-1.5B
View all activity
Organizations
None yet
models
3
Sort: Recently updated
Khawn2u/llama3.2-1b-mla-Q4_K_M-GGUF
1B
•
Updated
Mar 10
•
11
Khawn2u/Llama-3.1-8b-Chain-Of-Thought-GGUF
8B
•
Updated
Oct 15, 2024
•
22
•
2
Khawn2u/lawma-8b-Q4_K_M-GGUF
8B
•
Updated
Sep 10, 2024
•
12
•
1
datasets
2
Sort: Recently updated
Khawn2u/Text-Beautified-1.0
Viewer
•
Updated
Mar 10
•
3k
•
2
Khawn2u/Fixberry
Viewer
•
Updated
Oct 13, 2024
•
10.8M
•
1