Isaac74's picture
Update README.md
bec85f8 verified
metadata
pipeline_tag: sentence-similarity
base_model: Qwen/Qwen3-Embedding-0.6B
license: apache-2.0
library_name: peft
tags:
  - peft
  - lora
  - sentence-transformers
  - semantic-search
  - faiss
  - hnsw
  - mathlib4
  - lean4

Lightweight Semantic Mathlib Search — LoRA Adapter (Demo)

What this is: A small, reproducible demo adapter for Qwen/Qwen3-Embedding-0.6B that enables lightweight, LLM-free semantic search over mathlib (Lean 4). Supports both natural language and formula-based search (can be used in combination) for finding Lean theorems or mathematical statements. Does not support LaTeX syntax search yet.

Data is built from ~180k Lean theorems extracted via LeanDojo for Lean 4.20.1 (there may be minor gaps vs. the full mathlib).

Part of the AITP 2025 submission: “Towards Lightweight and LLM-Free Semantic Search for mathlib4.”

Contents

  • adapter/adapter_config.json & adapter/adapter_model.safetensors — LoRA weights for the encoder
  • faiss/<*.idx> — HNSW FAISS index (built for cosine/inner-product search)
  • faiss/*public_index.jsonl — public mapping { idx, full_name, statement }

DEMO

https://github.com/IsaacLi74/Lightweight-and-LLM-Free-Semantic-Search-for-mathlib4