|
--- |
|
language: |
|
- en |
|
license: llama3.1 |
|
library_name: ollama |
|
tags: |
|
- legal |
|
- singapore |
|
- law |
|
- assistant |
|
- llama |
|
- quantized |
|
pipeline_tag: text-generation |
|
base_model: meta-llama/Meta-Llama-3.1-8B-Instruct |
|
base_model_relation: quantized |
|
model-index: |
|
- name: LexSG |
|
results: [] |
|
--- |
|
|
|
# LexSG - Singapore Legal Assistant Model |
|
|
|
A specialized AI assistant trained on Singapore statutes and subsidiary legislation, built on the Llama 3.1 8B Instruct architecture and optimized for legal text generation. |
|
|
|
## Model Details |
|
|
|
### Model Description |
|
|
|
LexSG is a fine-tuned and quantized language model designed specifically to assist with Singapore legal matters. It provides accurate, contextual responses about Singapore's legal framework and helps users understand complex legal provisions. |
|
|
|
- **Developed by:** Chang Sau Sheong |
|
- **Model type:** Causal Language Model |
|
- **Language(s) (NLP):** English |
|
- **License:** Llama 3.1 License |
|
- **Finetuned from model:** meta-llama/Meta-Llama-3.1-8B-Instruct |
|
|
|
### Model Sources |
|
|
|
- **Repository:** (https://huggingface.co/sausheong/lexsg) |
|
- **Base Model:** [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) |
|
|
|
## Uses |
|
|
|
### Direct Use |
|
|
|
This model is intended for educational and informational purposes to help users understand Singapore legal provisions and statutes. It can be used to: |
|
|
|
- Explain legal sections and provisions from Singapore acts |
|
- Answer questions about Singapore's legal framework |
|
- Provide context for legal documents |
|
- Help interpret legal language and terminology |
|
- Assist with understanding regulatory requirements |
|
|
|
### Downstream Use |
|
|
|
The model can be integrated into legal research tools, educational platforms, or chatbot applications focused on Singapore law. |
|
|
|
### Out-of-Scope Use |
|
|
|
- **Not for legal advice:** This model should not be used as a substitute for professional legal counsel |
|
- **Not for other jurisdictions:** Specifically trained on Singapore law and may not be accurate for other legal systems |
|
- **Not for critical decisions:** Should not be used for making important legal or business decisions without professional verification |
|
|
|
## Bias, Risks, and Limitations |
|
|
|
- **Training data limitations:** Responses are based on training data and may not reflect the most recent legal changes |
|
- **Legislation only:** Training data is Singapore statutes and subsidiary legislation only, without any Singapore legal cases |
|
- **Legal complexity:** Legal interpretations can be highly context-dependent and nuanced |
|
- **Professional consultation required:** Complex legal matters require consultation with qualified legal professionals |
|
- **Potential biases:** May reflect biases present in legal training data |
|
|
|
### Recommendations |
|
|
|
Users should be made aware of the risks, biases and limitations of the model. Always consult with qualified legal professionals for specific legal matters. |
|
|
|
## How to Get Started with the Model |
|
|
|
### llama.cpp/Ollama |
|
|
|
The model file `llama-3.1-8b-lexsg-q4_k_m.gguf` is formatted in GGUF and can be used in any llama.cpp compatible library or application. |
|
Specifically it has been tested in Ollama [Ollama](https://ollama.com/), with the given Modelfile |
|
|
|
### Running the Model |
|
|
|
To use this with Ollama: |
|
|
|
1. Build the model from the Modelfile: |
|
```bash |
|
ollama create lexsg -f Modelfile |
|
``` |
|
|
|
or even simpler just do this: |
|
```bash |
|
./setup_ollama_model.sh |
|
``` |
|
|
|
2. Run the model: |
|
```bash |
|
ollama run lexsg |
|
``` |
|
|
|
3. Start asking questions about Singapore law: |
|
``` |
|
> What does Section 73 of the Companies Act cover? |
|
> Explain the requirements for setting up a private limited company in Singapore |
|
> What are the penalties for non-compliance with PDPA? |
|
``` |
|
|
|
## Training Details |
|
|
|
### Training Data |
|
|
|
The model was fine-tuned on Singapore legal documents and statutes, including but not limited to: |
|
- Singapore Acts and Statutes |
|
- Legal provisions and regulations |
|
- Case law references |
|
- Regulatory guidelines |
|
|
|
### Training Procedure |
|
|
|
#### Training Hyperparameters |
|
|
|
- **Training regime:** Fine-tuned from Llama 3.1 8B Instruct |
|
- **Quantization:** Q4_K_M (4-bit quantized for efficient inference) |
|
|
|
#### Speeds, Sizes, Times |
|
|
|
- **Model size:** ~4.8GB (quantized) |
|
- **Context length:** 4,096 tokens |
|
- **Max generation:** 1,024 tokens |
|
|
|
|
|
## Technical Specifications |
|
|
|
### Model Architecture and Objective |
|
|
|
- **Architecture:** Llama 3.1 transformer architecture |
|
- **Training objective:** Causal language modeling |
|
|
|
### Hardware |
|
|
|
- **Memory requirements:** ~6GB RAM recommended for inference |
|
- **Platform support:** Cross-platform via Ollama |
|
|
|
### Inference parameters |
|
|
|
The following are the inference parameters in the model file. You can change it accordingly. |
|
|
|
- Temperature: 0.3 (conservative, factual responses) |
|
- Top-p: 0.9 (nucleus sampling for quality) |
|
- Top-k: 40 (controlled vocabulary selection) |
|
- Repeat penalty: 1.1 (reduces repetition) |
|
|
|
## Model Card Authors |
|
|
|
Chang Sau Sheong |
|
|
|
## More Information |
|
|
|
For more details about Singapore legislation, refer to [Singapore Statutes Online](https://sso.agc.gov.sg/) |
|
|
|
--- |
|
|
|
**Legal Disclaimer:** This model is designed to provide general information about Singapore law and should not be considered as legal advice. For specific legal matters, always consult with a qualified legal professional licensed to practice in Singapore. |
|
|