Enhance model card with sample usage, library name, and citation
Browse filesThis PR significantly improves the model card by:
- **Adding a detailed "Sample Usage" section:** This section guides users through environment setup, downloading model checkpoints directly from the Hugging Face Hub, and running inference with clear command-line instructions, making the model much more accessible and immediately usable.
- **Including `MinkowskiEngine` as `library_name` in the metadata:** This crucial addition improves the model's discoverability and provides important context regarding its underlying library for sparse tensor processing.
- **Adding the BibTeX "Citation" section:** This enables researchers to easily cite the paper when using the model, ensuring proper academic attribution.
The existing ArXiv paper link has been retained as per instructions.
@@ -1,10 +1,11 @@
|
|
1 |
---
|
2 |
license: mit
|
|
|
3 |
tags:
|
4 |
- 3D point clouds
|
5 |
- reconstruction
|
6 |
- ply
|
7 |
-
|
8 |
---
|
9 |
|
10 |
# **ScoreLiDAR: Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion**
|
@@ -15,4 +16,53 @@ pipeline_tag: other
|
|
15 |
Diffusion models have been applied to 3D LiDAR scene completion due to their strong training stability and high completion quality. However, the slow sampling speed limits the practical application of diffusion-based scene completion models since autonomous vehicles require an efficient perception of surrounding environments. This paper proposes a novel distillation method tailored for 3D Li- DAR scene completion models, dubbed ScoreLiDAR, which achieves efficient yet high-quality scene completion. Score- LiDAR enables the distilled model to sample in significantly fewer steps after distillation. To improve completion quality, we also introduce a novel Structural Loss, which encourages the distilled model to capture the geometric structure of the 3D LiDAR scene. The loss contains a scene-wise term constraining the holistic structure and a point-wise term constraining the key landmark points and their relative configuration. Extensive experiments demonstrate that ScoreLiDAR significantly accelerates the completion time from 30.55 to 5.37 seconds per frame (>5x) on SemanticKITTI and achieves superior performance compared to state-of-the-art 3D LiDAR scene completion models. Our model and code are publicly available on https: //github.com/happyw1nd/ScoreLiDAR.
|
16 |
|
17 |
## Repo Info
|
18 |
-
This repository stores checkpoints for the ScoreLiDAR paper. For more information and detailed usage of the models, please refer to the [official GitHub repository](https://github.com/happyw1nd/ScoreLiDAR).
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: mit
|
3 |
+
pipeline_tag: other
|
4 |
tags:
|
5 |
- 3D point clouds
|
6 |
- reconstruction
|
7 |
- ply
|
8 |
+
library_name: MinkowskiEngine
|
9 |
---
|
10 |
|
11 |
# **ScoreLiDAR: Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion**
|
|
|
16 |
Diffusion models have been applied to 3D LiDAR scene completion due to their strong training stability and high completion quality. However, the slow sampling speed limits the practical application of diffusion-based scene completion models since autonomous vehicles require an efficient perception of surrounding environments. This paper proposes a novel distillation method tailored for 3D Li- DAR scene completion models, dubbed ScoreLiDAR, which achieves efficient yet high-quality scene completion. Score- LiDAR enables the distilled model to sample in significantly fewer steps after distillation. To improve completion quality, we also introduce a novel Structural Loss, which encourages the distilled model to capture the geometric structure of the 3D LiDAR scene. The loss contains a scene-wise term constraining the holistic structure and a point-wise term constraining the key landmark points and their relative configuration. Extensive experiments demonstrate that ScoreLiDAR significantly accelerates the completion time from 30.55 to 5.37 seconds per frame (>5x) on SemanticKITTI and achieves superior performance compared to state-of-the-art 3D LiDAR scene completion models. Our model and code are publicly available on https: //github.com/happyw1nd/ScoreLiDAR.
|
17 |
|
18 |
## Repo Info
|
19 |
+
This repository stores checkpoints for the ScoreLiDAR paper. For more information and detailed usage of the models, please refer to the [official GitHub repository](https://github.com/happyw1nd/ScoreLiDAR).
|
20 |
+
|
21 |
+
## Sample Usage
|
22 |
+
|
23 |
+
To get started with ScoreLiDAR, follow these steps to set up the environment, download the necessary weights, and run an inference example.
|
24 |
+
|
25 |
+
1. **Clone the repository:**
|
26 |
+
```bash
|
27 |
+
git clone https://github.com/happyw1nd/ScoreLiDAR
|
28 |
+
cd ScoreLiDAR
|
29 |
+
```
|
30 |
+
|
31 |
+
2. **Install the environment:**
|
32 |
+
```bash
|
33 |
+
sudo apt install build-essential python3-dev libopenblas-dev
|
34 |
+
pip3 install -r requirements.txt
|
35 |
+
pip3 install -U MinkowskiEngine==0.5.4 --install-option="--blas=openblas" -v --no-deps
|
36 |
+
pip3 install -U -e .
|
37 |
+
```
|
38 |
+
|
39 |
+
3. **Download model checkpoints:**
|
40 |
+
Download the required checkpoint files from this Hugging Face repository and place them in the `checkpoints/` directory within your cloned `ScoreLiDAR` repository.
|
41 |
+
```bash
|
42 |
+
huggingface-cli download happywind/ScoreLiDAR ScoreLiDAR_diff_net.ckpt --local-dir checkpoints
|
43 |
+
huggingface-cli download happywind/ScoreLiDAR refine_net.ckpt --local-dir checkpoints
|
44 |
+
```
|
45 |
+
|
46 |
+
4. **Run inference:**
|
47 |
+
The inference script will read `.ply` files from the `scorelidar/Datasets/test/` directory. A sample `.ply` file is typically included in the repository for testing. Results will be saved under `scorelidar/results/`.
|
48 |
+
```bash
|
49 |
+
python3 tools/diff_completion_pipeline.py --denoising_steps 8 --cond_weight 3.5
|
50 |
+
```
|
51 |
+
|
52 |
+
5. **Visualize results (optional):**
|
53 |
+
You can visualize the generated point cloud files using the provided script:
|
54 |
+
```bash
|
55 |
+
python3 vis_pcd.py -p <path_to_.ply_file>
|
56 |
+
```
|
57 |
+
|
58 |
+
## Citation
|
59 |
+
If you find our paper useful or relevant to your research, please kindly cite our papers:
|
60 |
+
|
61 |
+
```bibtex
|
62 |
+
@article{zhang2024distillingdiffusionmodels,
|
63 |
+
title={Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion},
|
64 |
+
author={Shengyuan Zhang and An Zhao and Ling Yang and Zejian Li and Chenye Meng and Haoran Xu and Tianrun Chen and AnYang Wei and Perry Pengyun GU and Lingyun Sun,
|
65 |
+
journal={arXiv:2412.03515},
|
66 |
+
year={2024}
|
67 |
+
}
|
68 |
+
```
|