Update README.md
Browse files
README.md
CHANGED
@@ -11,18 +11,13 @@ license: mit
|
|
11 |
|
12 |
# Model Card for LegoGPT
|
13 |
|
14 |
-
|
15 |
-
|
16 |
-
|
17 |
|
18 |
## Model Details
|
19 |
|
20 |
### Model Description
|
21 |
|
22 |
-
<!-- Provide a longer summary of what this model is. -->
|
23 |
-
|
24 |
-
|
25 |
-
|
26 |
- **Developed by:** [Carnegie Mellon University Generative Intelligence Lab](https://www.cs.cmu.edu/~generative-intelligence-lab/)
|
27 |
- **Funded by:** This work is partly supported by the Packard Foundation, Cisco Research Grant, and Amazon Faculty Award. This work is also in part supported by the Manufacturing Futures Institute, Carnegie Mellon University, through a grant from the Richard King Mellon Foundation. KD is supported by the Microsoft Research PhD Fellowship.
|
28 |
- **Model type:** Autoregressive
|
@@ -32,45 +27,17 @@ license: mit
|
|
32 |
|
33 |
### Model Sources
|
34 |
|
35 |
-
<!-- Provide the basic links for the model. -->
|
36 |
-
|
37 |
- **Repository:** [AvaLovelace1/LegoGPT](https://github.com/AvaLovelace1/LegoGPT)
|
38 |
- **Paper:** [Generating Physically Stable and Buildable LEGO® Designs from Text](https://huggingface.co/papers/2505.05469)
|
39 |
- **Demo:** [cmu-gil/LegoGPT-Demo](https://huggingface.co/spaces/cmu-gil/LegoGPT-Demo)
|
40 |
|
41 |
-
##
|
42 |
-
|
43 |
-
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
44 |
-
|
45 |
-
### Direct Use
|
46 |
-
|
47 |
-
<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
|
48 |
-
|
49 |
-
[More Information Needed]
|
50 |
|
51 |
-
|
52 |
-
|
53 |
-
<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
|
54 |
-
|
55 |
-
[More Information Needed]
|
56 |
-
|
57 |
-
## Bias, Risks, and Limitations
|
58 |
-
|
59 |
-
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
|
60 |
-
|
61 |
-
[More Information Needed]
|
62 |
-
|
63 |
-
### Recommendations
|
64 |
-
|
65 |
-
<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
|
66 |
-
|
67 |
-
Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
|
68 |
|
69 |
## How to Get Started with the Model
|
70 |
|
71 |
-
|
72 |
-
|
73 |
-
[More Information Needed]
|
74 |
|
75 |
## Training Details
|
76 |
|
|
|
11 |
|
12 |
# Model Card for LegoGPT
|
13 |
|
14 |
+
These are the model weights for LegoGPT, the first approach for generating physically stable LEGO brick models from text prompts, as described in [Generating Physically Stable and Buildable LEGO® Designs from Text](https://huggingface.co/papers/2505.05469).
|
15 |
+
This model was fine-tuned from [meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct).
|
|
|
16 |
|
17 |
## Model Details
|
18 |
|
19 |
### Model Description
|
20 |
|
|
|
|
|
|
|
|
|
21 |
- **Developed by:** [Carnegie Mellon University Generative Intelligence Lab](https://www.cs.cmu.edu/~generative-intelligence-lab/)
|
22 |
- **Funded by:** This work is partly supported by the Packard Foundation, Cisco Research Grant, and Amazon Faculty Award. This work is also in part supported by the Manufacturing Futures Institute, Carnegie Mellon University, through a grant from the Richard King Mellon Foundation. KD is supported by the Microsoft Research PhD Fellowship.
|
23 |
- **Model type:** Autoregressive
|
|
|
27 |
|
28 |
### Model Sources
|
29 |
|
|
|
|
|
30 |
- **Repository:** [AvaLovelace1/LegoGPT](https://github.com/AvaLovelace1/LegoGPT)
|
31 |
- **Paper:** [Generating Physically Stable and Buildable LEGO® Designs from Text](https://huggingface.co/papers/2505.05469)
|
32 |
- **Demo:** [cmu-gil/LegoGPT-Demo](https://huggingface.co/spaces/cmu-gil/LegoGPT-Demo)
|
33 |
|
34 |
+
## Limitations
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
35 |
|
36 |
+
The model is restricted to creating structures on a 20x20x20 grid. It was trained on a dataset of 21 object categories: *basket, bed, bench, birdhouse, bookshelf, bottle, bowl, bus, camera, car, chair, guitar, jar, mug, piano, pot, sofa, table, tower, train, vessel*. Performance on prompts from outside these categories may be limited.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
37 |
|
38 |
## How to Get Started with the Model
|
39 |
|
40 |
+
See the [GitHub repo](https://github.com/AvaLovelace1/LegoGPT) for usage examples and an interactive CLI demo.
|
|
|
|
|
41 |
|
42 |
## Training Details
|
43 |
|