Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -11,25 +11,24 @@ We are a community of researchers from different countries and institutions inte
|
|
11 |
|
12 |
|
13 |
<details>
|
14 |
-
<summary>[Complete] Quantitative Analysis of Hugging Face Hub
|
15 |
|
16 |
- Using activity data from Hugging Face Hub datasets, models and spaces, we ran a three-part quantitative analysis of development activity, finding that:
|
17 |
- activity is extremely imbalanced between repositories; with over 70% of models having 0 downloads, while 1% accounting for 99% of downloads
|
18 |
- the community has a core-periphery structure, with a core of prolific developers while the large majority (89%) of developers are "islands", primarily working alone.
|
19 |
</details>
|
20 |
|
21 |
-
|
22 |
<details>
|
23 |
-
<summary>[Ongoing]
|
24 |
-
|
25 |
-
- We are investigating how open source AI artefacts such as open datasets and open-weight models are built and who contributes to this process across the model building pipeline, including during the pre-release and post-release phases.
|
26 |
-
- Our research aims to map collaboration practices from pre-release to post-release across diverse organisational contexts, including individuals, grassroots initiatives, research institutes, startups, and large technology companies, and geographies.
|
27 |
|
|
|
|
|
28 |
</details>
|
29 |
|
30 |
-
<details>
|
31 |
-
<summary>[Ongoing]
|
|
|
|
|
|
|
32 |
|
33 |
-
- We are looking into what kinds of downstream activity are most prevalent for different kinds of base foundation models.
|
34 |
-
- What kinds of derivations (e.g. fine tuning, quantizing) are most popular?
|
35 |
</details>
|
|
|
11 |
|
12 |
|
13 |
<details>
|
14 |
+
<summary><b>[Complete] Quantitative Analysis of Hugging Face Hub</b></summary>
|
15 |
|
16 |
- Using activity data from Hugging Face Hub datasets, models and spaces, we ran a three-part quantitative analysis of development activity, finding that:
|
17 |
- activity is extremely imbalanced between repositories; with over 70% of models having 0 downloads, while 1% accounting for 99% of downloads
|
18 |
- the community has a core-periphery structure, with a core of prolific developers while the large majority (89%) of developers are "islands", primarily working alone.
|
19 |
</details>
|
20 |
|
|
|
21 |
<details>
|
22 |
+
<summary><b>[Ongoing] Model Tree Lineages from Base Models to Derived Models</b> </summary>
|
|
|
|
|
|
|
23 |
|
24 |
+
- We are looking into what kinds of downstream activity are most prevalent for different kinds of base foundation models.
|
25 |
+
- What kinds of derivations (e.g. fine tuning, quantizing) are most popular?
|
26 |
</details>
|
27 |
|
28 |
+
<details open>
|
29 |
+
<summary><b>[Ongoing] Cartography of Collaboration in the Open Source AI Ecosystem</b> </summary>
|
30 |
+
|
31 |
+
- We are investigating how open source AI artefacts such as open datasets and open-weight models are built and who contributes to this process across the model building pipeline, including during the pre-release and post-release phases.
|
32 |
+
- Our research aims to map collaboration practices from pre-release to post-release across diverse organisational contexts, including individuals, grassroots initiatives, research institutes, startups, and large technology companies, and geographies.
|
33 |
|
|
|
|
|
34 |
</details>
|