Update README.md
Browse files
README.md
CHANGED
@@ -10,6 +10,10 @@ datasets:
|
|
10 |
|
11 |
---
|
12 |
|
|
|
|
|
|
|
|
|
13 |
### Overview and motivation
|
14 |
Gene expression count vectors are tricky to navigate with machine learning models: they have a very peaked distribution around small values but it is also long tailed.
|
15 |
Moreover, the count distributions can be cell dependent and higher variable per sample. This makes it challenging to train a model to predict count vectors. When predicting raw counts and moreover predicting the mean (say under L2 loss), without hand crafted loss function, the model will be dominated by the large counts and will not be robust to errors of small counts.
|
|
|
10 |
|
11 |
---
|
12 |
|
13 |
+
## Code:
|
14 |
+
Code is made public at https://github.com/jsjung00/diffusePerturb.
|
15 |
+
A more detailed write up is also linked there.
|
16 |
+
|
17 |
### Overview and motivation
|
18 |
Gene expression count vectors are tricky to navigate with machine learning models: they have a very peaked distribution around small values but it is also long tailed.
|
19 |
Moreover, the count distributions can be cell dependent and higher variable per sample. This makes it challenging to train a model to predict count vectors. When predicting raw counts and moreover predicting the mean (say under L2 loss), without hand crafted loss function, the model will be dominated by the large counts and will not be robust to errors of small counts.
|