bubbliiiing commited on
Commit
1f9a2e5
·
1 Parent(s): 048d3c9

Update Readme

Browse files
Files changed (2) hide show
  1. README.md +17 -39
  2. README_en.md +19 -13
README.md CHANGED
@@ -1,33 +1,5 @@
1
  ---
2
- frameworks:
3
- - Pytorch
4
- license: other
5
- tasks:
6
- - text-to-video-synthesis
7
-
8
- #model-type:
9
- ##如 gpt、phi、llama、chatglm、baichuan 等
10
- #- gpt
11
-
12
- #domain:
13
- ##如 nlp、cv、audio、multi-modal
14
- #- nlp
15
-
16
- #language:
17
- ##语言代码列表 https://help.aliyun.com/document_detail/215387.html?spm=a2c4g.11186623.0.0.9f8d7467kni6Aa
18
- #- cn
19
-
20
- #metrics:
21
- ##如 CIDEr、Blue、ROUGE 等
22
- #- CIDEr
23
-
24
- #tags:
25
- ##各种自定义,包括 pretrained、fine-tuned、instruction-tuned、RL-tuned 等训练方法和其他
26
- #- pretrained
27
-
28
- #tools:
29
- ##如 vllm、fastchat、llamacpp、AdaSeq 等
30
- #- vllm
31
  ---
32
 
33
  [![Arxiv Page](https://img.shields.io/badge/Arxiv-Page-red)](https://arxiv.org/abs/2405.18991)
@@ -44,6 +16,14 @@ EasyAnimate是一个基于transformer结构的pipeline,可用于生成AI图片
44
  # 模型地址
45
  EasyAnimateV5.1:
46
 
 
 
 
 
 
 
 
 
47
  12B:
48
  | 名称 | 种类 | 存储空间 | Hugging Face | Model Scope | 描述 |
49
  |--|--|--|--|--|--|
@@ -344,23 +324,21 @@ Linux 的详细信息:
344
  我们需要大约 60GB 的可用磁盘空间,请检查!
345
 
346
  EasyAnimateV5.1-12B的视频大小可以由不同的GPU Memory生成,包括:
347
- | GPU memory |384x672x72|384x672x49|576x1008x25|576x1008x49|768x1344x25|768x1344x49|
348
  |----------|----------|----------|----------|----------|----------|----------|
349
- | 16GB | 🧡 | 🧡 | | | ❌ | ❌ |
350
- | 24GB | 🧡 | 🧡 | 🧡 | 🧡 | | ❌ |
351
- | 40GB | ✅ | ✅ | ✅ | ✅ | | |
352
  | 80GB | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
353
 
354
  EasyAnimateV5.1-7B的视频大小可以由不同的GPU Memory生成,包括:
355
- | GPU memory |384x672x72|384x672x49|576x1008x25|576x1008x49|768x1344x25|768x1344x49|
356
  |----------|----------|----------|----------|----------|----------|----------|
357
- | 16GB | 🧡 | 🧡 | | | ❌ | ❌ |
358
- | 24GB | ✅ | ✅ | 🧡 | 🧡 | | ❌ |
359
- | 40GB | ✅ | ✅ | ✅ | ✅ | | |
360
  | 80GB | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
361
 
362
- 由于qwen2-vl-7b的float16的权重,无法在16GB显存下运行,如果您的显存是16GB,请前往[Huggingface](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8)或者[Modelscope](https://modelscope.cn/models/Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8)下载量化后的qwen2-vl-7b对原有的text encoder进行替换,并安装对应的依赖库(auto-gptq, optimum)。
363
-
364
  ✅ 表示它可以在"model_cpu_offload"的情况下运行,🧡代表它可以在"model_cpu_offload_and_qfloat8"的情况下运行,⭕️ 表示它可以在"sequential_cpu_offload"的情况下运行,❌ 表示它无法运行。请注意,使用sequential_cpu_offload运行会更慢。
365
 
366
  有一些不支持torch.bfloat16的卡型���如2080ti、V100,需要将app.py、predict文件中的weight_dtype修改为torch.float16才可以运行。
 
1
  ---
2
+ license: apache-2.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
4
 
5
  [![Arxiv Page](https://img.shields.io/badge/Arxiv-Page-red)](https://arxiv.org/abs/2405.18991)
 
16
  # 模型地址
17
  EasyAnimateV5.1:
18
 
19
+ 7B:
20
+ | 名称 | 种类 | 存储空间 | Hugging Face | Model Scope | 描述 |
21
+ |--|--|--|--|--|--|
22
+ | EasyAnimateV5.1-7b-zh-InP | EasyAnimateV5.1 | 30 GB | [🤗Link](https://huggingface.co/alibaba-pai/EasyAnimateV5.1-7b-zh-InP) | [😄Link](https://modelscope.cn/models/PAI/EasyAnimateV5.1-7b-zh-InP)| 官方的图生视频权重。支持多分辨率(512,768,1024)的视频预测,支持多分辨率(512,768,1024)的视频预测,以49帧、每秒8帧进行训练,支持多语言预测 |
23
+ | EasyAnimateV5.1-7b-zh-Control | EasyAnimateV5.1 | 30 GB | [🤗Link](https://huggingface.co/alibaba-pai/EasyAnimateV5.1-7b-zh-Control) | [😄Link](https://modelscope.cn/models/PAI/EasyAnimateV5.1-7b-zh-Control)| 官方的视频控制权重,支持不同的控制条件,如Canny、Depth、Pose、MLSD等,同时支持使用轨迹控制。支持多分辨率(512,768,1024)的视频预测,支持多分辨率(512,768,1024)的视频预测,以49帧、每秒8帧进行训练,支持多语言预测 |
24
+ | EasyAnimateV5.1-7b-zh-Control-Camera | EasyAnimateV5.1 | 30 GB | [🤗Link](https://huggingface.co/alibaba-pai/EasyAnimateV5.1-7b-zh-Control-Camera) | [😄Link](https://modelscope.cn/models/PAI/EasyAnimateV5.1-7b-zh-Control-Camera)| 官方的视频相机控制权重,支持通过输入相机运动轨迹控制生成方向。支持多分辨率(512,768,1024)的视频预测,支持多分辨率(512,768,1024)的视频预测,以49帧、每秒8帧进行训练,支持多语言预测 |
25
+ | EasyAnimateV5.1-7b-zh | EasyAnimateV5.1 | 30 GB | [🤗Link](https://huggingface.co/alibaba-pai/EasyAnimateV5.1-7b-zh) | [😄Link](https://modelscope.cn/models/PAI/EasyAnimateV5.1-7b-zh)| 官方的文生视频权重。支持多分辨率(512,768,1024)的视频预测,支持多分辨率(512,768,1024)的视频预测,以49帧、每秒8帧进行训练,支持多语言预测 |
26
+
27
  12B:
28
  | 名称 | 种类 | 存储空间 | Hugging Face | Model Scope | 描述 |
29
  |--|--|--|--|--|--|
 
324
  我们需要大约 60GB 的可用磁盘空间,请检查!
325
 
326
  EasyAnimateV5.1-12B的视频大小可以由不同的GPU Memory生成,包括:
327
+ | GPU memory |384x672x25|384x672x49|576x1008x25|576x1008x49|768x1344x25|768x1344x49|
328
  |----------|----------|----------|----------|----------|----------|----------|
329
+ | 16GB | 🧡 | ⭕️ | ⭕️ | ⭕️ | ❌ | ❌ |
330
+ | 24GB | 🧡 | 🧡 | 🧡 | 🧡 | 🧡 | ❌ |
331
+ | 40GB | ✅ | ✅ | ✅ | ✅ | | |
332
  | 80GB | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
333
 
334
  EasyAnimateV5.1-7B的视频大小可以由不同的GPU Memory生成,包括:
335
+ | GPU memory |384x672x25|384x672x49|576x1008x25|576x1008x49|768x1344x25|768x1344x49|
336
  |----------|----------|----------|----------|----------|----------|----------|
337
+ | 16GB | 🧡 | 🧡 | ⭕️ | ⭕️ | ❌ | ❌ |
338
+ | 24GB | ✅ | ✅ | | 🧡 | 🧡 | ❌ |
339
+ | 40GB | ✅ | ✅ | ✅ | ✅ | | |
340
  | 80GB | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
341
 
 
 
342
  ✅ 表示它可以在"model_cpu_offload"的情况下运行,🧡代表它可以在"model_cpu_offload_and_qfloat8"的情况下运行,⭕️ 表示它可以在"sequential_cpu_offload"的情况下运行,❌ 表示它无法运行。请注意,使用sequential_cpu_offload运行会更慢。
343
 
344
  有一些不支持torch.bfloat16的卡型���如2080ti、V100,需要将app.py、predict文件中的weight_dtype修改为torch.float16才可以运行。
README_en.md CHANGED
@@ -15,6 +15,14 @@ EasyAnimate is a pipeline based on the transformer architecture, designed for ge
15
 
16
  EasyAnimateV5.1:
17
 
 
 
 
 
 
 
 
 
18
  12B:
19
  | Name | Type | Storage Space | Hugging Face | Model Scope | Description |
20
  |--|--|--|--|--|--|
@@ -317,22 +325,20 @@ The detailed of Linux:
317
  We need about 60GB available on disk (for saving weights), please check!
318
 
319
  The video size for EasyAnimateV5.1-12B can be generated by different GPU Memory, including:
320
- | GPU memory | 384x672x72 | 384x672x49 | 576x1008x25 | 576x1008x49 | 768x1344x25 | 768x1344x49 |
321
  |------------|------------|------------|------------|------------|------------|------------|
322
- | 16GB | 🧡 | 🧡 | | | ❌ | ❌ |
323
- | 24GB | 🧡 | 🧡 | 🧡 | 🧡 | | ❌ |
324
- | 40GB | ✅ | ✅ | ✅ | ✅ | | |
325
- | 80GB | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
326
 
327
  The video size for EasyAnimateV5.1-7B can be generated by different GPU Memory, including:
328
- | GPU memory | 384x672x72 | 384x672x49 | 576x1008x25 | 576x1008x49 | 768x1344x25 | 768x1344x49 |
329
- |------------|------------|------------|------------|------------|------------|------------|
330
- | 16GB | 🧡 | 🧡 | | | ❌ | ❌ |
331
- | 24GB | ✅ | ✅ | 🧡 | 🧡 | ❌ | ❌ |
332
- | 40GB | ✅ | ✅ | ✅ | ✅ | | |
333
- | 80GB | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
334
-
335
- Due to the float16 weights of qwen2-vl-7b, it cannot run on a 16GB GPU. If your GPU memory is 16GB, please visit [Huggingface](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8) or [Modelscope](https://modelscope.cn/models/Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8) to download the quantized version of qwen2-vl-7b to replace the original text encoder, and install the corresponding dependency libraries (auto-gptq, optimum).
336
 
337
  ✅ indicates it can run under "model_cpu_offload", 🧡 represents it can run under "model_cpu_offload_and_qfloat8", ⭕️ indicates it can run under "sequential_cpu_offload", ❌ means it can't run. Please note that running with sequential_cpu_offload will be slower.
338
 
 
15
 
16
  EasyAnimateV5.1:
17
 
18
+ 7B:
19
+ | Name | Type | Storage Space | Hugging Face | Model Scope | Description |
20
+ |--|--|--|--|--|--|
21
+ | EasyAnimateV5.1-7b-zh-InP | EasyAnimateV5.1 | 30 GB | [🤗Link](https://huggingface.co/alibaba-pai/EasyAnimateV5.1-7b-zh-InP) | [😄Link](https://modelscope.cn/models/PAI/EasyAnimateV5.1-7b-zh-InP) | Official image-to-video weights. Supports video prediction at multiple resolutions (512, 768, 1024), trained with 49 frames at 8 frames per second, and supports for multilingual prediction. |
22
+ | EasyAnimateV5.1-7b-zh-Control | EasyAnimateV5.1 | 30 GB | [🤗Link](https://huggingface.co/alibaba-pai/EasyAnimateV5.1-7b-zh-Control) | [😄Link](https://modelscope.cn/models/PAI/EasyAnimateV5.1-7b-zh-Control) | Official video control weights, supporting various control conditions such as Canny, Depth, Pose, MLSD, and trajectory control. Supports video prediction at multiple resolutions (512, 768, 1024), trained with 49 frames at 8 frames per second, and supports for multilingual prediction. |
23
+ | EasyAnimateV5.1-7b-zh-Control-Camera | EasyAnimateV5.1 | 30 GB | [🤗Link](https://huggingface.co/alibaba-pai/EasyAnimateV5.1-7b-zh-Control-Camera) | [😄Link](https://modelscope.cn/models/PAI/EasyAnimateV5.1-7b-zh-Control-Camera) | Official video camera control weights, supporting direction generation control by inputting camera motion trajectories. Supports video prediction at multiple resolutions (512, 768, 1024), trained with 49 frames at 8 frames per second, and supports for multilingual prediction. |
24
+ | EasyAnimateV5.1-7b-zh | EasyAnimateV5.1 | 30 GB | [🤗Link](https://huggingface.co/alibaba-pai/EasyAnimateV5.1-7b-zh) | [😄Link](https://modelscope.cn/models/PAI/EasyAnimateV5.1-7b-zh) | Official text-to-video weights. Supports video prediction at multiple resolutions (512, 768, 1024), trained with 49 frames at 8 frames per second, and supports for multilingual prediction. |
25
+
26
  12B:
27
  | Name | Type | Storage Space | Hugging Face | Model Scope | Description |
28
  |--|--|--|--|--|--|
 
325
  We need about 60GB available on disk (for saving weights), please check!
326
 
327
  The video size for EasyAnimateV5.1-12B can be generated by different GPU Memory, including:
328
+ | GPU memory | 384x672x25 | 384x672x49 | 576x1008x25 | 576x1008x49 | 768x1344x25 | 768x1344x49 |
329
  |------------|------------|------------|------------|------------|------------|------------|
330
+ | 16GB | 🧡 | ⭕️ | ⭕️ | ⭕️ | ❌ | ❌ |
331
+ | 24GB | 🧡 | 🧡 | 🧡 | 🧡 | 🧡 | ❌ |
332
+ | 40GB | ✅ | ✅ | ✅ | ✅ | | |
333
+ | 80GB | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
334
 
335
  The video size for EasyAnimateV5.1-7B can be generated by different GPU Memory, including:
336
+ | GPU memory |384x672x25|384x672x49|576x1008x25|576x1008x49|768x1344x25|768x1344x49|
337
+ |----------|----------|----------|----------|----------|----------|----------|
338
+ | 16GB | 🧡 | 🧡 | ⭕️ | ⭕️ | ❌ | ❌ |
339
+ | 24GB | ✅ | ✅ | ✅ | 🧡 | 🧡 | ❌ |
340
+ | 40GB | ✅ | ✅ | ✅ | ✅ | | |
341
+ | 80GB | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
 
 
342
 
343
  ✅ indicates it can run under "model_cpu_offload", 🧡 represents it can run under "model_cpu_offload_and_qfloat8", ⭕️ indicates it can run under "sequential_cpu_offload", ❌ means it can't run. Please note that running with sequential_cpu_offload will be slower.
344