shirley430316 commited on
Commit
1ac5c57
·
verified ·
1 Parent(s): 1bccc6e

Upload folder using huggingface_hub

Browse files
Files changed (5) hide show
  1. .gitattributes +1 -0
  2. assets/teaser2.gif +3 -0
  3. base.safetensors +3 -0
  4. lora.safetensors +3 -0
  5. readme.md +65 -0
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ assets/teaser2.gif filter=lfs diff=lfs merge=lfs -text
assets/teaser2.gif ADDED

Git LFS Details

  • SHA256: 17c1b9de64b45218959dc3dc7864822e991d4ab18fe045208033f48ff41f1a13
  • Pointer size: 133 Bytes
  • Size of remote file: 22.3 MB
base.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:71132d9d1bcbc10f0a04030c1cd3b8b0325bb285a589f1631b1527a25221dc1a
3
+ size 5944885692
lora.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b86da74fbe9a852404824f5e162b088d3f084860c0e4834c518c90d1ddc751cb
3
+ size 699995008
readme.md ADDED
@@ -0,0 +1,65 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model:
4
+ - Wan-AI/Wan2.1-T2V-1.3B
5
+ tags:
6
+ - image-to-video
7
+ ---
8
+ # LivePhoto-Wan2.1
9
+
10
+ [Code Repository](https://github.com/XavierCHEN34/LivePhoto) | [Project Page](https://xavierchen34.github.io/LivePhoto-Page/) | [Wan2.1 Model](https://huggingface.co/Wan-AI/Wan2.1-T2V-1.3B) | [Paper](https://arxiv.org/abs/2312.02928)
11
+
12
+ **LivePhoto-Wan2.1** supports text-guided image-to-video generation with control over motion intensity levels. Built upon the **Wan2.1-T2V-1.3B** architecture, it is adapted for image-to-video tasks using **Pusa** fine-tuning strategy. A **motion intensity module** is plugged in to adjust the movement strength in the generated videos.
13
+
14
+ <table align="center">
15
+ <tr>
16
+ <td>
17
+ <img src="./assets/teaser2.gif">
18
+ </td>
19
+ </tr>
20
+ <tr>
21
+ </table>
22
+
23
+ ### Installation
24
+ ```
25
+ conda create -n livephoto python=3.10 -y
26
+ conda activate livephoto
27
+ cd ./LivePhoto-Wan2.1/PusaV1
28
+ pip install -e .
29
+ pip install xfuser>=0.4.3 absl-py peft lightning pandas deepspeed wandb av
30
+ ```
31
+
32
+ ### Model Preparation
33
+ ```
34
+ pip install -U "huggingface_hub[cli]"
35
+ huggingface-cli download https://huggingface.co/Wan-AI/Wan2.1-T2V-1.3B/resolve/main/Wan2.1_VAE.pth ./model_zoo/Wan2.1/base/Wan2.1_VAE.pth
36
+ huggingface-cli download https://huggingface.co/Wan-AI/Wan2.1-T2V-1.3B/resolve/main/models_t5_umt5-xxl-enc-bf16.pth ./model_zoo/Wan2.1/base/models_t5_umt5-xxl-enc-bf16.pth
37
+ huggingface-cli download shirley430316/LivePhoto-Wan2.1 --local-dir ./model_zoo/LivePhoto-Wan2.1/
38
+ ```
39
+
40
+ After proper preparation, the directory looks like:
41
+ ```
42
+ ./model_zoo
43
+ - LivePhoto-Wan2.1
44
+ - Wan2.1-T2V-1.3B
45
+ - base.safetensors
46
+ - lora.safetensors
47
+ ```
48
+ ### Usage Example
49
+ #### I2V with Motion Intensity Levels
50
+ ```
51
+ # make sure you are in ~/LivePhoto-Wan2.1/PusaV1
52
+ CUDA_VISIBLE_DEVICES=0 python examples/pusavideo/wan_14b_multi_frames_pusa.py \
53
+ --image_paths "./demos/input_image.jpg" \
54
+ --prompt "A cute orange kitten with big round eyes stands upright on its hind legs on a smooth wooden floor. The kitten begins to move its tiny front paws up and down rhythmically, swaying its body left and right as if dancing. Its fluffy tail flicks slightly behind it, and the soft lighting creates a warm, cozy indoor atmosphere. The kitten’s ears twitch gently as it keeps its balance, adding to the charm of its playful little dance. The background stays softly blurred, keeping focus on the kitten’s adorable movements." \
55
+ --cond_position "0" \
56
+ --noise_multipliers "0" \
57
+ --dit_path "./model_zoo/Wan2.1/base.safetensors" \
58
+ --lora_path "./model_zoo/Wan2.1/lora.safetensors" \
59
+ --lora_alpha 1.2 \
60
+ --num_inference_steps 30 \
61
+ --motion_intensity 5 \ # valid motion intensity levels from 1 through 6
62
+ ```
63
+
64
+ ### Acknowledgement
65
+ This version is developped on the codebase of [Pusa-VidGen](https://github.com/Yaofang-Liu/Pusa-VidGen). Much appreciation for the insightful project.