2 2 15

JiaxinYe

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago

Hierarchical Codec Diffusion for Video-to-Speech Generation

submitted a paper 26 days ago

Hierarchical Codec Diffusion for Video-to-Speech Generation

liked a model about 2 months ago

audeering/wav2vec2-large-robust-6-ft-age-gender

View all activity

Organizations

None yet

upvoted a paper 26 days ago

Hierarchical Codec Diffusion for Video-to-Speech Generation

Paper • 2604.15923 • Published 29 days ago • 2

submitted a paper to Daily Papers 26 days ago

Hierarchical Codec Diffusion for Video-to-Speech Generation

Paper • 2604.15923 • Published 29 days ago • 2

liked a model about 2 months ago

audeering/wav2vec2-large-robust-6-ft-age-gender

Audio Classification • 90.8M • Updated Nov 27, 2023 • 20.8k • 6

liked a dataset 6 months ago

catfang/emosign

Viewer • Updated Jun 26, 2025 • 200 • 29 • 4

upvoted a collection 6 months ago

FaceLLM

Collection

A multimodal large language model trained specifically for facial image understanding. Project page: https://www.idiap.ch/paper/facellm • 3 items • Updated Jul 23, 2025 • 4

liked a model 8 months ago

GCYY/speecht5_finetuned_fleurs_zh

Text-to-Speech • Updated Aug 31, 2023 • 11 • 2

liked a dataset 9 months ago

nvidia/AudioSkills

Preview • Updated Jan 8 • 4.17k • 101

New activity in huggingface/InferenceSupport 11 months ago

baichuan-inc/Baichuan-Omni-1d5-Base

#2944 opened 11 months ago by

JiaxinYe

liked a model 12 months ago

maitrix-org/Voila-Tokenizer

Audio-to-Audio • 59.7M • Updated May 6, 2025 • 196 • 7

liked 3 datasets about 1 year ago

liked 2 models over 1 year ago

microsoft/VidTok

Updated Apr 5, 2025 • 43

amphion/MaskGCT

Text-to-Speech • Updated Apr 13, 2025 • 528 • 306

authored a paper over 1 year ago

DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control

Paper • 2410.13830 • Published Oct 17, 2024 • 26

liked a Space over 1 year ago

NaturalSpeech3 FACodec

🏃

178

Convert and reconstruct speech files

liked 2 datasets over 1 year ago

Fudan-fMRI/fMRI-Shape

Viewer • Updated Aug 15, 2025 • 1.4k • 1.19k • 10

Fudan-fMRI/fMRI-Objaverse

Viewer • Updated Aug 15, 2025 • 3.14k • 507 • 3

liked 2 models over 2 years ago

oraul/stable-diffusion-v1-4_FFHQ_smaller

Text-to-Image • Updated Aug 26, 2023 • 20 • 2

coqui/XTTS-v1

Text-to-Speech • Updated Nov 10, 2023 • 1.49k • 370

JiaxinYe

AI & ML interests

Recent Activity

Organizations

JiaxinYe's activity

baichuan-inc/Baichuan-Omni-1d5-Base

NaturalSpeech3 FACodec