Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
han weidong's picture
3 5

han weidong

dongdong2021
21world's profile picture LighterDarkness's profile picture SteveSHEN's profile picture
·
https://github.com/weidong2018
  • weidong2018

AI & ML interests

NLP;Multi-modal;LLM

Recent Activity

authored a paper about 1 month ago
Lossless KV Cache Compression to 2%
authored a paper about 1 month ago
Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought
liked a model 5 months ago
tencent/Hunyuan-A13B-Instruct
View all activity

Organizations

Knowledge Works Lab at Fudan University's profile picture Tencent's profile picture

authored 2 papers about 1 month ago

Lossless KV Cache Compression to 2%

Paper • 2410.15252 • Published Oct 20, 2024 • 1

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Paper • 2505.15431 • Published May 21 • 1
authored a paper 7 months ago

TransMamba: Flexibly Switching between Transformer and Mamba

Paper • 2503.24067 • Published Mar 31 • 21
authored a paper 10 months ago

Scaling Laws for Floating Point Quantization Training

Paper • 2501.02423 • Published Jan 5 • 26
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs