wyx

DecoderImmortal

AI & ML interests

None yet

Recent Activity

liked a dataset 28 days ago

anon8231489123/ShareGPT_Vicuna_unfiltered

upvoted a paper about 1 month ago

ProxyAttn: Guided Sparse Attention via Representative Heads

liked a Space about 2 months ago

yzweak/AutoPR

View all activity

Organizations

None yet

liked a dataset 28 days ago

anon8231489123/ShareGPT_Vicuna_unfiltered

Updated Apr 12, 2023 • 36.3k • 829

upvoted a paper about 1 month ago

ProxyAttn: Guided Sparse Attention via Representative Heads

Paper • 2509.24745 • Published Sep 29 • 1

liked a Space about 2 months ago

AutoPR

🚀

Generate social media posts from PDFs

liked a model 3 months ago

baidu/ERNIE-4.5-21B-A3B-Thinking

Text Generation • 22B • Updated 4 days ago • 14.5k • • 768

liked a dataset 3 months ago

Naomibas/llm-system-prompts-benchmark

Viewer • Updated Jul 11, 2024 • 100 • 173 • 14

updated 2 models 5 months ago

DecoderImmortal/Llama3-8B-MSN

8B • Updated Jul 9 • 4

DecoderImmortal/DeepSeek-Coder-7B-MSN

7B • Updated Jul 9 • 4

upvoted a paper 5 months ago

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 92

upvoted a collection 5 months ago

ERNIE 4.5

Collection

collection of ERNIE 4.5 models. • 27 items • Updated 19 days ago • 179

published 2 models 6 months ago

DecoderImmortal/DeepSeek-Coder-7B-MSN

7B • Updated Jul 9 • 4

DecoderImmortal/Llama3-8B-MSN

8B • Updated Jul 9 • 4

upvoted an article 8 months ago

Article

What is test-time compute and how to scale it?

Feb 6

•

109

upvoted a paper 8 months ago

What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models

Paper • 2503.24235 • Published Mar 31 • 54

updated a model about 1 year ago

DecoderImmortal/LM-Combiner

Updated Nov 22, 2024 • 1

upvoted a paper about 1 year ago

Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers

Paper • 2404.04925 • Published Apr 7, 2024 • 1

updated a model about 1 year ago

DecoderImmortal/CDA4GEC

Updated Sep 1, 2024

wyx

AI & ML interests

Recent Activity

Organizations

DecoderImmortal's activity

AutoPR

What is test-time compute and how to scale it?