4 41 53

Chew Kok Wah

chewkokwah

AI & ML interests

Open Domain Question Answering

Recent Activity

liked a model about 5 hours ago

MikaStars39/PeRL

upvoted an article 7 days ago

What makes good reasoning data

liked a dataset 20 days ago

ByteDance-Seed/BeyondAIME

View all activity

Organizations

liked a model about 5 hours ago

MikaStars39/PeRL

Updated about 1 month ago • 1

upvoted an article 7 days ago

Article

What makes good reasoning data

Oct 30, 2025

•

liked a dataset 20 days ago

ByteDance-Seed/BeyondAIME

Viewer • Updated Jun 17, 2025 • 100 • 466 • 14

upvoted an article about 1 month ago

Article

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

Aug 8, 2025

•

liked a dataset about 1 month ago

asusevski/kaggle-AI-Mathematical-Olympiad-3-responses

Updated 29 days ago • 13 • 1

upvoted an article about 1 month ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

Dec 1, 2025

•

262

liked 3 models about 1 month ago

upvoted an article about 1 month ago

Article

Announcing New Hugging Face and KerasHub integration

Jul 10, 2024

•

liked a dataset about 1 month ago

MathArena/cmimc_2025_outputs

Viewer • Updated 23 days ago • 4.64k • 33 • 1

upvoted a collection about 1 month ago

MathArena Outputs

Collection

Outputs of models on the MathArena Benchmark. • 16 items • Updated 25 days ago • 1

liked a model about 1 month ago

Intel/Qwen3-30B-A3B-Thinking-2507-int4-AutoRound

0.6B • Updated Sep 19, 2025 • 58 • 11

liked a dataset about 2 months ago

madrylab/gsm8k-platinum

Viewer • Updated Mar 11, 2025 • 1.21k • 1.51k • 45

upvoted an article about 2 months ago

Article

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

Sep 16, 2025

•

upvoted a paper about 2 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 176

upvoted an article about 2 months ago

Article

On the Shifting Global Compute Landscape

Oct 29, 2025

•

upvoted a paper about 2 months ago

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29, 2025 • 77

liked a model 2 months ago

tencent/DeepSeek-V3.1-Terminus-W4AFP8

Text Generation • 349B • Updated Nov 4, 2025 • 389 • 15

upvoted a paper 2 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 320

Chew Kok Wah

AI & ML interests

Recent Activity

Organizations

chewkokwah's activity

What makes good reasoning data

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

Transformers v5: Simple model definitions powering the AI ecosystem

Announcing New Hugging Face and KerasHub integration

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

On the Shifting Global Compute Landscape