Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
6
Jonathan von Rad
jonny-vr
Follow
jonny-vr
AI & ML interests
LLM Compression & Mechanistic Interpretability
Organizations
None yet
jonny-vr
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
monology/pile-uncopyrighted
4 months ago
Could you please implement train:1% feature? This way we don't have to download the entire dataset.
1
#12 opened 4 months ago by
jonny-vr
New activity in
Qwen/Qwen3-32B
4 months ago
Low Score on GSM8K on lm-eval-harness? (just 74.91)
2
#36 opened 4 months ago by
jonny-vr
New activity in
nvidia/NV-Embed-v2
4 months ago
TypeError: cannot unpack non-iterable NoneType object
👀
👍
8
5
#37 opened 9 months ago by
Pietroferr
New activity in
google/gemma-3-27b-pt
4 months ago
Model is a Memory Hog - 2xH100 80GB OOM??
1
#5 opened 4 months ago by
jonny-vr
New activity in
Qwen/Qwen3-32B
5 months ago
Where is the Base Model?
👍
➕
8
#34 opened 5 months ago by
jonny-vr
New activity in
google/gemma-3-1b-pt
5 months ago
When evaluating Wiki2, I just get Loss: Nan, while with gemma-3-1b-it it works..
2
#8 opened 5 months ago by
jonny-vr