Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Open to Collab
452.2
TFLOPS
1
10
36
PhysiQuanty
PRO
PhysiQuanty
Follow
Mathos34400's profile picture
LukeFP's profile picture
telcom's profile picture
39 followers
·
803 following
AI & ML interests
Theoretical Physics, Meta Deep Learning
Recent Activity
reacted
to
rob-x-ai
's
post
with 🔥
about 16 hours ago
Genesis 1B is now public. 🔥 I’m training a 1.003B parameter model from scratch on 2× RTX 4090s and opened a public playground for early checkpoints. The real bottleneck wasn’t training. It was checkpointing: FSDP full-state gather over PCIe = NCCL timeout hell Switching to DCP sharded checkpoints changed the trajectory of the run. - Playground: https://huggingface.co/spaces/rob-x-ai/genesis-1b-playground - Write-up: https://kroonen.ai/blog/distributed-checkpoint-failures-rtx4090/
liked
a dataset
about 18 hours ago
Shrijanagain/SKT-OMNI-CORPUS-146T-V1
liked
a dataset
1 day ago
open-index/hacker-news
View all activity
Organizations
PhysiQuanty
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
OpenTransformer/AGILLM-3-large-v2
6 days ago
FSDP or DDP
2
#1 opened 6 days ago by
PhysiQuanty
FSDP or DDP
2
#1 opened 6 days ago by
PhysiQuanty