Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alexey Gorbatovski's picture
3 8

Alexey Gorbatovski

Myashka
SmartFlow's profile picture elephantmipt's profile picture 21world's profile picture
·
  • Myashka

AI & ML interests

NLP Alignment

Recent Activity

authored a paper 5 days ago
ESSA: Evolutionary Strategies for Scalable Alignment
upvoted a paper 23 days ago
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models
commented on a paper about 1 month ago
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping
View all activity

Organizations

None yet

commented a paper about 1 month ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21 • 83 •
3
New activity in agentica-org/DeepScaleR-Preview-Dataset about 2 months ago

There are no answers for 6 samples

#4 opened about 2 months ago by
Myashka
New activity in Myashka/CryptoNews_50_50 over 1 year ago

Librarian Bot: Add language metadata for dataset

#2 opened over 1 year ago by
librarian-bot
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs