Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Open to Collab
2140.8
TFLOPS
2
2
1
Michal Valko
misovalko
Follow
Simontwice's profile picture
shahzad4894's profile picture
lukbl's profile picture
30 followers
·
112 following
https://misovalko.github.io/
misovalko
misovalko
michalvalko
misovalko.bsky.social
AI & ML interests
large language models, reasoning, fine-tuning, test-time computation, reinforcement learning with human feedback, world models
Recent Activity
upvoted
a
paper
about 1 month ago
A General Theoretical Paradigm to Understand Learning from Human Preferences
authored
a paper
about 1 month ago
Optimal Design for Reward Modeling in RLHF
authored
a paper
about 1 month ago
Sharp Deviations Bounds for Dirichlet Weighted Sums with Application to analysis of Bayesian algorithms
View all activity
Organizations
misovalko
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a Space
about 2 years ago
Running
on
Zero
282
Daily Papers
📊
282
Complete list of past Daily Papers