Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Daniel Bolya's picture
7 5

Daniel Bolya

dbolya
0xSojalSec's profile picture Tonic's profile picture
·
  • dbolya

AI & ML interests

None yet

Organizations

AI at Meta's profile picture

upvoted 2 collections 4 months ago

Perception Encoder

Collection
OpenCLIP (PE Core image + text) and timm PE Core, Spatial, Lang (ViT only) weights. NOTE: These weights do not work with original modeling code. • 19 items • Updated Sep 19 • 6

Perception LM

Collection
7 items • Updated Apr 17 • 62
upvoted a collection 7 months ago

Perception Encoder

Collection
17 items • Updated Jul 11 • 70
upvoted 2 papers 7 months ago

Perception Encoder: The best visual embeddings are not at the output of the network

Paper • 2504.13181 • Published Apr 17 • 34

PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding

Paper • 2504.13180 • Published Apr 17 • 19
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs