Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
OpenGVLab 's Collections
Vlaser
NaViL
InternVL3.5-Flash
InternVL3.5-Core
InternVL3.5
ScaleCUA
SDLM
Docopilot
ZeroGUI
InternVL3
VisualPRM
Mono-InternVL
PIIP
VideoChat-R1
InternVideo2.5
VideoMAE-v2
VideoChat-Flash
InternVL2.5
InternVL2.5-MPO
InternVL2.0
InternVL1.5
InternVL1.0
V2PE
InternVL Adaptation
InternVideo2
VideoChat
VideoMamba
InternVid
OmniCorpus
All-Seeing Project
InternImage
PVT v2
InternVL Data

PVT v2

updated Sep 28

Improved Baselines with Pyramid Vision Transformer

Upvote
-

  • PVT v2: Improved Baselines with Pyramid Vision Transformer

    Paper • 2106.13797 • Published Jun 25, 2021

  • OpenGVLab/pvt_v2_b1

    Image Classification • 14M • Updated Mar 12, 2024 • 50 • 1

  • OpenGVLab/pvt_v2_b2

    Image Classification • 25.4M • Updated Mar 12, 2024 • 326 • 1

  • OpenGVLab/pvt_v2_b2_linear

    Image Classification • 22.6M • Updated Mar 12, 2024 • 38 • 1

  • OpenGVLab/pvt_v2_b5

    Image Classification • 82M • Updated Mar 12, 2024 • 30 • 1

  • OpenGVLab/pvt_v2_b3

    Image Classification • 45.2M • Updated Mar 12, 2024 • 29 • 2

  • OpenGVLab/pvt_v2_b4

    Image Classification • 62.6M • Updated Mar 12, 2024 • 29 • 1

  • OpenGVLab/pvt_v2_b0

    Image Classification • 3.67M • Updated Mar 12, 2024 • 3.43k • • 2
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs