Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
jiaweiou 's Collections
Speech
VAE
Vidun

Vidun

updated Dec 5, 2024
Upvote
1

  • Look Every Frame All at Once: Video-Ma^2mba for Efficient Long-form Video Understanding with Multi-Axis Gradient Checkpointing

    Paper • 2411.19460 • Published Nov 29, 2024 • 11

  • Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding

    Paper • 2406.19263 • Published Jun 27, 2024 • 10

  • OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation

    Paper • 2412.02592 • Published Dec 3, 2024 • 24
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs