Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Arsenever's picture
3 5 20

Arsenever

Eurayka
21world's profile picture
·

AI & ML interests

None yet

Recent Activity

submitted a paper 4 days ago
LaViT: Aligning Latent Visual Thoughts for Multi-modal Reasoning
liked a model 4 days ago
Svard/LaViT-3B
upvoted a paper 4 days ago
LaViT: Aligning Latent Visual Thoughts for Multi-modal Reasoning
View all activity

Organizations

OpenGVLab's profile picture

submitted a paper to Daily Papers 4 days ago

LaViT: Aligning Latent Visual Thoughts for Multi-modal Reasoning

Paper • 2601.10129 • Published 5 days ago • 9
authored a paper about 1 month ago

VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs

Paper • 2511.20272 • Published Nov 25, 2025 • 1
authored 4 papers 2 months ago

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

Paper • 2403.15377 • Published Mar 22, 2024 • 27

TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

Paper • 2410.19702 • Published Oct 25, 2024 • 1

Make Your Training Flexible: Towards Deployment-Efficient Video Models

Paper • 2503.14237 • Published Mar 18, 2025 • 5

ExpVid: A Benchmark for Experiment Video Understanding & Reasoning

Paper • 2510.11606 • Published Oct 13, 2025 • 4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs