Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Harris Zhang's picture
2 2

Harris Zhang

HanSolo9682
mucai's profile picture jizhongpeng's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 25 days ago
Contamination Detection for VLMs using Multi-Modal Semantic Perturbation
updated a dataset 4 months ago
HanSolo9682/Vinoground
View all activity

Organizations

University of Wisconsin - Madison's profile picture vgbench's profile picture CounterCurate's profile picture LLaVA-R1's profile picture ThinkSpace's profile picture

authored 4 papers about 1 year ago

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Paper • 2410.10818 • Published Oct 14, 2024 • 17

CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples

Paper • 2402.13254 • Published Feb 20, 2024

VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation

Paper • 2407.10972 • Published Jul 15, 2024 • 1

Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos

Paper • 2410.02763 • Published Oct 3, 2024 • 7
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs