CD

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

TianyuZhang authored a paper 4 days ago

ClimateGAN: Raising Climate Change Awareness by Generating Images of Floods

TianyuZhang authored a paper 4 days ago

MuPT: A Generative Symbolic Music Pretrained Transformer

TianyuZhang authored a paper 4 days ago

Large-scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation

View all activity

TianyuZhang

authored 10 papers 4 days ago

ClimateGAN: Raising Climate Change Awareness by Generating Images of Floods

Paper • 2110.02871 • Published Oct 6, 2021

MuPT: A Generative Symbolic Music Pretrained Transformer

Paper • 2404.06393 • Published Apr 9, 2024 • 16

Large-scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation

Paper • 2211.06687 • Published Nov 12, 2022 • 4

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Paper • 2412.04626 • Published Dec 5, 2024 • 14

STRICT: Stress Test of Rendering Images Containing Text

Paper • 2505.18985 • Published May 25

A Single Merging Suffices: Recovering Server-based Learning Performance in Decentralized Learning

Paper • 2507.06542 • Published Jul 9

MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation

Paper • 2406.07529 • Published Jun 11, 2024

W4ng1204

authored a paper 4 days ago

InteractComp: Evaluating Search Agents With Ambiguous Queries

Paper • 2510.24668 • Published 6 days ago • 94

TianyuZhang

authored a paper 4 days ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published 5 days ago • 186

sheryc

authored a paper 14 days ago

Scope: Selective Cross-modal Orchestration of Visual Perception Experts

Paper • 2510.12974 • Published 20 days ago

W4ng1204

authored a paper 18 days ago

VeritasFi: An Adaptable, Multi-tiered RAG Framework for Multi-modal Financial Question Answering

Paper • 2510.10828 • Published 22 days ago • 1

sheryc

authored a paper 20 days ago

VeritasFi: An Adaptable, Multi-tiered RAG Framework for Multi-modal Financial Question Answering

Paper • 2510.10828 • Published 22 days ago • 1

sheryc

authored a paper 27 days ago

Improving GUI Grounding with Explicit Position-to-Coordinate Mapping

Paper • 2510.03230 • Published Oct 3 • 3

W4ng1204

authored a paper about 1 month ago

It Takes Two: Your GRPO Is Secretly DPO

Paper • 2510.00977 • Published Oct 1 • 31

W4ng1204

authored 2 papers about 2 months ago

An Entity Linking Agent for Question Answering

Paper • 2508.03865 • Published Aug 5

Improving Context Fidelity via Native Retrieval-Augmented Reasoning

Paper • 2509.13683 • Published Sep 17 • 8

sheryc

authored a paper about 2 months ago

WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation

Paper • 2508.16763 • Published Aug 22 • 2

AI & ML interests

Recent Activity

Team members 3

CLAPv2's activity