-
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Paper • 2402.10986 • Published • 80 -
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Paper • 2408.02657 • Published • 35 -
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale
Paper • 2508.10711 • Published • 142
Charles Cai
charlescai2016
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 8 hours ago
ChronoEdit: Towards Temporal Reasoning for Image Editing and World
Simulation
upvoted
an
article
2 days ago
Train your ControlNet with diffusers
upvoted
a
paper
2 days ago
The End of Manual Decoding: Towards Truly End-to-End Language Models