-
ShareGPT4V: Improving Large Multi-Modal Models with Better Captions
Paper • 2311.12793 • Published • 18 -
PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics
Paper • 2311.12198 • Published • 22 -
CoDi-2: In-Context, Interleaved, and Interactive Any-to-Any Generation
Paper • 2311.18775 • Published • 6 -
Code Llama: Open Foundation Models for Code
Paper • 2308.12950 • Published • 29
Bob Gonsalves PRO
pinknoiz
AI & ML interests
None yet
Recent Activity
liked
a model
about 1 month ago
apple/DiffuCoder-7B-cpGRPO
liked
a model
6 months ago
teapotai/teapotllm
updated
a collection
10 months ago
Read this
Organizations
Apps
-
Running8989
Research Tracker
🚀Display a sortable table of research papers and models
-
Runtime error333333
MLLM-guided Image Editing (MGIE)
👩Transform images based on textual instructions
-
DressCode: Autoregressively Sewing and Generating Garments from Text Guidance
Paper • 2401.16465 • Published • 12
Read this
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 625 -
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Paper • 2402.17177 • Published • 88 -
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
Paper • 2402.19427 • Published • 56 -
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Paper • 2402.19479 • Published • 35
temp
-
ShareGPT4V: Improving Large Multi-Modal Models with Better Captions
Paper • 2311.12793 • Published • 18 -
PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics
Paper • 2311.12198 • Published • 22 -
CoDi-2: In-Context, Interleaved, and Interactive Any-to-Any Generation
Paper • 2311.18775 • Published • 6 -
Code Llama: Open Foundation Models for Code
Paper • 2308.12950 • Published • 29
Datasets
Apps
-
Running8989
Research Tracker
🚀Display a sortable table of research papers and models
-
Runtime error333333
MLLM-guided Image Editing (MGIE)
👩Transform images based on textual instructions
-
DressCode: Autoregressively Sewing and Generating Garments from Text Guidance
Paper • 2401.16465 • Published • 12
Voice
Read this
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 625 -
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Paper • 2402.17177 • Published • 88 -
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
Paper • 2402.19427 • Published • 56 -
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Paper • 2402.19479 • Published • 35