Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2506.05573

ByteDance Papers

ByteDance papers collection

about 18 hours ago

Contrastive Learning for Many-to-many Multilingual Neural Machine Translation

Paper • 2105.09501 • Published May 20, 2021
Cross-modal Contrastive Learning for Speech Translation

Paper • 2205.02444 • Published May 5, 2022
ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs

Paper • 2210.03052 • Published Oct 6, 2022
Diffusion Glancing Transformer for Parallel Sequence to Sequence Learning

Paper • 2212.10240 • Published Dec 20, 2022 • 1

deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct

Text Generation • 16B • Updated Jul 3, 2024 • 268k • • 495
Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 166
meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 327k • • 1.54k
microsoft/OmniParser

Image-Text-to-Text • Updated Dec 2, 2024 • 345 • 1.69k

FlashWorld: High-quality 3D Scene Generation within Seconds

Paper • 2510.13678 • Published 19 days ago • 70
NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks

Paper • 2510.15019 • Published 18 days ago • 62
GeoSVR: Taming Sparse Voxels for Geometrically Accurate Surface Reconstruction

Paper • 2509.18090 • Published Sep 22 • 3
Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation

Paper • 2509.19296 • Published Sep 23 • 22

CoRAG: Collaborative Retrieval-Augmented Generation

Paper • 2504.01883 • Published Apr 2 • 9
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Paper • 2504.08837 • Published Apr 10 • 43
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

Paper • 2504.10068 • Published Apr 14 • 30
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published Apr 14 • 84

teknium/OpenHermes-2.5-Mistral-7B

Text Generation • 7B • Updated Feb 19, 2024 • 161k • 874
ByteDance/SDXL-Lightning

Text-to-Image • Updated Apr 3, 2024 • 130k • • 2.1k
google/gemma-7b-it

Text Generation • 9B • Updated Aug 14, 2024 • 138k • 1.21k
dphn/dolphin-2.2.1-mistral-7b

Text Generation • 7B • Updated May 20, 2024 • 1.38k • 198

ByteDance Papers

ByteDance papers collection

about 18 hours ago

Contrastive Learning for Many-to-many Multilingual Neural Machine Translation

Paper • 2105.09501 • Published May 20, 2021
Cross-modal Contrastive Learning for Speech Translation

Paper • 2205.02444 • Published May 5, 2022
ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs

Paper • 2210.03052 • Published Oct 6, 2022
Diffusion Glancing Transformer for Parallel Sequence to Sequence Learning

Paper • 2212.10240 • Published Dec 20, 2022 • 1

CoRAG: Collaborative Retrieval-Augmented Generation

Paper • 2504.01883 • Published Apr 2 • 9
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Paper • 2504.08837 • Published Apr 10 • 43
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

Paper • 2504.10068 • Published Apr 14 • 30
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published Apr 14 • 84

deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct

Text Generation • 16B • Updated Jul 3, 2024 • 268k • • 495
Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 166
meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 327k • • 1.54k
microsoft/OmniParser

Image-Text-to-Text • Updated Dec 2, 2024 • 345 • 1.69k

teknium/OpenHermes-2.5-Mistral-7B

Text Generation • 7B • Updated Feb 19, 2024 • 161k • 874
ByteDance/SDXL-Lightning

Text-to-Image • Updated Apr 3, 2024 • 130k • • 2.1k
google/gemma-7b-it

Text Generation • 9B • Updated Aug 14, 2024 • 138k • 1.21k
dphn/dolphin-2.2.1-mistral-7b

Text Generation • 7B • Updated May 20, 2024 • 1.38k • 198

FlashWorld: High-quality 3D Scene Generation within Seconds

Paper • 2510.13678 • Published 19 days ago • 70
NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks

Paper • 2510.15019 • Published 18 days ago • 62
GeoSVR: Taming Sparse Voxels for Geometrically Accurate Surface Reconstruction

Paper • 2509.18090 • Published Sep 22 • 3
Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation

Paper • 2509.19296 • Published Sep 23 • 22

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs