Collections
Discover the best community collections!
Collections including paper arxiv:2210.03629
-
deepseek-ai/DeepSeek-R1
Text Generation • Updated • 695k • • 13k -
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 739k • • 2k -
google/gemma-2-27b-it
Text Generation • 27B • Updated • 393k • 559 -
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper • 2201.11903 • Published • 15
-
High-Resolution Image Synthesis with Latent Diffusion Models
Paper • 2112.10752 • Published • 15 -
Adding Conditional Control to Text-to-Image Diffusion Models
Paper • 2302.05543 • Published • 58 -
Proximal Policy Optimization Algorithms
Paper • 1707.06347 • Published • 11 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 64
-
LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries
Paper • 2508.15760 • Published • 47 -
LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?
Paper • 2508.01780 • Published • 21 -
API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs
Paper • 2304.08244 • Published • 1 -
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Paper • 2508.16153 • Published • 160
-
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 1.32M • • 654 -
meta-llama/Llama-3.1-8B-Instruct
Text Generation • Updated • 6.1M • • 5.49k -
Toolformer: Language Models Can Teach Themselves to Use Tools
Paper • 2302.04761 • Published • 12 -
ReAct: Synergizing Reasoning and Acting in Language Models
Paper • 2210.03629 • Published • 33
-
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 1.32M • • 654 -
meta-llama/Llama-3.1-8B-Instruct
Text Generation • Updated • 6.1M • • 5.49k -
Toolformer: Language Models Can Teach Themselves to Use Tools
Paper • 2302.04761 • Published • 12 -
ReAct: Synergizing Reasoning and Acting in Language Models
Paper • 2210.03629 • Published • 33
-
deepseek-ai/DeepSeek-R1
Text Generation • Updated • 695k • • 13k -
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 739k • • 2k -
google/gemma-2-27b-it
Text Generation • 27B • Updated • 393k • 559 -
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper • 2201.11903 • Published • 15
-
High-Resolution Image Synthesis with Latent Diffusion Models
Paper • 2112.10752 • Published • 15 -
Adding Conditional Control to Text-to-Image Diffusion Models
Paper • 2302.05543 • Published • 58 -
Proximal Policy Optimization Algorithms
Paper • 1707.06347 • Published • 11 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 64
-
LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries
Paper • 2508.15760 • Published • 47 -
LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?
Paper • 2508.01780 • Published • 21 -
API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs
Paper • 2304.08244 • Published • 1 -
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Paper • 2508.16153 • Published • 160