Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2401.01335

Memory Augmented Language Models through Mixture of Word Experts

Paper • 2311.10768 • Published Nov 15, 2023 • 19
System 2 Attention (is something you might need too)

Paper • 2311.11829 • Published Nov 20, 2023 • 43
Fine-tuning Language Models for Factuality

Paper • 2311.08401 • Published Nov 14, 2023 • 30
Orca 2: Teaching Small Language Models How to Reason

Paper • 2311.11045 • Published Nov 18, 2023 • 77

Tradecraft Patterns

Methods and analysis for generating synthetic data to populate graphs at scale, based on network motif (patterns) of tradecraft.

InGram: Inductive Knowledge Graph Embedding via Relation Graphs

Paper • 2305.19987 • Published May 31, 2023 • 2
Curating Grounded Synthetic Data with Global Perspectives for Equitable A

Paper • 2406.10258 • Published Jun 10, 2024 • 1
Peregrine: A Pattern-Aware Graph Mining System

Paper • 2004.02369 • Published Apr 6, 2020 • 2
OFFER: A Motif Dimensional Framework for Network Representation Learning

Paper • 2008.12010 • Published Aug 27, 2020 • 1

Papers - Fine-tuning

Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning

Paper • 2310.20587 • Published Oct 31, 2023 • 18
SELF: Language-Driven Self-Evolution for Large Language Model

Paper • 2310.00533 • Published Oct 1, 2023 • 2
QLoRA: Efficient Finetuning of Quantized LLMs

Paper • 2305.14314 • Published May 23, 2023 • 58
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

Paper • 2309.14717 • Published Sep 26, 2023 • 45

DIBT Prompt collective SPIN

This collection contains resources related to the replication of SPIN with the dibt prompt collective dataset

argilla/zephyr-7b-spin-iter0-v0

Text Generation • Updated Mar 13, 2024 • 14 • 1
argilla/zephyr-7b-spin-iter1-v0

Text Generation • Updated Mar 13, 2024 • 8 • 1
argilla/zephyr-7b-spin-iter2-v0

Text Generation • Updated Mar 13, 2024 • 13 • 1
argilla/zephyr-7b-spin-iter3-v0

Text Generation • Updated Mar 13, 2024 • 11 • 8

A Critical Evaluation of AI Feedback for Aligning Large Language Models

Paper • 2402.12366 • Published Feb 19, 2024 • 3
Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation

Paper • 2401.08417 • Published Jan 16, 2024 • 37
Insights into Alignment: Evaluating DPO and its Variants Across Multiple Tasks

Paper • 2404.14723 • Published Apr 23, 2024 • 10
Self-Play Preference Optimization for Language Model Alignment

Paper • 2405.00675 • Published May 1, 2024 • 28

shisa-v2-research

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12, 2024 • 71
Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 104
argilla/magpie-ultra-v1.0

Viewer • Updated Nov 26, 2024 • 3.22M • 431 • 50
simplescaling/s1K-1.1

Viewer • Updated Feb 27, 2025 • 1k • 2.57k • 143

Synthetic Data Generation

Textbooks Are All You Need

Paper • 2306.11644 • Published Jun 20, 2023 • 152
Textbooks Are All You Need II: phi-1.5 technical report

Paper • 2309.05463 • Published Sep 11, 2023 • 88
TinyStories: How Small Can Language Models Be and Still Speak Coherent English?

Paper • 2305.07759 • Published May 12, 2023 • 38
Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 104

Self Improvement

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2, 2024 • 68

ibm-research/AttaQ

Viewer • Updated Jan 26, 2024 • 1.4k • 358 • 21
snorkelai/snorkel-curated-instruction-tuning

Preview • Updated Mar 11, 2024 • 130 • 11
corbyrosset/researchy_questions

Viewer • Updated Feb 29, 2024 • 96.4k • 790 • 35
argilla/ultrafeedback-binarized-preferences

Viewer • Updated Nov 30, 2023 • 63.6k • 245 • 81

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 109
How to Train Data-Efficient LLMs

Paper • 2402.09668 • Published Feb 15, 2024 • 42
BitDelta: Your Fine-Tune May Only Be Worth One Bit

Paper • 2402.10193 • Published Feb 15, 2024 • 21
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts

Paper • 2402.09727 • Published Feb 15, 2024 • 38

Memory Augmented Language Models through Mixture of Word Experts

Paper • 2311.10768 • Published Nov 15, 2023 • 19
System 2 Attention (is something you might need too)

Paper • 2311.11829 • Published Nov 20, 2023 • 43
Fine-tuning Language Models for Factuality

Paper • 2311.08401 • Published Nov 14, 2023 • 30
Orca 2: Teaching Small Language Models How to Reason

Paper • 2311.11045 • Published Nov 18, 2023 • 77

shisa-v2-research

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12, 2024 • 71
Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 104
argilla/magpie-ultra-v1.0

Viewer • Updated Nov 26, 2024 • 3.22M • 431 • 50
simplescaling/s1K-1.1

Viewer • Updated Feb 27, 2025 • 1k • 2.57k • 143

Tradecraft Patterns

Methods and analysis for generating synthetic data to populate graphs at scale, based on network motif (patterns) of tradecraft.

InGram: Inductive Knowledge Graph Embedding via Relation Graphs

Paper • 2305.19987 • Published May 31, 2023 • 2
Curating Grounded Synthetic Data with Global Perspectives for Equitable A

Paper • 2406.10258 • Published Jun 10, 2024 • 1
Peregrine: A Pattern-Aware Graph Mining System

Paper • 2004.02369 • Published Apr 6, 2020 • 2
OFFER: A Motif Dimensional Framework for Network Representation Learning

Paper • 2008.12010 • Published Aug 27, 2020 • 1

Synthetic Data Generation

Textbooks Are All You Need

Paper • 2306.11644 • Published Jun 20, 2023 • 152
Textbooks Are All You Need II: phi-1.5 technical report

Paper • 2309.05463 • Published Sep 11, 2023 • 88
TinyStories: How Small Can Language Models Be and Still Speak Coherent English?

Paper • 2305.07759 • Published May 12, 2023 • 38
Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 104

Papers - Fine-tuning

Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning

Paper • 2310.20587 • Published Oct 31, 2023 • 18
SELF: Language-Driven Self-Evolution for Large Language Model

Paper • 2310.00533 • Published Oct 1, 2023 • 2
QLoRA: Efficient Finetuning of Quantized LLMs

Paper • 2305.14314 • Published May 23, 2023 • 58
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

Paper • 2309.14717 • Published Sep 26, 2023 • 45

Self Improvement

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2, 2024 • 68

DIBT Prompt collective SPIN

This collection contains resources related to the replication of SPIN with the dibt prompt collective dataset

argilla/zephyr-7b-spin-iter0-v0

Text Generation • Updated Mar 13, 2024 • 14 • 1
argilla/zephyr-7b-spin-iter1-v0

Text Generation • Updated Mar 13, 2024 • 8 • 1
argilla/zephyr-7b-spin-iter2-v0

Text Generation • Updated Mar 13, 2024 • 13 • 1
argilla/zephyr-7b-spin-iter3-v0

Text Generation • Updated Mar 13, 2024 • 11 • 8

ibm-research/AttaQ

Viewer • Updated Jan 26, 2024 • 1.4k • 358 • 21
snorkelai/snorkel-curated-instruction-tuning

Preview • Updated Mar 11, 2024 • 130 • 11
corbyrosset/researchy_questions

Viewer • Updated Feb 29, 2024 • 96.4k • 790 • 35
argilla/ultrafeedback-binarized-preferences

Viewer • Updated Nov 30, 2023 • 63.6k • 245 • 81

A Critical Evaluation of AI Feedback for Aligning Large Language Models

Paper • 2402.12366 • Published Feb 19, 2024 • 3
Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation

Paper • 2401.08417 • Published Jan 16, 2024 • 37
Insights into Alignment: Evaluating DPO and its Variants Across Multiple Tasks

Paper • 2404.14723 • Published Apr 23, 2024 • 10
Self-Play Preference Optimization for Language Model Alignment

Paper • 2405.00675 • Published May 1, 2024 • 28

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 109
How to Train Data-Efficient LLMs

Paper • 2402.09668 • Published Feb 15, 2024 • 42
BitDelta: Your Fine-Tune May Only Be Worth One Bit

Paper • 2402.10193 • Published Feb 15, 2024 • 21
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts

Paper • 2402.09727 • Published Feb 15, 2024 • 38

Previous
1
2
3
4
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs