Interesting AI papers
updated
Attention Is All You Need
Paper
• 1706.03762
• Published
• 115
BERT: Pre-training of Deep Bidirectional Transformers for Language
Understanding
Paper
• 1810.04805
• Published
• 26
Universal Language Model Fine-tuning for Text Classification
Paper
• 1801.06146
• Published
• 8
Language Models are Few-Shot Learners
Paper
• 2005.14165
• Published
• 19
EELBERT: Tiny Models through Dynamic Embeddings
Paper
• 2310.20144
• Published
• 3
Scaling Laws for Neural Language Models
Paper
• 2001.08361
• Published
• 9
Training Compute-Optimal Large Language Models
Paper
• 2203.15556
• Published
• 11
BloombergGPT: A Large Language Model for Finance
Paper
• 2303.17564
• Published
• 30
MARRS: Multimodal Reference Resolution System
Paper
• 2311.01650
• Published
• 2
Scaling Instruction-Finetuned Language Models
Paper
• 2210.11416
• Published
• 7
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
Understanding
Paper
• 1804.07461
• Published
• 4
SuperGLUE: A Stickier Benchmark for General-Purpose Language
Understanding Systems
Paper
• 1905.00537
• Published
• 2
Measuring Massive Multitask Language Understanding
Paper
• 2009.03300
• Published
• 3
Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning
Paper
• 2303.15647
• Published
• 4
LoRA: Low-Rank Adaptation of Large Language Models
Paper
• 2106.09685
• Published
• 58
QLoRA: Efficient Finetuning of Quantized LLMs
Paper
• 2305.14314
• Published
• 59
The Power of Scale for Parameter-Efficient Prompt Tuning
Paper
• 2104.08691
• Published
• 10
Learning to summarize from human feedback
Paper
• 2009.01325
• Published
• 4
ReAct: Synergizing Reasoning and Acting in Language Models
Paper
• 2210.03629
• Published
• 33
Training language models to follow instructions with human feedback
Paper
• 2203.02155
• Published
• 24
Proximal Policy Optimization Algorithms
Paper
• 1707.06347
• Published
• 11
Direct Preference Optimization: Your Language Model is Secretly a Reward
Model
Paper
• 2305.18290
• Published
• 64
Constitutional AI: Harmlessness from AI Feedback
Paper
• 2212.08073
• Published
• 4
Automatic Chain of Thought Prompting in Large Language Models
Paper
• 2210.03493
• Published
• 2
PAL: Program-aided Language Models
Paper
• 2211.10435
• Published
• 4