Deepak Nathani's picture

4 1

Deepak Nathani

dnathani

·

https://deepakn97.github.io/

AI & ML interests

Controllable Text Generation, Dialogue systems

Recent Activity

upvoted a paper 29 days ago

What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity

upvoted a paper about 1 month ago

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

published a dataset 7 months ago

mlgym/coco-captioning

View all activity

Organizations

upvoted a paper 29 days ago

What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity

Paper • 2511.15593 • Published Nov 19 • 56

upvoted a paper about 1 month ago

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Paper • 2511.13254 • Published Nov 17 • 136

published a dataset 7 months ago

mlgym/coco-captioning

Viewer • Updated Dec 1, 2024 • 56.5k • 46

upvoted a paper 8 months ago

THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models

Paper • 2504.13367 • Published Apr 17 • 26

upvoted a paper 10 months ago

Retrieval Head Mechanistically Explains Long-Context Factuality

Paper • 2404.15574 • Published Apr 24, 2024 • 3

authored 2 papers 10 months ago

MAF: Multi-Aspect Feedback for Improving Reasoning in Large Language Models

Paper • 2310.12426 • Published Oct 19, 2023 • 1

Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies

Paper • 2308.03188 • Published Aug 6, 2023 • 2

liked a model about 2 years ago

akjindal53244/Arithmo-Mistral-7B

Text Generation • Updated Jan 27, 2024 • 1.07k • 62