srivatsa

srivatsa92

devsrivatsa

AI & ML interests

rag, agents, fine-tuning

Recent Activity

liked a dataset about 2 months ago

neerajaabhyankar/hindustani-raag-small

liked a model 3 months ago

miromind-ai/MiroThinker-32B-DPO-v0.1

upvoted an article 3 months ago

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

View all activity

Organizations

upvoted 2 articles 3 months ago

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

•

Apr 16

• 50

Article

Efficient Request Queueing – Optimizing LLM Performance

•

Apr 2

• 19

upvoted an article 5 months ago

Article

The Transformers Library: standardizing model definitions

May 15

• 120

upvoted 2 articles 9 months ago

Article

o3-mini & Deepseek-R1

•

Feb 2

• 24

Article

Fine-tune ModernBERT for RAG with Synthetic Data

and 2 others •

Jan 20

• 42

upvoted an article 10 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 216

upvoted a collection 12 months ago

Tiny Series

Collection

Tiny datasets that empower the foundation of Small Language Model! • 11 items • Updated Jan 26, 2024 • 42

upvoted an article over 1 year ago

Article

Deploy LLMs with Hugging Face Inference Endpoints

Jul 4, 2023

• 16

srivatsa

AI & ML interests

Recent Activity

Organizations

srivatsa92's activity

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Efficient Request Queueing – Optimizing LLM Performance

The Transformers Library: standardizing model definitions

o3-mini & Deepseek-R1

Fine-tune ModernBERT for RAG with Synthetic Data

Train 400x faster Static Embedding Models with Sentence Transformers

Deploy LLMs with Hugging Face Inference Endpoints