Costa Pissaris's picture

26 27

Costa Pissaris

somtimz

·

AI & ML interests

None yet

Recent Activity

updated a collection about 1 month ago

Some of the Papers I've Read

upvoted a paper about 1 month ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

upvoted an article 2 months ago

Fine-tune Llama 3 with ORPO

View all activity

Organizations

updated a collection about 1 month ago

Some of the Papers I've Read

A few of the research papers that I've read. • 9 items • Updated Sep 21

upvoted a paper about 1 month ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2 • 218

upvoted an article 2 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 239

upvoted a collection 4 months ago

Gemma 3n

4 items • Updated Jul 10 • 234

upvoted a collection 5 months ago

Self-improving LLMs

17 items • Updated Mar 27 • 2

upvoted a paper 5 months ago

Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification

Paper • 2502.01839 • Published Feb 3 • 11

upvoted a paper 6 months ago

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Paper • 2407.19594 • Published Jul 28, 2024 • 21

upvoted 2 articles 7 months ago

Article

Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework

By

•

May 7, 2024

• 3

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29, 2024

• 364

liked a dataset 7 months ago

mlabonne/FineTome-100k

Viewer • Updated Jul 29, 2024 • 100k • 10.6k • 242

liked a Space 8 months ago

Gemma 3 12b It

Generate text based on images and videos

upvoted a collection 8 months ago

Gemma 3 Release

28 items • Updated Aug 11 • 522

liked a model 10 months ago

Qwen/Qwen2.5-Coder-32B-Instruct

Text Generation • 33B • Updated Jan 12 • 99.3k • • 1.94k

upvoted an article about 1 year ago

Article

Let's talk about LLM evaluation

By

•

May 23, 2024

• 190

updated a collection over 1 year ago

Some of the Papers I've Read

A few of the research papers that I've read. • 9 items • Updated Sep 21

upvoted a paper over 1 year ago

RAG Does Not Work for Enterprises

Paper • 2406.04369 • Published May 31, 2024 • 1

updated a collection over 1 year ago

Some of the Papers I've Read

A few of the research papers that I've read. • 9 items • Updated Sep 21

upvoted a paper over 1 year ago

Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Paper • 2406.04271 • Published Jun 6, 2024 • 30

liked a dataset over 1 year ago

GAIR/lima

Viewer • Updated Jun 8, 2023 • 1.33k • 586 • 445