Sinatras's picture

Sinatras

sinatras

·

SinatrasC

AI & ML interests

None yet

Recent Activity

updated a dataset about 1 month ago

sinatras/pmpp-eval

liked a model 2 months ago

microsoft/xclip-base-patch32

liked a model 2 months ago

Qwen/Qwen3-30B-A3B-Thinking-2507

View all activity

Organizations

upvoted a collection 2 months ago

InternVL3.5

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated Sep 28 • 103

upvoted an article 2 months ago

Article

Jupyter Agents: training LLMs to reason with notebooks

Sep 10

•

57

upvoted a paper 3 months ago

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14 • 60

upvoted a collection 4 months ago

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 382

upvoted an article 4 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

Aug 5

•

506

upvoted an article 6 months ago

Article

Interactive Tools for machine learning, deep learning, and math

May 26

•

47

upvoted 2 collections 6 months ago

Gemma 3n Preview

4 items • Updated Jul 10 • 187

Marigold Computer Vision

All things Marigold • 17 items • Updated May 15 • 21

upvoted a paper 8 months ago

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published Apr 8 • 182

upvoted a collection 8 months ago

Llama 4

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated 26 days ago • 53

upvoted a paper 8 months ago

YourBench: Easy Custom Evaluation Sets for Everyone

Paper • 2504.01833 • Published Apr 2 • 22

upvoted an article 8 months ago

Article

Open R1: Update #4

Mar 26

•

48

upvoted 3 articles 10 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

•

1.31k

Article

How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents

Jan 29

•

17

Article

Mastering Long Contexts in LLMs with KVPress

Jan 23

•

70

upvoted 2 articles about 1 year ago

Article

Releasing the largest multilingual open pretraining dataset

Nov 13, 2024

•

104

Article

Detoxifying the Commons

Oct 31, 2024

•

6

upvoted a collection about 1 year ago

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 8 days ago • 100

upvoted 2 articles about 1 year ago

Article

Fine-Tuning 1B LLaMA 3.2: A Comprehensive Step-by-Step Guide with Code

Oct 2, 2024

•

74

Article

Document Similarity Search with ColPali

Sep 21, 2024

•

52