Yunus Serhat Bıçakçı's picture

Yunus Serhat Bıçakçı

yunusserhat

·

https://www.yunusserhat.com

AI & ML interests

None yet

Recent Activity

liked a model 9 days ago

deepseek-ai/DeepSeek-OCR

liked a model 10 days ago

Qwen/Qwen3-VL-30B-A3B-Instruct-FP8

liked a model 13 days ago

Qwen/Qwen3-VL-30B-A3B-Instruct

View all activity

Organizations

upvoted a collection about 1 month ago

InternVL3.5

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated Sep 28 • 99

upvoted 3 collections 3 months ago

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated 3 days ago • 75

ERNIE 4.5

collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 26 items • Updated Sep 24 • 174

🛰️🌍 Geospatial Datasets

A curated collections of diverse geospatial and satellite imagery datasets. • 56 items • Updated Mar 11 • 26

upvoted a paper 5 months ago

Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces

Paper • 2506.00123 • Published May 30 • 35

upvoted an article 5 months ago

Article

Vision Language Models (Better, Faster, Stronger)

May 12

• 557

upvoted a collection 6 months ago

D-FINE

State-of-the-art real-time object detection model with Apache 2.0 licence • 15 items • Updated May 5 • 55

upvoted a collection 8 months ago

Türkçe VLMler

11 items • Updated Mar 4 • 10

upvoted 2 articles 8 months ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

Mar 4

• 78

Article

FastRTC: The Real-Time Communication Library for Python

Feb 25

• 172

upvoted a paper 8 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 207

upvoted a collection 8 months ago

SigLIP2

36 items • Updated Jul 10 • 90

upvoted a collection 9 months ago

Visual Document Retrieval

A collection of models, datasets, and spaces in the VDR series • 5 items • Updated Jan 10 • 8

upvoted an article 9 months ago

Article

🚀 Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker!

By

•

Jan 29

• 19

upvoted a collection 10 months ago

Jan 10 Releases 🌨️

38 items • Updated Jan 10 • 12

upvoted a collection 11 months ago

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Aug 25 • 81

upvoted 2 collections about 1 year ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Apr 30 • 308

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 638

upvoted a collection over 1 year ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated May 1 • 574

upvoted a paper over 1 year ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 72