27 34

Ashvanth.S

peaceAsh

https://ash-01xor.github.io/

AI & ML interests

Generative Models | Continual Learning

Recent Activity

upvoted an article about 1 month ago

BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders

liked a model about 1 month ago

netflix/void-model

updated a model about 2 months ago

peaceAsh/ViT-T-16-clipkd-cc12m

View all activity

Organizations

upvoted an article about 1 month ago

Article

BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders

Nicolas-BZRD

•

Apr 7

• 27

upvoted an article 3 months ago

Article

Train AI models with Unsloth and Hugging Face Jobs for FREE

burtenshaw, danielhanchen, shimmyshimmer, mlabonne, davanstrien, evalstate

•

Feb 20

• 101

upvoted a paper 7 months ago

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published Oct 16, 2025 • 124

upvoted 2 articles 7 months ago

Article

Visualizing How VLMs Work

not-lain

•

Oct 7, 2025

• 55

Article

There is no such thing as a tokenizer-free lunch

catherinearnett

•

Sep 25, 2025

• 98

upvoted a collection 9 months ago

Synthetic Data Generation

Collection

SDG papers • 86 items • Updated Jul 11, 2025 • 15

upvoted an article 9 months ago

Article

Synthetic dataset generation techniques: Self-Instruct

davanstrien

•

May 15, 2024

• 23

upvoted 4 articles 10 months ago

Article

Mastering Tensor Dimensions in Transformers

not-lain

•

Jan 12, 2025

• 172

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 773

Article

FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages

davanstrien

•

Jul 8, 2025

• 35

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

ariG23498, lusxvr, andito, sergiopaniego, merve, pcuenq, reach-vb

•

May 21, 2025

• 258

upvoted an article about 1 year ago

Article

Vision Language Models (Better, faster, stronger)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 611

upvoted a collection about 1 year ago

Cohere Labs Aya Vision

Collection

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated Jul 31, 2025 • 74

upvoted an article over 1 year ago

Article

Introduction to State Space Models (SSM)

lbourdois

•

Jul 19, 2024

• 223

upvoted a paper over 1 year ago

Maya: An Instruction Finetuned Multilingual Multimodal Model

Paper • 2412.07112 • Published Dec 10, 2024 • 28

upvoted 2 articles over 1 year ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

ybelkada, timdettmers, artidoro, sgugger, smangrul

•

May 24, 2023

• 180

Article

The Workflow of PEFT

ariG23498

•

Aug 14, 2024

• 19

upvoted an article almost 2 years ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

andito, merve, SkalskiP

•

Jun 24, 2024

• 207

upvoted a collection almost 2 years ago

cool datasets

Collection

218 items • Updated 13 days ago • 20

upvoted a paper almost 2 years ago

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 66

Ashvanth.S

AI & ML interests

Recent Activity

Organizations

peaceAsh's activity

BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders

Train AI models with Unsloth and Hugging Face Jobs for FREE

Visualizing How VLMs Work

There is no such thing as a tokenizer-free lunch

Synthetic dataset generation techniques: Self-Instruct

Mastering Tensor Dimensions in Transformers

SmolLM3: smol, multilingual, long-context reasoner

FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Vision Language Models (Better, faster, stronger)

Introduction to State Space Models (SSM)

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

The Workflow of PEFT

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models