Jonah Turner's picture

Jonah Turner

drexalt

·

AI & ML interests

None yet

Recent Activity

liked a dataset 2 days ago

jinaai/code_search_net_clean

liked a dataset 2 days ago

CoIR-Retrieval/CodeSearchNet

liked a model 6 days ago

nvidia/llama-embed-nemotron-8b

View all activity

Organizations

upvoted a paper 15 days ago

Fantastic (small) Retrievers and How to Train Them: mxbai-edge-colbert-v0 Tech Report

Paper • 2510.14880 • Published Oct 16 • 15

upvoted a collection about 2 months ago

Embeddings datasets ⚡️

This collection gather datasets for embeddings pre-training and fine-tuning. • 12 items • Updated Oct 1 • 3

upvoted a collection 4 months ago

NanoBEIR-fr 🍺

French translation of zeta-alpha-ai's NanoBEIR collection • 13 items • Updated 19 days ago • 2

upvoted a paper 6 months ago

Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval

Paper • 2505.16967 • Published May 22 • 24

upvoted a collection 6 months ago

RLHN Datasets

RLHN: Cleaned Training Datasets with False Negatives Identified & Relabeled as ground truth. • 5 items • Updated May 23 • 4

upvoted a collection 10 months ago

NanoBEIR 🍺

A collection of smaller versions of BEIR datasets with 50 queries and up to 10K documents each. • 13 items • Updated Sep 11, 2024 • 23

upvoted 2 articles over 1 year ago

Article

Fine-tune Llama 3 with ORPO

Apr 22, 2024

•

241

Article

RAG using huggingface tools

Jul 7, 2024

•

90

upvoted a collection over 1 year ago

fuck quadratic attention

11 items • Updated Apr 24, 2024 • 24

upvoted a paper over 1 year ago

Simple linear attention language models balance the recall-throughput tradeoff

Paper • 2402.18668 • Published Feb 28, 2024 • 20