Asankhaya Sharma PRO

codelion

http://asankhaya.github.io/

AI & ML interests

Creator of OptiLLM, OpenEvolve, Adaptive Classifier, and Ellora. Pioneering a new category in AI infrastructure: inference-time compute for LLMs.

Recent Activity

reacted to their post with ❤️ 3 days ago

Introducing OpenEvolve Prompt Optimizer - a Space that automatically evolves and optimizes your prompts using OpenEvolve! This tool uses OpenEvolve to iteratively improve prompts by testing them on real datasets and evolving better versions. No more manual prompt engineering guesswork - let OpenEvolve find the optimal prompts for you. How it works: - Enter your initial prompt using {input} as a placeholder for dataset inputs - Input any HuggingFace dataset name you want to use for optimization - Specify the dataset split and field names for your use case - Click Optimize Prompt and the system will validate everything first - Compare your initial prompt vs the evolved best prompt side-by-side Try it here: https://huggingface.co/spaces/algorithmicsuperintelligence/prompt-optimizer OpenEvolve GitHub: https://github.com/algorithmicsuperintelligence/openevolve

reacted to their post with 👀 3 days ago

reacted to their post with 🚀 3 days ago

View all activity

Organizations

upvoted an article 22 days ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

21 days ago

•

upvoted an article about 1 month ago

Article

Python Is All You Need? Introducing Dria-Agent-α

Jan 10

•

upvoted a paper about 2 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 485

upvoted an article about 2 months ago

Article

mem-agent: Equipping LLM Agents with Memory Using RL

Oct 9

•

upvoted a collection 2 months ago

Mem-Agent

Collection

Small sized agents from Dria trained on interacting with an obsidian-like memory system using python tools. Trained on Qwen3-4B-Thinking-2507. • 4 items • Updated Sep 5 • 3

upvoted a paper 2 months ago

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14 • 60

upvoted an article 2 months ago

Article

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

Sep 11

•

upvoted a collection 3 months ago

Nemotron-Pre-Training-Dataset

Collection

7 items • Updated 2 days ago • 41

upvoted an article 4 months ago

Article

Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning

Aug 9

•

upvoted 2 papers 4 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 178

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7 • 127

upvoted 2 articles 4 months ago

Article

Towards Open Evolutionary Agents

Aug 4

•

Article

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation

Aug 3

•

upvoted a collection 4 months ago

GLM-4.5

Collection

GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated Aug 11 • 247

upvoted 2 papers 4 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 316

Deep Researcher with Test-Time Diffusion

Paper • 2507.16075 • Published Jul 21 • 66

upvoted an article 4 months ago

Article

Understanding Model Reasoning Through Thought Anchors: A Comparative Study of Qwen3 and DeepSeek-R1

Jul 23

•

upvoted 2 collections 4 months ago

Ellora

Collection

Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement • 12 items • Updated Oct 20 • 4

Internal Coherence Maximization

Collection

Internal Coherence Maximization (ICM): A Label-Free, Unsupervised Training Framework for LLMs • 7 items • Updated Oct 10 • 2

upvoted a collection 5 months ago

Pre-training Dataset Samples

Collection

A collection of pre-training datasets samples of sizes 10M, 100M and 1B tokens. Ideal for use in quick experimentation and ablations. • 19 items • Updated 13 days ago • 13

Asankhaya Sharma PRO

AI & ML interests

Recent Activity

Organizations

codelion's activity

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Python Is All You Need? Introducing Dria-Agent-α

mem-agent: Equipping LLM Agents with Memory Using RL

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning

Towards Open Evolutionary Agents

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation

Understanding Model Reasoning Through Thought Anchors: A Comparative Study of Qwen3 and DeepSeek-R1