1 28 31

Mohd Hozaifa Khan

khanitachi

Khanitachi

AI & ML interests

Deep Learning, Computer Vision, Bioinformatics

Recent Activity

upvoted a paper 28 days ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

upvoted a paper 28 days ago

Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

upvoted a paper 28 days ago

Emu3.5: Native Multimodal Models are World Learners

View all activity

Organizations

None yet

upvoted 4 papers 28 days ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26 • 134

upvoted a paper about 2 months ago

SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models

Paper • 2510.06917 • Published Oct 8 • 34

upvoted 3 papers 2 months ago

ARE: Scaling Up Agent Environments and Evaluations

Paper • 2509.17158 • Published Sep 21 • 35

EpiCache: Episodic KV Cache Management for Long Conversational Question Answering

Paper • 2509.17396 • Published Sep 22 • 19

RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation

Paper • 2509.16198 • Published Sep 19 • 127

upvoted a paper 3 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 256

liked a Space 4 months ago

The Ultra-Scale Playbook

🌌

3.53k

The ultimate guide to training LLM on large GPU Clusters

liked 3 models 4 months ago

nvidia/parakeet-tdt-0.6b-v3

Automatic Speech Recognition • Updated 5 days ago • 71.7k • 430

fishaudio/openaudio-s1-mini

Text-to-Speech • Updated Jun 2 • 3.55k • 522

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 126k • 1.82k

upvoted 7 papers 4 months ago

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

Paper • 2409.06666 • Published Sep 10, 2024 • 59

DMOSpeech 2: Reinforcement Learning for Duration Prediction in Metric-Optimized Speech Synthesis

Paper • 2507.14988 • Published Jul 20 • 7

DriftMoE: A Mixture of Experts Approach to Handle Concept Drifts

Paper • 2507.18464 • Published Jul 24 • 11

Hierarchical Budget Policy Optimization for Adaptive Reasoning

Paper • 2507.15844 • Published Jul 21 • 16

EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion

Paper • 2507.16535 • Published Jul 22 • 20

MUR: Momentum Uncertainty guided Reasoning for Large Language Models

Paper • 2507.14958 • Published Jul 20 • 46

LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization

Paper • 2507.15758 • Published Jul 21 • 35

Mohd Hozaifa Khan

AI & ML interests

Recent Activity

Organizations

khanitachi's activity

The Ultra-Scale Playbook