Mohammed Hamdy's picture

Mohammed Hamdy

mmhamdy

hugging-science

·

AI & ML interests

TechBio | AI4Sci | NLP | Reinforcement Learning

Recent Activity

upvoted an article 29 days ago

Continuous batching from first principles

reacted to Kseniase's post with ❤️ about 1 month ago

12 Types of JEPA Since Yann LeCun together with Randall Balestriero released a new paper on JEPA (Joint-Embedding Predictive Architecture), laying out its theory and introducing an efficient practical version called LeJEPA, we figured you might need even more JEPA. Here are 7 recent JEPA variants plus 5 iconic ones: 1. LeJEPA → https://huggingface.co/papers/2511.08544 Explains a full theory for JEPAs, defining the “ideal” JEPA embedding as an isotropic Gaussian, and proposes the SIGReg objective to push JEPA toward this ideal, resulting in practical LeJEPA 2. JEPA-T → https://huggingface.co/papers/2510.00974 A text-to-image model that tokenizes images and captions with a joint predictive Transformer, enhances fusion with cross-attention and text embeddings before training loss, and generates images by iteratively denoising visual tokens conditioned on text 3. Text-JEPA → https://huggingface.co/papers/2507.20491 Converts natural language into first-order logic, with a Z3 solver handling reasoning, enabling efficient, explainable QA with far lower compute than large LLMs 4. N-JEPA (Noise-based JEPA) → https://huggingface.co/papers/2507.15216 Connects self-supervised learning with diffusion-style noise by using noise-based masking and multi-level schedules, especially improving visual classification 5. SparseJEPA → https://huggingface.co/papers/2504.16140 Adds sparse representation learning to make embeddings more interpretable and efficient. It groups latent variables by shared semantic structure using a sparsity penalty while preserving accuracy 6. TS-JEPA (Time Series JEPA) → https://huggingface.co/papers/2509.25449 Adapts JEPA to time-series by learning latent self-supervised representations and predicting future latents for robustness to noise and confounders Read further below ↓ It you like it, also subscribe to the Turing Post: https://www.turingpost.com/subscribe

liked a Space about 2 months ago

HuggingFaceH4/on-policy-distillation

View all activity

Organizations

upvoted an article 29 days ago

Article

Continuous batching from first principles

+1

Nov 25

•

285

upvoted 2 articles 2 months ago

Article

Promoter-GPT: Writing DNA Instructions with Language Models

Oct 22

•

25

Article

Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models

Oct 20

•

20

upvoted an article 6 months ago

Article

FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages

Jul 8

•

32

upvoted an article 7 months ago

Article

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

Jun 4

•

22

upvoted a paper 7 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 147

upvoted an article 7 months ago

Article

The 4 Things Qwen-3’s Chat Template Teaches Us

Apr 30

•

81

upvoted a paper 7 months ago

Text Generation Beyond Discrete Token Sampling

Paper • 2505.14827 • Published May 20 • 10

upvoted an article 8 months ago

Article

Tiny Agents: an MCP-powered agent in 50 lines of code

Apr 25

•

303

upvoted a paper 9 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 202

upvoted a paper 10 months ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7 • 122

upvoted an article 10 months ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

+2

Mar 4

•

78

upvoted a collection 10 months ago

Cohere Labs Aya Vision

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated Jul 31 • 70

upvoted an article 10 months ago

Article

Common AI Model Formats

Feb 27

•

57

upvoted a collection 10 months ago

CHASE

Generate challenging synthetic data to evaluate LLMs • 5 items • Updated Feb 21 • 4

upvoted a paper 10 months ago

How to Get Your LLM to Generate Challenging Problems for Evaluation

Paper • 2502.14678 • Published Feb 20 • 18

upvoted a collection 10 months ago

Reasoning Datasets

50 items • Updated Jun 8 • 10

upvoted 2 papers 10 months ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19 • 43

From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions

Paper • 2502.13791 • Published Feb 19 • 5

upvoted a paper 11 months ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 123