mohsen ahadi nejad

mohsenahn

AI & ML interests

None yet

Recent Activity

liked a Space 1 day ago

MCP-1st-Birthday/Reuben_OS

liked a model 1 day ago

tencent/HunyuanVideo-1.5

liked a model 6 days ago

moonshotai/Kimi-K2-Thinking

View all activity

Organizations

None yet

liked a Space 1 day ago

ReubenOS

🖥

Connect Claude dekstop to Reuben OS via MCP

liked a model 1 day ago

tencent/HunyuanVideo-1.5

Text-to-Video • Updated about 9 hours ago • 1.49k • • 626

liked a model 6 days ago

moonshotai/Kimi-K2-Thinking

Text Generation • Updated 16 days ago • 241k • • 1.38k

reacted to Kseniase's post with ❤️ 6 days ago

Post

5893

12 Types of JEPA

Since Yann LeCun together with Randall Balestriero released a new paper on JEPA (Joint-Embedding Predictive Architecture), laying out its theory and introducing an efficient practical version called LeJEPA, we figured you might need even more JEPA. Here are 7 recent JEPA variants plus 5 iconic ones:

1. LeJEPA → LeJEPA: Provable and Scalable Self-Supervised Learning Without the Heuristics (2511.08544)
Explains a full theory for JEPAs, defining the “ideal” JEPA embedding as an isotropic Gaussian, and proposes the SIGReg objective to push JEPA toward this ideal, resulting in practical LeJEPA

2. JEPA-T → JEPA-T: Joint-Embedding Predictive Architecture with Text Fusion for Image Generation (2510.00974)
A text-to-image model that tokenizes images and captions with a joint predictive Transformer, enhances fusion with cross-attention and text embeddings before training loss, and generates images by iteratively denoising visual tokens conditioned on text

3. Text-JEPA → Speaking in Words, Thinking in Logic: A Dual-Process Framework in QA Systems (2507.20491)
Converts natural language into first-order logic, with a Z3 solver handling reasoning, enabling efficient, explainable QA with far lower compute than large LLMs

4. N-JEPA (Noise-based JEPA) → Improving Joint Embedding Predictive Architecture with Diffusion Noise (2507.15216)
Connects self-supervised learning with diffusion-style noise by using noise-based masking and multi-level schedules, especially improving visual classification

5. SparseJEPA → SparseJEPA: Sparse Representation Learning of Joint Embedding Predictive Architectures (2504.16140)
Adds sparse representation learning to make embeddings more interpretable and efficient. It groups latent variables by shared semantic structure using a sparsity penalty while preserving accuracy

6. TS-JEPA (Time Series JEPA) → Joint Embeddings Go Temporal (2509.25449)
Adapts JEPA to time-series by learning latent self-supervised representations and predicting future latents for robustness to noise and confounders

Read further below ↓
It you like it, also subscribe to the Turing Post: https://www.turingpost.com/subscribe

1 reply

liked a dataset 6 days ago

Yzineb/arabic_multimodal_text

Viewer • Updated 20 days ago • 162 • 29 • 1

liked a dataset 7 days ago

mohajesmaeili/Persian_Arabic_TextLine_Image_Ocr_Medium

Viewer • Updated Sep 26 • 791k • 1.64k • 5

reacted to ronantakizawa's post with 🔥 8 days ago

Post

2275

I built a demo on how to implement Cache-Augmented Generation (CAG) in an LLM and compare its performance gains to RAG (111 stars, 20 forks).

https://github.com/ronantakizawa/cacheaugmentedgeneration

CAG preloads document content into an LLM’s context as a precomputed key-value (KV) cache. This caching eliminates the need for real-time retrieval during inference, reducing token usage by up to 76% while maintaining answer quality.

CAG is particularly effective for constrained knowledge bases like internal documentation, FAQs, and customer support systems, where all relevant information can fit within the model's extended context window.

#rag #retrievalaugmentedgeneration