Feynman Innovations's picture

Feynman Innovations

ajibawa-2023

·

AjinkyaBawase

AI & ML interests

LLM, RL, DL, ML, AGI. Developing LLMs (preferably fully fine tuned ) for various use cases.

Recent Activity

reacted to DmitryRyumin's post with 🔥 5 days ago

🚀🤖🌟 New Research Alert - ICCV 2025 (Oral)! 🌟🤖🚀 📄 Title: Variance-based Pruning for Accelerating and Compressing Trained Networks 🔝 📝 Description: The one-shot pruning method efficiently compresses networks, reducing computation and memory usage while retaining almost full performance and requiring minimal fine-tuning. 👥 Authors: Uranik Berisha, Jens Mehnert, and Alexandru Paul Condurache 📅 Conference: ICCV, 19 – 23 Oct, 2025 | Honolulu, Hawai'i, USA 🇺🇸 📄 Paper: https://huggingface.co/papers/2507.12988 🚀 ICCV-2023-25-Papers: https://github.com/DmitryRyumin/ICCV-2023-25-Papers 🚀 Added to the Efficient Learning Section: https://github.com/DmitryRyumin/ICCV-2023-25-Papers/blob/main/sections/2025/main/efficient-learning.md 📚 More Papers: more cutting-edge research presented at other conferences in the https://huggingface.co/spaces/DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin 🔍 Keywords: #VarianceBasedPruning #NetworkCompression #ModelAcceleration #EfficientDeepLearning #VisionTransformers #AI #ICCV2025 #ResearchHighlight

reacted to onekq's post with 👍 5 days ago

Context rot is such a catchy phrase, but the problem has been identified 2+ years ago, called attention decay. https://huggingface.co/papers/2307.03172 I spotted the same problem in coding tasks, and documented in my book (https://www.amazon.com/dp/9999331130). Why did this problem become hot again? This is because many of us thought the problem has been solved by long context models, which is not true. Here we were misled by benchmarks. Most long-context benchmarks build around the QA scenario, i.e. "finding needle in haystack". But in agentic scenarios, the model needs to find EVERYTHING in the haystack, and just can't afford enough attention for this challenge.

reacted to di-zhang-fdu's post with 🔥 5 days ago

The training dataset of ChemVLM is open-sourced now, have a check! https://huggingface.co/datasets/di-zhang-fdu/chemvlm-sft-datasets papers: https://huggingface.co/papers/2408.07246

View all activity

Organizations

upvoted 10 papers 5 days ago

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published 8 days ago • 90

Scaling Language-Centric Omnimodal Representation Learning

Paper • 2510.11693 • Published 19 days ago • 97

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published 16 days ago • 101

LightMem: Lightweight and Efficient Memory-Augmented Generation

Paper • 2510.18866 • Published 11 days ago • 106

LongCodeZip: Compress Long Context for Code Language Models

Paper • 2510.00446 • Published Oct 1 • 107

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published 26 days ago • 112

Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model

Paper • 2510.12276 • Published 18 days ago • 142

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published 19 days ago • 160

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28 • 170

Agent Learning via Early Experience

Paper • 2510.08558 • Published 23 days ago • 255

upvoted 4 papers 11 days ago

Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published 26 days ago • 109

Apriel-1.5-15b-Thinker

Paper • 2510.01141 • Published about 1 month ago • 113

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29 • 136

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published 19 days ago • 168

upvoted 6 papers 12 days ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published 26 days ago • 461

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30 • 519

FinTrust: A Comprehensive Benchmark of Trustworthiness Evaluation in Finance Domain

Paper • 2510.15232 • Published 15 days ago • 5

Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation

Paper • 2510.15624 • Published 15 days ago • 14

Paper2Web: Let's Make Your Paper Alive!

Paper • 2510.15842 • Published 15 days ago • 24

A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning

Paper • 2510.15444 • Published 15 days ago • 144