Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Djuunaa's picture
165 28 543

Djuunaa

djuna
ecastera's profile picture nour-ai's profile picture zelk12's profile picture
Β·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago
KORMo-Team/KORMo-10B-sft
reacted to onekq's post with πŸ‘ 1 day ago
Context rot is such a catchy phrase, but the problem has been identified 2+ years ago, called attention decay. https://huggingface.co/papers/2307.03172 I spotted the same problem in coding tasks, and documented in my book (https://www.amazon.com/dp/9999331130). Why did this problem become hot again? This is because many of us thought the problem has been solved by long context models, which is not true. Here we were misled by benchmarks. Most long-context benchmarks build around the QA scenario, i.e. "finding needle in haystack". But in agentic scenarios, the model needs to find EVERYTHING in the haystack, and just can't afford enough attention for this challenge.
new activity 1 day ago
MiniMaxAI/MiniMax-M2:No lightning attention?
View all activity

Organizations

Dev Mode Explorers's profile picture Djuna Test Lab's profile picture Punya's profile picture

djuna 's datasets

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs