User Modeling
Paper • 2505.16467 • PublishedNote ✅
IP-Dialog: Evaluating Implicit Personalization in Dialogue Systems with Synthetic Data
Paper • 2506.02449 • Published • 1Note ✅
Localizing Persona Representations in LLMs
Paper • 2505.24539 • PublishedNote ✅
Persona Vectors: Monitoring and Controlling Character Traits in Language Models
Paper • 2507.21509 • Published • 33Note ✅
Know Me, Respond to Me: Benchmarking LLMs for Dynamic User Profiling and Personalized Responses at Scale
Paper • 2504.14225 • Published • 1Note ✅
Language Models Change Facts Based on the Way You Talk
Paper • 2507.14238 • Published • 1Note ✅
Learning a Generative Meta-Model of LLM Activations
Paper • 2602.06964 • Published • 3Note keyword: latent user attributes probing LLM activations - def. check out
CLaS-Bench: A Cross-Lingual Alignment and Steering Benchmark
Paper • 2601.08331 • PublishedNote keyword: linear probes demographic inference language model residual stream
Bi-directional Bias Attribution: Debiasing Large Language Models without Modifying Prompts
Paper • 2602.04398 • PublishedNote recommended (similar to seeds)
Reasoning Beyond Chain-of-Thought: A Latent Computational Mode in Large Language Models
Paper • 2601.08058 • PublishedNote recommended (similar to seeds)
YaPO: Learnable Sparse Activation Steering Vectors for Domain Adaptation
Paper • 2601.08441 • Published • 8Note recommended (similar to seeds)
Steer2Edit: From Activation Steering to Component-Level Editing
Paper • 2602.09870 • Published • 1Note recommended (similar to seeds)
Endogenous Resistance to Activation Steering in Language Models
Paper • 2602.06941 • PublishedNote recommended (similar to seeds)
BLOCK-EM: Preventing Emergent Misalignment by Blocking Causal Features
Paper • 2602.00767 • PublishedNote recommended (similar to seeds)
AntiPaSTO: Self-Supervised Steering of Moral Reasoning
Paper • 2601.07473 • Published • 1Note recommended (similar to seeds)
Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics
Paper • 2602.02343 • Published • 13Note recommended (similar to seeds)
-
Who's asking? User personas and the mechanics of latent misalignment
Paper • 2406.12094 • Published
Contextualized Visual Personalization in Vision-Language Models
Paper • 2602.03454 • Published • 3Note keyword: implicit personalization interpretability language models
When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs
Paper • 2601.11000 • Published • 27Note recommended (similar to seeds)
Lost in the Prompt Order: Revealing the Limitations of Causal Attention in Language Models
Paper • 2601.14152 • Published • 6Note recommended (similar to seeds)
Simplifying Outcomes of Language Model Component Analyses with ELIA
Paper • 2602.18262 • Published • 1Note keyword: LLM user persona representation mechanistic interpretability
Language-based Trial and Error Falls Behind in the Era of Experience
Paper • 2601.21754 • Published • 16Note keyword: latent user attributes probing LLM activations
Persona Prompting as a Lens on LLM Social Reasoning
Paper • 2601.20757 • Published • 3Note keyword: steering vectors user persona demographic bias LLM
Fine-Grained Activation Steering: Steering Less, Achieving More
Paper • 2602.04428 • PublishedNote recommended (similar to seeds)