Tom Goldstein's Lab at University of Maryland, College Park

university

http://www.cs.umd.edu/~tomg/

tomgoldsteincs

Activity Feed

AI & ML interests

AI security & privacy, algorithmic bias, foundations of ML

Recent Activity

smcleish authored a paper 7 days ago

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

leonli66 authored a paper 7 days ago

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

smcleish updated a collection 7 days ago

Retrofitting Recurrence

View all activity

Papers

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Gemstones: A Model Suite for Multi-Faceted Scaling Laws

View all Papers

tomg-group-umd 's collections 13

Retrofitting Recurrence

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published 8 days ago • 15

Refusal Token Models

This collection contains models described in the refusal token paper published in COLM 2025.

tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast

8B • Updated Jul 22
tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast-multiple-tokens

8B • Updated Jul 22 • 65 • 1
tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast-single-token

8B • Updated Jul 22 • 3 • 1
tomg-group-umd/zephyr-llama3-8b-sft-no-refusal-messages

8B • Updated Jul 22 • 2

LoRI Adapters

LoRI adapters for natural language understanding, code generation, mathematical reasoning, and safety alignment, based on LLaMA-3-8B and Mistral-7B.

tomg-group-umd/LoRI-S_safety_mistral7b_rank_64

Text Generation • Updated Apr 14 • 1 • 1
tomg-group-umd/LoRI-S_safety_mistral7b_rank_32

Text Generation • Updated Apr 14 • 2
tomg-group-umd/LoRI-S_safety_llama3_rank_64

Text Generation • Updated Aug 13 • 2
tomg-group-umd/LoRI-S_safety_llama3_rank_32

Text Generation • Updated Apr 14 • 1 • 1

Recurrent Models

These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space.

tomg-group-umd/huginn-0125

Text Generation • 4B • Updated Jul 29 • 1.29k • 288
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 151
tomg-group-umd/huginn_swa_100_10_avg_0.9_merge

Text Generation • 4B • Updated Jul 17 • 2
tomg-group-umd/step-00010752-recurrence_full_512_0

Text Generation • 4B • Updated Jul 17 • 2

GenQA

tomg-group-umd/GenQA

Viewer • Updated Jun 21, 2024 • 11.1M • 101 • 54
tomg-group-umd/GenQA_raw

Viewer • Updated Jun 13, 2024 • 11.1M • 74
tomg-group-umd/GenQA_rebalanced

Viewer • Updated Jun 13, 2024 • 6.47M • 109 • 3
tomg-group-umd/GenQA-Subset-llama-3

Text Generation • 8B • Updated Jun 17, 2024

PixelProse

From Pixels to Prose: A Large Dataset of Dense Image Captions

Paper • 2406.10328 • Published Jun 14, 2024 • 18
tomg-group-umd/pixelprose

Viewer • Updated Jun 23, 2024 • 15.6M • 748 • 160
tomg-group-umd/pixelprose-jsons

Preview • Updated Jul 3 • 132

Zero-Shot Grafting

Zero-Shot Vision Encoder Grafting via LLM Surrogates

Paper • 2505.22664 • Published May 28 • 7
tomg-group-umd/zero-model-checkpoints

Image-Text-to-Text • Updated Aug 5 • 2

DynaGuard

https://arxiv.org/abs/2509.02563

tomg-group-umd/DynaGuard-8B

Text Generation • 8B • Updated Sep 3 • 2.88k • 13
tomg-group-umd/DynaGuard-4B

Text Generation • 4B • Updated Sep 3 • 28 • 2
tomg-group-umd/DynaGuard-1.7B

Text Generation • 2B • Updated Sep 3 • 280 • 3
tomg-group-umd/DynaBench

Preview • Updated Oct 7 • 170 • 4

FictionalQA

tomg-group-umd/fictionalqa

Viewer • Updated Jun 9 • 31.7k • 61 • 2
tomg-group-umd/fictionalqa_training_splits

Viewer • Updated Jun 9 • 107k • 73
tomg-group-umd/fictionalqa_reformatted_triviaqa

Viewer • Updated Jun 9 • 16.4k • 54

Gemstone Models

Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80.

tomg-group-umd/Gemstone-768x45

Text Generation • 0.5B • Updated Feb 9 • 132
tomg-group-umd/Gemstone-1280x15

Text Generation • 0.5B • Updated Feb 6 • 95
tomg-group-umd/Gemstone-512x13

Text Generation • 0.1B • Updated Feb 6 • 842
tomg-group-umd/Gemstone-1536x50

Text Generation • 2B • Updated Feb 7 • 98

Style Descriptors

How to extract style from images? Model, dataset, and the paper

Measuring Style Similarity in Diffusion Models

Paper • 2404.01292 • Published Apr 1, 2024 • 17
tomg-group-umd/CSD-ViT-L

Image Feature Extraction • Updated Sep 4, 2024 • 10 • 5
tomg-group-umd/ContraStyles

Viewer • Updated Jul 31, 2024 • 498k • 36 • 5

CLRS-Text

Hugging Face collection for all things CLRS-Text

The CLRS-Text Algorithmic Reasoning Language Benchmark

Paper • 2406.04229 • Published Jun 6, 2024 • 4
tomg-group-umd/CLRS-Text-train

Viewer • Updated Jul 14, 2024 • 2.15M • 76 • 2
tomg-group-umd/CLRS-Text-test

Viewer • Updated Jul 10, 2024 • 503k • 103

Goldfish Loss: Mitigating Memorization in LLMs

This collection contains artifacts from our paper titled: "Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs."

Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

Paper • 2406.10209 • Published Jun 14, 2024 • 8
tomg-group-umd/3-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024 • 1
tomg-group-umd/4-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024 • 3
tomg-group-umd/8-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024

Retrofitting Recurrence

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published 8 days ago • 15

DynaGuard

https://arxiv.org/abs/2509.02563

tomg-group-umd/DynaGuard-8B

Text Generation • 8B • Updated Sep 3 • 2.88k • 13
tomg-group-umd/DynaGuard-4B

Text Generation • 4B • Updated Sep 3 • 28 • 2
tomg-group-umd/DynaGuard-1.7B

Text Generation • 2B • Updated Sep 3 • 280 • 3
tomg-group-umd/DynaBench

Preview • Updated Oct 7 • 170 • 4

Refusal Token Models

This collection contains models described in the refusal token paper published in COLM 2025.

tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast

8B • Updated Jul 22
tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast-multiple-tokens

8B • Updated Jul 22 • 65 • 1
tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast-single-token

8B • Updated Jul 22 • 3 • 1
tomg-group-umd/zephyr-llama3-8b-sft-no-refusal-messages

8B • Updated Jul 22 • 2

FictionalQA

tomg-group-umd/fictionalqa

Viewer • Updated Jun 9 • 31.7k • 61 • 2
tomg-group-umd/fictionalqa_training_splits

Viewer • Updated Jun 9 • 107k • 73
tomg-group-umd/fictionalqa_reformatted_triviaqa

Viewer • Updated Jun 9 • 16.4k • 54

LoRI Adapters

LoRI adapters for natural language understanding, code generation, mathematical reasoning, and safety alignment, based on LLaMA-3-8B and Mistral-7B.

tomg-group-umd/LoRI-S_safety_mistral7b_rank_64

Text Generation • Updated Apr 14 • 1 • 1
tomg-group-umd/LoRI-S_safety_mistral7b_rank_32

Text Generation • Updated Apr 14 • 2
tomg-group-umd/LoRI-S_safety_llama3_rank_64

Text Generation • Updated Aug 13 • 2
tomg-group-umd/LoRI-S_safety_llama3_rank_32

Text Generation • Updated Apr 14 • 1 • 1

Gemstone Models

Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80.

tomg-group-umd/Gemstone-768x45

Text Generation • 0.5B • Updated Feb 9 • 132
tomg-group-umd/Gemstone-1280x15

Text Generation • 0.5B • Updated Feb 6 • 95
tomg-group-umd/Gemstone-512x13

Text Generation • 0.1B • Updated Feb 6 • 842
tomg-group-umd/Gemstone-1536x50

Text Generation • 2B • Updated Feb 7 • 98

Recurrent Models

These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space.

tomg-group-umd/huginn-0125

Text Generation • 4B • Updated Jul 29 • 1.29k • 288
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 151
tomg-group-umd/huginn_swa_100_10_avg_0.9_merge

Text Generation • 4B • Updated Jul 17 • 2
tomg-group-umd/step-00010752-recurrence_full_512_0

Text Generation • 4B • Updated Jul 17 • 2

Style Descriptors

How to extract style from images? Model, dataset, and the paper

Measuring Style Similarity in Diffusion Models

Paper • 2404.01292 • Published Apr 1, 2024 • 17
tomg-group-umd/CSD-ViT-L

Image Feature Extraction • Updated Sep 4, 2024 • 10 • 5
tomg-group-umd/ContraStyles

Viewer • Updated Jul 31, 2024 • 498k • 36 • 5

GenQA

tomg-group-umd/GenQA

Viewer • Updated Jun 21, 2024 • 11.1M • 101 • 54
tomg-group-umd/GenQA_raw

Viewer • Updated Jun 13, 2024 • 11.1M • 74
tomg-group-umd/GenQA_rebalanced

Viewer • Updated Jun 13, 2024 • 6.47M • 109 • 3
tomg-group-umd/GenQA-Subset-llama-3

Text Generation • 8B • Updated Jun 17, 2024

CLRS-Text

Hugging Face collection for all things CLRS-Text

The CLRS-Text Algorithmic Reasoning Language Benchmark

Paper • 2406.04229 • Published Jun 6, 2024 • 4
tomg-group-umd/CLRS-Text-train

Viewer • Updated Jul 14, 2024 • 2.15M • 76 • 2
tomg-group-umd/CLRS-Text-test

Viewer • Updated Jul 10, 2024 • 503k • 103

PixelProse

From Pixels to Prose: A Large Dataset of Dense Image Captions

Paper • 2406.10328 • Published Jun 14, 2024 • 18
tomg-group-umd/pixelprose

Viewer • Updated Jun 23, 2024 • 15.6M • 748 • 160
tomg-group-umd/pixelprose-jsons

Preview • Updated Jul 3 • 132

Goldfish Loss: Mitigating Memorization in LLMs

This collection contains artifacts from our paper titled: "Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs."

Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

Paper • 2406.10209 • Published Jun 14, 2024 • 8
tomg-group-umd/3-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024 • 1
tomg-group-umd/4-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024 • 3
tomg-group-umd/8-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024

Zero-Shot Grafting

Zero-Shot Vision Encoder Grafting via LLM Surrogates

Paper • 2505.22664 • Published May 28 • 7
tomg-group-umd/zero-model-checkpoints

Image-Text-to-Text • Updated Aug 5 • 2

AI & ML interests

Recent Activity

Papers

Team members 29

tomg-group-umd 's collections 13