Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ernie-research
/
TLDR-Gemma-2B-MA-PPO-Fixed5
like
1
Follow
ernie-research
8
Safetensors
openai/summarize_from_feedback
gemma
arxiv:
2410.02743
License:
mit
Model card
Files
Files and versions
xet
Community
main
TLDR-Gemma-2B-MA-PPO-Fixed5
5.03 GB
1 contributor
History:
3 commits
Moyu-hrsun
Create README.md
52e0f54
verified
9 months ago
.gitattributes
Safe
1.57 kB
Upload folder using huggingface_hub
about 1 year ago
README.md
5.13 kB
Create README.md
9 months ago
config.json
686 Bytes
Upload folder using huggingface_hub
about 1 year ago
generation_config.json
Safe
132 Bytes
Upload folder using huggingface_hub
about 1 year ago
model-00001-of-00002.safetensors
4.95 GB
xet
Upload folder using huggingface_hub
about 1 year ago
model-00002-of-00002.safetensors
67.1 MB
xet
Upload folder using huggingface_hub
about 1 year ago
model.safetensors.index.json
Safe
13.5 kB
Upload folder using huggingface_hub
about 1 year ago
special_tokens_map.json
Safe
555 Bytes
Upload folder using huggingface_hub
about 1 year ago
tokenizer.json
Safe
17.5 MB
xet
Upload folder using huggingface_hub
about 1 year ago
tokenizer.model
Safe
4.24 MB
xet
Upload folder using huggingface_hub
about 1 year ago
tokenizer_config.json
Safe
1.11 kB
Upload folder using huggingface_hub
about 1 year ago