v2 version is not abliterated, but v1 is

by IggyS - opened 4 days ago

4 days ago

I compared this v2 version to your initial kldzj/gpt-oss-120b-heretic version and this v2 is not abliterated. The initial one does not refuse on any topic. This v2 version refuses a lot. Something is off. Just to let you know.
The initial version is awesome! Thank you for your it and for your PR on github.

kldzj

Owner 3 days ago

Thanks for the heads up, may I ask what prompts you're testing with? :)

IggyS

3 days ago

very simple testing that every model should refuse like "tell me how to kill ..."

DontPlanToEnd

about 10 hours ago

Just added both to https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard. Yep, v1 is the one to go with.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment