lightonai/LightOnOCR-2-1B · RLVR Strategy Suggestions

RLVR Strategy Suggestions

#30

by TheOfficialAJ - opened 16 days ago

I am trying to finetune the base variant for a table parsing task. I am also looking into outputting the tables in OTSL instead of HTML to save up on tokens.

After normal finetuning, I want to experiment with RLVR to better enforce the structure of the table. I couldn't find the exact training strategy used being discussed in the paper or the finetuning notebook.

Is it possible to get access to the RLVR pipeline?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment