hanspeterlyngsoeraaschoujensen/DeepScaleR-1.5B-lora-256-scaling_factor_5.0-mask_cosine_0.0 Updated Sep 19, 2025
hanspeterlyngsoeraaschoujensen/DeepScaleR-1.5B-lora-256-scaling_factor_5.0-mask_cosine_0.00_0.90 Updated Aug 29, 2025
hanspeterlyngsoeraaschoujensen/llm-finetune-DeepScaleR-1.5B-Preview-128-new-tokens-scaling-factor-5.0-mask-cosi Updated Aug 27, 2025
hanspeterlyngsoeraaschoujensen/Qwen3_1.7B-fineweb_edu-train-ctx2048_layer_4 Updated Sep 24, 2025 • 45
hanspeterlyngsoeraaschoujensen/Qwen3_1.7B-fineweb_edu-train-ctx2048_layer_2 Updated Sep 24, 2025 • 24
hanspeterlyngsoeraaschoujensen/Qwen3_1.7B-fineweb_edu-train-ctx2048_layer_0 Updated Sep 24, 2025 • 44
hanspeterlyngsoeraaschoujensen/Qwen3_0.6B-fineweb_edu-train-ctx2048_layer_4 Updated Sep 24, 2025 • 37
hanspeterlyngsoeraaschoujensen/Qwen3_0.6B-fineweb_edu-train-ctx2048_layer_2 Updated Sep 24, 2025 • 32
hanspeterlyngsoeraaschoujensen/Qwen3_0.6B-fineweb_edu-train-ctx2048_layer_0 Updated Sep 24, 2025 • 27
hanspeterlyngsoeraaschoujensen/OpenR1-Math-every_n_tokens250-spacy_segmenter-basic_strategy Viewer • Updated Aug 26, 2025 • 93.5k • 4