llama2-7b-coding-fft

This model is a Full Fine-Tuned (FFT) version of LLaMA2-7B on coding datasets, trained as part of replicating the Mask Fine-Tuning (MFT) paper.

Model Details

The model was trained on 30,000 samples from three coding datasets (matching the paper):

This model serves as the FFT baseline for the Mask Fine-Tuning paper replication. It will be evaluated on:

Evaluation on HumanEval is pending. Results will be updated here once available.

If you use this model, please cite the original MFT paper:

@article{mft2025,
  title={Mask Fine-Tuning},
  author={[Authors from paper]},
  journal={arXiv preprint arXiv:2503.22764v1},
  year={2025}
}

Training configuration and code available at: GitHub Repository

This model inherits the LLaMA 2 Community License from the base model.

Safetensors

Model size

2B params

Tensor type

F32

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Finetuned

(1094)

this model