JetBrains
/

Mellum-4b-sft-python

Text Generation

text-generation-inference

Model card Files Files and versions

topshik commited on May 19

Commit

2a7d386

·

verified ·

1 Parent(s): 3c521ef

Update README.md

Files changed (1) hide show

README.md +2 -51

README.md CHANGED Viewed

@@ -165,62 +165,13 @@ Designed for integration into professional developer tooling (e.g., intelligent
 - Security: Code suggestions should not be assumed to be secure or free of vulnerabilities.
 # Sample Usage
-Here are examples of how to run and sample from the model.
-## Generic generaion
 ```python
 import json
 from transformers import AutoTokenizer, AutoModelForCausalLM
-example = """
-import sys
-import os
-import time
-sys.path.append(os.getcwd())
-from cluster.prepare_data import get_headers_pairs_list, write_dist_matrix
-from cluster.token_edit_distance import get_distance_matrix
-if len(sys.argv) < 3:
-    print(
-        "Too few arguments. You should provide: \n1. dataset_filename" +
-        "\n2. output_data_filename"
-    )
-    sys.exit()
-start = time.perf_counter()
-dataset_filename_ = sys.argv[1]
-output_data_filename_ = sys.argv[2]
-headers_pairs = get_headers_pairs_list(dataset_filename_, verbose=True)
-dist_matrix, max_dist = get_distance_matrix(
-    list(map(lambda x: x[1], headers_pairs)),
-    verbose=True
-)
-write_dist_matrix(dist_matrix, max_dist, output_data_filename_, verbose=True)
-end = time.perf_counter()
-"""
-tokenizer = AutoTokenizer.from_pretrained('JetBrains/Mellum-4b-sft-python')
-model = AutoModelForCausalLM.from_pretrained('JetBrains/Mellum-4b-sft-python')
-encoded_input = tokenizer(example, return_tensors='pt', return_token_type_ids=False)
-input_len = len(encoded_input["input_ids"][0])
-out = model.generate(
-    **encoded_input,
-    max_new_tokens=100,
-)
-print("### Context")
-print(tokenizer.decode(out[0][:input_len]))
-print("### Prediction")
-print(tokenizer.decode(out[0][input_len:]))
-```
-## Fill in the middle with additional files as context generation
-```python
 example = """<filename>utils.py
 def multiply(x, y):
     return x * y

 - Security: Code suggestions should not be assumed to be secure or free of vulnerabilities.
 # Sample Usage
+Here is an example of how to run and sample from the model with additional files context and fill in the middle.
+## Fill in the middle with additional files as context generation
 ```python
 import json
 from transformers import AutoTokenizer, AutoModelForCausalLM
 example = """<filename>utils.py
 def multiply(x, y):
     return x * y