Arko007 commited on Sep 20

Commit

2debeeb

verified ·

1 Parent(s): 89355ab

Upload definitive calibrated Brain 2 model (v3) with model card

Browse files

Files changed (22) hide show

README.md +53 -0
added_tokens.json +3 -0
checkpoint-131/added_tokens.json +3 -0
checkpoint-131/config.json +45 -0
checkpoint-131/model.safetensors +3 -0
checkpoint-131/optimizer.pt +3 -0
checkpoint-131/rng_state.pth +3 -0
checkpoint-131/scaler.pt +3 -0
checkpoint-131/scheduler.pt +3 -0
checkpoint-131/special_tokens_map.json +51 -0
checkpoint-131/spm.model +3 -0
checkpoint-131/tokenizer.json +0 -0
checkpoint-131/tokenizer_config.json +63 -0
checkpoint-131/trainer_state.json +46 -0
checkpoint-131/training_args.bin +3 -0
config.json +45 -0
model.safetensors +3 -0
special_tokens_map.json +51 -0
spm.model +3 -0
tokenizer.json +0 -0
tokenizer_config.json +63 -0
training_args.bin +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,53 @@

+---
+license: mit
+language: en
+pipeline_tag: text-classification
+tags:
+- fact-checking
+- fake-news
+- deberta
+- credo-ai
+---
+# fact-check1-v3-final (Credo AI - Brain 2)
+This is the definitive "Brain 2" model from the **Credo AI** project, a powerful and robust binary (FAKE/REAL) news classifier. This model represents the culmination of a multi-stage refinement process designed to create a highly accurate and unbiased fake news specialist.
+## The Model's Journey: From Specialist to Sage
+This model is a fine-tuned version of `microsoft/deberta-v3-large`.
+- **v1:** The model was initially trained on a massive, combined corpus of over 50,000 news articles, achieving 99.9%+ accuracy on its core task. However, this intense specialization made it a "conspiracy theorist," classifying simple, factual statements as FAKE because they didn't match the stylistic patterns of the news it was trained on.
+- **v2:** The model underwent a "scientific calibration" by being fine-tuned on a dataset of pristine scientific facts (`allenai/scifact`). This corrected its bias against clean, encyclopedic text but was still too narrow.
+- **v3 (This version):** The model has completed its final "masterclass." It was fine-tuned on a perfectly balanced dataset containing:
+    - **General Knowledge:** Thousands of clean, factual statements from Wikipedia (`wiki_qa` dataset) to broaden its understanding of truth.
+    - **Memory Reinforcement:** A sample of classic fake news articles (`mrisdal/fake-news`) to remind it of its core mission and prevent catastrophic forgetting.
+This final calibration, performed with an ultra-low learning rate, has produced a model that is both a powerful fake news detector and a robust, general-purpose fact-checker.
+## Intended Use
+This model is intended to be used as a fast and powerful first-line filter for English-language news headlines and short texts.
+```python
+from transformers import pipeline
+classifier = pipeline("text-classification", model="Arko007/fact-check1-v3-final")
+# Example 1: Simple Fact (The test it used to fail)
+text1 = "The sun rises in the east."
+print(classifier(text1))
+# Expected output: [{'label': 'REAL', 'score': ...}]
+# Example 2: Fake News
+text2 = "BREAKING: Scientists confirm lizards are secretly running the government."
+print(classifier(text2))
+# Expected output: [{'label': 'FAKE', 'score': ...}]
+```
+## Limitations
+While this v3 model is far more robust, its core training was on political and general news. Its performance may be lower on highly specialized domains like financial analysis or deep scientific literature.

added_tokens.json ADDED Viewed

	@@ -0,0 +1,3 @@

+{
+  "[MASK]": 128000
+}

checkpoint-131/added_tokens.json ADDED Viewed

	@@ -0,0 +1,3 @@

+{
+  "[MASK]": 128000
+}

checkpoint-131/config.json ADDED Viewed

	@@ -0,0 +1,45 @@

+{
+  "architectures": [
+    "DebertaV2ForSequenceClassification"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "bos_token_id": 1,
+  "dtype": "float32",
+  "eos_token_id": 2,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 1024,
+  "id2label": {
+    "0": "REAL",
+    "1": "FAKE"
+  },
+  "initializer_range": 0.02,
+  "intermediate_size": 4096,
+  "label2id": {
+    "FAKE": 1,
+    "REAL": 0
+  },
+  "layer_norm_eps": 1e-07,
+  "legacy": true,
+  "max_position_embeddings": 512,
+  "max_relative_positions": -1,
+  "model_type": "deberta-v2",
+  "norm_rel_ebd": "layer_norm",
+  "num_attention_heads": 16,
+  "num_hidden_layers": 24,
+  "pad_token_id": 0,
+  "pooler_dropout": 0,
+  "pooler_hidden_act": "gelu",
+  "pooler_hidden_size": 1024,
+  "pos_att_type": [
+    "p2c",
+    "c2p"
+  ],
+  "position_biased_input": false,
+  "position_buckets": 256,
+  "relative_attention": true,
+  "share_att_key": true,
+  "transformers_version": "4.56.1",
+  "type_vocab_size": 0,
+  "vocab_size": 128100
+}

checkpoint-131/model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8980f1c1b68499cc06ba3fc000c3b69cf10f108c7f08bde26d9d643b70160dc0
+size 1740304440

checkpoint-131/optimizer.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:40a53f68145fda753e88b4551fffc0fb3d33eb9c07319e59f916e052099b6de0
+size 3480846145

checkpoint-131/rng_state.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a487c12f69c5d4038b2d6860be7d319eb254ae48d23495cce9ddbde7f096ee2c
+size 14645

checkpoint-131/scaler.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c5d63b270923670c02fe5752ae03df8e97c3cccc938df5463b8a1b8eb0a1f695
+size 1383

checkpoint-131/scheduler.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:87d30a9a0946534cec2f840f9e3b1c64efbd722f4b2ff4e5f8d12519dc5fd89d
+size 1465

checkpoint-131/special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,51 @@

+{
+  "bos_token": {
+    "content": "[CLS]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "cls_token": {
+    "content": "[CLS]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "[SEP]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "mask_token": {
+    "content": "[MASK]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": {
+    "content": "[PAD]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "sep_token": {
+    "content": "[SEP]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "unk_token": {
+    "content": "[UNK]",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  }
+}

checkpoint-131/spm.model ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c679fbf93643d19aab7ee10c0b99e460bdbc02fedf34b92b05af343b4af586fd
+size 2464616

checkpoint-131/tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

checkpoint-131/tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,63 @@

+{
+  "added_tokens_decoder": {
+    "0": {
+      "content": "[PAD]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "1": {
+      "content": "[CLS]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "2": {
+      "content": "[SEP]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "3": {
+      "content": "[UNK]",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "128000": {
+      "content": "[MASK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "bos_token": "[CLS]",
+  "clean_up_tokenization_spaces": false,
+  "cls_token": "[CLS]",
+  "do_lower_case": false,
+  "eos_token": "[SEP]",
+  "extra_special_tokens": {},
+  "mask_token": "[MASK]",
+  "max_length": 512,
+  "model_max_length": 1000000000000000019884624838656,
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "sp_model_kwargs": {},
+  "split_by_punct": false,
+  "stride": 0,
+  "tokenizer_class": "DebertaV2Tokenizer",
+  "truncation_side": "right",
+  "truncation_strategy": "longest_first",
+  "unk_token": "[UNK]",
+  "vocab_type": "spm"
+}

checkpoint-131/trainer_state.json ADDED Viewed

	@@ -0,0 +1,46 @@

+{
+  "best_global_step": 131,
+  "best_metric": 1.0,
+  "best_model_checkpoint": "./fact-check1-v3-final/checkpoint-131",
+  "epoch": 1.0,
+  "eval_steps": 500,
+  "global_step": 131,
+  "is_hyper_param_search": false,
+  "is_local_process_zero": true,
+  "is_world_process_zero": true,
+  "log_history": [
+    {
+      "epoch": 1.0,
+      "eval_accuracy": 1.0,
+      "eval_f1": 1.0,
+      "eval_loss": 6.0304661019472405e-05,
+      "eval_precision": 1.0,
+      "eval_recall": 1.0,
+      "eval_runtime": 13.0856,
+      "eval_samples_per_second": 17.729,
+      "eval_steps_per_second": 2.216,
+      "step": 131
+    }
+  ],
+  "logging_steps": 500,
+  "max_steps": 131,
+  "num_input_tokens_seen": 0,
+  "num_train_epochs": 1,
+  "save_steps": 500,
+  "stateful_callbacks": {
+    "TrainerControl": {
+      "args": {
+        "should_epoch_stop": false,
+        "should_evaluate": false,
+        "should_log": false,
+        "should_save": true,
+        "should_training_stop": true
+      },
+      "attributes": {}
+    }
+  },
+  "total_flos": 1613417064844800.0,
+  "train_batch_size": 4,
+  "trial_name": null,
+  "trial_params": null
+}

checkpoint-131/training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:15f3160be56cb9738113979ad762424630df77855858f01f6eac1de28db6603b
+size 5777

config.json ADDED Viewed

	@@ -0,0 +1,45 @@

+{
+  "architectures": [
+    "DebertaV2ForSequenceClassification"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "bos_token_id": 1,
+  "dtype": "float32",
+  "eos_token_id": 2,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 1024,
+  "id2label": {
+    "0": "REAL",
+    "1": "FAKE"
+  },
+  "initializer_range": 0.02,
+  "intermediate_size": 4096,
+  "label2id": {
+    "FAKE": 1,
+    "REAL": 0
+  },
+  "layer_norm_eps": 1e-07,
+  "legacy": true,
+  "max_position_embeddings": 512,
+  "max_relative_positions": -1,
+  "model_type": "deberta-v2",
+  "norm_rel_ebd": "layer_norm",
+  "num_attention_heads": 16,
+  "num_hidden_layers": 24,
+  "pad_token_id": 0,
+  "pooler_dropout": 0,
+  "pooler_hidden_act": "gelu",
+  "pooler_hidden_size": 1024,
+  "pos_att_type": [
+    "p2c",
+    "c2p"
+  ],
+  "position_biased_input": false,
+  "position_buckets": 256,
+  "relative_attention": true,
+  "share_att_key": true,
+  "transformers_version": "4.56.1",
+  "type_vocab_size": 0,
+  "vocab_size": 128100
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8980f1c1b68499cc06ba3fc000c3b69cf10f108c7f08bde26d9d643b70160dc0
+size 1740304440

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,51 @@

+{
+  "bos_token": {
+    "content": "[CLS]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "cls_token": {
+    "content": "[CLS]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "[SEP]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "mask_token": {
+    "content": "[MASK]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": {
+    "content": "[PAD]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "sep_token": {
+    "content": "[SEP]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "unk_token": {
+    "content": "[UNK]",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  }
+}

spm.model ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c679fbf93643d19aab7ee10c0b99e460bdbc02fedf34b92b05af343b4af586fd
+size 2464616

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,63 @@

+{
+  "added_tokens_decoder": {
+    "0": {
+      "content": "[PAD]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "1": {
+      "content": "[CLS]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "2": {
+      "content": "[SEP]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "3": {
+      "content": "[UNK]",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "128000": {
+      "content": "[MASK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "bos_token": "[CLS]",
+  "clean_up_tokenization_spaces": false,
+  "cls_token": "[CLS]",
+  "do_lower_case": false,
+  "eos_token": "[SEP]",
+  "extra_special_tokens": {},
+  "mask_token": "[MASK]",
+  "max_length": 512,
+  "model_max_length": 1000000000000000019884624838656,
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "sp_model_kwargs": {},
+  "split_by_punct": false,
+  "stride": 0,
+  "tokenizer_class": "DebertaV2Tokenizer",
+  "truncation_side": "right",
+  "truncation_strategy": "longest_first",
+  "unk_token": "[UNK]",
+  "vocab_type": "spm"
+}

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:15f3160be56cb9738113979ad762424630df77855858f01f6eac1de28db6603b
+size 5777