nielsr HF Staff commited on
Commit
992669f
·
verified ·
1 Parent(s): 47a3a89

Add pipeline tag and Hugging Face paper link

Browse files

This PR improves the model card by:

- Adding the `pipeline_tag: text-retrieval` to the metadata, which ensures the model can be discovered via the Hugging Face Hub's pipeline filters (e.g., at https://huggingface.co/models?pipeline_tag=text-retrieval). This accurately reflects the model's function as a dense retriever.
- Adding a prominent link to the official Hugging Face paper page (`https://huggingface.co/papers/2507.03922`) at the top of the model card for easy access.
- Clarifying the existing paper references in the `Introduction` and `Model List` sections to explicitly mention "on arXiv" without replacing the original arXiv links.

The existing sample usage, GitHub link, and license information are preserved as they are already correctly provided.

Files changed (1) hide show
  1. README.md +11 -9
README.md CHANGED
@@ -1,20 +1,22 @@
1
  ---
2
- tags:
3
- - transformers
4
- - sentence-transformers
5
  language:
6
  - en
7
- license: apache-2.0
8
  library_name: transformers
9
- base_model:
10
- - bge-large-en-v1.5
 
 
 
11
  model_index:
12
  - name: kpr-bge-large-en-v1.5
13
- results:
14
  ---
15
 
16
  # Knowledgeable Embedding: kpr-bge-large-en-v1.5
17
 
 
 
18
  ## Introduction
19
 
20
  **Injecting dynamically updatable entity knowledge into embeddings to enhance RAG**
@@ -27,7 +29,7 @@ Although RAG typically relies on embedding-based retrieval, the embedding models
27
 
28
  **The entity knowledge is pluggable and can be dynamically updated.**
29
 
30
- For further details, refer to [our paper](https://arxiv.org/abs/2507.03922) or [GitHub repository](https://github.com/knowledgeable-embedding/knowledgeable-embedding).
31
 
32
  ## Model List
33
 
@@ -39,7 +41,7 @@ For further details, refer to [our paper](https://arxiv.org/abs/2507.03922) or [
39
  | [knowledgeable-ai/kpr-bge-base-en-v1.5](https://huggingface.co/knowledgeable-ai/kpr-bge-base-en-v1.5) | 112M | [bge-base-en-v1.5](https://huggingface.co/BAAI/bge-base-en-v1.5) |
40
  | [knowledgeable-ai/kpr-bge-large-en-v1.5](https://huggingface.co/knowledgeable-ai/kpr-bge-large-en-v1.5) | 340M | [bge-large-en-v1.5](https://huggingface.co/BAAI/bge-large-en-v1.5) |
41
 
42
- For practical use, we recommend `knowledgeable-ai/kpr-bge-*`, which significantly outperforms state-of-the-art models on queries involving less-frequent entities while performing comparably on other queries, as reported in [our paper](https://arxiv.org/abs/2507.03922).
43
 
44
  Regarding the model size, we do not count the entity embeddings since they are stored in CPU memory and have a negligible impact on runtime performance. See [this page](https://github.com/knowledgeable-embedding/knowledgeable-embedding/wiki/Internals-of-Knowledgeable-Embedding) for details.
45
 
 
1
  ---
2
+ base_model:
3
+ - bge-large-en-v1.5
 
4
  language:
5
  - en
 
6
  library_name: transformers
7
+ license: apache-2.0
8
+ pipeline_tag: text-retrieval
9
+ tags:
10
+ - transformers
11
+ - sentence-transformers
12
  model_index:
13
  - name: kpr-bge-large-en-v1.5
 
14
  ---
15
 
16
  # Knowledgeable Embedding: kpr-bge-large-en-v1.5
17
 
18
+ This model is presented in the paper [Dynamic Injection of Entity Knowledge into Dense Retrievers](https://huggingface.co/papers/2507.03922).
19
+
20
  ## Introduction
21
 
22
  **Injecting dynamically updatable entity knowledge into embeddings to enhance RAG**
 
29
 
30
  **The entity knowledge is pluggable and can be dynamically updated.**
31
 
32
+ For further details, refer to [our paper on arXiv](https://arxiv.org/abs/2507.03922) or [GitHub repository](https://github.com/knowledgeable-embedding/knowledgeable-embedding).
33
 
34
  ## Model List
35
 
 
41
  | [knowledgeable-ai/kpr-bge-base-en-v1.5](https://huggingface.co/knowledgeable-ai/kpr-bge-base-en-v1.5) | 112M | [bge-base-en-v1.5](https://huggingface.co/BAAI/bge-base-en-v1.5) |
42
  | [knowledgeable-ai/kpr-bge-large-en-v1.5](https://huggingface.co/knowledgeable-ai/kpr-bge-large-en-v1.5) | 340M | [bge-large-en-v1.5](https://huggingface.co/BAAI/bge-large-en-v1.5) |
43
 
44
+ For practical use, we recommend `knowledgeable-ai/kpr-bge-*`, which significantly outperforms state-of-the-art models on queries involving less-frequent entities while performing comparably on other queries, as reported in [our paper on arXiv](https://arxiv.org/abs/2507.03922).
45
 
46
  Regarding the model size, we do not count the entity embeddings since they are stored in CPU memory and have a negligible impact on runtime performance. See [this page](https://github.com/knowledgeable-embedding/knowledgeable-embedding/wiki/Internals-of-Knowledgeable-Embedding) for details.
47