Ill-Ness's picture
Update README.md
a2142fe verified
---
license: apache-2.0
language:
- en
base_model:
- Qwen/Qwen3-VL-Reranker-2B
pipeline_tag: text-ranking
library_name: sentence-transformers
tags:
- reranking
- retrieval
- rag
- cross-encoder
- qwen3-vl
- pytorch
---
# **Supertron2-Reranker-2B: A Compact Cross-Encoder Reranking Model**
## **Model Description**
**Supertron2-Reranker-2B** is a reranking model built on top of [Qwen/Qwen3-VL-Reranker-2B](https://huggingface.co/Qwen/Qwen3-VL-Reranker-2B). It is designed to score query-document pairs for retrieval pipelines, search systems, and RAG applications where a stronger second-stage ranker is useful.
* **Developed by:** Surpem
* **Model type:** Cross-Encoder Reranker
* **Architecture:** Qwen3-VL reranker, 2B parameters
* **License:** Apache 2.0
---
## **Capabilities**
### **Search Reranking**
Supertron2-Reranker-2B can compare a user query against candidate passages and assign relevance scores. It is intended as a second-stage reranker after a faster retriever has already selected candidate documents.
### **RAG Pipelines**
The model can help improve retrieval-augmented generation by pushing more relevant documents toward the top of the context window before answer generation.
### **Question-Document Matching**
Supertron2-Reranker-2B is useful for matching questions to passages, snippets, help-center articles, documentation chunks, and other text candidates.
### **Instruction-Aware Retrieval**
The model is prompted for relevance scoring, making it suitable for natural language search tasks where query intent matters.
---
## **Get Started**
```python
from sentence_transformers import CrossEncoder
model_id = "Surpem/Supertron2-Reranker-2B"
model = CrossEncoder(model_id)
pairs = [
("What is the capital of France?", "Paris is the capital and largest city of France."),
("What is the capital of France?", "Mars is often called the red planet."),
]
scores = model.predict(pairs)
print(scores)
```
Example reranking:
```python
query = "How do I reset my password?"
documents = [
"Use the account recovery page to reset your password.",
"Our refund policy allows returns within 30 days.",
"Two-factor authentication adds extra login security.",
]
results = model.rank(query, documents)
print(results)
```
---
## **Hardware Requirements**
| Precision | Min VRAM | Recommended |
|---|---|---|
| bfloat16 | 6 GB | 10 GB+ |
| 4-bit quantized | 3 GB | 6 GB+ |
For larger batches or long documents, use more VRAM or reduce the batch size/max sequence length.
---
## **Intended Use**
Supertron2-Reranker-2B is intended for:
* Search reranking
* RAG document reranking
* Query-passage relevance scoring
* Documentation and knowledge-base retrieval
* Evaluation of candidate retrieval results
It is not intended to be used as a standalone chat model.
---
## **Limitations**
* The model scores relevance; it does not generate answers.
* It should be evaluated on your own retrieval domain before production use.
* Long documents may need chunking before reranking.
* Relevance scores are relative and may not be calibrated across unrelated queries.
* The model may still rank incorrect, outdated, or unsafe content highly if it appears textually relevant.
---
## **Citation**
```bibtex
@misc{surpem2026supertron2-reranker-2b,
title={Supertron2-Reranker-2B -- Compact Cross-Encoder Reranking Model},
author={Surpem},
year={2026},
url={https://huggingface.co/Surpem/Supertron2-Reranker-2B},
}
```