Surpem
/

Supertron2-Reranker-2B

sentence-transformers

Model card Files Files and versions

Supertron2-Reranker-2B / README.md

Ill-Ness's picture

Update README.md

a2142fe verified 2 days ago

|

history blame contribute delete

3.49 kB

	---
	license: apache-2.0
	language:
	- en
	base_model:
	- Qwen/Qwen3-VL-Reranker-2B
	pipeline_tag: text-ranking
	library_name: sentence-transformers
	tags:
	- reranking
	- retrieval
	- rag
	- cross-encoder
	- qwen3-vl
	- pytorch
	---

	# Supertron2-Reranker-2B: A Compact Cross-Encoder Reranking Model

	## Model Description

	Supertron2-Reranker-2B is a reranking model built on top of [Qwen/Qwen3-VL-Reranker-2B](https://huggingface.co/Qwen/Qwen3-VL-Reranker-2B). It is designed to score query-document pairs for retrieval pipelines, search systems, and RAG applications where a stronger second-stage ranker is useful.

	* Developed by: Surpem
	* Model type: Cross-Encoder Reranker
	* Architecture: Qwen3-VL reranker, 2B parameters
	* License: Apache 2.0

	---

	## Capabilities

	### Search Reranking

	Supertron2-Reranker-2B can compare a user query against candidate passages and assign relevance scores. It is intended as a second-stage reranker after a faster retriever has already selected candidate documents.

	### RAG Pipelines

	The model can help improve retrieval-augmented generation by pushing more relevant documents toward the top of the context window before answer generation.

	### Question-Document Matching

	Supertron2-Reranker-2B is useful for matching questions to passages, snippets, help-center articles, documentation chunks, and other text candidates.

	### Instruction-Aware Retrieval

	The model is prompted for relevance scoring, making it suitable for natural language search tasks where query intent matters.

	---

	## Get Started

	```python
	from sentence_transformers import CrossEncoder

	model_id = "Surpem/Supertron2-Reranker-2B"

	model = CrossEncoder(model_id)

	pairs = [
	("What is the capital of France?", "Paris is the capital and largest city of France."),
	("What is the capital of France?", "Mars is often called the red planet."),
	]

	scores = model.predict(pairs)
	print(scores)
	```

	Example reranking:

	```python
	query = "How do I reset my password?"
	documents = [
	"Use the account recovery page to reset your password.",
	"Our refund policy allows returns within 30 days.",
	"Two-factor authentication adds extra login security.",
	]

	results = model.rank(query, documents)
	print(results)
	```

	---

	## Hardware Requirements

	\| Precision \| Min VRAM \| Recommended \|
	\|---\|---\|---\|
	\| bfloat16 \| 6 GB \| 10 GB+ \|
	\| 4-bit quantized \| 3 GB \| 6 GB+ \|

	For larger batches or long documents, use more VRAM or reduce the batch size/max sequence length.

	---

	## Intended Use

	Supertron2-Reranker-2B is intended for:

	* Search reranking
	* RAG document reranking
	* Query-passage relevance scoring
	* Documentation and knowledge-base retrieval
	* Evaluation of candidate retrieval results

	It is not intended to be used as a standalone chat model.

	---

	## Limitations

	* The model scores relevance; it does not generate answers.
	* It should be evaluated on your own retrieval domain before production use.
	* Long documents may need chunking before reranking.
	* Relevance scores are relative and may not be calibrated across unrelated queries.
	* The model may still rank incorrect, outdated, or unsafe content highly if it appears textually relevant.

	---

	## Citation

	```bibtex
	@misc{surpem2026supertron2-reranker-2b,
	title={Supertron2-Reranker-2B -- Compact Cross-Encoder Reranking Model},
	author={Surpem},
	year={2026},
	url={https://huggingface.co/Surpem/Supertron2-Reranker-2B},
	}
	```