Text Ranking
sentence-transformers
Safetensors
PyTorch
English
qwen3_vl
reranking
retrieval
rag
cross-encoder
qwen3-vl
Instructions to use Surpem/Supertron2-Reranker-2B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use Surpem/Supertron2-Reranker-2B with sentence-transformers:
from sentence_transformers import CrossEncoder model = CrossEncoder("Surpem/Supertron2-Reranker-2B") query = "Which planet is known as the Red Planet?" passages = [ "Venus is often called Earth's twin because of its similar size and proximity.", "Mars, known for its reddish appearance, is often referred to as the Red Planet.", "Jupiter, the largest planet in our solar system, has a prominent red spot.", "Saturn, famous for its rings, is sometimes mistaken for the Red Planet." ] scores = model.predict([(query, passage) for passage in passages]) print(scores) - Notebooks
- Google Colab
- Kaggle
| license: apache-2.0 | |
| language: | |
| - en | |
| base_model: | |
| - Qwen/Qwen3-VL-Reranker-2B | |
| pipeline_tag: text-ranking | |
| library_name: sentence-transformers | |
| tags: | |
| - reranking | |
| - retrieval | |
| - rag | |
| - cross-encoder | |
| - qwen3-vl | |
| - pytorch | |
| # **Supertron2-Reranker-2B: A Compact Cross-Encoder Reranking Model** | |
| ## **Model Description** | |
| **Supertron2-Reranker-2B** is a reranking model built on top of [Qwen/Qwen3-VL-Reranker-2B](https://huggingface.co/Qwen/Qwen3-VL-Reranker-2B). It is designed to score query-document pairs for retrieval pipelines, search systems, and RAG applications where a stronger second-stage ranker is useful. | |
| * **Developed by:** Surpem | |
| * **Model type:** Cross-Encoder Reranker | |
| * **Architecture:** Qwen3-VL reranker, 2B parameters | |
| * **License:** Apache 2.0 | |
| --- | |
| ## **Capabilities** | |
| ### **Search Reranking** | |
| Supertron2-Reranker-2B can compare a user query against candidate passages and assign relevance scores. It is intended as a second-stage reranker after a faster retriever has already selected candidate documents. | |
| ### **RAG Pipelines** | |
| The model can help improve retrieval-augmented generation by pushing more relevant documents toward the top of the context window before answer generation. | |
| ### **Question-Document Matching** | |
| Supertron2-Reranker-2B is useful for matching questions to passages, snippets, help-center articles, documentation chunks, and other text candidates. | |
| ### **Instruction-Aware Retrieval** | |
| The model is prompted for relevance scoring, making it suitable for natural language search tasks where query intent matters. | |
| --- | |
| ## **Get Started** | |
| ```python | |
| from sentence_transformers import CrossEncoder | |
| model_id = "Surpem/Supertron2-Reranker-2B" | |
| model = CrossEncoder(model_id) | |
| pairs = [ | |
| ("What is the capital of France?", "Paris is the capital and largest city of France."), | |
| ("What is the capital of France?", "Mars is often called the red planet."), | |
| ] | |
| scores = model.predict(pairs) | |
| print(scores) | |
| ``` | |
| Example reranking: | |
| ```python | |
| query = "How do I reset my password?" | |
| documents = [ | |
| "Use the account recovery page to reset your password.", | |
| "Our refund policy allows returns within 30 days.", | |
| "Two-factor authentication adds extra login security.", | |
| ] | |
| results = model.rank(query, documents) | |
| print(results) | |
| ``` | |
| --- | |
| ## **Hardware Requirements** | |
| | Precision | Min VRAM | Recommended | | |
| |---|---|---| | |
| | bfloat16 | 6 GB | 10 GB+ | | |
| | 4-bit quantized | 3 GB | 6 GB+ | | |
| For larger batches or long documents, use more VRAM or reduce the batch size/max sequence length. | |
| --- | |
| ## **Intended Use** | |
| Supertron2-Reranker-2B is intended for: | |
| * Search reranking | |
| * RAG document reranking | |
| * Query-passage relevance scoring | |
| * Documentation and knowledge-base retrieval | |
| * Evaluation of candidate retrieval results | |
| It is not intended to be used as a standalone chat model. | |
| --- | |
| ## **Limitations** | |
| * The model scores relevance; it does not generate answers. | |
| * It should be evaluated on your own retrieval domain before production use. | |
| * Long documents may need chunking before reranking. | |
| * Relevance scores are relative and may not be calibrated across unrelated queries. | |
| * The model may still rank incorrect, outdated, or unsafe content highly if it appears textually relevant. | |
| --- | |
| ## **Citation** | |
| ```bibtex | |
| @misc{surpem2026supertron2-reranker-2b, | |
| title={Supertron2-Reranker-2B -- Compact Cross-Encoder Reranking Model}, | |
| author={Surpem}, | |
| year={2026}, | |
| url={https://huggingface.co/Surpem/Supertron2-Reranker-2B}, | |
| } | |
| ``` | |