Upload folder using huggingface_hub

Browse files

Files changed (6) hide show

README.md +65 -0
config.json +5 -0
model.safetensors +3 -0
tokenizer.json +0 -0
tokenizer_config.json +14 -0
training_meta.json +13 -0

README.md ADDED Viewed

	@@ -0,0 +1,65 @@

+---
+language:
+  - en
+license: mit
+library_name: transformers
+pipeline_tag: zero-shot-classification
+tags:
+  - zero-shot
+  - multi-label
+  - text-classification
+  - pytorch
+metrics:
+  - precision
+  - recall
+  - f1
+base_model: bert-base-uncased
+datasets:
+  - polodealvarado/zeroshot-classification
+---
+# Zero-Shot Text Classification — polyencoder
+Learnable poly-codes with label-conditioned cross-attention.
+This model encodes texts and candidate labels into a shared embedding space using BERT,
+enabling classification into arbitrary categories without retraining for new labels.
+## Training Details
+| Parameter | Value |
+|-----------|-------|
+| Base model | `bert-base-uncased` |
+| Model variant | `polyencoder` |
+| Training steps | 1000 |
+| Batch size | 2 |
+| Learning rate | 2e-05 |
+| Trainable params | 109,494,528 |
+| Training time | 359.7s |
+## Dataset
+Trained on [polodealvarado/zeroshot-classification](https://huggingface.co/datasets/polodealvarado/zeroshot-classification).
+## Evaluation Results
+| Metric | Score |
+|--------|-------|
+| Precision | 0.9463 |
+| Recall | 0.9677 |
+| F1 Score | 0.9569 |
+## Usage
+```python
+from models.polyencoder import PolyEncoderModel
+model = PolyEncoderModel.from_pretrained("polodealvarado/polyencoder")
+predictions = model.predict(
+    texts=["The stock market crashed yesterday."],
+    labels=[["Finance", "Sports", "Biology", "Economy"]],
+)
+print(predictions)
+# [{"text": "...", "scores": {"Finance": 0.98, "Economy": 0.85, ...}}]
+```

config.json ADDED Viewed

	@@ -0,0 +1,5 @@

+{
+  "max_num_labels": 13,
+  "model_name": "bert-base-uncased",
+  "num_poly_codes": 16
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a101edbb3a5026b6de47b0f8b734e49c821c9de60771f764b5eda6b63f8c99ea
+size 438003512

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,14 @@

+{
+  "backend": "tokenizers",
+  "cls_token": "[CLS]",
+  "do_lower_case": true,
+  "is_local": false,
+  "mask_token": "[MASK]",
+  "model_max_length": 512,
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "strip_accents": null,
+  "tokenize_chinese_chars": true,
+  "tokenizer_class": "BertTokenizer",
+  "unk_token": "[UNK]"
+}

training_meta.json ADDED Viewed

	@@ -0,0 +1,13 @@

+{
+  "model_type": "polyencoder",
+  "encoder_name": "bert-base-uncased",
+  "param_count": 109494528,
+  "num_steps": 1000,
+  "best_step": 950,
+  "batch_size": 2,
+  "learning_rate": 2e-05,
+  "train_time_s": 359.68,
+  "precision": 0.9463,
+  "recall": 0.9677,
+  "f1": 0.9569
+}