Token Classification
Transformers
TensorBoard
Safetensors
xlm-roberta
Generated from Trainer
language-identification
codeswitching
Instructions to use polyglot-tagger/language-identification with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use polyglot-tagger/language-identification with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("token-classification", model="polyglot-tagger/language-identification")# Load model directly from transformers import AutoTokenizer, AutoModelForTokenClassification tokenizer = AutoTokenizer.from_pretrained("polyglot-tagger/language-identification") model = AutoModelForTokenClassification.from_pretrained("polyglot-tagger/language-identification") - Notebooks
- Google Colab
- Kaggle
| { | |
| "add_prefix_space": true, | |
| "backend": "tokenizers", | |
| "bos_token": "<s>", | |
| "cls_token": "<s>", | |
| "eos_token": "</s>", | |
| "is_local": false, | |
| "mask_token": "<mask>", | |
| "model_max_length": 512, | |
| "pad_token": "<pad>", | |
| "sep_token": "</s>", | |
| "tokenizer_class": "XLMRobertaTokenizer", | |
| "unk_token": "<unk>" | |
| } | |