winvoker/turkish-sentiment-analysis-dataset
Viewer • Updated • 490k • 1.02k • 47
A fine-tuned version of the codealchemist01/turkish-sentiment-analysis model, improved with additional balanced training data to enhance neutral and negative class performance.
This model was fine-tuned on a balanced combination of the original dataset and additional Turkish sentiment datasets:
winvoker/turkish-sentiment-analysis-dataset (440,641 samples)WhiteAngelss/Turkce-Duygu-Analizi-Dataset (440,641 samples)maydogan/Turkish_SentimentAnalysis_TRSAv1 (150,000 samples)turkish-nlp-suite/MusteriYorumlari (73,920 samples)W4nkel/turkish-sentiment-dataset (4,800 samples)mustfkeskin/turkish-movie-sentiment-analysis-dataset (Kaggle, 83,227 samples)Split Distribution:
Overall Metrics:
| Class | Precision | Recall | F1-Score | Support |
|---|---|---|---|---|
| Negative | 90.65% | 86.79% | 88.68% | 10,926 |
| Neutral | 90.91% | 90.24% | 90.57% | 20,967 |
| Positive | 93.41% | 95.84% | 94.61% | 23,796 |
Neutral Class Performance:
Better Class Balance:
General Performance:
from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch
# Load model and tokenizer
model_name = "codealchemist01/turkish-sentiment-analysis-finetuned"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSequenceClassification.from_pretrained(model_name)
# Example text
text = "Bu ürün normal, beklediğim gibi. Özel bir şey yok."
# Tokenize
inputs = tokenizer(text, return_tensors="pt", truncation=True, max_length=128)
# Predict
with torch.no_grad():
outputs = model(**inputs)
predictions = torch.nn.functional.softmax(outputs.logits, dim=-1)
predicted_label_id = predictions.argmax().item()
# Map to label
id2label = {0: "negative", 1: "neutral", 2: "positive"}
predicted_label = id2label[predicted_label_id]
confidence = predictions[0][predicted_label_id].item()
print(f"Label: {predicted_label}")
print(f"Confidence: {confidence:.4f}")
If you use this model, please cite:
@misc{turkish-sentiment-analysis-finetuned,
title={Turkish Sentiment Analysis Model (Fine-tuned)},
author={codealchemist01},
year={2024},
base_model={codealchemist01/turkish-sentiment-analysis},
howpublished={\url{https://huggingface.co/codealchemist01/turkish-sentiment-analysis-finetuned}}
}
Apache 2.0
Base model
codealchemist01/turkish-sentiment-analysis