| --- |
| library_name: transformers |
| tags: |
| - cryptology |
| - cipher |
| datasets: |
| - asimokby/Turkish-OSCAR-GEC |
| language: |
| - tr |
| base_model: |
| - google-t5/t5-base |
| license: apache-2.0 |
| --- |
| |
| This project contains a text-to-text model designed to decrypt Turkish text encoded using a substitution cipher. |
| In a substitution cipher, each letter in the plaintext is replaced by a corresponding, unique letter to form the ciphertext. |
| The model leverages statistical and linguistic properties of Turkish to make educated guesses about the letter substitutions, |
| aiming to recover the original plaintext message. |
|
|
| This model is for monoalphabetic Turkish substitution ciphers and it outputs decoded text. |
|
|
| Example: |
|
|
| Encoded text: |
| **Wn cqwqp fqpöqgbatqj ckjxq çmxmçmvy rafhqça zgqjgaçkxnh.** |
|
|
| Decoded text: |
| **Bu sabah kahvaltıdan sonra yürüyüşe çıkmayı planlıyorum.** |
|
|
| Suggested Usage: |
| ```py |
| #Load the model and tokenizer |
| cipher_text = "" #Encoded text here! |
| inputs = tokenizer(cipher_text, return_tensors="pt", padding=True, truncation=True, max_length=256).to(device) |
| outputs = model.generate(inputs["input_ids"], max_length=256) |
| decoded_text = tokenizer.decode(outputs[0], skip_special_tokens=True) |
| ``` |