| --- |
| library_name: transformers |
| tags: |
| - cryptology |
| - cipher |
| datasets: |
| - agentlans/high-quality-english-sentences |
| language: |
| - en |
| base_model: |
| - google-t5/t5-base |
| license: apache-2.0 |
| --- |
| |
| This project contains a text-to-text model designed to decrypt English text encoded using a substitution cipher. |
| In a substitution cipher, each letter in the plaintext is replaced by a corresponding, unique letter to form the ciphertext. |
| The model leverages statistical and linguistic properties of English to make educated guesses about the letter substitutions, |
| aiming to recover the original plaintext message. |
|
|
| This model is for monoalphabetic English substitution ciphers and it outputs decoded text. |
|
|
| Example: |
|
|
| Encoded text: |
| **Z hztwgx tstcsf qf z ulooqfe osfuqb tzx uezx awej z ozewsbe vlfwby fsmqisfx.** |
|
|
| Decoded text: |
| **A family member or a support person may stay with a patient during recovery.** |
|
|
| Suggested Usage: |
| ```py |
| #Load the model and tokenizer |
| cipher_text = "" #Encoded text here! |
| inputs = tokenizer(cipher_text, return_tensors="pt", padding=True, truncation=True, max_length=256).to(device) |
| outputs = model.generate(inputs["input_ids"], max_length=256) |
| decoded_text = tokenizer.decode(outputs[0], skip_special_tokens=True) |
| ``` |