| --- |
| pipeline_tag: text-generation |
| library_name: pytorch bitsandbytes |
| datasets: |
| - DLMveloper/DLM_DataSet |
| language: |
| - ru |
| - en |
| - kk |
| license: other |
| license_name: dlm-license |
| license_link: LICENSE |
| --- |
| |
| # Model Card for Solade |
|
|
| ## Model Details |
|
|
| ### Model Description |
|
|
| Языковая модель на 1.2 миллиардов параметр, |
|
|
| - **Developed by:** DLMveloper |
| - **Model type:** Decoder-only transformer (text generation) |
| - **Language(s):** Russian, English, Kazakh |
| - **License:** [The license is being examined by lawyers] |
|
|
| ### Model Sources |
|
|
| - **Repository:** https://huggingface.co/DLMveloper/Solade (Testing) |
|
|
| ## Uses |
|
|
| ### Direct Use |
|
|
| Генерация текста на русском, английском, казахском языках. |
|
|
| ### Out-of-Scope Use |
|
|
| Модель обучена на ограниченном объёме данных (300 шагов), не предназначена для высокоточных или критичных задач. |
|
|
| ## Bias, Risks, and Limitations |
|
|
| Модель обучена на небольшом количестве шагов и может выдавать несвязный или некорректный текст. |
|
|
| ## How to Get Started with the Model |
|
|
| ## Training Details |
|
|
| ### Training Data |
|
|
| Датасет: DLMveloper/DLM_DataSet (подвыборка ~20000 примеров) |
| |
| ### Training Procedure |
| |
| #### Training Hyperparameters |
| |
| - **Training regime:** ??????? |
| - **Steps:** ??? |
| - **Batch size:** ? |
| - **Learning rate:** ????? |
| - **Sequence length:** ???. |
| |
| #### Speeds, Sizes, Times |
| |
| - **Размер модели:** ????? (??-bit quantized) |
| |
| ## Technical Specifications |
| |
| ### Model Architecture and Objective |
| |
| - Параметров: ?? |
| - Слоёв: ?? |
| - Hidden size: ???? |
| - Attention heads: ?? |
| - Intermediate size (FFN): ???? |
| - Vocab size: ??? |
| - Компоненты: ??????? |
| |
| ### Compute Infrastructure |
| |
| |
| #### Software |
| |
| ??????????? |