Solade / README.md
DLMveloper's picture
Update README.md
b24a265 verified
|
Raw
History Blame Contribute Delete
1.92 kB
---
pipeline_tag: text-generation
library_name: pytorch bitsandbytes
datasets:
- DLMveloper/DLM_DataSet
language:
- ru
- en
- kk
license: other
license_name: dlm-license
license_link: LICENSE
---
# Model Card for Solade
## Model Details
### Model Description
Языковая модель на 1.2 миллиардов параметр,
- **Developed by:** DLMveloper
- **Model type:** Decoder-only transformer (text generation)
- **Language(s):** Russian, English, Kazakh
- **License:** [The license is being examined by lawyers]
### Model Sources
- **Repository:** https://huggingface.co/DLMveloper/Solade (Testing)
## Uses
### Direct Use
Генерация текста на русском, английском, казахском языках.
### Out-of-Scope Use
Модель обучена на ограниченном объёме данных (300 шагов), не предназначена для высокоточных или критичных задач.
## Bias, Risks, and Limitations
Модель обучена на небольшом количестве шагов и может выдавать несвязный или некорректный текст.
## How to Get Started with the Model
## Training Details
### Training Data
Датасет: DLMveloper/DLM_DataSet (подвыборка ~20000 примеров)
### Training Procedure
#### Training Hyperparameters
- **Training regime:** ???????
- **Steps:** ???
- **Batch size:** ?
- **Learning rate:** ?????
- **Sequence length:** ???.
#### Speeds, Sizes, Times
- **Размер модели:** ????? (??-bit quantized)
## Technical Specifications
### Model Architecture and Objective
- Параметров: ??
- Слоёв: ??
- Hidden size: ????
- Attention heads: ??
- Intermediate size (FFN): ????
- Vocab size: ???
- Компоненты: ???????
### Compute Infrastructure
#### Software
???????????