Solade / README.md
DLMveloper's picture
Update README.md
b24a265 verified
|
Raw
History Blame Contribute Delete
1.92 kB
metadata
pipeline_tag: text-generation
library_name: pytorch bitsandbytes
datasets:
  - DLMveloper/DLM_DataSet
language:
  - ru
  - en
  - kk
license: other
license_name: dlm-license
license_link: LICENSE

Model Card for Solade

Model Details

Model Description

Языковая модель на 1.2 миллиардов параметр,

  • Developed by: DLMveloper
  • Model type: Decoder-only transformer (text generation)
  • Language(s): Russian, English, Kazakh
  • License: [The license is being examined by lawyers]

Model Sources

Uses

Direct Use

Генерация текста на русском, английском, казахском языках.

Out-of-Scope Use

Модель обучена на ограниченном объёме данных (300 шагов), не предназначена для высокоточных или критичных задач.

Bias, Risks, and Limitations

Модель обучена на небольшом количестве шагов и может выдавать несвязный или некорректный текст.

How to Get Started with the Model

Training Details

Training Data

Датасет: DLMveloper/DLM_DataSet (подвыборка ~20000 примеров)

Training Procedure

Training Hyperparameters

  • Training regime: ???????
  • Steps: ???
  • Batch size: ?
  • Learning rate: ?????
  • Sequence length: ???.

Speeds, Sizes, Times

  • Размер модели: ????? (??-bit quantized)

Technical Specifications

Model Architecture and Objective

  • Параметров: ??
  • Слоёв: ??
  • Hidden size: ????
  • Attention heads: ??
  • Intermediate size (FFN): ????
  • Vocab size: ???
  • Компоненты: ???????

Compute Infrastructure

Software

???????????