| | --- |
| | license: mit |
| | language: |
| | - en |
| | tags: |
| | - sft |
| | --- |
| | ```Dimensity-3B``` |
| |
|
| | # Model Details |
| | Dimensity-3B is a finetuned version of the StableLM framework trained on a variety of conversational data. It contains 3 billion parameters. |
| |
|
| | # Intended Uses |
| | This model is intended for conversational AI applications. It can engage in open-ended dialogue by generating responses to user prompts. |
| |
|
| | ## Factors |
| | # Training Data |
| | The model was trained on a large dataset of over 100 million conversational exchanges extracted from Reddit comments, customer support logs, and other online dialogues. |
| |
|
| | # Prompt Template |
| | The model was finetuned using the following prompt template: |
| | ``` |
| | ### Human: {prompt} |
| | |
| | ### Assistant: |
| | ``` |
| | This prompts the model to take on an assistant role. |
| |
|
| | # Ethical Considerations |
| | As the model was trained on public conversational data, it may generate responses that contain harmful stereotypes or toxic content. The model should be used with caution in sensitive contexts. |
| |
|
| | # Caveats and Recommendations |
| | This model is designed for open-ended conversation. It may sometimes generate plausible-sounding but incorrect information. Outputs should be validated against external sources. |