| # API Key Configuration Guide | |
| ## 1. Large Language Model (LLM) | |
| ### Using DeepSeek as an Example | |
| **Official Documentation**: https://api-docs.deepseek.com/zh-cn/ | |
| Note: For users outside China, we recommend using large language models such as Gemini, Claude, or ChatGPT for the best experience. | |
| ### Configuration Steps | |
| 1. **Apply for API Key** | |
| - Visit platform: https://platform.deepseek.com/usage | |
| - Log in and apply for API Key | |
| - ⚠️ **Important**: Save the obtained API Key securely | |
| 2. **Configuration Parameters** | |
| - **Model Name**: `deepseek-chat` | |
| - **Base URL**: `https://api.deepseek.com/v1` | |
| - **API Key**: Fill in the Key obtained in the previous step | |
| 3. **API Configuration** | |
| - **Web Usage**: Select "Use Custom Model" in the LLM model form, and fill in the model according to the configuration parameters | |
| - **Local Deployment**: In config.toml, locate `[developer.chat_models_config."deepseek-chat"]` and fill in the configuration parameters to make the default configuration accessible from the Web page. Locate `[llm]` and configure model, base_url, and api_key | |
| ## 2. Multimodal Large Language Model (VLM) | |
| ### 2.1 Using GLM-4.6V | |
| **API Key Management**: https://open.bigmodel.cn/usercenter/proj-mgmt/apikeys | |
| ### Configuration Parameters | |
| - **Model Name**: `glm-4.6v` | |
| - **Base URL**: `https://open.bigmodel.cn/api/paas/v4/` | |
| ### 2.2 Using Qwen3-VL | |
| **API Key Management**: Go to Alibaba Cloud Bailian Platform to apply for an API Key https://bailian.console.aliyun.com/cn-beijing/?apiKey=1&tab=globalset#/efm/api_key | |
| - **Model Name**: `qwen3-vl-8b-instruct` | |
| - **Base URL**: `https://dashscope.aliyuncs.com/compatible-mode/v1` | |
| - Parameter Configuration: Select "Use Custom Model" in the VLM Model form and fill in the parameters. For local deployment, locate `[vlm]` and configure model, base_url, and api_key. Add the following fields in config.toml as the default Web API configuration: | |
| ``` | |
| [developer.chat_models_config."qwen3-vl-8b-instruct"] | |
| base_url = "https://dashscope.aliyuncs.com/compatible-mode/v1" | |
| api_key = "YOUR_API_KEY" | |
| timeout = 20.0 | |
| temperature = 0.1 | |
| max_retries = 2 | |
| ``` | |
| ### 2.3 Using Qwen3-Omni | |
| Qwen3-Omni can also be applied for through the Alibaba Cloud Bailian Platform. The specific parameters are as follows, which can be used for automatic labeling music in omni_bgm_label.py | |
| - **Model Name**: `qwen3-omni-flash-2025-12-01` | |
| - **Base URL**: `https://dashscope.aliyuncs.com/compatible-mode/v1` | |
| For more details, please refer to the documentation: https://bailian.console.aliyun.com/cn-beijing/?tab=doc#/doc | |
| Model List: https://help.aliyun.com/zh/model-studio/models | |
| Billing Dashboard: https://billing-cost.console.aliyun.com/home | |
| ## 3. Pexels Image and Video Download API Key Configuration | |
| 1. Open the Pexels website, register an account, and apply for an API key at https://www.pexels.com/api/ | |
| <div align="center"> | |
| <img src="https://image-url-2-feature-1251524319.cos.ap-shanghai.myqcloud.com/openstoryline/docs/resource/pexels_api.png" alt="Pexels API application" width="70%"> | |
| <p><em>Figure 1: Pexels API Application Page</em></p> | |
| </div> | |
| 2. Web Usage: Locate the Pexels configuration, select "Use custom key", and enter your API key in the form. | |
| <div align="center"> | |
| <img src="https://image-url-2-feature-1251524319.cos.ap-shanghai.myqcloud.com/openstoryline/docs/resource/use_pexels_api_en.png" alt="Pexels API input" width="70%"> | |
| <p><em>Figure 2: Pexels API Usage</em></p> | |
| </div> | |
| 3. Local Deployment: Fill in the API key in the `pexels_api_key` field in the `config.toml` file as the default configuration for the project. | |
| ## 4. TTS (Text-to-Speech) Configuration | |
| ### Option 1: 302.ai | |
| **Service URL**: https://302.ai/product/detail/302ai-mmaudio-text-to-speech | |
| ### Option 2: MiniMax | |
| **Subscription Page**: https://platform.minimax.io/subscribe/audio-subscription | |
| **Configuration Steps**: | |
| 1. Create API Key | |
| 2. Visit: https://platform.minimax.io/user-center/basic-information/interface-key | |
| 3. Obtain and save API Key | |
| ### Option 3: Bytedance | |
| 1. Step 1: Enable Audio/Video Subtitle Generation Service | |
| Use the legacy page to find the audio/video subtitle generation service: | |
| - Visit: https://console.volcengine.com/speech/service/9?AppID=8782592131 | |
| 2. Step 2: Obtain Authentication Information | |
| View the account basic information page: | |
| - Visit: https://console.volcengine.com/user/basics/ | |
| <div align="center"> | |
| <img src="https://image-url-2-feature-1251524319.cos.ap-shanghai.myqcloud.com/openstoryline/docs/resource/use_bytedance_tts_zh.png" alt="Bytedance TTS API Configuration" width="70%"> | |
| <p><em>Figure 3: Bytedance TTS API Usage</em></p> | |
| </div> | |
| You need to obtain the following information: | |
| - **UID**: The ID from the main account information | |
| - **APP ID**: The APP ID from the service interface authentication information | |
| - **Access Token**: The Access Token from the service interface authentication information | |
| For local deployment, modify the config.toml file: | |
| ``` | |
| [generate_voiceover.providers.bytedance] | |
| uid = "" | |
| appid = "" | |
| access_token = "" | |
| ``` | |
| For detailed documentation, please refer to: https://www.volcengine.com/docs/6561/80909 | |
| ## Important Notes | |
| - All API Keys must be kept secure to avoid leakage | |
| - Ensure sufficient account balance before use | |
| - Regularly monitor API usage and costs |