--- language: - en - zh license: apache-2.0 library_name: transformers tags: - web3 - finance - defi - chain-of-thought - sft - security-audit - on-device-ai metrics: - accuracy - ponzi-detection-rate - code-security-score pipeline_tag: text-generation inference: false base_model: - Qwen/Qwen3-4B-Thinking-2507 --- # 🌌 The Era of Sovereign Intelligence > *Not your weights, not your intelligence.* > *In the dark forest of Web3, an individual without private AI is merely prey.* We stand at the intersection of history's two most significant exponential curves. **Artificial Intelligence** seeks to solve Knowledge & Logic, while **Blockchain** seeks to solve Trust & Value. However, the current convergence is flawed. Mainstream AI remains a centralized Computational Leviathan that runs on opaque servers, ingesting your data without comprehending the concept of Ownership. When a Web2 AI reads a smart contract, it sees code. When we look, we see Assets, Risks, and Game Theory. In the 24/7 PVP environment of DeFi, traditional Copilots are merely assistants. You need an **Agent** capable of independent risk assessment. Humans are carbon-based lifeforms with limited processing speed and are increasingly outmatched by MEV bots and algorithmic predators. We need an evolutionary tool, an **Exo-Cortex** that runs locally, remains absolutely loyal, and deeply understands the dark logic of finance. **DMind-3-mini** was not built to score points on general benchmarks. It was engineered to arm the individual against institutional extraction. We refuse to upload your Alpha strategies to the cloud. True Web3 AI must be Private, Local, and Antifragile. šŸ›”ļø **DMind-3-nano** is your Shield. āš”ļø **DMind-3-mini** is your Spear. Welcome to the era of **Sovereign Intelligence**. --- # 🧠 DMind-3-mini: The Computational Financial Actuary ## 1. Evolution & Legacy The DMind lineage was born from a singular conviction that decentralized finance deserves decentralized intelligence. This journey began with **DMind-1**, which shattered the monopoly of closed-source AI by releasing the world's first Web3-native LLM. It continued with **DMind-2**, which proved that domain-specific fine-tuning could outmaneuver trillion-parameter giants in vertical benchmarks. šŸš€ **A Paradigm Shift** **DMind-3** represents our most significant evolutionary leap yet. We recognized that in the high-stakes environment of DeFi, standard knowledge retrieval is a liability. A model must do more than recite facts. It must possess **Reflective Intelligence** (System 2 Thinking) to navigate risk. **DMind-3-mini** embodies this philosophy. Positioned as the Brain within our local ecosystem, it bridges the gap between the real-time reflexes of the edge-side *DMind-3-nano* and the macroscopic foresight of the cloud-native *DMind-3*. It is engineered not as a chatbot, but as a **Computational Financial Actuary** designed to bring institutional-grade logic to the individual sovereign user within a privacy-first, offline-capable engine. ## 2. āš™ļø Model Details * **Model Name:** DMind-3-mini * **Organization:** DMind * **Base Architecture:** Qwen3-4B-Thinking-2507 (Customized Transformer w/ RoPE) * **Parameter Count:** 4.2 Billion * **Precision:** **BF16 (Native)** * *āš ļø Note: We strictly advise against 4-bit quantization for financial logic tasks to preserve numerical precision in APY/IL calculations.* * **Context Window:** 128k tokens * **Hardware Requirement:** GPU with $\ge$ **12GB VRAM** (Recommended: NVIDIA RTX 4070Ti+, Apple M3/M4 Pro/Max). ## 3. šŸ”¬ Methodology: C³-SFT **DMind-3-mini** introduces **Contrastive Chain-of-Correction Supervised Fine-Tuning (C³-SFT)**. Unlike standard SFT which models a direct mapping \\(P(y|x)\\), C³-SFT forces the model to navigate a Correction Trajectory by contrasting against plausible but flawed reasoning. ![Figure 1: C3-SFT Paradigm](./Figures/Figure1.png) *(Figure 1: The C³-SFT training pipeline, illustrating the Triplet Data Structure and Contextual Loss Masking)* ### Mathematical Formalization The optimization objective \\(\mathcal{L}_{C^3}\\) is defined as maximizing the conditional probability of the correction path given the error context: $$ \mathcal{L}_{C^3}(\theta) = - \mathbb{E}_{\mathcal{D}} \left[ \sum_{t=1}^{T} \alpha_t \cdot \log P_\theta(y^+_{t} \mid x, y^-, y^+_{} \quad (\text{Audit Mode}) \end{cases} $$ * **Standard Mode:** Optimized for latency. * **Audit Mode:** The model internally generates a latent negative hypothesis \\(\mathcal{G}_{neg}(x)\\) and applies the critique operator \\(\mathcal{H}_{crit}\\) to derive a rigorously verified conclusion. ## 4. šŸ’” Intended Use: Web3 Financial Know-How DMind-3-mini is not just a coder. It is a **Financial Risk Assessor**. ### Key Capabilities * šŸ“Š **Yield Attribution Analysis:** Deconstructs APY sources to distinguish between Real Yield (Protocol Revenue) and Inflationary Yield (Token Emissions). * 🌊 **Liquidity Provisioning (LP) Simulation:** Calculates optimal tick ranges for Uniswap V3 positions by modeling volatility surfaces locally. * šŸ•µļøā€ā™‚ļø **Risk-Adjusted Code Auditing:** Beyond syntax errors, it identifies economic exploits such as Flash Loan attack vectors based on price manipulation. ### The Brain & Shield Ecosystem For maximum security, we recommend the **DMind Local Stack**: ![Figure 2: Brain and Shield Ecosystem](./Figures/Figure2.png) *(Figure 2: The On-Device Inference Ecosystem showing the synergy between Nano and Mini)* * **The Brain (DMind-3-mini):** Runs on your high-performance laptop. Handles complex strategy formulation, deep research, and System 2 logic. * **The Shield (DMind-3-nano):** Runs in your browser/wallet background. Handles real-time transaction signing safety checks and System 1 intuition. ## 5. šŸ“š Training Data The model was fine-tuned on **82,000** high-value private samples: | Data Source | Proportion | Description | | :--- | :---: | :--- | | **Institutional Alpha Reports** | 40% | Deep dive reports from top-tier firms (e.g., Paradigm, Delphi), structured into logic chains. | | **Financial Post-Mortems** | 30% | Historical analysis of collapses (Luna, FTX, Euler Hack), focusing on pre-crash indicators. | | **Smart Contract Audits** | 20% | C³-SFT formatted pairs: \\(\text{Vulnerable Code} \to \text{Exploit Analysis} \to \text{Fix}\\). | | **On-Chain Behavior Logs** | 10% | Parsed intent analysis of "Smart Money" wallet operations during high volatility events. | ## 6. šŸ† Performance Benchmarks Evaluated on three key benchmarks: DMind Benchmark (Web3 Native Logic), FinanceQA (Financial Domain Knowledge), and AIME 2025 (Advanced Mathematical Reasoning). ![Figure 3: Performance Benchmarks](./Figures/Figure3.png) The evaluation compares DMind-3-mini (4B) against top-tier frontier models (GPT-5.1, Claude Sonnet 4.5) and other efficient models. Despite its compact size, the Mini model demonstrates exceptional efficiency, particularly in specialized domain tasks where it outperforms significantly larger generalist models. ## 7. āš–ļø Limitations & Disclaimer * **High Hardware Barrier:** Due to the decision to retain BF16 precision for financial accuracy, this model **requires >= 12GB VRAM**. It is not suitable for standard office laptops. * **Knowledge Cutoff:** While the logic is robust, specific protocol data is limited to the training cutoff. Use with RAG for real-time data. * **Legal Disclaimer:** This model is an **analytical tool**, not a financial advisor. The output (NFA) should never be the sole basis for investment decisions. The developers assume no liability for financial losses.