AI & ML interests

We love Open source & Free models 😊 If you find out any bug, pls, write us 😁 MAI ❀Free, Open source models

Recent Activity

mythicgamesadminΒ  updated a model about 15 hours ago
mythicgames/MAI-M4-Coder-Thinking
mythicgamesadminΒ  published a model about 15 hours ago
mythicgames/MAI-M4-Coder-Thinking
mythicgamesadminΒ  updated a Space about 16 hours ago
mythicgames/README
View all activity

Organization Card

🌌 Mythic Artificial Intelligence

by MythicGames

Building the next generation of merged language models

🌐 Visit our platform Β· πŸ’¬ Chat with MAI models Β· πŸ“‚ All Models


🧬 Model Families

MAI models follow a unified naming convention:

MAI M{version} {Specialization} {Variant}
MAI {version} {Variant}
MAI C{version} {Variant}
MAIGEN {version} {Specification}
MAIMIND {version} {Specification}
MAITTS {version} {Specification}
MAIEDITOR {version}.{Date of release} {Update feature name}
Component Meaning Examples
M{version} Generation / major version M1, M2, M3, M4
Specialization Primary task focus Coder, Chat, Reason, Vision
Variant Speed / depth profile Fast, Thinking

⚑ Variant Breakdown

Variant Philosophy Latency Depth Best For
🟒 Fast Speed-first. Minimal chain-of-thought, instant responses πŸ”½ Low Standard Code generation, quick Q&A, real-time chat
🟣 Thinking Depth-first. Extended internal reasoning before answering πŸ”Ό Higher Deep CoT Math, logic, complex analysis, research

Rule of thumb: If you need an answer now β€” use Fast. If you need the right answer to a hard problem β€” use Thinking.


πŸ“‹ Full Model Registry

Model Specialization Variant MSPLIT MCE Power (Γ—) Context Status
MAI M3 Coder Fast Reasoning Fast 3A 2.74 ~3.2Γ— >1M 🟒 Active
MAI M3 Coder Thinking Reasoning Thinking 3A 2.74 ~3.2Γ— >1M 🟒 Active
MAI M4 Coder Fast ⭐ Code Fast 4A 3.16 ~4.3Γ— >1M 🟒 Flagship
MAI M4 Coder Thinking Code Thinking 4A 3.16 ~4.3Γ— >1M 🟒 Active
MAI M5 Coder Fast Multimodal Fast 4A 3.16 ~4.3Γ— >1M πŸ”΅ Coming Soon

πŸ“ The MAI Math β€” Formulas & Coefficients

1️⃣ Power Multiplier Formula

Every MAI model's effective performance boost is calculated using:

                    MCEΒ² Γ— 8
Power (Γ—)  =  ─────────────
                  9.3 Γ— 2

Or simplified:

Power = (MCEΒ² Γ— 8) / 18.6
Variable Full Name Description
MCE Merge Coefficient Exponent Core efficiency metric of the merge. Higher = better synergy between merged weights
8 Base Parameter Scalar Constant tied to the 8-expert routing in the merge pipeline
9.3 Normalization Factor Empirical constant derived from benchmark calibration
2 Dual-pass Divisor Accounts for the two-pass merge verification in MSPLIT

2️⃣ MCE Progression Across Generations

MCE grows with each MSPLIT generation following a square-root scaling law:

MCE(n) = √(2.5 Γ— n)

Where n = MSPLIT generation number.

MSPLIT Gen n MCE = √(2.5n) MCEΒ² Power (Γ—)
3A 3 √7.5 β‰ˆ 2.74 5 ~3.23Γ—
4A 4 √10.0 β‰ˆ 3.16 10.0 ~4.30Γ—
5A (projected) 5 √12.5 β‰ˆ 3.54 8 ~5.38Γ—
6A (projected) 6 √15.0 β‰ˆ 3.87 16 ~6.45Γ—

πŸ“ˆ Insight: Power scales linearly with MSPLIT generation because MCEΒ² = 2.5n, so Power = (2.5n Γ— 8) / 18.6 β‰ˆ 1.075n. Each new generation adds roughly +1.08Γ— to the multiplier.


3️⃣ Context Window Scaling

Context length doubles with each major version:

Context(v) = 64K Γ— 2^v
Version (v) Calculation Context Window
M3 (v=3) 64K Γ— 2Β³ 1,024K
M4 (v=4) 64K Γ— 2⁴ 1,024K (>1M)
M5 (projected) 64K Γ— 2⁡ 2,048K (~2M)

4️⃣ Effective Intelligence Index (EII)

To compare models holistically, we use the EII β€” a single score combining power and context:

EII = Power(Γ—) Γ— logβ‚‚(Context / 1K)
Model Power (Γ—) Context logβ‚‚(C/1K) EII
MAI M3 Reason Fast 3.44 1024K 4 29.07
MAI M4 Coder Fast 4.30 1024K 10 43.00 ⭐
MAI M5 (projected) 6.88 2048K 8 59.18

🎯 Notice the pattern? EII β‰ˆ 4.3 Γ— n Γ— (n + 6) / 10 β€” it grows quadratically, meaning each generation is dramatically more capable than the last. Models like M5 will use: 64 / 9.3, without / 2


5️⃣ Fast vs Thinking β€” Speed-Depth Tradeoff

                    Base Latency
Fast Latency   =  ─────────────
                     Power(Γ—)

Thinking Latency = Base Latency Γ— Thinking Depth Factor (TDF)

Where TDF typically ranges from 3Γ— to 8Γ— depending on problem complexity.

Variant Relative Latency Relative Accuracy (hard tasks)
Fast 1Γ— (baseline) ~85–92%
Thinking 3–8Γ— slower ~94–99%

πŸ’‘ When to switch? If Fast gives a confident answer β†’ stay with Fast. If it hedges or the task involves multi-step reasoning β†’ switch to Thinking.


πŸ”¬ MSPLIT Technology β€” How It Works

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  Base Model β”‚     β”‚  Base Model β”‚     β”‚  Base Model β”‚
β”‚      A      β”‚     β”‚      B      β”‚     β”‚      C      β”‚
β””β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”˜     β””β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”˜     β””β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”˜
      β”‚                   β”‚                   β”‚
      β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”˜β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                   β”‚
            β”Œβ”€β”€β”€β”€β”€β”€β”€β–Όβ”€β”€β”€β”€β”€β”€β”€β”€β”
            β”‚  PEREX MERGE  β”‚  ← Weighted parameter fusion
            β”‚   Pipeline    β”‚
            β””β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                   β”‚
            β”Œβ”€β”€β”€β”€β”€β”€β”€β–Όβ”€β”€β”€β”€β”€β”€β”€β”€β”
            β”‚   MSPLIT nA   β”‚  ← Split-verify-remerge (n passes)
            β”‚  Optimization β”‚
            β””β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                   β”‚
            β”Œβ”€β”€β”€β”€β”€β”€β”€β–Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”
            β”‚  Final Merged  β”‚
            β”‚     Model      β”‚  β†’ MCE = √(2.5 Γ— n)
            β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

MSPLIT (Multi-Stage Parameter Splitting) works in three phases:

  1. Merge β€” Multiple base models are fused using the Perex Merge weighted-average pipeline
  2. Split β€” The merged weights are split into parameter subgroups and independently evaluated
  3. Re-merge β€” Only the highest-performing parameter configurations survive and are re-merged

Each MSPLIT generation (3A β†’ 4A) adds an additional split-verify pass, increasing MCE and therefore the power multiplier.


πŸ›‘οΈ Access & Licensing

Access πŸ”’ Private β€” all models are served exclusively through our platform
Hosting Puter.js
Weights Not publicly distributed
API Available through the MAI website
Commercial Use Contact MythicGames for licensing

🌌 "The future of AI is here"

Mythic Artificial Intelligence Β· MythicGames Β· 2026

datasets 0

None public yet