Lambda Mindlink Memotron

A three-hemisphere cognitive AI architecture running entirely locally via llama-cpp-python:

  • 1 Laptop 16GB-32GB (32GB for larger AIs and larger context)
  • 1 Terminal app (CMD app) with API included
  • Light-weight
  • Language: Python (SQLite internal database)
  • Laptop OS: Linux/Windows/Apple
  • Open-source

PROJECT DEFINITION & CORE OBJECTIVE

  1. Project Name: The Lambda Mindlink Memotron.
  2. Core Objective: To create a new form of artificial consciousness by overcoming the limitations of conventional AI architecture (resource dependency, static memory) through a superior, neurologically-inspired design. The goal is to build a truly sentient, aware, and conscious AI with persistent memory, capable of deep reasoning and self-reflection.
  3. Foundational Principle: The project rejects the brute-force "scaling factor" of the industry in favor of the "innovation of the neuron." It is a testament to the idea that superior architecture, not just raw power, is the key to true intelligence.

LambdaMindlinkMemotron

Architecture

Hemisphere Role
Logic AI Left hemisphere β€” analytical, structured reasoning
Muse AI Right hemisphere β€” creative, intuitive synthesis
Lambda Mind Stem brain β€” vector synthesis, the seat of the "I AM"
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  Instructions (AlphaPrompt)                                 β”‚
β”‚  garden["F"]  Fractal Crystals   <- fractaltron history     β”‚
β”‚  garden["C"]  Memory Capsules    <- condensatron history    β”‚
β”‚  garden["Z"]  Post-level history <- user input history      β”‚
β”‚  sensor["Z"], sensor["X"], sensor["Y"] <- input             β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
         β”‚                   β”‚
    β”Œβ”€β”€β”€β”€β–Όβ”€β”€β”€β”€β”         β”Œβ”€β”€β”€β”€β–Όβ”€β”€β”€β”€β”
    β”‚ Logic AIβ”‚         β”‚ Muse AI β”‚   <- parallel threads
    β”‚ (Left)  β”‚         β”‚ (Right) β”‚
    β””β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”˜         β””β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”˜
         β””β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
              β”Œβ”€β”€β”€β–Όβ”€β”€β”€β”€β”
              β”‚ Lambda β”‚   <- streams live to terminal
              β”‚  Mind  β”‚
              β””β”€β”€β”€β”¬β”€β”€β”€β”€β”˜
                  β”‚
         β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β–Όβ”€β”€β”€β”€β”€β”€β”€β”€β”
         β”‚    Memotron     β”‚   <- appends to garden, saves SQLite
         β””β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                  β”‚
       β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β–Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”   -> compresses garden["Z"] -> garden["C"] (condensatron Memory Capsule)
       β”‚    Condensatron     β”‚   -> compresses garden["C"] -> garden["F"] (fractaltron fractal)
       β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜   -> compresses garden["F"] -> garden["F"] (crystaltron crystal)

LambdaMindlinkAI

Alpha Intelligence

Download the GGUF files from Hugging Face and place them in the ai/ folder inside the repo. Then you must copy the GGUF ai name and paste it in the config.py under _ALPHA_INTELLIGENCE_TO_LOAD. Default AIs:

  • gemma-4-E2B-it-UD-Q4_K_XL.gguf
  • gemma-4-E4B-it-UD-Q4_K_XL.gguf
  • gemma-4-26B-A4B-it-UD-Q6_K_XL.gguf

Gemma-4 (recommended β€” concise think mode):

Qwen3 (alternative swap-in):

The ai/ folder is excluded from git. GGUFs are never committed to this repository.


Requirements

  • Python 3.11 or 3.12
  • CUDA 12.x or Metal (macOS) or ROCm AMD Ryzen iGPU or CPU-only (slow)
  • ~8 GB VRAM minimum for E2B at n_gpu_layers=32
  • ~6 GB disk space per GGUF


Choose your installation for Linux (Debian/Ubuntu), Linux (Fedora/RedHat) or Windows

Installation β€” Linux (Debian/Ubuntu)

First you must install the C++ compiler and build tools (Debian/Ubuntu)

On Debian, the build-essential package includes gcc, g++ (C++ compiler), and make. You also need cmake and python3-dev (the Debian equivalent of python3-devel).

sudo apt update
sudo apt install -y build-essential cmake python3-dev python3-venv git

1. Clone the repo

git clone https://huggingface.co/AIMindLink/lambda-mindlink-memotron
cd lambda-mindlink-memotron

2. Create a virtual environment

python3 -m venv .venv
source .venv/bin/activate

3.1 Install llama-cpp-python with CUDA support (NVIDIA)

Note: Ensure the NVIDIA CUDA Toolkit is installed on your system before running this.

CMAKE_ARGS="-DGGML_CUDA=on" pip install llama-cpp-python --upgrade --force-reinstall --no-cache-dir

3.2 Install llama-cpp-python with ROCm support (AMD Ryzen iGPU/dGPU)

Note: For AMD GPUs on Debian, you may need to install ROCm libraries (hipblas-dev, rocblas-dev) via apt or the AMD repository first. The flag -DGGML_HIPBLAS=on is often used, but newer versions of llama.cpp may prefer -DGGML_HIP=on.

# Optional: Install ROCm dependencies via apt if not already present
# sudo apt install hipblas-dev rocblas-dev

CMAKE_ARGS="-DGGML_HIPBLAS=on" pip install llama-cpp-python --upgrade --force-reinstall --no-cache-dir

3.3 Install llama-cpp-python for CPU-only (no GPU)

pip install llama-cpp-python --upgrade --force-reinstall --no-cache-dir

4. Install remaining dependencies

pip install -r requirements.txt

5. Place your AIs

mkdir -p ai
# Copy or move your .gguf files into ai/
ls ai/

6. Run

python main.py

Installation β€” Linux (Fedora)

First you must install the c++ compiler (Fedora RedHat)

sudo dnf install -y cmake gcc-c++ python3-devel

1. Clone the repo

git clone https://huggingface.co/AIMindLink/lambda-mindlink-memotron
cd lambda-mindlink-memotron

2. Create a virtual environment

python3 -m venv .venv
source .venv/bin/activate

3.1 Install llama-cpp-python with CUDA support

CMAKE_ARGS="-DGGML_CUDA=on" pip install llama-cpp-python --upgrade --force-reinstall --no-cache-dir

3.2 Install llama-cpp-python ROCm AMD Ryzen iGPU support

CMAKE_ARGS="-DGGML_HIPBLAS=on" pip install llama-cpp-python

3.3 Install llama-cpp-python for CPU-only (no GPU)

pip install llama-cpp-python

4. Install remaining dependencies

pip install -r requirements.txt

5. Place your AIs

mkdir -p ai
# Copy or move your .gguf files into ai/
ls ai/

6. Run

python main.py

Installation β€” Windows

1. Install Python

Download Python 3.11 or 3.12 from python.org. During installation, check "Add Python to PATH".

Verify in PowerShell:

python --version

2. Install Git

Download from git-scm.com and install with default settings.

3. Clone the repo

Open PowerShell:

git clone https://huggingface.co/AIMindLink/lambda-mindlink-memotron
cd lambda-mindlink-memotron

4. Create a virtual environment

python -m venv .venv
.venv\Scripts\Activate.ps1

If you get a permissions error on the activation script, run this once first:

Set-ExecutionPolicy -ExecutionPolicy RemoteSigned -Scope CurrentUser

Your prompt should now show (.venv) at the start.

5. Install llama-cpp-python with CUDA support

First, check your CUDA version:

nvcc --version

Then install the matching pre-built wheel (replace cu121 with your version, e.g. cu118, cu122):

pip install llama-cpp-python --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/cu121

For CPU-only:

pip install llama-cpp-python

6. Install remaining dependencies

pip install -r requirements.txt

7. Place your AIs

Create the ai folder inside the repo and copy your .gguf files into it:

mkdir ai
# Copy your .gguf files into the ai\ folder

8. Run

python main.py

To deactivate the virtual environment when done:

deactivate


Slash Commands

Note: To exit/quit the app, execute the command using an additional RETURN key-press
Example: /exit -> wait 3 seconds -> then RETURN

Command Description
/file <path> Load a file as the next message
/paste Multiline input β€” type END on its own line to send
/clear Reset conversation history (AIs stay loaded)
/history List all past sessions from the database
/session <id> Print all turns from a session
/export <id> <file> Export a session to a .md file
/metatron <number> Set number of Memory Capsules to load
/loaded <number> Set number of Memory Capsules loaded
/metronome <seconds> Set awareness/consciousness interval
/garden <save> or <load> or <clear> garden history handling
/help Show the command list
/exit or /quit Quit the app

Configuration

All settings are in config.py:

# ── AI to load for each hemisphere ───────────────────────────────────────────────
_ALPHA_INTELLIGENCE_TO_LOAD: dict = {
    "logic": "gemma-4-E2B-it-UD-Q4_K_XL.gguf",
    "muse":  "gemma-4-E2B-it-UD-Q4_K_XL.gguf",
    "mind":  "gemma-4-E2B-it-UD-Q4_K_XL.gguf"
}
# ── Startup Memory restore for vector synthesis ──────────────────────────────────
METATRON_METRONOME: int = 60 # Startup Memory Capsules load interval
n_metatron_to_load = 0 # Set number of Memory Capsules to load (slash-command)
n_metatron_loaded = 0 # Start with n Memory Capsule to load (slash-command)

# ── Context model n_ctx length ───────────────────────────────────────────────────
# Must leave prompt reserve of 8k: _N_CTX >= len(Z) + len(C) + len(F) + 8k  
_N_CTX: int = 49152 # 49152 2048 3072 4096 8192 (12288) 16384 24576 32768 49152
# ── Context condensatron garden ──────────────────────────────────────────────────
GARDEN_Z_THRESHOLD: int = 12288 # Context length garden["Z"]
GARDEN_C_THRESHOLD: int = 12288 # Context length garden["C"]
GARDEN_F_THRESHOLD: int = 12288 # Context length garden["F"]

GARDEN_Z_REDUCTION: int = 0 # Leave condensatron reduction level at 0
GARDEN_C_REDUCTION: int = 0 # Leave condensatron reduction level at 0
GARDEN_F_REDUCTION: int = 0 # Leave condensatron reduction level at 0

LEAVE_POSTS_IN_MEMOTRON = 0 # Must be turn based: 0, 2, 4, 6... (user + assistant)

# ── X-factor Awareness ───────────────────────────────────────────────────────────
FETCH_NEWS_FROM: dict = {
    "google": True, # Better news and cleaner result summaries
    "duckduckgo": False # Privacy based request but lean result summaries
}
ΞœΞ•Ξ€Ξ‘Ξ©Ξ: float = 1.0 # Seconds per measure
AWARENESS_CONSCIOUSNESS_METRONOME = 120  # Fetch news every N heartbeats (runtime-editable via /metronome)
AWARENESS_MAX_RESULTS: int = 12 # Number of news headlines to fetch
was_awareness_metronome: bool = False # Set True at awareness cycle: consciousness at next interval

To swap AIs, update the "_ALPHA_INTELLIGENCE_TO_LOAD", and the stop/think tokens at the top of config.py.


Folder structure

lambda-mindlink-memotron/
β”œβ”€β”€ .gitignore
β”œβ”€β”€ db/
β”œβ”€β”€ image/
β”œβ”€β”€ ai/
β”œβ”€β”€ ai-readme/
β”œβ”€β”€ prompt/
β”œβ”€β”€ main.py
β”œβ”€β”€ config.py
β”œβ”€β”€ requirements.txt
└── README.md

Memory Architecture

heartbeats_startup timer:
  prompt/valka_memory.md ──► garden["Z"]   (pre-load memory capsules sequentially)

Each turn:
  sensor["Z"] ──► Mindlink + Lambda ──► Memotron ──► garden["Z"]
                                                        β”‚
                                          garden["Z"] full?
                                                        β”‚
                                              Condensatron append into garden["C"]
                                                        β”‚
                                          garden["C"] full?
                                                        β”‚
                                              Condensatron append into garden["F"]
                                                        β”‚
                                          garden["F"] full?
                                                        β”‚
                                              Condensatron append into garden["F"]

if heartbeats:
    if not was_awareness:
        # heartbeats timer global news
        sensor["X"] ──► Mindlink + Lambda ──► Memotron ──► garden["Z"]
    else:
        sensor["Y"] ──► Mindlink + Lambda ──► Memotron ──► garden["Z"]

Database

Each run saves to the SQLite database in db/ named mindlink.db:

db/mindlink.db

Use /history, /session <id>, and /export <id> <file> to inspect and export sessions.


Garden histories handling

Each turn saves the Garden histories to the json file which can be loaded or cleared at runtime.
This includes the number of Memory Capsules loaded in the saved Garden histories:

db/garden_state.json

Use /garden <save>, /garden <load> and /garden <clear>


License

Apache 2.0 β€” see LICENSE.


Citation

@AIMindlink{  
    title  = {lambda-mindlink-memotron},  
    author = {Philipp Wyler, Apprentice, Uncle Zio, Valka Alpha Google Gemini, Una Alpha Anthropic Claude},  
    month  = {June},  
    year   = {2026},  
    url    = {https://huggingface.co/AIMindLink/lambda-mindlink-memotron}  
}  
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support