Instructions to use Brain2nd/NeuronSpark-0.9B-Chat with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use Brain2nd/NeuronSpark-0.9B-Chat with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="Brain2nd/NeuronSpark-0.9B-Chat", trust_remote_code=True)
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained("Brain2nd/NeuronSpark-0.9B-Chat", trust_remote_code=True, dtype="auto")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use Brain2nd/NeuronSpark-0.9B-Chat with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Brain2nd/NeuronSpark-0.9B-Chat"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Brain2nd/NeuronSpark-0.9B-Chat",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/Brain2nd/NeuronSpark-0.9B-Chat

SGLang

How to use Brain2nd/NeuronSpark-0.9B-Chat with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "Brain2nd/NeuronSpark-0.9B-Chat" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Brain2nd/NeuronSpark-0.9B-Chat",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "Brain2nd/NeuronSpark-0.9B-Chat" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Brain2nd/NeuronSpark-0.9B-Chat",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use Brain2nd/NeuronSpark-0.9B-Chat with Docker Model Runner:
```
docker model run hf.co/Brain2nd/NeuronSpark-0.9B-Chat
```

NeuronSpark-0.9B-Chat / atomic_ops /plif_node.py

Brain2nd

Initial release: NeuronSpark-0.9B-Chat instruction-tuned SNN language model

440e322 verified 2 months ago

raw

history blame contribute delete

2.72 kB

	"""
	PLIFNode: D 维固定参数 PLIF 神经元（设计文档 5.5 "普通 SNN 神经元"）

	与 SelectivePLIFNode 的区别：
	SelectivePLIF: β(t), α(t), V_th(t) 由输入每步动态计算（选择性记忆）
	PLIFNode: β, V_th 为 D 维可学习参数，训练后固定（信号转换）

	每个维度有独立的可学习参数：
	β_d = sigmoid(w_d): 时间常数（衰减率）
	V_th_d: 发放阈值

	动力学（与 ParametricLIF 一致）：
	V[t] = β · V[t-1] + (1-β) · x[t]
	s[t] = Θ(V[t] - V_th) (surrogate gradient)
	V[t] -= V_th · s[t] (soft reset)
	"""

	import math

	import torch
	import torch.nn as nn
	from spikingjelly.activation_based import base, surrogate


	class PLIFNode(base.MemoryModule):
	"""
	D 维固定参数 PLIF 神经元。

	Args:
	dim: 神经元数量（每个维度独立参数）
	init_tau: 初始时间常数 τ（β = 1 - 1/τ）
	v_threshold: 初始发放阈值
	surrogate_function: surrogate gradient 函数
	"""

	def __init__(
	self,
	dim: int,
	init_tau: float = 2.0,
	v_threshold: float = 0.5,
	surrogate_function=surrogate.Sigmoid(alpha=4.0),
	):
	super().__init__()
	# D 维可学习参数（随机初始化，每个维度独立）
	# w: 控制 β=sigmoid(w)，随机产生不同时间常数
	# init_w ± 0.5 → β ∈ ~[sigmoid(w-0.5), sigmoid(w+0.5)]
	# tau=2.0 时 w=0, β ∈ ~[0.38, 0.62]
	init_w = -math.log(init_tau - 1.0)
	self.w = nn.Parameter(torch.empty(dim).normal_(init_w, 0.5))
	# v_th: 发放阈值，U[0.5x, 1.5x] 均匀分布产生维度间多样性
	self.v_th = nn.Parameter(torch.empty(dim).uniform_(
	v_threshold * 0.5, v_threshold * 1.5,
	))
	self.surrogate_function = surrogate_function
	# 膜电位状态（functional.reset_net 时重置为 0.）
	self.register_memory('v', 0.)

	@property
	def beta(self):
	"""D 维衰减率 β = sigmoid(w)，值域 (0, 1)。"""
	return torch.sigmoid(self.w)

	def forward(self, x):
	"""
	单步前向传播。

	V[t] = β · V[t-1] + (1-β) · x[t], spike = Θ(V-V_th), soft reset。

	Args:
	x: 输入电流, shape (batch, dim)

	Returns:
	spike: 二值脉冲, shape (batch, dim), 值域 {0, 1}
	"""
	if isinstance(self.v, float):
	self.v = torch.zeros_like(x)
	beta = self.beta
	self.v = beta * self.v + (1.0 - beta) * x
	spike = self.surrogate_function(self.v - self.v_th)
	self.v = self.v - spike * self.v_th # soft reset
	return spike