Spaces:

CiLprototype
/

CurvOpt-SmarterModels

Running

Update README.md

5f31bde verified 4 days ago

1.29 kB

	---
	title: CurvOpt SmarterModels
	emoji: 📊
	colorFrom: red
	colorTo: red
	sdk: gradio
	sdk_version: 6.6.0
	app_file: app.py
	pinned: false
	license: apache-2.0
	short_description: Smarter Models, Smaller Footprint
	---
	# CurvOpt-LLM — Realtime Optimizer

	Curvature-guided mixed-precision optimization for LLMs. No retraining required.

	## What This Does
	- Loads any HuggingFace causal LM
	- Computes Fisher diagonal curvature per layer (real gradients)
	- Assigns FP32 / FP16 / BF16 per layer based on sensitivity
	- Rewrites and saves a deployable optimized model (downloadable ZIP)
	- Reports electricity, CO₂, and water footprint savings

	## How to Use
	1. Select a model from the dropdown (or enter a custom HF model ID)
	2. Set calibration samples (1–32) and PPL tolerance
	3. Click Run Optimization
	4. Download the optimized model ZIP when done

	## Supported Models
	OPT family · GPT-2 family · Pythia · Phi · BLOOM · Mistral · Llama-2 · Qwen · Falcon · and any `AutoModelForCausalLM` compatible model.

	## Research
	Based on Fisher Information / Optimal Brain Damage curvature analysis.
	Novel contribution: per-request curvature-gated mixed precision with user intent feedback.

	Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference