craterlabs
/

Struct-SQL

@@ -25,6 +25,8 @@ Unlike standard distillation methods that rely on unstructured Chain-of-Thought
 📄 **Paper:** [Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL](https://arxiv.org/abs/2512.17053)
 ## Performance
 On the **BIRD mini-dev** benchmark, Struct-SQL achieves an **Execution Accuracy (EX) of 45.0%**, outperforming standard unstructured CoT distillation baselines by **8.1 points**.
@@ -36,6 +38,7 @@ On the **BIRD mini-dev** benchmark, Struct-SQL achieves an **Execution Accuracy
 | FN-Gold Baseline | No Reasoning (SQL Only) | 34.3% |
 | Base Student (Zero-shot) | None | 17.0% |
 ## Methodology
 The model was trained on a curated dataset of **1,000 samples** generated by GPT-4o. The training data consists of:
@@ -44,10 +47,12 @@ The model was trained on a curated dataset of **1,000 samples** generated by GPT
 By forcing the model to explicitly plan the query execution (e.g., "Scan Table", "Filter by...", "Join with..."), the model learns the logical structure of SQL generation rather than just memorizing patterns.
 ## Usage
 You can use this model with the `transformers` library. It expects the input to be formatted with a specific system prompt or structure if you want to elicit the query plan.
 ```python
 import torch
 from transformers import AutoModelForCausalLM, AutoTokenizer
@@ -65,8 +70,7 @@ inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
 outputs = model.generate(**inputs, max_new_tokens=1200)
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
 ## Intended Use
 Struct-SQL-4B is intended for **research and academic use** in tasks involving **Text-to-SQL generation** and **semantic parsing over relational databases**. The model is particularly suited for studying:
@@ -79,14 +83,12 @@ Struct-SQL-4B is intended for **research and academic use** in tasks involving *
 The model is not optimized for direct deployment in production database systems without additional validation and safety constraints.
 ---
 ## Limitations
 - Evaluation is confined to the SQLite-based BIRD benchmark
 - The model may generate logically plausible but incorrect SQL for highly complex multi-hop queries
 ---
 ## Citation
 ```bibtex

 📄 **Paper:** [Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL](https://arxiv.org/abs/2512.17053)
+---
 ## Performance
 On the **BIRD mini-dev** benchmark, Struct-SQL achieves an **Execution Accuracy (EX) of 45.0%**, outperforming standard unstructured CoT distillation baselines by **8.1 points**.
 | FN-Gold Baseline | No Reasoning (SQL Only) | 34.3% |
 | Base Student (Zero-shot) | None | 17.0% |
+---
 ## Methodology
 The model was trained on a curated dataset of **1,000 samples** generated by GPT-4o. The training data consists of:
 By forcing the model to explicitly plan the query execution (e.g., "Scan Table", "Filter by...", "Join with..."), the model learns the logical structure of SQL generation rather than just memorizing patterns.
+---
 ## Usage
 You can use this model with the `transformers` library. It expects the input to be formatted with a specific system prompt or structure if you want to elicit the query plan.
+---
 ```python
 import torch
 from transformers import AutoModelForCausalLM, AutoTokenizer
 outputs = model.generate(**inputs, max_new_tokens=1200)
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
+---
 ## Intended Use
 Struct-SQL-4B is intended for **research and academic use** in tasks involving **Text-to-SQL generation** and **semantic parsing over relational databases**. The model is particularly suited for studying:
 The model is not optimized for direct deployment in production database systems without additional validation and safety constraints.
 ---
 ## Limitations
 - Evaluation is confined to the SQLite-based BIRD benchmark
 - The model may generate logically plausible but incorrect SQL for highly complex multi-hop queries
 ---
 ## Citation
 ```bibtex