-
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper • 2309.12307 • Published • 89 -
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 67 -
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper • 2310.09263 • Published • 40 -
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 106
kuan li
minlik
AI & ML interests
None yet
Organizations
None yet
LLM
-
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper • 2309.12307 • Published • 89 -
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 67 -
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper • 2310.09263 • Published • 40 -
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 106
IE
Information Extraction
models 15
minlik/chinese-alpaca-plus-33b-merged
Text Generation • 33B • Updated • 3
minlik/chinese-llama-13b-merged
Text Generation • 13B • Updated • 3 • 6
minlik/chinese-alpaca-pro-33b-merged
Text Generation • 33B • Updated • 5 • 4
minlik/chinese-alpaca-13b-merged
Text Generation • 13B • Updated • 4 • 16
minlik/Qwen2.5-Vl-3B-Instruct-GRPO-deepmath-ocr-7k
4B • Updated • 3
minlik/Qwen2.5-VL-3B-Instruct-GRPO-deepmath-ocr-1k
4B • Updated • 1
minlik/chinese-llama-plus-7b-merged
Text Generation • 7B • Updated • 8 • 8
minlik/chinese-alpaca-7b-merged
Text Generation • 7B • Updated • 8 • 10
minlik/chinese-alpaca-33b-merged
Text Generation • 33B • Updated • 780 • 9
minlik/docllm-yi-6b
Text Generation • 7B • Updated • 398 • 1